A fast maintenance algorithm of the discovered high-utility itemsets with transaction deletion
High-utility itemset mining (HUIM) has been recently studied to mine high-utility itemsets (HUIs) from the transactional database by considering more factors such as profit and quantity. Many approaches have been proposed for HUIM from a static database. Fewer studies have been developed to maintain...
Gespeichert in:
Veröffentlicht in: | Intelligent data analysis 2016-01, Vol.20 (4), p.891-913 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | High-utility itemset mining (HUIM) has been recently studied to mine high-utility itemsets (HUIs) from the transactional database by considering more factors such as profit and quantity. Many approaches have been proposed for HUIM from a static database. Fewer studies have been developed to maintain the discovered HUIs in dynamic environment whether transaction insertion or transaction deletion. In the past, the FUP-HUI-DEL and PRE-HUI-DEL algorithms were respectively proposed to effectively maintain the discovered high transaction-weighted utilization itemsets(HTWUIs) and high-utility itemsets (HUIs) when the transactions are consequentially deleted from the original database. The original database is still, however, required to be rescanned when small transaction-weighted utilization itemsets in the original database are necessary to be maintained. In this paper, an efficient algorithm namely HUI-list-DEL is presented to discover HUIs by maintaining the built utility-list structure for transaction deletion in dynamic databases. Based on the designed algorithm, the HUIs can be directly produced without candidate generation or the numerous database scans. Two pruning strategies are also designed to speed up the maintenance approach of HUIs. Substantial experiments show that the proposed maintenance approach for transaction deletion significantly outperforms the previous approaches in terms of execution time, memory consumption and scalability. |
---|---|
ISSN: | 1088-467X 1571-4128 |
DOI: | 10.3233/IDA-160837 |