Multi-Level Linear Location Tree for Efficient Sequential Pattern Mining
Discovering sequential patterns from a large database of sequences is an important problem in the field of knowledge discovery and data mining. Most of the previously used sequential pattern mining data structure follows a complex hash-tree structure which has poor locality. Since the hash-tree uses...
Gespeichert in:
Veröffentlicht in: | Key engineering materials 2005-01, Vol.277-279, p.369-374 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Discovering sequential patterns from a large database of sequences is an important problem in the field of knowledge discovery and data mining. Most of the previously used sequential pattern mining data structure follows a complex hash-tree structure which has poor locality. Since the hash-tree uses the repetition of a simple search instead of a consecutive search, the hash-based sequential pattern mining, though it reduces search time, may exhibit low performance. In this paper, we propose a novel multi-level linear location tree (or MLLT in short) with an excellent locality structure and develop an efficient MLLT-tree based sequential pattern mining algorithm for mining the complete set of frequent sequence by linear copy of nodes. Our performance study shows that MLLT runs considerably faster than the hash-tree based GSP algorithm. |
---|---|
ISSN: | 1013-9826 1662-9795 1662-9795 |
DOI: | 10.4028/www.scientific.net/KEM.277-279.369 |