Frequent item set mining method and device based on MapReduce and array

The embodiment of the invention discloses a frequent item set mining method based on MapReduce and an array. The frequent item set mining method comprises the following steps: converting a data set into a two-dimensional array; decomposing the two-dimensional array into a plurality of two-dimensiona...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: XUAN SHILI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The embodiment of the invention discloses a frequent item set mining method based on MapReduce and an array. The frequent item set mining method comprises the following steps: converting a data set into a two-dimensional array; decomposing the two-dimensional array into a plurality of two-dimensional sub-arrays; allocating the plurality of two-dimensional sub-arrays to at least two nodes which execute frequent item set mining tasks in parallel, wherein each node mines a sub-frequent item set corresponding to the two-dimensional sub-array and retains a non-frequent item set of each node; and counting and summarizing the sub-frequent item sets and combining the non-frequent item sets to obtain a frequent item set of the data set. According to the method, the database is only scanned once andconverted into the two-dimensional array, scanning of the database is reduced, and the I/O time is shortened; meanwhile, the array is creatively decomposed into more sub-arrays through a horizontal division method; a MapRedu