Analysis of Frequent Itemsets Mining Algorithm Againts Models of Different Datasets

Data mining is a study that uses statistical knowledge, mathematical calculations, artificial intelligence methods, machine learning by extracting and identifying useful information and related knowledge from various large databases.. One of them is looking for itemsets combination from the data sta...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of physics. Conference series 2019-11, Vol.1361 (1), p.12036
Hauptverfasser: Muliono, R, Muhathir, Khairina, N, Harahap, M K
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Data mining is a study that uses statistical knowledge, mathematical calculations, artificial intelligence methods, machine learning by extracting and identifying useful information and related knowledge from various large databases.. One of them is looking for itemsets combination from the data stack, the search process can be done using the Apriori Association Rules algorithm, the FPGrowth Association Rule and Closed Association rule. The three algorithms are some of the implementations of frequent itemsets search methods. Two datasets will be tested, namely retail datasets and accidents datasets derived from fimi datasets, with the aim of knowing the behavior of the three algorithms against the dataset model we tested. Each algorithm will test both datasets, with minimum support values for retail datasets ranging from 0.01-0.05 and min-conf 0.01-0.05. Likewise on the accident min-sup and min-conf 0.6-1 datasets. For retail algorithm data with the fastest processing time, the FP-Growth AR algorithm is more than 85% compared to Apriori-AR, followed by AR-Apriori and Apriori-Close-AR. For detailed memory usage results the Apriori-AR algorithm is lighter and outperforms. Accident data the experimental results show that FP-Growth-AR is a little more memory efficient and the fastest process is Apriori-Close, while the Apriori-AR with the longest processing time.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/1361/1/012036