Apriori and GUHA – Comparing two approaches to data mining with association rules

Two approaches to data mining with association rules are compared – the apriori algorithm and the ASSOC procedure. The first one was developed for market basket analysis at the beginning of 1990s. An association rule is understood as an implication between conjunctions of attribute-value pairs. The...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Intelligent data analysis 2017-01, Vol.21 (4), p.981-1013
Hauptverfasser: Rauch, Jan, Šimůnek, Milan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Two approaches to data mining with association rules are compared – the apriori algorithm and the ASSOC procedure. The first one was developed for market basket analysis at the beginning of 1990s. An association rule is understood as an implication between conjunctions of attribute-value pairs. The ASSOC procedure is an implementation of the GUHA method of mechanizing hypothesis formation developed since the 1960s. ASSOC deals with association rules – general relations of two general Boolean attributes. Arules – a computational environment for mining association rules based on apriori and the 4ft-Miner procedure – an implementation of the ASSOC procedure are discussed and compared. It is shown that the arules approach to missing information does not correspond to Kleene’s approach and this can lead to a large number of misleading rules. It is also shown that a secured completion developed for the ASSOC procedure avoids this problem.
ISSN:1088-467X
1571-4128
DOI:10.3233/IDA-160069