COMPUTATION OF FREQUENT DATA VALUES

A method is provided for computing frequent value statistics, such as the to p most frequent values in a data column, in a database management system using a combination of hashing techniques and a varying and dynamic threshold value to compute the N most frequent values within a data column. A vary...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: RJAIBI, WALID
Format: Patent
Sprache:eng ; fre
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method is provided for computing frequent value statistics, such as the to p most frequent values in a data column, in a database management system using a combination of hashing techniques and a varying and dynamic threshold value to compute the N most frequent values within a data column. A varying threshold value allows the method to ignore any dat a value that is not at least more frequent than the least frequent data value already in the list. During the column scan, a data value can enter and exit the list of most frequent values depending upo n the data value's own frequency relative to that of another data value. On completion of the colum n scan, the list created already holds the N most frequent values obviating the need for a further so rt operation. The method is suited for use in database management systems where performance and reliable statistics are valued.