Data clustering

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for clustering data are disclosed. In one aspect, a method includes the actions of receiving feature vectors. The actions further include accessing rules that each relate one or more values of the feat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Pouyan, Maziyar Baran, Fano, Andrew E, Shea, Timothy M, Vinson, David William, Esfahani, Saeideh Shahrokh, Yang, Yao A
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for clustering data are disclosed. In one aspect, a method includes the actions of receiving feature vectors. The actions further include accessing rules that each relate one or more values of the feature vectors to a respective label of a plurality of labels. The actions further include, based on the rules, generating heuristics that each identify related values of the feature vectors. The actions further include, for each of the heuristics, generating a matrix that reflects a similarity of the feature vectors. The actions further include, based on the matrices that each reflects a respective similarity of the feature vectors, generating clusters that each include a subset of the feature vectors. The actions further include, for each cluster, determining a label of the plurality of labels.