Data clustering
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for clustering data are disclosed. In one aspect, a method includes the actions of receiving feature vectors. The actions further include accessing rules that each relate one or more values of the feat...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for clustering data are disclosed. In one aspect, a method includes the actions of receiving feature vectors. The actions further include accessing rules that each relate one or more values of the feature vectors to a respective label of a plurality of labels. The actions further include, based on the rules, generating heuristics that each identify related values of the feature vectors. The actions further include, for each of the heuristics, generating a matrix that reflects a similarity of the feature vectors. The actions further include, based on the matrices that each reflects a respective similarity of the feature vectors, generating clusters that each include a subset of the feature vectors. The actions further include, for each cluster, determining a label of the plurality of labels. |
---|