Unsupervised competition-based encoding
A method collects word-based data corresponding to a first identifier. A first phrase vector is generated for the first identifier by extracting frequency data from the word-based data. A similarity metric is generated corresponding to the first identifier and a second identifier by comparing the fi...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A method collects word-based data corresponding to a first identifier. A first phrase vector is generated for the first identifier by extracting frequency data from the word-based data. A similarity metric is generated corresponding to the first identifier and a second identifier by comparing the first phrase vector of the first identifier to a second phrase vector of the second identifier. A tuple is generated that includes the first identifier and the second identifier using the similarity metric. A machine learning model is trained with the tuple to generate an embedded vector corresponding to the first identifier. |
---|