Unsupervised competition-based encoding

A method collects word-based data corresponding to a first identifier. A first phrase vector is generated for the first identifier by extracting frequency data from the word-based data. A similarity metric is generated corresponding to the first identifier and a second identifier by comparing the fi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Ben David, Daniel, Resheff, Yehezkel Shraga, Horesh, Yair
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES PHYSICS SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A method collects word-based data corresponding to a first identifier. A first phrase vector is generated for the first identifier by extracting frequency data from the word-based data. A similarity metric is generated corresponding to the first identifier and a second identifier by comparing the first phrase vector of the first identifier to a second phrase vector of the second identifier. A tuple is generated that includes the first identifier and the second identifier using the similarity metric. A machine learning model is trained with the tuple to generate an embedded vector corresponding to the first identifier.