ANALYZING DOCUMENT VERSIONS TO IDENTIFY SHARED DOCUMENT ELEMENTS USING MACHINE LEARNING

A present invention embodiment analyzes documents. A first document is received comprising a plurality of sentences that each include one or more words. A matrix is populated with the plurality of sentences, wherein each of the one or more words of each sentence in the matrix is encoded as a numeric...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	HE, Liu Yao, HU, Di, LI, Ying, JI, Xiao Feng
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A present invention embodiment analyzes documents. A first document is received comprising a plurality of sentences that each include one or more words. A matrix is populated with the plurality of sentences, wherein each of the one or more words of each sentence in the matrix is encoded as a numerical value. The matrix is processed using a machine learning model to generate a first feature map. The first feature map is compared to a second feature map of a corresponding second document to identify a shared document element between the first document and the second document based on a common feature in the first feature map and the second feature map. The shared document element is indicated via a user interface.