ANALYZING DOCUMENT VERSIONS TO IDENTIFY SHARED DOCUMENT ELEMENTS USING MACHINE LEARNING

A present invention embodiment analyzes documents. A first document is received comprising a plurality of sentences that each include one or more words. A matrix is populated with the plurality of sentences, wherein each of the one or more words of each sentence in the matrix is encoded as a numeric...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: HE, Liu Yao, HU, Di, LI, Ying, JI, Xiao Feng
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A present invention embodiment analyzes documents. A first document is received comprising a plurality of sentences that each include one or more words. A matrix is populated with the plurality of sentences, wherein each of the one or more words of each sentence in the matrix is encoded as a numerical value. The matrix is processed using a machine learning model to generate a first feature map. The first feature map is compared to a second feature map of a corresponding second document to identify a shared document element between the first document and the second document based on a common feature in the first feature map and the second feature map. The shared document element is indicated via a user interface.