IDENTIFICATION OF NEW CONTENT WITHIN A DIGITAL DOCUMENT
A computer-implemented method for electronically identifying new content in a digital document. The method includes receiving a digital document, utilizing a NLP pipeline to identify one or more articles of subject matter content, together with their respective relationships, contained within the di...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A computer-implemented method for electronically identifying new content in a digital document. The method includes receiving a digital document, utilizing a NLP pipeline to identify one or more articles of subject matter content, together with their respective relationships, contained within the digital document. The method further includes generating, by the NLP pipeline, a knowledge graph, based on the one or more relationships between the one or more articles of subject matter content contained within the digital document, and comparing the generated knowledge graph to one or more stored knowledge graphs based on a novelty-criteria, to determine whether the identified one or more articles of subject matter content, together with their respective relationships, are represented in the one or more stored knowledge graphs. The method further includes communicating one or more portions of the digital document that were determined to not be contained within the one or more stored knowledge graphs. |
---|