Natural language processing on semi-structured data

Techniques for performing natural language processing (NLP) on semi-structured data are described. An exemplary method includes receiving a semi-structured document to perform NLP on using a trained NLP model; converting the semi-structured document into a secondary format, wherein the secondary for...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Pushkin, Yahor, Horwood, Graham Vintcent, Chowdhury, Aruni Roy, Marcheggiani, Diego, Xie, Yusheng, Zhu, Xuan, Zhou, Liutong, Vyas, Yogarshi Paritosh, Zarandioon, Saman, Ballesteros Martinez, Miguel, Pang, Bo, Al-Onaizan, Yaser, Mallya Kasaragod, Sunil, Zhang, Yinxiao
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Techniques for performing natural language processing (NLP) on semi-structured data are described. An exemplary method includes receiving a semi-structured document to perform NLP on using a trained NLP model; converting the semi-structured document into a secondary format, wherein the secondary format includes spatial information for tokens of the semi-structured document; flattening the converted, secondary formatted semi-structured document into a Unicode Transformation Format text file; performing NLP on the Unicode Transformation Format text file using the trained NLP model; and providing a result of the NLP to a requester.