INTELLIGENT DATA EXTRACTION SYSTEM AND METHOD

A system and method for automating and improving data extraction from a variety of document types, including both unstructured, structured, and nested content, is disclosed. The system and method incorporate an intelligent machine learning model that is designed to intelligently identify chunks of t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: DHANDAPANI, Vijay, RAJAGOPALAN, Srinivasan Krishnan, KOTNALA, Rahul, MUTHU, Loganathan, CHANDRAN, Manikandan, PRAKASH, Anand Yesuraj, DEB, Simantini, VENKATAPPA, Lokesh, GOPALAN, Peter Ashly, SINGH, Harbhajan, RAMAN, Ramakrishnan, KUMAR, RBSanthosh
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A system and method for automating and improving data extraction from a variety of document types, including both unstructured, structured, and nested content, is disclosed. The system and method incorporate an intelligent machine learning model that is designed to intelligently identify chunks of text, map the fields in the document, and extract multi-record values. The system is designed to operate with little to no human intervention, while offering significant gains in accuracy, data visualization, and efficiency. The architecture applies customized techniques including density-based adaptive text clustering, tabular data extraction based on hierarchical intelligent keyword searches, and natural language processing-based field value selection. z z 03: z X x z O A -i 5 T) z z O O pm z Oz X 3 0O * (C) Z OO< Wz ui C) z 0Q Z i0 Om< C)O o u- 0- --- U)4- Z- 0 -i