Intelligent extraction of information from a document

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing intelligent extraction of information from a document. A computing module receives input data representing an image of a document. The module also receives context data for the document....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Lagunas, Jaime Rodriguez, Arnal, Joan Verdu, Rey, Reynaldo Alberto España, Martorell, Esperanza Eugenia Puigserver, Martín, Sandra Orozco, Tuma, Carlos Gaston Besanson
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing intelligent extraction of information from a document. A computing module receives input data representing an image of a document. The module also receives context data for the document. The context data includes parameters that are descriptive of the document in the image. The module processes the input data and the context data to determine a complexity value that characterizes a level of complexity in identifying information to be extracted from the document. The system selects a machine-learning model to use in extracting information from the document. The model is selected based on the complexity value and from multiple candidate models. The system extracts information from the document using the selected model, including converting a portion of the image of the document that shows typed or handwritten text into a digitized text string.