INFORMATION PROCESSING APPARATUS, IMAGE FORMING APPARATUS, AND INFORMATION PROCESSING METHOD FOR AUTOMATICALLY ORDERING PAGE

Provided is an information processing apparatus for ordering a plurality of scanned page data with high accuracy. The OCR unit performs optical character recognition for character and layout in a page for each of the plurality of page data. The rule order unit classifies the characters and layouts t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: SHOJI, Hidenori
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Provided is an information processing apparatus for ordering a plurality of scanned page data with high accuracy. The OCR unit performs optical character recognition for character and layout in a page for each of the plurality of page data. The rule order unit classifies the characters and layouts that are performed optical character recognition by the OCR unit based on the page ordering rules, extracts the page numbers, and calculates the certainty of the page numbers. The ML order unit classifies the page data of pages with low certainty calculated by the rule order unit by machine learning, and it infers the page number.