INFORMATION PROCESSING APPARATUS, IMAGE FORMING APPARATUS, AND INFORMATION PROCESSING METHOD FOR AUTOMATICALLY ORDERING PAGE
Provided is an information processing apparatus for ordering a plurality of scanned page data with high accuracy. The OCR unit performs optical character recognition for character and layout in a page for each of the plurality of page data. The rule order unit classifies the characters and layouts t...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Provided is an information processing apparatus for ordering a plurality of scanned page data with high accuracy. The OCR unit performs optical character recognition for character and layout in a page for each of the plurality of page data. The rule order unit classifies the characters and layouts that are performed optical character recognition by the OCR unit based on the page ordering rules, extracts the page numbers, and calculates the certainty of the page numbers. The ML order unit classifies the page data of pages with low certainty calculated by the rule order unit by machine learning, and it infers the page number. |
---|