SYSTEM AND METHOD FOR AUTOMATED INFORMATION EXTRACTION FROM SCANNED DOCUMENTS

The problem of ever-increasing huge volume of unstructured data, mainly documents, and within that the scanned documents, needs to have a solution to expedite the overall turnaround time in document centric business processing. Majority of these documents often do not strictly follow a specific form...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	PILLAI, SHRIRAM, SAHOO, NIHAR RANJAN, KURHEKAR, PUSHKAR, MHASHILKAR, KAMLESH, KSHIRSAGAR, MAHESH, NIGAM, SHIVANI
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	PILLAI, SHRIRAM SAHOO, NIHAR RANJAN KURHEKAR, PUSHKAR MHASHILKAR, KAMLESH KSHIRSAGAR, MAHESH NIGAM, SHIVANI
description	The problem of ever-increasing huge volume of unstructured data, mainly documents, and within that the scanned documents, needs to have a solution to expedite the overall turnaround time in document centric business processing. Majority of these documents often do not strictly follow a specific format or a template, and creating a generic OCR solution, which would work on any kind of document format is needed to enhance overall efficacy of processes. Embodiments of the present disclosure provide system and method that extract tabular and text information from scanned documents. More specifically, method and system are provided to extract user filled tabular data, textual information, selected radio-buttons and checked checkboxes, stamps, barcodes from scanned copies of any filled form with or without any template being pre-defined or without any prior knowledge about format of input forms. The system converts extracted information in a structured form for further for analytics and reporting.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2022222284A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2022222284A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2022222284A13</originalsourceid><addsrcrecordid>eNrjZPANjgwOcfVVcPRzUfB1DfHwd1Fw8w9ScAwN8fd1DHF1UfD0A_KBTE9_PwXXiJAgR2cw0y3I31ch2NnRzw-oxsXfOdTX1S8kmIeBNS0xpziVF0pzMyi7uYY4e-imFuTHpxYXJCan5qWWxIcGGxkYgYGFiaOhMXGqAMPPL2k</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>SYSTEM AND METHOD FOR AUTOMATED INFORMATION EXTRACTION FROM SCANNED DOCUMENTS</title><source>esp@cenet</source><creator>PILLAI, SHRIRAM ; SAHOO, NIHAR RANJAN ; KURHEKAR, PUSHKAR ; MHASHILKAR, KAMLESH ; KSHIRSAGAR, MAHESH ; NIGAM, SHIVANI</creator><creatorcontrib>PILLAI, SHRIRAM ; SAHOO, NIHAR RANJAN ; KURHEKAR, PUSHKAR ; MHASHILKAR, KAMLESH ; KSHIRSAGAR, MAHESH ; NIGAM, SHIVANI</creatorcontrib><description>The problem of ever-increasing huge volume of unstructured data, mainly documents, and within that the scanned documents, needs to have a solution to expedite the overall turnaround time in document centric business processing. Majority of these documents often do not strictly follow a specific format or a template, and creating a generic OCR solution, which would work on any kind of document format is needed to enhance overall efficacy of processes. Embodiments of the present disclosure provide system and method that extract tabular and text information from scanned documents. More specifically, method and system are provided to extract user filled tabular data, textual information, selected radio-buttons and checked checkboxes, stamps, barcodes from scanned copies of any filled form with or without any template being pre-defined or without any prior knowledge about format of input forms. The system converts extracted information in a structured form for further for analytics and reporting.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220714&DB=EPODOC&CC=US&NR=2022222284A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25563,76418</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220714&DB=EPODOC&CC=US&NR=2022222284A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>PILLAI, SHRIRAM</creatorcontrib><creatorcontrib>SAHOO, NIHAR RANJAN</creatorcontrib><creatorcontrib>KURHEKAR, PUSHKAR</creatorcontrib><creatorcontrib>MHASHILKAR, KAMLESH</creatorcontrib><creatorcontrib>KSHIRSAGAR, MAHESH</creatorcontrib><creatorcontrib>NIGAM, SHIVANI</creatorcontrib><title>SYSTEM AND METHOD FOR AUTOMATED INFORMATION EXTRACTION FROM SCANNED DOCUMENTS</title><description>The problem of ever-increasing huge volume of unstructured data, mainly documents, and within that the scanned documents, needs to have a solution to expedite the overall turnaround time in document centric business processing. Majority of these documents often do not strictly follow a specific format or a template, and creating a generic OCR solution, which would work on any kind of document format is needed to enhance overall efficacy of processes. Embodiments of the present disclosure provide system and method that extract tabular and text information from scanned documents. More specifically, method and system are provided to extract user filled tabular data, textual information, selected radio-buttons and checked checkboxes, stamps, barcodes from scanned copies of any filled form with or without any template being pre-defined or without any prior knowledge about format of input forms. The system converts extracted information in a structured form for further for analytics and reporting.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPANjgwOcfVVcPRzUfB1DfHwd1Fw8w9ScAwN8fd1DHF1UfD0A_KBTE9_PwXXiJAgR2cw0y3I31ch2NnRzw-oxsXfOdTX1S8kmIeBNS0xpziVF0pzMyi7uYY4e-imFuTHpxYXJCan5qWWxIcGGxkYgYGFiaOhMXGqAMPPL2k</recordid><startdate>20220714</startdate><enddate>20220714</enddate><creator>PILLAI, SHRIRAM</creator><creator>SAHOO, NIHAR RANJAN</creator><creator>KURHEKAR, PUSHKAR</creator><creator>MHASHILKAR, KAMLESH</creator><creator>KSHIRSAGAR, MAHESH</creator><creator>NIGAM, SHIVANI</creator><scope>EVB</scope></search><sort><creationdate>20220714</creationdate><title>SYSTEM AND METHOD FOR AUTOMATED INFORMATION EXTRACTION FROM SCANNED DOCUMENTS</title><author>PILLAI, SHRIRAM ; SAHOO, NIHAR RANJAN ; KURHEKAR, PUSHKAR ; MHASHILKAR, KAMLESH ; KSHIRSAGAR, MAHESH ; NIGAM, SHIVANI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2022222284A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>PILLAI, SHRIRAM</creatorcontrib><creatorcontrib>SAHOO, NIHAR RANJAN</creatorcontrib><creatorcontrib>KURHEKAR, PUSHKAR</creatorcontrib><creatorcontrib>MHASHILKAR, KAMLESH</creatorcontrib><creatorcontrib>KSHIRSAGAR, MAHESH</creatorcontrib><creatorcontrib>NIGAM, SHIVANI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>PILLAI, SHRIRAM</au><au>SAHOO, NIHAR RANJAN</au><au>KURHEKAR, PUSHKAR</au><au>MHASHILKAR, KAMLESH</au><au>KSHIRSAGAR, MAHESH</au><au>NIGAM, SHIVANI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>SYSTEM AND METHOD FOR AUTOMATED INFORMATION EXTRACTION FROM SCANNED DOCUMENTS</title><date>2022-07-14</date><risdate>2022</risdate><abstract>The problem of ever-increasing huge volume of unstructured data, mainly documents, and within that the scanned documents, needs to have a solution to expedite the overall turnaround time in document centric business processing. Majority of these documents often do not strictly follow a specific format or a template, and creating a generic OCR solution, which would work on any kind of document format is needed to enhance overall efficacy of processes. Embodiments of the present disclosure provide system and method that extract tabular and text information from scanned documents. More specifically, method and system are provided to extract user filled tabular data, textual information, selected radio-buttons and checked checkboxes, stamps, barcodes from scanned copies of any filled form with or without any template being pre-defined or without any prior knowledge about format of input forms. The system converts extracted information in a structured form for further for analytics and reporting.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_US2022222284A1
source	esp@cenet
subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS
title	SYSTEM AND METHOD FOR AUTOMATED INFORMATION EXTRACTION FROM SCANNED DOCUMENTS
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T05%3A03%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=PILLAI,%20SHRIRAM&rft.date=2022-07-14&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2022222284A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true