Identifying matching canonical documents in response to a visual query

A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing te...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Casey, Matthew R, Popat, Ashok C, Petrou, David
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Casey, Matthew R Popat, Ashok C Petrou, David
description	A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical source document matching the visual query that contains the one or more high quality textual strings is identified and retrieved. Then at least a portion of the canonical document is sent to the client system.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_AU2011336445BB2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>AU2011336445BB2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_AU2011336445BB23</originalsourceid><addsrcrecordid>eNrjZHDzTEnNK8lMq8zMS1fITSxJzgAxkhPz8vMykxNzFFLyk0tzgSqKFTLzFIpSiwvy84pTFUryFRIVyjKLS4EqCktTiyp5GFjTEnOKU3mhNDeDiptriLOHbmpBfjxQV2Jyal5qSbxjqJGBoaGxsZmJiamTk5ExkcoAkPs1NA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Identifying matching canonical documents in response to a visual query</title><source>esp@cenet</source><creator>Casey, Matthew R ; Popat, Ashok C ; Petrou, David</creator><creatorcontrib>Casey, Matthew R ; Popat, Ashok C ; Petrou, David</creatorcontrib><description>A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical source document matching the visual query that contains the one or more high quality textual strings is identified and retrieved. Then at least a portion of the canonical document is sent to the client system.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2017</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20170413&DB=EPODOC&CC=AU&NR=2011336445B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,309,781,886,25566,76549</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20170413&DB=EPODOC&CC=AU&NR=2011336445B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Casey, Matthew R</creatorcontrib><creatorcontrib>Popat, Ashok C</creatorcontrib><creatorcontrib>Petrou, David</creatorcontrib><title>Identifying matching canonical documents in response to a visual query</title><description>A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical source document matching the visual query that contains the one or more high quality textual strings is identified and retrieved. Then at least a portion of the canonical document is sent to the client system.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2017</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHDzTEnNK8lMq8zMS1fITSxJzgAxkhPz8vMykxNzFFLyk0tzgSqKFTLzFIpSiwvy84pTFUryFRIVyjKLS4EqCktTiyp5GFjTEnOKU3mhNDeDiptriLOHbmpBfjxQV2Jyal5qSbxjqJGBoaGxsZmJiamTk5ExkcoAkPs1NA</recordid><startdate>20170413</startdate><enddate>20170413</enddate><creator>Casey, Matthew R</creator><creator>Popat, Ashok C</creator><creator>Petrou, David</creator><scope>EVB</scope></search><sort><creationdate>20170413</creationdate><title>Identifying matching canonical documents in response to a visual query</title><author>Casey, Matthew R ; Popat, Ashok C ; Petrou, David</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_AU2011336445BB23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2017</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Casey, Matthew R</creatorcontrib><creatorcontrib>Popat, Ashok C</creatorcontrib><creatorcontrib>Petrou, David</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Casey, Matthew R</au><au>Popat, Ashok C</au><au>Petrou, David</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Identifying matching canonical documents in response to a visual query</title><date>2017-04-13</date><risdate>2017</risdate><abstract>A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical source document matching the visual query that contains the one or more high quality textual strings is identified and retrieved. Then at least a portion of the canonical document is sent to the client system.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_AU2011336445BB2
source	esp@cenet
subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
title	Identifying matching canonical documents in response to a visual query
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T07%3A49%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Casey,%20Matthew%20R&rft.date=2017-04-13&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EAU2011336445BB2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true