Comparing text pages using image features based on word positions

A signature for a page of text is generated. The signature serves as an identifier of the text page. Positions of words in a text page are determined. Positions of multiple second words in the text page are determined relative to the position of a first word in the text page. A signature value is ge...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: PONCIN GUILLAUME, SPASOJEVIC NEMANJA L, BLOOMBERG DAN S
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator PONCIN GUILLAUME
SPASOJEVIC NEMANJA L
BLOOMBERG DAN S
description A signature for a page of text is generated. The signature serves as an identifier of the text page. Positions of words in a text page are determined. Positions of multiple second words in the text page are determined relative to the position of a first word in the text page. A signature value is generated that describes the second word positions relative to the first word position. The signature value is stored. Additional signatures for the text page can be generated, each signature describing positions of other words in the text page relative to a word in the text page for which the signature is being generated. The signatures can be used to compare the text page to another text page and generate a measure of similarity that describes the result of the comparison.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US8151187B1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US8151187B1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US8151187B13</originalsourceid><addsrcrecordid>eNrjZHB0zs8tSCzKzEtXKEmtKFEoSExPLVYoLQYJZOYCOQppqYklpUVAwaTE4tQUhfw8hfL8ohSFgvzizJLM_LxiHgbWtMSc4lReKM3NoODmGuLsoZtakB-fWlyQmJyal1oSHxpsYWhqaGhh7mRoTIQSALNYMZM</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Comparing text pages using image features based on word positions</title><source>esp@cenet</source><creator>PONCIN GUILLAUME ; SPASOJEVIC NEMANJA L ; BLOOMBERG DAN S</creator><creatorcontrib>PONCIN GUILLAUME ; SPASOJEVIC NEMANJA L ; BLOOMBERG DAN S</creatorcontrib><description>A signature for a page of text is generated. The signature serves as an identifier of the text page. Positions of words in a text page are determined. Positions of multiple second words in the text page are determined relative to the position of a first word in the text page. A signature value is generated that describes the second word positions relative to the first word position. The signature value is stored. Additional signatures for the text page can be generated, each signature describing positions of other words in the text page relative to a word in the text page for which the signature is being generated. The signatures can be used to compare the text page to another text page and generate a measure of similarity that describes the result of the comparison.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2012</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20120403&amp;DB=EPODOC&amp;CC=US&amp;NR=8151187B1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20120403&amp;DB=EPODOC&amp;CC=US&amp;NR=8151187B1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>PONCIN GUILLAUME</creatorcontrib><creatorcontrib>SPASOJEVIC NEMANJA L</creatorcontrib><creatorcontrib>BLOOMBERG DAN S</creatorcontrib><title>Comparing text pages using image features based on word positions</title><description>A signature for a page of text is generated. The signature serves as an identifier of the text page. Positions of words in a text page are determined. Positions of multiple second words in the text page are determined relative to the position of a first word in the text page. A signature value is generated that describes the second word positions relative to the first word position. The signature value is stored. Additional signatures for the text page can be generated, each signature describing positions of other words in the text page relative to a word in the text page for which the signature is being generated. The signatures can be used to compare the text page to another text page and generate a measure of similarity that describes the result of the comparison.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2012</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHB0zs8tSCzKzEtXKEmtKFEoSExPLVYoLQYJZOYCOQppqYklpUVAwaTE4tQUhfw8hfL8ohSFgvzizJLM_LxiHgbWtMSc4lReKM3NoODmGuLsoZtakB-fWlyQmJyal1oSHxpsYWhqaGhh7mRoTIQSALNYMZM</recordid><startdate>20120403</startdate><enddate>20120403</enddate><creator>PONCIN GUILLAUME</creator><creator>SPASOJEVIC NEMANJA L</creator><creator>BLOOMBERG DAN S</creator><scope>EVB</scope></search><sort><creationdate>20120403</creationdate><title>Comparing text pages using image features based on word positions</title><author>PONCIN GUILLAUME ; SPASOJEVIC NEMANJA L ; BLOOMBERG DAN S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US8151187B13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2012</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>PONCIN GUILLAUME</creatorcontrib><creatorcontrib>SPASOJEVIC NEMANJA L</creatorcontrib><creatorcontrib>BLOOMBERG DAN S</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>PONCIN GUILLAUME</au><au>SPASOJEVIC NEMANJA L</au><au>BLOOMBERG DAN S</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Comparing text pages using image features based on word positions</title><date>2012-04-03</date><risdate>2012</risdate><abstract>A signature for a page of text is generated. The signature serves as an identifier of the text page. Positions of words in a text page are determined. Positions of multiple second words in the text page are determined relative to the position of a first word in the text page. A signature value is generated that describes the second word positions relative to the first word position. The signature value is stored. Additional signatures for the text page can be generated, each signature describing positions of other words in the text page relative to a word in the text page for which the signature is being generated. The signatures can be used to compare the text page to another text page and generate a measure of similarity that describes the result of the comparison.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US8151187B1
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
HANDLING RECORD CARRIERS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
title Comparing text pages using image features based on word positions
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T21%3A23%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=PONCIN%20GUILLAUME&rft.date=2012-04-03&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS8151187B1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true