System and method for performing Unicode matching

System and method for performing Unicode matching for comparing and merging similar data objects having Unicode strings that are equivalent yet not exact matches. Unicode characters are characterized by number of strokes, stroke order, radicals, geometry, phonemes in association with input method ed...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ENDO RICHARD T, ZHENG XIDONG, HAZI ARIEL, WEINBERG PAUL N, YOSPE NATHAN F
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator ENDO RICHARD T
ZHENG XIDONG
HAZI ARIEL
WEINBERG PAUL N
YOSPE NATHAN F
description System and method for performing Unicode matching for comparing and merging similar data objects having Unicode strings that are equivalent yet not exact matches. Unicode characters are characterized by number of strokes, stroke order, radicals, geometry, phonemes in association with input method editor and keyboard characteristics such as location of a character on an IME or keyboard (or number of GUI interface interactions used in entering the character, e.g., via tapping where "a" on a mobile device keyboard takes 1 tap of a key, "b" takes 2 taps). These characteristics associated with code points and IME's/keyboards are utilized to create subdomains for matching and determining "distance" to other Unicode code points (e.g., number of keyboard keys away). Allows for determining whether close, yet incorrect data entry may have taken place. Enables merging of duplicate data objects into master data object where minor differences or spelling errors introduce actually represent duplicate data.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US9275019B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US9275019B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US9275019B23</originalsourceid><addsrcrecordid>eNrjZDAMriwuSc1VSMxLUchNLcnIT1FIyy9SKEgtAlK5mXnpCqF5mcn5KakKuYklyRlAAR4G1rTEnOJUXijNzaDg5hri7KGbWpAfn1pckJicmpdaEh8abGlkbmpgaOlkZEyEEgBCOSt3</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>System and method for performing Unicode matching</title><source>esp@cenet</source><creator>ENDO RICHARD T ; ZHENG XIDONG ; HAZI ARIEL ; WEINBERG PAUL N ; YOSPE NATHAN F</creator><creatorcontrib>ENDO RICHARD T ; ZHENG XIDONG ; HAZI ARIEL ; WEINBERG PAUL N ; YOSPE NATHAN F</creatorcontrib><description>System and method for performing Unicode matching for comparing and merging similar data objects having Unicode strings that are equivalent yet not exact matches. Unicode characters are characterized by number of strokes, stroke order, radicals, geometry, phonemes in association with input method editor and keyboard characteristics such as location of a character on an IME or keyboard (or number of GUI interface interactions used in entering the character, e.g., via tapping where "a" on a mobile device keyboard takes 1 tap of a key, "b" takes 2 taps). These characteristics associated with code points and IME's/keyboards are utilized to create subdomains for matching and determining "distance" to other Unicode code points (e.g., number of keyboard keys away). Allows for determining whether close, yet incorrect data entry may have taken place. Enables merging of duplicate data objects into master data object where minor differences or spelling errors introduce actually represent duplicate data.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; IMAGE DATA PROCESSING OR GENERATION, IN GENERAL ; PHYSICS</subject><creationdate>2016</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20160301&amp;DB=EPODOC&amp;CC=US&amp;NR=9275019B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20160301&amp;DB=EPODOC&amp;CC=US&amp;NR=9275019B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ENDO RICHARD T</creatorcontrib><creatorcontrib>ZHENG XIDONG</creatorcontrib><creatorcontrib>HAZI ARIEL</creatorcontrib><creatorcontrib>WEINBERG PAUL N</creatorcontrib><creatorcontrib>YOSPE NATHAN F</creatorcontrib><title>System and method for performing Unicode matching</title><description>System and method for performing Unicode matching for comparing and merging similar data objects having Unicode strings that are equivalent yet not exact matches. Unicode characters are characterized by number of strokes, stroke order, radicals, geometry, phonemes in association with input method editor and keyboard characteristics such as location of a character on an IME or keyboard (or number of GUI interface interactions used in entering the character, e.g., via tapping where "a" on a mobile device keyboard takes 1 tap of a key, "b" takes 2 taps). These characteristics associated with code points and IME's/keyboards are utilized to create subdomains for matching and determining "distance" to other Unicode code points (e.g., number of keyboard keys away). Allows for determining whether close, yet incorrect data entry may have taken place. Enables merging of duplicate data objects into master data object where minor differences or spelling errors introduce actually represent duplicate data.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2016</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDAMriwuSc1VSMxLUchNLcnIT1FIyy9SKEgtAlK5mXnpCqF5mcn5KakKuYklyRlAAR4G1rTEnOJUXijNzaDg5hri7KGbWpAfn1pckJicmpdaEh8abGlkbmpgaOlkZEyEEgBCOSt3</recordid><startdate>20160301</startdate><enddate>20160301</enddate><creator>ENDO RICHARD T</creator><creator>ZHENG XIDONG</creator><creator>HAZI ARIEL</creator><creator>WEINBERG PAUL N</creator><creator>YOSPE NATHAN F</creator><scope>EVB</scope></search><sort><creationdate>20160301</creationdate><title>System and method for performing Unicode matching</title><author>ENDO RICHARD T ; ZHENG XIDONG ; HAZI ARIEL ; WEINBERG PAUL N ; YOSPE NATHAN F</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US9275019B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2016</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>ENDO RICHARD T</creatorcontrib><creatorcontrib>ZHENG XIDONG</creatorcontrib><creatorcontrib>HAZI ARIEL</creatorcontrib><creatorcontrib>WEINBERG PAUL N</creatorcontrib><creatorcontrib>YOSPE NATHAN F</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ENDO RICHARD T</au><au>ZHENG XIDONG</au><au>HAZI ARIEL</au><au>WEINBERG PAUL N</au><au>YOSPE NATHAN F</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>System and method for performing Unicode matching</title><date>2016-03-01</date><risdate>2016</risdate><abstract>System and method for performing Unicode matching for comparing and merging similar data objects having Unicode strings that are equivalent yet not exact matches. Unicode characters are characterized by number of strokes, stroke order, radicals, geometry, phonemes in association with input method editor and keyboard characteristics such as location of a character on an IME or keyboard (or number of GUI interface interactions used in entering the character, e.g., via tapping where "a" on a mobile device keyboard takes 1 tap of a key, "b" takes 2 taps). These characteristics associated with code points and IME's/keyboards are utilized to create subdomains for matching and determining "distance" to other Unicode code points (e.g., number of keyboard keys away). Allows for determining whether close, yet incorrect data entry may have taken place. Enables merging of duplicate data objects into master data object where minor differences or spelling errors introduce actually represent duplicate data.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US9275019B2
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
PHYSICS
title System and method for performing Unicode matching
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T17%3A54%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ENDO%20RICHARD%20T&rft.date=2016-03-01&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS9275019B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true