An n-gram-based approach for detecting approximately duplicate database records
Gespeichert in:
Veröffentlicht in: | International journal on digital libraries 2002-05, Vol.3 (4), p.325-331 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 331 |
---|---|
container_issue | 4 |
container_start_page | 325 |
container_title | International journal on digital libraries |
container_volume | 3 |
creator | Tian, Zengping Lu, Hongjun Ji, Wenyun Zhou, Aoying Tian, Zhong |
description | |
doi_str_mv | 10.1007/s007990100044 |
format | Article |
fullrecord | <record><control><sourceid>pascalfrancis_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1007_s007990100044</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>13759341</sourcerecordid><originalsourceid>FETCH-LOGICAL-c182t-a627515fa37e4fa64d65c2c81f4da3c0ecb35de616496281faf73fff096e6a273</originalsourceid><addsrcrecordid>eNpVkM1LAzEQxYMoWKtH77l4jGby2T2WolYo9KLnZZqPurLdXZIV7H9vyhbEy8zj8XvD8Ai5B_4InNunXEZV8aK5UhdkBkoKBpLzy7PWHMQ1ucn5qyCwADsj22VHO7ZPeGA7zMFTHIbUo_uksU_UhzG4sen2k_3THHAM7ZH676FtXNHU44inIE3B9cnnW3IVsc3h7rzn5OPl-X21Zpvt69tquWEOFmJkaITVoCNKG1REo7zRTrgFROVROh7cTmofDBhVGVFsjFbGGHllgkFh5Zyw6a5Lfc4pxHpI5bt0rIHXpzbqf20U_mHiB8wO25iwc03-C0mrK6lA_gL0PV-u</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>An n-gram-based approach for detecting approximately duplicate database records</title><source>SpringerLink Journals - AutoHoldings</source><creator>Tian, Zengping ; Lu, Hongjun ; Ji, Wenyun ; Zhou, Aoying ; Tian, Zhong</creator><creatorcontrib>Tian, Zengping ; Lu, Hongjun ; Ji, Wenyun ; Zhou, Aoying ; Tian, Zhong</creatorcontrib><identifier>ISSN: 1432-5012</identifier><identifier>EISSN: 1432-1300</identifier><identifier>DOI: 10.1007/s007990100044</identifier><language>eng</language><publisher>Berlin: Springer</publisher><subject>Exact sciences and technology ; Information and communication sciences ; Information processing and retrieval ; Information science. Documentation ; Miscellaneous ; Sciences and techniques of general use</subject><ispartof>International journal on digital libraries, 2002-05, Vol.3 (4), p.325-331</ispartof><rights>2002 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c182t-a627515fa37e4fa64d65c2c81f4da3c0ecb35de616496281faf73fff096e6a273</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=13759341$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Tian, Zengping</creatorcontrib><creatorcontrib>Lu, Hongjun</creatorcontrib><creatorcontrib>Ji, Wenyun</creatorcontrib><creatorcontrib>Zhou, Aoying</creatorcontrib><creatorcontrib>Tian, Zhong</creatorcontrib><title>An n-gram-based approach for detecting approximately duplicate database records</title><title>International journal on digital libraries</title><subject>Exact sciences and technology</subject><subject>Information and communication sciences</subject><subject>Information processing and retrieval</subject><subject>Information science. Documentation</subject><subject>Miscellaneous</subject><subject>Sciences and techniques of general use</subject><issn>1432-5012</issn><issn>1432-1300</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2002</creationdate><recordtype>article</recordtype><recordid>eNpVkM1LAzEQxYMoWKtH77l4jGby2T2WolYo9KLnZZqPurLdXZIV7H9vyhbEy8zj8XvD8Ai5B_4InNunXEZV8aK5UhdkBkoKBpLzy7PWHMQ1ucn5qyCwADsj22VHO7ZPeGA7zMFTHIbUo_uksU_UhzG4sen2k_3THHAM7ZH676FtXNHU44inIE3B9cnnW3IVsc3h7rzn5OPl-X21Zpvt69tquWEOFmJkaITVoCNKG1REo7zRTrgFROVROh7cTmofDBhVGVFsjFbGGHllgkFh5Zyw6a5Lfc4pxHpI5bt0rIHXpzbqf20U_mHiB8wO25iwc03-C0mrK6lA_gL0PV-u</recordid><startdate>200205</startdate><enddate>200205</enddate><creator>Tian, Zengping</creator><creator>Lu, Hongjun</creator><creator>Ji, Wenyun</creator><creator>Zhou, Aoying</creator><creator>Tian, Zhong</creator><general>Springer</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>200205</creationdate><title>An n-gram-based approach for detecting approximately duplicate database records</title><author>Tian, Zengping ; Lu, Hongjun ; Ji, Wenyun ; Zhou, Aoying ; Tian, Zhong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c182t-a627515fa37e4fa64d65c2c81f4da3c0ecb35de616496281faf73fff096e6a273</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2002</creationdate><topic>Exact sciences and technology</topic><topic>Information and communication sciences</topic><topic>Information processing and retrieval</topic><topic>Information science. Documentation</topic><topic>Miscellaneous</topic><topic>Sciences and techniques of general use</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tian, Zengping</creatorcontrib><creatorcontrib>Lu, Hongjun</creatorcontrib><creatorcontrib>Ji, Wenyun</creatorcontrib><creatorcontrib>Zhou, Aoying</creatorcontrib><creatorcontrib>Tian, Zhong</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>International journal on digital libraries</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tian, Zengping</au><au>Lu, Hongjun</au><au>Ji, Wenyun</au><au>Zhou, Aoying</au><au>Tian, Zhong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An n-gram-based approach for detecting approximately duplicate database records</atitle><jtitle>International journal on digital libraries</jtitle><date>2002-05</date><risdate>2002</risdate><volume>3</volume><issue>4</issue><spage>325</spage><epage>331</epage><pages>325-331</pages><issn>1432-5012</issn><eissn>1432-1300</eissn><cop>Berlin</cop><pub>Springer</pub><doi>10.1007/s007990100044</doi><tpages>7</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1432-5012 |
ispartof | International journal on digital libraries, 2002-05, Vol.3 (4), p.325-331 |
issn | 1432-5012 1432-1300 |
language | eng |
recordid | cdi_crossref_primary_10_1007_s007990100044 |
source | SpringerLink Journals - AutoHoldings |
subjects | Exact sciences and technology Information and communication sciences Information processing and retrieval Information science. Documentation Miscellaneous Sciences and techniques of general use |
title | An n-gram-based approach for detecting approximately duplicate database records |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T13%3A07%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20n-gram-based%20approach%20for%20detecting%20approximately%20duplicate%20database%20records&rft.jtitle=International%20journal%20on%20digital%20libraries&rft.au=Tian,%20Zengping&rft.date=2002-05&rft.volume=3&rft.issue=4&rft.spage=325&rft.epage=331&rft.pages=325-331&rft.issn=1432-5012&rft.eissn=1432-1300&rft_id=info:doi/10.1007/s007990100044&rft_dat=%3Cpascalfrancis_cross%3E13759341%3C/pascalfrancis_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |