Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver
This paper develops a shredded document reassembly algorithm based on character/word detection. A new word compatibility estimation metric and a searching strategy called Greedy Composition and Optimal Matching (GCOM) are proposed to compose documents from their vertically shredded stripes. We reduc...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on multimedia 2020-05, Vol.22 (5), p.1168-1181 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1181 |
---|---|
container_issue | 5 |
container_start_page | 1168 |
container_title | IEEE transactions on multimedia |
container_volume | 22 |
creator | Liang, Yongqing Li, Xin |
description | This paper develops a shredded document reassembly algorithm based on character/word detection. A new word compatibility estimation metric and a searching strategy called Greedy Composition and Optimal Matching (GCOM) are proposed to compose documents from their vertically shredded stripes. We reduce the stripe puzzle reassembly problem to the traveling salesman problem (TSP) on a sparse graph. The word-path compatibility metric takes advantages of the optical character recognition (OCR) to compute the compatibility score among a group of stripes. The global composition strategy, based on an integration of greedy composition and optimal matching, is proposed to search for a maximal Hamiltonian path and the final global reassembly. We demonstrate that our solver outperforms the state-of-the-art puzzle solvers on reassembling stripe shredded documents. |
doi_str_mv | 10.1109/TMM.2019.2941777 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2393778139</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8844125</ieee_id><sourcerecordid>2393778139</sourcerecordid><originalsourceid>FETCH-LOGICAL-c333t-b0adf4d94d3415316a0f82f4db6babf57301d859b4e5ff40de0a8e5f019b10ce3</originalsourceid><addsrcrecordid>eNqNkMFLwzAUh4MoOKd3wUvAo3S-NKlpj1J1ChsTp3gsafPqKl1Tk0zZf2_mhl49vR9573tJPkJOGYwYg-zyeTodxcCyUZwJJqXcIwMWUgQg5X7ISQxRFjM4JEfOvQMwkYAckPYJlXO4LNume6PzhUWtUdMbU62W2Hk697bp0dEXt-m_GqujR-UXdIqhUVHVaTq2iHpNc7PsjWt8Yzo6632zVC2dKl8tfhab9hPtMTmoVevwZFeH5OXu9jm_jyaz8UN-PYkqzrmPSlC6FjoTmguWcHaloE7jcFJelaqsE8mB6TTJSoFJXQvQCCoNMfy-ZFAhH5Lz7d7emo8VOl-8m5XtwpVFzDMuZcpCGRLYTlXWOGexLnobXm3XBYNi47QITouN02LnNCAXW-QLS1O7qsGuwl8MABIOkMkYfuKQpP-fzhuvNu5ys-p8QM-2aIP4h6SpECxO-Dc76ZRy</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2393778139</pqid></control><display><type>article</type><title>Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver</title><source>IEEE Electronic Library (IEL)</source><creator>Liang, Yongqing ; Li, Xin</creator><creatorcontrib>Liang, Yongqing ; Li, Xin</creatorcontrib><description>This paper develops a shredded document reassembly algorithm based on character/word detection. A new word compatibility estimation metric and a searching strategy called Greedy Composition and Optimal Matching (GCOM) are proposed to compose documents from their vertically shredded stripes. We reduce the stripe puzzle reassembly problem to the traveling salesman problem (TSP) on a sparse graph. The word-path compatibility metric takes advantages of the optical character recognition (OCR) to compute the compatibility score among a group of stripes. The global composition strategy, based on an integration of greedy composition and optimal matching, is proposed to search for a maximal Hamiltonian path and the final global reassembly. We demonstrate that our solver outperforms the state-of-the-art puzzle solvers on reassembling stripe shredded documents.</description><identifier>ISSN: 1520-9210</identifier><identifier>EISSN: 1941-0077</identifier><identifier>DOI: 10.1109/TMM.2019.2941777</identifier><identifier>CODEN: ITMUF8</identifier><language>eng</language><publisher>PISCATAWAY: IEEE</publisher><subject>Algorithms ; Compatibility ; Composition ; Computer Science ; Computer Science, Information Systems ; Computer Science, Software Engineering ; global reconstruction from local alignments ; Image reconstruction ; Matching ; Measurement ; Optical character recognition ; Optical character recognition software ; Optimal matching ; Reliability ; Science & Technology ; Semantics ; sequence compatibility measurement ; Shredded document reassembly ; Solvers ; Technology ; Telecommunications ; Traveling salesman problem ; Traveling salesman problems</subject><ispartof>IEEE transactions on multimedia, 2020-05, Vol.22 (5), p.1168-1181</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>6</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000530097200005</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c333t-b0adf4d94d3415316a0f82f4db6babf57301d859b4e5ff40de0a8e5f019b10ce3</citedby><cites>FETCH-LOGICAL-c333t-b0adf4d94d3415316a0f82f4db6babf57301d859b4e5ff40de0a8e5f019b10ce3</cites><orcidid>0000-0002-7282-0476 ; 0000-0002-0144-9489</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8844125$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>315,782,786,798,27931,27932,28255,54765</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8844125$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Liang, Yongqing</creatorcontrib><creatorcontrib>Li, Xin</creatorcontrib><title>Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver</title><title>IEEE transactions on multimedia</title><addtitle>TMM</addtitle><addtitle>IEEE T MULTIMEDIA</addtitle><description>This paper develops a shredded document reassembly algorithm based on character/word detection. A new word compatibility estimation metric and a searching strategy called Greedy Composition and Optimal Matching (GCOM) are proposed to compose documents from their vertically shredded stripes. We reduce the stripe puzzle reassembly problem to the traveling salesman problem (TSP) on a sparse graph. The word-path compatibility metric takes advantages of the optical character recognition (OCR) to compute the compatibility score among a group of stripes. The global composition strategy, based on an integration of greedy composition and optimal matching, is proposed to search for a maximal Hamiltonian path and the final global reassembly. We demonstrate that our solver outperforms the state-of-the-art puzzle solvers on reassembling stripe shredded documents.</description><subject>Algorithms</subject><subject>Compatibility</subject><subject>Composition</subject><subject>Computer Science</subject><subject>Computer Science, Information Systems</subject><subject>Computer Science, Software Engineering</subject><subject>global reconstruction from local alignments</subject><subject>Image reconstruction</subject><subject>Matching</subject><subject>Measurement</subject><subject>Optical character recognition</subject><subject>Optical character recognition software</subject><subject>Optimal matching</subject><subject>Reliability</subject><subject>Science & Technology</subject><subject>Semantics</subject><subject>sequence compatibility measurement</subject><subject>Shredded document reassembly</subject><subject>Solvers</subject><subject>Technology</subject><subject>Telecommunications</subject><subject>Traveling salesman problem</subject><subject>Traveling salesman problems</subject><issn>1520-9210</issn><issn>1941-0077</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><sourceid>AOWDO</sourceid><recordid>eNqNkMFLwzAUh4MoOKd3wUvAo3S-NKlpj1J1ChsTp3gsafPqKl1Tk0zZf2_mhl49vR9573tJPkJOGYwYg-zyeTodxcCyUZwJJqXcIwMWUgQg5X7ISQxRFjM4JEfOvQMwkYAckPYJlXO4LNume6PzhUWtUdMbU62W2Hk697bp0dEXt-m_GqujR-UXdIqhUVHVaTq2iHpNc7PsjWt8Yzo6632zVC2dKl8tfhab9hPtMTmoVevwZFeH5OXu9jm_jyaz8UN-PYkqzrmPSlC6FjoTmguWcHaloE7jcFJelaqsE8mB6TTJSoFJXQvQCCoNMfy-ZFAhH5Lz7d7emo8VOl-8m5XtwpVFzDMuZcpCGRLYTlXWOGexLnobXm3XBYNi47QITouN02LnNCAXW-QLS1O7qsGuwl8MABIOkMkYfuKQpP-fzhuvNu5ys-p8QM-2aIP4h6SpECxO-Dc76ZRy</recordid><startdate>20200501</startdate><enddate>20200501</enddate><creator>Liang, Yongqing</creator><creator>Li, Xin</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AOWDO</scope><scope>BLEPL</scope><scope>DTL</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-7282-0476</orcidid><orcidid>https://orcid.org/0000-0002-0144-9489</orcidid></search><sort><creationdate>20200501</creationdate><title>Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver</title><author>Liang, Yongqing ; Li, Xin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c333t-b0adf4d94d3415316a0f82f4db6babf57301d859b4e5ff40de0a8e5f019b10ce3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Compatibility</topic><topic>Composition</topic><topic>Computer Science</topic><topic>Computer Science, Information Systems</topic><topic>Computer Science, Software Engineering</topic><topic>global reconstruction from local alignments</topic><topic>Image reconstruction</topic><topic>Matching</topic><topic>Measurement</topic><topic>Optical character recognition</topic><topic>Optical character recognition software</topic><topic>Optimal matching</topic><topic>Reliability</topic><topic>Science & Technology</topic><topic>Semantics</topic><topic>sequence compatibility measurement</topic><topic>Shredded document reassembly</topic><topic>Solvers</topic><topic>Technology</topic><topic>Telecommunications</topic><topic>Traveling salesman problem</topic><topic>Traveling salesman problems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Liang, Yongqing</creatorcontrib><creatorcontrib>Li, Xin</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Web of Science - Science Citation Index Expanded - 2020</collection><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on multimedia</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Liang, Yongqing</au><au>Li, Xin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver</atitle><jtitle>IEEE transactions on multimedia</jtitle><stitle>TMM</stitle><stitle>IEEE T MULTIMEDIA</stitle><date>2020-05-01</date><risdate>2020</risdate><volume>22</volume><issue>5</issue><spage>1168</spage><epage>1181</epage><pages>1168-1181</pages><issn>1520-9210</issn><eissn>1941-0077</eissn><coden>ITMUF8</coden><abstract>This paper develops a shredded document reassembly algorithm based on character/word detection. A new word compatibility estimation metric and a searching strategy called Greedy Composition and Optimal Matching (GCOM) are proposed to compose documents from their vertically shredded stripes. We reduce the stripe puzzle reassembly problem to the traveling salesman problem (TSP) on a sparse graph. The word-path compatibility metric takes advantages of the optical character recognition (OCR) to compute the compatibility score among a group of stripes. The global composition strategy, based on an integration of greedy composition and optimal matching, is proposed to search for a maximal Hamiltonian path and the final global reassembly. We demonstrate that our solver outperforms the state-of-the-art puzzle solvers on reassembling stripe shredded documents.</abstract><cop>PISCATAWAY</cop><pub>IEEE</pub><doi>10.1109/TMM.2019.2941777</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0002-7282-0476</orcidid><orcidid>https://orcid.org/0000-0002-0144-9489</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1520-9210 |
ispartof | IEEE transactions on multimedia, 2020-05, Vol.22 (5), p.1168-1181 |
issn | 1520-9210 1941-0077 |
language | eng |
recordid | cdi_proquest_journals_2393778139 |
source | IEEE Electronic Library (IEL) |
subjects | Algorithms Compatibility Composition Computer Science Computer Science, Information Systems Computer Science, Software Engineering global reconstruction from local alignments Image reconstruction Matching Measurement Optical character recognition Optical character recognition software Optimal matching Reliability Science & Technology Semantics sequence compatibility measurement Shredded document reassembly Solvers Technology Telecommunications Traveling salesman problem Traveling salesman problems |
title | Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-04T16%3A12%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Reassembling%20Shredded%20Document%20Stripes%20Using%20Word-Path%20Metric%20and%20Greedy%20Composition%20Optimal%20Matching%20Solver&rft.jtitle=IEEE%20transactions%20on%20multimedia&rft.au=Liang,%20Yongqing&rft.date=2020-05-01&rft.volume=22&rft.issue=5&rft.spage=1168&rft.epage=1181&rft.pages=1168-1181&rft.issn=1520-9210&rft.eissn=1941-0077&rft.coden=ITMUF8&rft_id=info:doi/10.1109/TMM.2019.2941777&rft_dat=%3Cproquest_RIE%3E2393778139%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2393778139&rft_id=info:pmid/&rft_ieee_id=8844125&rfr_iscdi=true |