Compressing a plurality of documents

Documents are compressed. A partially compressed document is obtained. The partially compressed document includes one or more code words that replace one or more common tokens of a document to be compressed. The one or more common tokens are tokens common to a plurality of documents, and included in...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Gnech, Thomas H, Koenig, Steffen, Petrik, Oliver, Roehrig, Jochen, Illner, Regina, Zoellin, Christian
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Gnech, Thomas H
Koenig, Steffen
Petrik, Oliver
Roehrig, Jochen
Illner, Regina
Zoellin, Christian
description Documents are compressed. A partially compressed document is obtained. The partially compressed document includes one or more code words that replace one or more common tokens of a document to be compressed. The one or more common tokens are tokens common to a plurality of documents, and included in a common dictionary. The common dictionary provides a mapping of code words to common tokens. A document associated dictionary is created from non-common tokens of the document to be compressed. The document associated dictionary provides another mapping of other code words to the non-common tokens. A compressed document is created. The creating of the compressed document includes replacing one or more non-common tokens of the partially compressed document with one or more other code words of the document associated dictionary. The compressed document includes the one or more code words of the partially compressed document and the one or more other code words of the document associated dictionary.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US10956440B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US10956440B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US10956440B23</originalsourceid><addsrcrecordid>eNrjZFBxzs8tKEotLs7MS1dIVCjIKS1KzMksqVTIT1NIyU8uzU3NKynmYWBNS8wpTuWF0twMim6uIc4euqkF-fGpxQWJyal5qSXxocGGBpamZiYmBk5GxsSoAQAtcicx</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Compressing a plurality of documents</title><source>esp@cenet</source><creator>Gnech, Thomas H ; Koenig, Steffen ; Petrik, Oliver ; Roehrig, Jochen ; Illner, Regina ; Zoellin, Christian</creator><creatorcontrib>Gnech, Thomas H ; Koenig, Steffen ; Petrik, Oliver ; Roehrig, Jochen ; Illner, Regina ; Zoellin, Christian</creatorcontrib><description>Documents are compressed. A partially compressed document is obtained. The partially compressed document includes one or more code words that replace one or more common tokens of a document to be compressed. The one or more common tokens are tokens common to a plurality of documents, and included in a common dictionary. The common dictionary provides a mapping of code words to common tokens. A document associated dictionary is created from non-common tokens of the document to be compressed. The document associated dictionary provides another mapping of other code words to the non-common tokens. A compressed document is created. The creating of the compressed document includes replacing one or more non-common tokens of the partially compressed document with one or more other code words of the document associated dictionary. The compressed document includes the one or more code words of the partially compressed document and the one or more other code words of the document associated dictionary.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210323&amp;DB=EPODOC&amp;CC=US&amp;NR=10956440B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210323&amp;DB=EPODOC&amp;CC=US&amp;NR=10956440B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Gnech, Thomas H</creatorcontrib><creatorcontrib>Koenig, Steffen</creatorcontrib><creatorcontrib>Petrik, Oliver</creatorcontrib><creatorcontrib>Roehrig, Jochen</creatorcontrib><creatorcontrib>Illner, Regina</creatorcontrib><creatorcontrib>Zoellin, Christian</creatorcontrib><title>Compressing a plurality of documents</title><description>Documents are compressed. A partially compressed document is obtained. The partially compressed document includes one or more code words that replace one or more common tokens of a document to be compressed. The one or more common tokens are tokens common to a plurality of documents, and included in a common dictionary. The common dictionary provides a mapping of code words to common tokens. A document associated dictionary is created from non-common tokens of the document to be compressed. The document associated dictionary provides another mapping of other code words to the non-common tokens. A compressed document is created. The creating of the compressed document includes replacing one or more non-common tokens of the partially compressed document with one or more other code words of the document associated dictionary. The compressed document includes the one or more code words of the partially compressed document and the one or more other code words of the document associated dictionary.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZFBxzs8tKEotLs7MS1dIVCjIKS1KzMksqVTIT1NIyU8uzU3NKynmYWBNS8wpTuWF0twMim6uIc4euqkF-fGpxQWJyal5qSXxocGGBpamZiYmBk5GxsSoAQAtcicx</recordid><startdate>20210323</startdate><enddate>20210323</enddate><creator>Gnech, Thomas H</creator><creator>Koenig, Steffen</creator><creator>Petrik, Oliver</creator><creator>Roehrig, Jochen</creator><creator>Illner, Regina</creator><creator>Zoellin, Christian</creator><scope>EVB</scope></search><sort><creationdate>20210323</creationdate><title>Compressing a plurality of documents</title><author>Gnech, Thomas H ; Koenig, Steffen ; Petrik, Oliver ; Roehrig, Jochen ; Illner, Regina ; Zoellin, Christian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US10956440B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Gnech, Thomas H</creatorcontrib><creatorcontrib>Koenig, Steffen</creatorcontrib><creatorcontrib>Petrik, Oliver</creatorcontrib><creatorcontrib>Roehrig, Jochen</creatorcontrib><creatorcontrib>Illner, Regina</creatorcontrib><creatorcontrib>Zoellin, Christian</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gnech, Thomas H</au><au>Koenig, Steffen</au><au>Petrik, Oliver</au><au>Roehrig, Jochen</au><au>Illner, Regina</au><au>Zoellin, Christian</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Compressing a plurality of documents</title><date>2021-03-23</date><risdate>2021</risdate><abstract>Documents are compressed. A partially compressed document is obtained. The partially compressed document includes one or more code words that replace one or more common tokens of a document to be compressed. The one or more common tokens are tokens common to a plurality of documents, and included in a common dictionary. The common dictionary provides a mapping of code words to common tokens. A document associated dictionary is created from non-common tokens of the document to be compressed. The document associated dictionary provides another mapping of other code words to the non-common tokens. A compressed document is created. The creating of the compressed document includes replacing one or more non-common tokens of the partially compressed document with one or more other code words of the document associated dictionary. The compressed document includes the one or more code words of the partially compressed document and the one or more other code words of the document associated dictionary.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US10956440B2
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Compressing a plurality of documents
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T11%3A11%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Gnech,%20Thomas%20H&rft.date=2021-03-23&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS10956440B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true