USING INDEX PARTITIONING AND RECONCILIATION FOR DATA DEDUPLICATION

The subject disclosure is directed towards a data deduplication technology in which a hash index service's index is partitioned into subspace indexes, with less than the entire hash index service's index cached to save memory. The subspace index is accessed to determine whether a data chun...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	SENGUPTA SUDIPTA, BENTON JAMES ROBERT, LI JIN, KALACH RAN, OLTEAN PAUL ADRIAN, DESAI RONAKKUMAR N
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	SENGUPTA SUDIPTA BENTON JAMES ROBERT LI JIN KALACH RAN OLTEAN PAUL ADRIAN DESAI RONAKKUMAR N
description	The subject disclosure is directed towards a data deduplication technology in which a hash index service's index is partitioned into subspace indexes, with less than the entire hash index service's index cached to save memory. The subspace index is accessed to determine whether a data chunk already exists or needs to be indexed and stored. The index may be divided into subspaces based on criteria associated with the data to index, such as file type, data type, time of last usage, and so on. Also described is subspace reconciliation, in which duplicate entries in subspaces are detected so as to remove entries and chunks from the deduplication system. Subspace reconciliation may be performed at off-peak time, when more system resources are available, and may be interrupted if resources are needed. Subspaces to reconcile may be based on similarity, including via similarity of signatures that each compactly represents the subspace's hashes.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2016012098A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2016012098A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2016012098A13</originalsourceid><addsrcrecordid>eNrjZHAKDfb0c1fw9HNxjVAIcAwK8Qzx9PcDCTn6uSgEuTr7-zl7-ng6gkQV3PyDFFwcQxwVXFxdQgN8PJ3BwjwMrGmJOcWpvFCam0HZzTXE2UM3tSA_PrW4IDE5NS-1JD402MjA0MzA0MjA0sLR0Jg4VQC78iw5</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>USING INDEX PARTITIONING AND RECONCILIATION FOR DATA DEDUPLICATION</title><source>esp@cenet</source><creator>SENGUPTA SUDIPTA ; BENTON JAMES ROBERT ; LI JIN ; KALACH RAN ; OLTEAN PAUL ADRIAN ; DESAI RONAKKUMAR N</creator><creatorcontrib>SENGUPTA SUDIPTA ; BENTON JAMES ROBERT ; LI JIN ; KALACH RAN ; OLTEAN PAUL ADRIAN ; DESAI RONAKKUMAR N</creatorcontrib><description>The subject disclosure is directed towards a data deduplication technology in which a hash index service's index is partitioned into subspace indexes, with less than the entire hash index service's index cached to save memory. The subspace index is accessed to determine whether a data chunk already exists or needs to be indexed and stored. The index may be divided into subspaces based on criteria associated with the data to index, such as file type, data type, time of last usage, and so on. Also described is subspace reconciliation, in which duplicate entries in subspaces are detected so as to remove entries and chunks from the deduplication system. Subspace reconciliation may be performed at off-peak time, when more system resources are available, and may be interrupted if resources are needed. Subspaces to reconcile may be based on similarity, including via similarity of signatures that each compactly represents the subspace's hashes.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2016</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20160114&DB=EPODOC&CC=US&NR=2016012098A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20160114&DB=EPODOC&CC=US&NR=2016012098A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SENGUPTA SUDIPTA</creatorcontrib><creatorcontrib>BENTON JAMES ROBERT</creatorcontrib><creatorcontrib>LI JIN</creatorcontrib><creatorcontrib>KALACH RAN</creatorcontrib><creatorcontrib>OLTEAN PAUL ADRIAN</creatorcontrib><creatorcontrib>DESAI RONAKKUMAR N</creatorcontrib><title>USING INDEX PARTITIONING AND RECONCILIATION FOR DATA DEDUPLICATION</title><description>The subject disclosure is directed towards a data deduplication technology in which a hash index service's index is partitioned into subspace indexes, with less than the entire hash index service's index cached to save memory. The subspace index is accessed to determine whether a data chunk already exists or needs to be indexed and stored. The index may be divided into subspaces based on criteria associated with the data to index, such as file type, data type, time of last usage, and so on. Also described is subspace reconciliation, in which duplicate entries in subspaces are detected so as to remove entries and chunks from the deduplication system. Subspace reconciliation may be performed at off-peak time, when more system resources are available, and may be interrupted if resources are needed. Subspaces to reconcile may be based on similarity, including via similarity of signatures that each compactly represents the subspace's hashes.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2016</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHAKDfb0c1fw9HNxjVAIcAwK8Qzx9PcDCTn6uSgEuTr7-zl7-ng6gkQV3PyDFFwcQxwVXFxdQgN8PJ3BwjwMrGmJOcWpvFCam0HZzTXE2UM3tSA_PrW4IDE5NS-1JD402MjA0MzA0MjA0sLR0Jg4VQC78iw5</recordid><startdate>20160114</startdate><enddate>20160114</enddate><creator>SENGUPTA SUDIPTA</creator><creator>BENTON JAMES ROBERT</creator><creator>LI JIN</creator><creator>KALACH RAN</creator><creator>OLTEAN PAUL ADRIAN</creator><creator>DESAI RONAKKUMAR N</creator><scope>EVB</scope></search><sort><creationdate>20160114</creationdate><title>USING INDEX PARTITIONING AND RECONCILIATION FOR DATA DEDUPLICATION</title><author>SENGUPTA SUDIPTA ; BENTON JAMES ROBERT ; LI JIN ; KALACH RAN ; OLTEAN PAUL ADRIAN ; DESAI RONAKKUMAR N</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2016012098A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2016</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>SENGUPTA SUDIPTA</creatorcontrib><creatorcontrib>BENTON JAMES ROBERT</creatorcontrib><creatorcontrib>LI JIN</creatorcontrib><creatorcontrib>KALACH RAN</creatorcontrib><creatorcontrib>OLTEAN PAUL ADRIAN</creatorcontrib><creatorcontrib>DESAI RONAKKUMAR N</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SENGUPTA SUDIPTA</au><au>BENTON JAMES ROBERT</au><au>LI JIN</au><au>KALACH RAN</au><au>OLTEAN PAUL ADRIAN</au><au>DESAI RONAKKUMAR N</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>USING INDEX PARTITIONING AND RECONCILIATION FOR DATA DEDUPLICATION</title><date>2016-01-14</date><risdate>2016</risdate><abstract>The subject disclosure is directed towards a data deduplication technology in which a hash index service's index is partitioned into subspace indexes, with less than the entire hash index service's index cached to save memory. The subspace index is accessed to determine whether a data chunk already exists or needs to be indexed and stored. The index may be divided into subspaces based on criteria associated with the data to index, such as file type, data type, time of last usage, and so on. Also described is subspace reconciliation, in which duplicate entries in subspaces are detected so as to remove entries and chunks from the deduplication system. Subspace reconciliation may be performed at off-peak time, when more system resources are available, and may be interrupted if resources are needed. Subspaces to reconcile may be based on similarity, including via similarity of signatures that each compactly represents the subspace's hashes.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_US2016012098A1
source	esp@cenet
subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
title	USING INDEX PARTITIONING AND RECONCILIATION FOR DATA DEDUPLICATION
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T21%3A48%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SENGUPTA%20SUDIPTA&rft.date=2016-01-14&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2016012098A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true