Speeding deduplication using a most wanted digest cache

Embodiments are directed to techniques for performing deduplication. A method includes (a) obtaining a digest of a data block logically-positioned within a filesystem, the digest providing a hash value of data of the data block, (b) searching a Most Wanted Digest Cache (MWDC) within system memory fo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Wang, Yubing, Gillono, John, Seibel, Christopher A, Armangau, Philippe, Caram, Bruce E
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Wang, Yubing
Gillono, John
Seibel, Christopher A
Armangau, Philippe
Caram, Bruce E
description Embodiments are directed to techniques for performing deduplication. A method includes (a) obtaining a digest of a data block logically-positioned within a filesystem, the digest providing a hash value of data of the data block, (b) searching a Most Wanted Digest Cache (MWDC) within system memory for the digest, (c) locating an entry in the MWDC using the digest, wherein this locating indicates that the data block has the same data as another data block located elsewhere within the filesystem, the other data block having been previously persistently-stored, the entry having been added to the MWDC in response to the other data block having been deduplicated at least a plurality number of times, (d) locating a mapping structure referenced by the entry located from the MWDC, the mapping structure providing metadata about the other data block, and (e) deduplicating the data block and the other data block with reference to the located mapping structure.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US11093454B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US11093454B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US11093454B23</originalsourceid><addsrcrecordid>eNrjZDAPLkhNTcnMS1dISU0pLcjJTE4syczPUygtBoklKuTmF5colCfmlaSmKKRkpqcCecmJyRmpPAysaYk5xam8UJqbQdHNNcTZQze1ID8-tbggMTk1L7UkPjTY0NDA0tjE1MTJyJgYNQDVIi39</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Speeding deduplication using a most wanted digest cache</title><source>esp@cenet</source><creator>Wang, Yubing ; Gillono, John ; Seibel, Christopher A ; Armangau, Philippe ; Caram, Bruce E</creator><creatorcontrib>Wang, Yubing ; Gillono, John ; Seibel, Christopher A ; Armangau, Philippe ; Caram, Bruce E</creatorcontrib><description>Embodiments are directed to techniques for performing deduplication. A method includes (a) obtaining a digest of a data block logically-positioned within a filesystem, the digest providing a hash value of data of the data block, (b) searching a Most Wanted Digest Cache (MWDC) within system memory for the digest, (c) locating an entry in the MWDC using the digest, wherein this locating indicates that the data block has the same data as another data block located elsewhere within the filesystem, the other data block having been previously persistently-stored, the entry having been added to the MWDC in response to the other data block having been deduplicated at least a plurality number of times, (d) locating a mapping structure referenced by the entry located from the MWDC, the mapping structure providing metadata about the other data block, and (e) deduplicating the data block and the other data block with reference to the located mapping structure.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210817&amp;DB=EPODOC&amp;CC=US&amp;NR=11093454B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210817&amp;DB=EPODOC&amp;CC=US&amp;NR=11093454B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Wang, Yubing</creatorcontrib><creatorcontrib>Gillono, John</creatorcontrib><creatorcontrib>Seibel, Christopher A</creatorcontrib><creatorcontrib>Armangau, Philippe</creatorcontrib><creatorcontrib>Caram, Bruce E</creatorcontrib><title>Speeding deduplication using a most wanted digest cache</title><description>Embodiments are directed to techniques for performing deduplication. A method includes (a) obtaining a digest of a data block logically-positioned within a filesystem, the digest providing a hash value of data of the data block, (b) searching a Most Wanted Digest Cache (MWDC) within system memory for the digest, (c) locating an entry in the MWDC using the digest, wherein this locating indicates that the data block has the same data as another data block located elsewhere within the filesystem, the other data block having been previously persistently-stored, the entry having been added to the MWDC in response to the other data block having been deduplicated at least a plurality number of times, (d) locating a mapping structure referenced by the entry located from the MWDC, the mapping structure providing metadata about the other data block, and (e) deduplicating the data block and the other data block with reference to the located mapping structure.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDAPLkhNTcnMS1dISU0pLcjJTE4syczPUygtBoklKuTmF5colCfmlaSmKKRkpqcCecmJyRmpPAysaYk5xam8UJqbQdHNNcTZQze1ID8-tbggMTk1L7UkPjTY0NDA0tjE1MTJyJgYNQDVIi39</recordid><startdate>20210817</startdate><enddate>20210817</enddate><creator>Wang, Yubing</creator><creator>Gillono, John</creator><creator>Seibel, Christopher A</creator><creator>Armangau, Philippe</creator><creator>Caram, Bruce E</creator><scope>EVB</scope></search><sort><creationdate>20210817</creationdate><title>Speeding deduplication using a most wanted digest cache</title><author>Wang, Yubing ; Gillono, John ; Seibel, Christopher A ; Armangau, Philippe ; Caram, Bruce E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US11093454B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Wang, Yubing</creatorcontrib><creatorcontrib>Gillono, John</creatorcontrib><creatorcontrib>Seibel, Christopher A</creatorcontrib><creatorcontrib>Armangau, Philippe</creatorcontrib><creatorcontrib>Caram, Bruce E</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wang, Yubing</au><au>Gillono, John</au><au>Seibel, Christopher A</au><au>Armangau, Philippe</au><au>Caram, Bruce E</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Speeding deduplication using a most wanted digest cache</title><date>2021-08-17</date><risdate>2021</risdate><abstract>Embodiments are directed to techniques for performing deduplication. A method includes (a) obtaining a digest of a data block logically-positioned within a filesystem, the digest providing a hash value of data of the data block, (b) searching a Most Wanted Digest Cache (MWDC) within system memory for the digest, (c) locating an entry in the MWDC using the digest, wherein this locating indicates that the data block has the same data as another data block located elsewhere within the filesystem, the other data block having been previously persistently-stored, the entry having been added to the MWDC in response to the other data block having been deduplicated at least a plurality number of times, (d) locating a mapping structure referenced by the entry located from the MWDC, the mapping structure providing metadata about the other data block, and (e) deduplicating the data block and the other data block with reference to the located mapping structure.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US11093454B2
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Speeding deduplication using a most wanted digest cache
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T22%3A12%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Wang,%20Yubing&rft.date=2021-08-17&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS11093454B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true