Postings List Compression and Decompression on Mobile Devices

Recent years has seen a tremendous increase in both the performance of handheld devices and the use cases they are required to fullfil. Indeed, operations previously reserved for handling on personal computers have begun being executed on smart phones and tablets instead. This revolutionary developm...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Pedersen, Lars Martin S
Format: Dissertation
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Pedersen, Lars Martin S
description Recent years has seen a tremendous increase in both the performance of handheld devices and the use cases they are required to fullfil. Indeed, operations previously reserved for handling on personal computers have begun being executed on smart phones and tablets instead. This revolutionary development allows one to exploit handheld device hardware in novel applications.Trondheim-based start-up Atbrox is engaged in an EU project where Atbrox focus is search on mobile devices. An important component of search is the inverted index, and within, the per term postings list an encoded list of Unified Resource Identifiers (URI). Decoding of a postings list must be fast in order to not comprimise the user experience, but is also required to hold a small storage footprint. As the first to our knowledge, this thesis attempts to identify the properties of postings list encoding and decoding on handheld devices.Variable-byte coding, Group Varint coding, and Elias gamma coding are implemented in Objective-C. Performance is surveyed by benchmarking three devices out of Apple: A 5th generation iPod, a 4th generation iPad, and an iPad Air. Executions are run from disk to-disk, i.e. by reading a block of data, applying either encoding or decoding, and writing the result to permanent storage. Block sizes are varied. In addition, multithreading is applied during both encoding and decoding and compared to serial executions in an attempt to identify the properties under which each coding scheme performs best.This thesis provides valueable insight to the properties of coding schemes on handheld devices. Among its findings is the varying degree of performance and compression ratio between coding schemes: Group Varint proves to outperform the two others in terms of speed, however, is lacking in terms of compression. Elias gamma code provides the best compression ratio, but is the slowest in both encoding and decoding. Results also prove a strong correspondance between block size and performance, although a point of saturation is reached at 512 KiB. Additionally, block sizes below 512 KiB display an inability to take advantage of multithreading.
format Dissertation
fullrecord <record><control><sourceid>cristin_3HK</sourceid><recordid>TN_cdi_cristin_nora_11250_253665</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>11250_253665</sourcerecordid><originalsourceid>FETCH-cristin_nora_11250_2536653</originalsourceid><addsrcrecordid>eNrjZLANyC8uycxLL1bwySwuUXDOzy0oSi0uzszPU0jMS1FwSU1GEgEi3_ykzJxUoHhZZnJqMQ8Da1piTnEqL5TmZlBwcw1x9tBNLsoEGRufl1-UGG9oaGRqEG9kamxmZmpMhBIAszsvTw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>dissertation</recordtype></control><display><type>dissertation</type><title>Postings List Compression and Decompression on Mobile Devices</title><source>NORA - Norwegian Open Research Archives</source><creator>Pedersen, Lars Martin S</creator><creatorcontrib>Pedersen, Lars Martin S ; Elster, Anne Cathrine ; Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskap</creatorcontrib><description>Recent years has seen a tremendous increase in both the performance of handheld devices and the use cases they are required to fullfil. Indeed, operations previously reserved for handling on personal computers have begun being executed on smart phones and tablets instead. This revolutionary development allows one to exploit handheld device hardware in novel applications.Trondheim-based start-up Atbrox is engaged in an EU project where Atbrox focus is search on mobile devices. An important component of search is the inverted index, and within, the per term postings list an encoded list of Unified Resource Identifiers (URI). Decoding of a postings list must be fast in order to not comprimise the user experience, but is also required to hold a small storage footprint. As the first to our knowledge, this thesis attempts to identify the properties of postings list encoding and decoding on handheld devices.Variable-byte coding, Group Varint coding, and Elias gamma coding are implemented in Objective-C. Performance is surveyed by benchmarking three devices out of Apple: A 5th generation iPod, a 4th generation iPad, and an iPad Air. Executions are run from disk to-disk, i.e. by reading a block of data, applying either encoding or decoding, and writing the result to permanent storage. Block sizes are varied. In addition, multithreading is applied during both encoding and decoding and compared to serial executions in an attempt to identify the properties under which each coding scheme performs best.This thesis provides valueable insight to the properties of coding schemes on handheld devices. Among its findings is the varying degree of performance and compression ratio between coding schemes: Group Varint proves to outperform the two others in terms of speed, however, is lacking in terms of compression. Elias gamma code provides the best compression ratio, but is the slowest in both encoding and decoding. Results also prove a strong correspondance between block size and performance, although a point of saturation is reached at 512 KiB. Additionally, block sizes below 512 KiB display an inability to take advantage of multithreading.</description><language>eng</language><publisher>Institutt for datateknikk og informasjonsvitenskap</publisher><creationdate>2013</creationdate><rights>info:eu-repo/semantics/openAccess</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,311,776,881,4038,26544</link.rule.ids><linktorsrc>$$Uhttp://hdl.handle.net/11250/253665$$EView_record_in_NORA$$FView_record_in_$$GNORA$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Pedersen, Lars Martin S</creatorcontrib><title>Postings List Compression and Decompression on Mobile Devices</title><description>Recent years has seen a tremendous increase in both the performance of handheld devices and the use cases they are required to fullfil. Indeed, operations previously reserved for handling on personal computers have begun being executed on smart phones and tablets instead. This revolutionary development allows one to exploit handheld device hardware in novel applications.Trondheim-based start-up Atbrox is engaged in an EU project where Atbrox focus is search on mobile devices. An important component of search is the inverted index, and within, the per term postings list an encoded list of Unified Resource Identifiers (URI). Decoding of a postings list must be fast in order to not comprimise the user experience, but is also required to hold a small storage footprint. As the first to our knowledge, this thesis attempts to identify the properties of postings list encoding and decoding on handheld devices.Variable-byte coding, Group Varint coding, and Elias gamma coding are implemented in Objective-C. Performance is surveyed by benchmarking three devices out of Apple: A 5th generation iPod, a 4th generation iPad, and an iPad Air. Executions are run from disk to-disk, i.e. by reading a block of data, applying either encoding or decoding, and writing the result to permanent storage. Block sizes are varied. In addition, multithreading is applied during both encoding and decoding and compared to serial executions in an attempt to identify the properties under which each coding scheme performs best.This thesis provides valueable insight to the properties of coding schemes on handheld devices. Among its findings is the varying degree of performance and compression ratio between coding schemes: Group Varint proves to outperform the two others in terms of speed, however, is lacking in terms of compression. Elias gamma code provides the best compression ratio, but is the slowest in both encoding and decoding. Results also prove a strong correspondance between block size and performance, although a point of saturation is reached at 512 KiB. Additionally, block sizes below 512 KiB display an inability to take advantage of multithreading.</description><fulltext>true</fulltext><rsrctype>dissertation</rsrctype><creationdate>2013</creationdate><recordtype>dissertation</recordtype><sourceid>3HK</sourceid><recordid>eNrjZLANyC8uycxLL1bwySwuUXDOzy0oSi0uzszPU0jMS1FwSU1GEgEi3_ykzJxUoHhZZnJqMQ8Da1piTnEqL5TmZlBwcw1x9tBNLsoEGRufl1-UGG9oaGRqEG9kamxmZmpMhBIAszsvTw</recordid><startdate>2013</startdate><enddate>2013</enddate><creator>Pedersen, Lars Martin S</creator><general>Institutt for datateknikk og informasjonsvitenskap</general><scope>3HK</scope></search><sort><creationdate>2013</creationdate><title>Postings List Compression and Decompression on Mobile Devices</title><author>Pedersen, Lars Martin S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-cristin_nora_11250_2536653</frbrgroupid><rsrctype>dissertations</rsrctype><prefilter>dissertations</prefilter><language>eng</language><creationdate>2013</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Pedersen, Lars Martin S</creatorcontrib><collection>NORA - Norwegian Open Research Archives</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Pedersen, Lars Martin S</au><format>dissertation</format><genre>dissertation</genre><ristype>THES</ristype><Advisor>Elster, Anne Cathrine</Advisor><Advisor>Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskap</Advisor><btitle>Postings List Compression and Decompression on Mobile Devices</btitle><date>2013</date><risdate>2013</risdate><abstract>Recent years has seen a tremendous increase in both the performance of handheld devices and the use cases they are required to fullfil. Indeed, operations previously reserved for handling on personal computers have begun being executed on smart phones and tablets instead. This revolutionary development allows one to exploit handheld device hardware in novel applications.Trondheim-based start-up Atbrox is engaged in an EU project where Atbrox focus is search on mobile devices. An important component of search is the inverted index, and within, the per term postings list an encoded list of Unified Resource Identifiers (URI). Decoding of a postings list must be fast in order to not comprimise the user experience, but is also required to hold a small storage footprint. As the first to our knowledge, this thesis attempts to identify the properties of postings list encoding and decoding on handheld devices.Variable-byte coding, Group Varint coding, and Elias gamma coding are implemented in Objective-C. Performance is surveyed by benchmarking three devices out of Apple: A 5th generation iPod, a 4th generation iPad, and an iPad Air. Executions are run from disk to-disk, i.e. by reading a block of data, applying either encoding or decoding, and writing the result to permanent storage. Block sizes are varied. In addition, multithreading is applied during both encoding and decoding and compared to serial executions in an attempt to identify the properties under which each coding scheme performs best.This thesis provides valueable insight to the properties of coding schemes on handheld devices. Among its findings is the varying degree of performance and compression ratio between coding schemes: Group Varint proves to outperform the two others in terms of speed, however, is lacking in terms of compression. Elias gamma code provides the best compression ratio, but is the slowest in both encoding and decoding. Results also prove a strong correspondance between block size and performance, although a point of saturation is reached at 512 KiB. Additionally, block sizes below 512 KiB display an inability to take advantage of multithreading.</abstract><pub>Institutt for datateknikk og informasjonsvitenskap</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_cristin_nora_11250_253665
source NORA - Norwegian Open Research Archives
title Postings List Compression and Decompression on Mobile Devices
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T19%3A19%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-cristin_3HK&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&rft.genre=dissertation&rft.btitle=Postings%20List%20Compression%20and%20Decompression%20on%20Mobile%20Devices&rft.au=Pedersen,%20Lars%20Martin%20S&rft.date=2013&rft_id=info:doi/&rft_dat=%3Ccristin_3HK%3E11250_253665%3C/cristin_3HK%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true