Succinct indexes for strings, binary relations and multilabeled trees

We define and design succinct indexes for several abstract data types (ADTs). The concept is to design auxiliary data structures that ideally occupy asymptotically less space than the information-theoretic lower bound on the space required to encode the given data, and support an extended set of ope...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on algorithms 2011-09, Vol.7 (4), p.1-27
Hauptverfasser: Barbay, Jérémy, He, Meng, Munro, J. Ian, Satti, Srinivasa Rao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 27
container_issue 4
container_start_page 1
container_title ACM transactions on algorithms
container_volume 7
creator Barbay, Jérémy
He, Meng
Munro, J. Ian
Satti, Srinivasa Rao
description We define and design succinct indexes for several abstract data types (ADTs). The concept is to design auxiliary data structures that ideally occupy asymptotically less space than the information-theoretic lower bound on the space required to encode the given data, and support an extended set of operations using the basic operators defined in the ADT. The main advantage of succinct indexes as opposed to succinct (integrated data/index) encodings is that we make assumptions only on the ADT through which the main data is accessed, rather than the way in which the data is encoded. This allows more freedom in the encoding of the main data. In this article, we present succinct indexes for various data types, namely strings, binary relations and multilabeled trees. Given the support for the interface of the ADTs of these data types, we can support various useful operations efficiently by constructing succinct indexes for them. When the operators in the ADTs are supported in constant time, our results are comparable to previous results, while allowing more flexibility in the encoding of the given data. Using our techniques, we design a succinct encoding that represents a string of length n over an alphabet of size σ using n H k ( S ) + lg σ · o ( n ) + O ( n lg σ/lg lg lg σ) bits to support access/rank/select operations in o ((lg lg σ) 1+ϵ ) time, for any fixed constant ϵ > 0. We also design a succinct text index using n H 0 ( S ) + O ( n lg σ/lg lg σ) bits that supports finding all the occ occurrences of a given pattern of length m in O ( m lg lg σ + occ lg n /lg ϵ σ) time, for any fixed constant 0 < ϵ < 1. Previous results on these two problems either have a lg σ factor instead of lg lg σ in the running time, or are not compressed. Finally, we present succinct encodings of binary relations and multi-labeled trees that are more compact than previous structures.
doi_str_mv 10.1145/2000807.2000820
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_963919237</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>963919237</sourcerecordid><originalsourceid>FETCH-LOGICAL-c339t-b024352e081bef22c7b369d80b269dcd95f3025609cce4790158b47d5fa2e56e3</originalsourceid><addsrcrecordid>eNo9kD1PwzAURS0EEqUws3pjIe2zX5zEI6rKh1SJAZitxHlBRq5TbEeCf0-hFdO5w9WV7mHsWsBCiFItJQA0UC_-KOGEzYQqdVEh4ul_luqcXaT0AYAasZmx9ctkrQs2cxd6-qLEhzHylKML7-mWdy608ZtH8m12Y0i8DT3fTj4733bkqec5EqVLdja0PtHVkXP2dr9-XT0Wm-eHp9XdprCIOhcdyBKVJGhER4OUtu6w0n0DndzD9loNCFJVoK2lstYgVNOVda-GVpKqCOfs5rC7i-PnRCmbrUuWvG8DjVMyukIttMR631wemjaOKUUazC667f6LEWB-fZmjL3P0hT-GtlzA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>963919237</pqid></control><display><type>article</type><title>Succinct indexes for strings, binary relations and multilabeled trees</title><source>ACM Digital Library Complete</source><creator>Barbay, Jérémy ; He, Meng ; Munro, J. Ian ; Satti, Srinivasa Rao</creator><creatorcontrib>Barbay, Jérémy ; He, Meng ; Munro, J. Ian ; Satti, Srinivasa Rao</creatorcontrib><description>We define and design succinct indexes for several abstract data types (ADTs). The concept is to design auxiliary data structures that ideally occupy asymptotically less space than the information-theoretic lower bound on the space required to encode the given data, and support an extended set of operations using the basic operators defined in the ADT. The main advantage of succinct indexes as opposed to succinct (integrated data/index) encodings is that we make assumptions only on the ADT through which the main data is accessed, rather than the way in which the data is encoded. This allows more freedom in the encoding of the main data. In this article, we present succinct indexes for various data types, namely strings, binary relations and multilabeled trees. Given the support for the interface of the ADTs of these data types, we can support various useful operations efficiently by constructing succinct indexes for them. When the operators in the ADTs are supported in constant time, our results are comparable to previous results, while allowing more flexibility in the encoding of the given data. Using our techniques, we design a succinct encoding that represents a string of length n over an alphabet of size σ using n H k ( S ) + lg σ · o ( n ) + O ( n lg σ/lg lg lg σ) bits to support access/rank/select operations in o ((lg lg σ) 1+ϵ ) time, for any fixed constant ϵ &gt; 0. We also design a succinct text index using n H 0 ( S ) + O ( n lg σ/lg lg σ) bits that supports finding all the occ occurrences of a given pattern of length m in O ( m lg lg σ + occ lg n /lg ϵ σ) time, for any fixed constant 0 &lt; ϵ &lt; 1. Previous results on these two problems either have a lg σ factor instead of lg lg σ in the running time, or are not compressed. Finally, we present succinct encodings of binary relations and multi-labeled trees that are more compact than previous structures.</description><identifier>ISSN: 1549-6325</identifier><identifier>EISSN: 1549-6333</identifier><identifier>DOI: 10.1145/2000807.2000820</identifier><language>eng</language><subject>Algorithms ; Asymptotic properties ; Encoding ; Flexibility ; Lower bounds ; Operators ; Strings ; Trees</subject><ispartof>ACM transactions on algorithms, 2011-09, Vol.7 (4), p.1-27</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c339t-b024352e081bef22c7b369d80b269dcd95f3025609cce4790158b47d5fa2e56e3</citedby><cites>FETCH-LOGICAL-c339t-b024352e081bef22c7b369d80b269dcd95f3025609cce4790158b47d5fa2e56e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Barbay, Jérémy</creatorcontrib><creatorcontrib>He, Meng</creatorcontrib><creatorcontrib>Munro, J. Ian</creatorcontrib><creatorcontrib>Satti, Srinivasa Rao</creatorcontrib><title>Succinct indexes for strings, binary relations and multilabeled trees</title><title>ACM transactions on algorithms</title><description>We define and design succinct indexes for several abstract data types (ADTs). The concept is to design auxiliary data structures that ideally occupy asymptotically less space than the information-theoretic lower bound on the space required to encode the given data, and support an extended set of operations using the basic operators defined in the ADT. The main advantage of succinct indexes as opposed to succinct (integrated data/index) encodings is that we make assumptions only on the ADT through which the main data is accessed, rather than the way in which the data is encoded. This allows more freedom in the encoding of the main data. In this article, we present succinct indexes for various data types, namely strings, binary relations and multilabeled trees. Given the support for the interface of the ADTs of these data types, we can support various useful operations efficiently by constructing succinct indexes for them. When the operators in the ADTs are supported in constant time, our results are comparable to previous results, while allowing more flexibility in the encoding of the given data. Using our techniques, we design a succinct encoding that represents a string of length n over an alphabet of size σ using n H k ( S ) + lg σ · o ( n ) + O ( n lg σ/lg lg lg σ) bits to support access/rank/select operations in o ((lg lg σ) 1+ϵ ) time, for any fixed constant ϵ &gt; 0. We also design a succinct text index using n H 0 ( S ) + O ( n lg σ/lg lg σ) bits that supports finding all the occ occurrences of a given pattern of length m in O ( m lg lg σ + occ lg n /lg ϵ σ) time, for any fixed constant 0 &lt; ϵ &lt; 1. Previous results on these two problems either have a lg σ factor instead of lg lg σ in the running time, or are not compressed. Finally, we present succinct encodings of binary relations and multi-labeled trees that are more compact than previous structures.</description><subject>Algorithms</subject><subject>Asymptotic properties</subject><subject>Encoding</subject><subject>Flexibility</subject><subject>Lower bounds</subject><subject>Operators</subject><subject>Strings</subject><subject>Trees</subject><issn>1549-6325</issn><issn>1549-6333</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><recordid>eNo9kD1PwzAURS0EEqUws3pjIe2zX5zEI6rKh1SJAZitxHlBRq5TbEeCf0-hFdO5w9WV7mHsWsBCiFItJQA0UC_-KOGEzYQqdVEh4ul_luqcXaT0AYAasZmx9ctkrQs2cxd6-qLEhzHylKML7-mWdy608ZtH8m12Y0i8DT3fTj4733bkqec5EqVLdja0PtHVkXP2dr9-XT0Wm-eHp9XdprCIOhcdyBKVJGhER4OUtu6w0n0DndzD9loNCFJVoK2lstYgVNOVda-GVpKqCOfs5rC7i-PnRCmbrUuWvG8DjVMyukIttMR631wemjaOKUUazC667f6LEWB-fZmjL3P0hT-GtlzA</recordid><startdate>201109</startdate><enddate>201109</enddate><creator>Barbay, Jérémy</creator><creator>He, Meng</creator><creator>Munro, J. Ian</creator><creator>Satti, Srinivasa Rao</creator><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>201109</creationdate><title>Succinct indexes for strings, binary relations and multilabeled trees</title><author>Barbay, Jérémy ; He, Meng ; Munro, J. Ian ; Satti, Srinivasa Rao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c339t-b024352e081bef22c7b369d80b269dcd95f3025609cce4790158b47d5fa2e56e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Algorithms</topic><topic>Asymptotic properties</topic><topic>Encoding</topic><topic>Flexibility</topic><topic>Lower bounds</topic><topic>Operators</topic><topic>Strings</topic><topic>Trees</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Barbay, Jérémy</creatorcontrib><creatorcontrib>He, Meng</creatorcontrib><creatorcontrib>Munro, J. Ian</creatorcontrib><creatorcontrib>Satti, Srinivasa Rao</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on algorithms</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Barbay, Jérémy</au><au>He, Meng</au><au>Munro, J. Ian</au><au>Satti, Srinivasa Rao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Succinct indexes for strings, binary relations and multilabeled trees</atitle><jtitle>ACM transactions on algorithms</jtitle><date>2011-09</date><risdate>2011</risdate><volume>7</volume><issue>4</issue><spage>1</spage><epage>27</epage><pages>1-27</pages><issn>1549-6325</issn><eissn>1549-6333</eissn><abstract>We define and design succinct indexes for several abstract data types (ADTs). The concept is to design auxiliary data structures that ideally occupy asymptotically less space than the information-theoretic lower bound on the space required to encode the given data, and support an extended set of operations using the basic operators defined in the ADT. The main advantage of succinct indexes as opposed to succinct (integrated data/index) encodings is that we make assumptions only on the ADT through which the main data is accessed, rather than the way in which the data is encoded. This allows more freedom in the encoding of the main data. In this article, we present succinct indexes for various data types, namely strings, binary relations and multilabeled trees. Given the support for the interface of the ADTs of these data types, we can support various useful operations efficiently by constructing succinct indexes for them. When the operators in the ADTs are supported in constant time, our results are comparable to previous results, while allowing more flexibility in the encoding of the given data. Using our techniques, we design a succinct encoding that represents a string of length n over an alphabet of size σ using n H k ( S ) + lg σ · o ( n ) + O ( n lg σ/lg lg lg σ) bits to support access/rank/select operations in o ((lg lg σ) 1+ϵ ) time, for any fixed constant ϵ &gt; 0. We also design a succinct text index using n H 0 ( S ) + O ( n lg σ/lg lg σ) bits that supports finding all the occ occurrences of a given pattern of length m in O ( m lg lg σ + occ lg n /lg ϵ σ) time, for any fixed constant 0 &lt; ϵ &lt; 1. Previous results on these two problems either have a lg σ factor instead of lg lg σ in the running time, or are not compressed. Finally, we present succinct encodings of binary relations and multi-labeled trees that are more compact than previous structures.</abstract><doi>10.1145/2000807.2000820</doi><tpages>27</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1549-6325
ispartof ACM transactions on algorithms, 2011-09, Vol.7 (4), p.1-27
issn 1549-6325
1549-6333
language eng
recordid cdi_proquest_miscellaneous_963919237
source ACM Digital Library Complete
subjects Algorithms
Asymptotic properties
Encoding
Flexibility
Lower bounds
Operators
Strings
Trees
title Succinct indexes for strings, binary relations and multilabeled trees
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T23%3A30%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Succinct%20indexes%20for%20strings,%20binary%20relations%20and%20multilabeled%20trees&rft.jtitle=ACM%20transactions%20on%20algorithms&rft.au=Barbay,%20J%C3%A9r%C3%A9my&rft.date=2011-09&rft.volume=7&rft.issue=4&rft.spage=1&rft.epage=27&rft.pages=1-27&rft.issn=1549-6325&rft.eissn=1549-6333&rft_id=info:doi/10.1145/2000807.2000820&rft_dat=%3Cproquest_cross%3E963919237%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=963919237&rft_id=info:pmid/&rfr_iscdi=true