Inverted file partitioning schemes in multiple disk systems

Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on parallel and distributed systems 1995-02, Vol.6 (2), p.142-153
Hauptverfasser: Byeong-Soo Jeong, Omiecinski, E.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 153
container_issue 2
container_start_page 142
container_title IEEE transactions on parallel and distributed systems
container_volume 6
creator Byeong-Soo Jeong
Omiecinski, E.
description Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.< >
doi_str_mv 10.1109/71.342125
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_342125</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>342125</ieee_id><sourcerecordid>28493559</sourcerecordid><originalsourceid>FETCH-LOGICAL-c366t-fd891040ca7cb1306175d802507d813f3cd47fb645384cd64a3a7afc6ee827fa3</originalsourceid><addsrcrecordid>eNqFkD1LBDEQhoMoeJ4WtlZbiGCxmu9ksZLDj4MDG62XXHai0f0ysyfcv3dlDy2tZmCeeV54CTll9IoxWlwbdiUkZ1ztkRlTyuacWbE_7lSqvOCsOCRHiO-UMqmonJGbZfsFaYAqC7GGrHdpiEPs2ti-ZujfoAHMYps1m3qI_QhUET8y3OIADR6Tg-BqhJPdnJOX-7vnxWO-enpYLm5XuRdaD3mobDHGU--MXzNBNTOqspQrairLRBC-kiastVTCSl9p6YQzLngNYLkJTszJxeTtU_e5ARzKJqKHunYtdBssuZWFUKr4HxTGioLTEbycQJ86xASh7FNsXNqWjJY_PZaGlVOPI3u-kzr0rg7JtT7i78MYrLXQI3Y2YREA_q6T4xtgcnlH</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>23783920</pqid></control><display><type>article</type><title>Inverted file partitioning schemes in multiple disk systems</title><source>IEEE Electronic Library (IEL)</source><creator>Byeong-Soo Jeong ; Omiecinski, E.</creator><creatorcontrib>Byeong-Soo Jeong ; Omiecinski, E.</creatorcontrib><description>Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.&lt; &gt;</description><identifier>ISSN: 1045-9219</identifier><identifier>EISSN: 1558-2183</identifier><identifier>DOI: 10.1109/71.342125</identifier><identifier>CODEN: ITDSEO</identifier><language>eng</language><publisher>Los Alamitos, CA: IEEE</publisher><subject>Applied sciences ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Exact sciences and technology ; File systems ; Frequency ; Information retrieval ; Load management ; Message passing ; Multiprocessing systems ; Parallel architectures ; Scalability ; Software ; Spatial databases ; System performance</subject><ispartof>IEEE transactions on parallel and distributed systems, 1995-02, Vol.6 (2), p.142-153</ispartof><rights>1995 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c366t-fd891040ca7cb1306175d802507d813f3cd47fb645384cd64a3a7afc6ee827fa3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/342125$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/342125$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=3556636$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Byeong-Soo Jeong</creatorcontrib><creatorcontrib>Omiecinski, E.</creatorcontrib><title>Inverted file partitioning schemes in multiple disk systems</title><title>IEEE transactions on parallel and distributed systems</title><addtitle>TPDS</addtitle><description>Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.&lt; &gt;</description><subject>Applied sciences</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Exact sciences and technology</subject><subject>File systems</subject><subject>Frequency</subject><subject>Information retrieval</subject><subject>Load management</subject><subject>Message passing</subject><subject>Multiprocessing systems</subject><subject>Parallel architectures</subject><subject>Scalability</subject><subject>Software</subject><subject>Spatial databases</subject><subject>System performance</subject><issn>1045-9219</issn><issn>1558-2183</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1995</creationdate><recordtype>article</recordtype><recordid>eNqFkD1LBDEQhoMoeJ4WtlZbiGCxmu9ksZLDj4MDG62XXHai0f0ysyfcv3dlDy2tZmCeeV54CTll9IoxWlwbdiUkZ1ztkRlTyuacWbE_7lSqvOCsOCRHiO-UMqmonJGbZfsFaYAqC7GGrHdpiEPs2ti-ZujfoAHMYps1m3qI_QhUET8y3OIADR6Tg-BqhJPdnJOX-7vnxWO-enpYLm5XuRdaD3mobDHGU--MXzNBNTOqspQrairLRBC-kiastVTCSl9p6YQzLngNYLkJTszJxeTtU_e5ARzKJqKHunYtdBssuZWFUKr4HxTGioLTEbycQJ86xASh7FNsXNqWjJY_PZaGlVOPI3u-kzr0rg7JtT7i78MYrLXQI3Y2YREA_q6T4xtgcnlH</recordid><startdate>19950201</startdate><enddate>19950201</enddate><creator>Byeong-Soo Jeong</creator><creator>Omiecinski, E.</creator><general>IEEE</general><general>IEEE Computer Society</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>19950201</creationdate><title>Inverted file partitioning schemes in multiple disk systems</title><author>Byeong-Soo Jeong ; Omiecinski, E.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c366t-fd891040ca7cb1306175d802507d813f3cd47fb645384cd64a3a7afc6ee827fa3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1995</creationdate><topic>Applied sciences</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Exact sciences and technology</topic><topic>File systems</topic><topic>Frequency</topic><topic>Information retrieval</topic><topic>Load management</topic><topic>Message passing</topic><topic>Multiprocessing systems</topic><topic>Parallel architectures</topic><topic>Scalability</topic><topic>Software</topic><topic>Spatial databases</topic><topic>System performance</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Byeong-Soo Jeong</creatorcontrib><creatorcontrib>Omiecinski, E.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on parallel and distributed systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Byeong-Soo Jeong</au><au>Omiecinski, E.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Inverted file partitioning schemes in multiple disk systems</atitle><jtitle>IEEE transactions on parallel and distributed systems</jtitle><stitle>TPDS</stitle><date>1995-02-01</date><risdate>1995</risdate><volume>6</volume><issue>2</issue><spage>142</spage><epage>153</epage><pages>142-153</pages><issn>1045-9219</issn><eissn>1558-2183</eissn><coden>ITDSEO</coden><abstract>Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.&lt; &gt;</abstract><cop>Los Alamitos, CA</cop><pub>IEEE</pub><doi>10.1109/71.342125</doi><tpages>12</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1045-9219
ispartof IEEE transactions on parallel and distributed systems, 1995-02, Vol.6 (2), p.142-153
issn 1045-9219
1558-2183
language eng
recordid cdi_ieee_primary_342125
source IEEE Electronic Library (IEL)
subjects Applied sciences
Computer science
control theory
systems
Computer systems and distributed systems. User interface
Exact sciences and technology
File systems
Frequency
Information retrieval
Load management
Message passing
Multiprocessing systems
Parallel architectures
Scalability
Software
Spatial databases
System performance
title Inverted file partitioning schemes in multiple disk systems
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T10%3A26%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Inverted%20file%20partitioning%20schemes%20in%20multiple%20disk%20systems&rft.jtitle=IEEE%20transactions%20on%20parallel%20and%20distributed%20systems&rft.au=Byeong-Soo%20Jeong&rft.date=1995-02-01&rft.volume=6&rft.issue=2&rft.spage=142&rft.epage=153&rft.pages=142-153&rft.issn=1045-9219&rft.eissn=1558-2183&rft.coden=ITDSEO&rft_id=info:doi/10.1109/71.342125&rft_dat=%3Cproquest_RIE%3E28493559%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=23783920&rft_id=info:pmid/&rft_ieee_id=342125&rfr_iscdi=true