Inverted file partitioning schemes in multiple disk systems
Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on parallel and distributed systems 1995-02, Vol.6 (2), p.142-153 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 153 |
---|---|
container_issue | 2 |
container_start_page | 142 |
container_title | IEEE transactions on parallel and distributed systems |
container_volume | 6 |
creator | Byeong-Soo Jeong Omiecinski, E. |
description | Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.< > |
doi_str_mv | 10.1109/71.342125 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_342125</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>342125</ieee_id><sourcerecordid>28493559</sourcerecordid><originalsourceid>FETCH-LOGICAL-c366t-fd891040ca7cb1306175d802507d813f3cd47fb645384cd64a3a7afc6ee827fa3</originalsourceid><addsrcrecordid>eNqFkD1LBDEQhoMoeJ4WtlZbiGCxmu9ksZLDj4MDG62XXHai0f0ysyfcv3dlDy2tZmCeeV54CTll9IoxWlwbdiUkZ1ztkRlTyuacWbE_7lSqvOCsOCRHiO-UMqmonJGbZfsFaYAqC7GGrHdpiEPs2ti-ZujfoAHMYps1m3qI_QhUET8y3OIADR6Tg-BqhJPdnJOX-7vnxWO-enpYLm5XuRdaD3mobDHGU--MXzNBNTOqspQrairLRBC-kiastVTCSl9p6YQzLngNYLkJTszJxeTtU_e5ARzKJqKHunYtdBssuZWFUKr4HxTGioLTEbycQJ86xASh7FNsXNqWjJY_PZaGlVOPI3u-kzr0rg7JtT7i78MYrLXQI3Y2YREA_q6T4xtgcnlH</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>23783920</pqid></control><display><type>article</type><title>Inverted file partitioning schemes in multiple disk systems</title><source>IEEE Electronic Library (IEL)</source><creator>Byeong-Soo Jeong ; Omiecinski, E.</creator><creatorcontrib>Byeong-Soo Jeong ; Omiecinski, E.</creatorcontrib><description>Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.< ></description><identifier>ISSN: 1045-9219</identifier><identifier>EISSN: 1558-2183</identifier><identifier>DOI: 10.1109/71.342125</identifier><identifier>CODEN: ITDSEO</identifier><language>eng</language><publisher>Los Alamitos, CA: IEEE</publisher><subject>Applied sciences ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Exact sciences and technology ; File systems ; Frequency ; Information retrieval ; Load management ; Message passing ; Multiprocessing systems ; Parallel architectures ; Scalability ; Software ; Spatial databases ; System performance</subject><ispartof>IEEE transactions on parallel and distributed systems, 1995-02, Vol.6 (2), p.142-153</ispartof><rights>1995 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c366t-fd891040ca7cb1306175d802507d813f3cd47fb645384cd64a3a7afc6ee827fa3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/342125$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/342125$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=3556636$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Byeong-Soo Jeong</creatorcontrib><creatorcontrib>Omiecinski, E.</creatorcontrib><title>Inverted file partitioning schemes in multiple disk systems</title><title>IEEE transactions on parallel and distributed systems</title><addtitle>TPDS</addtitle><description>Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.< ></description><subject>Applied sciences</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Exact sciences and technology</subject><subject>File systems</subject><subject>Frequency</subject><subject>Information retrieval</subject><subject>Load management</subject><subject>Message passing</subject><subject>Multiprocessing systems</subject><subject>Parallel architectures</subject><subject>Scalability</subject><subject>Software</subject><subject>Spatial databases</subject><subject>System performance</subject><issn>1045-9219</issn><issn>1558-2183</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1995</creationdate><recordtype>article</recordtype><recordid>eNqFkD1LBDEQhoMoeJ4WtlZbiGCxmu9ksZLDj4MDG62XXHai0f0ysyfcv3dlDy2tZmCeeV54CTll9IoxWlwbdiUkZ1ztkRlTyuacWbE_7lSqvOCsOCRHiO-UMqmonJGbZfsFaYAqC7GGrHdpiEPs2ti-ZujfoAHMYps1m3qI_QhUET8y3OIADR6Tg-BqhJPdnJOX-7vnxWO-enpYLm5XuRdaD3mobDHGU--MXzNBNTOqspQrairLRBC-kiastVTCSl9p6YQzLngNYLkJTszJxeTtU_e5ARzKJqKHunYtdBssuZWFUKr4HxTGioLTEbycQJ86xASh7FNsXNqWjJY_PZaGlVOPI3u-kzr0rg7JtT7i78MYrLXQI3Y2YREA_q6T4xtgcnlH</recordid><startdate>19950201</startdate><enddate>19950201</enddate><creator>Byeong-Soo Jeong</creator><creator>Omiecinski, E.</creator><general>IEEE</general><general>IEEE Computer Society</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>19950201</creationdate><title>Inverted file partitioning schemes in multiple disk systems</title><author>Byeong-Soo Jeong ; Omiecinski, E.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c366t-fd891040ca7cb1306175d802507d813f3cd47fb645384cd64a3a7afc6ee827fa3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1995</creationdate><topic>Applied sciences</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Exact sciences and technology</topic><topic>File systems</topic><topic>Frequency</topic><topic>Information retrieval</topic><topic>Load management</topic><topic>Message passing</topic><topic>Multiprocessing systems</topic><topic>Parallel architectures</topic><topic>Scalability</topic><topic>Software</topic><topic>Spatial databases</topic><topic>System performance</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Byeong-Soo Jeong</creatorcontrib><creatorcontrib>Omiecinski, E.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on parallel and distributed systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Byeong-Soo Jeong</au><au>Omiecinski, E.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Inverted file partitioning schemes in multiple disk systems</atitle><jtitle>IEEE transactions on parallel and distributed systems</jtitle><stitle>TPDS</stitle><date>1995-02-01</date><risdate>1995</risdate><volume>6</volume><issue>2</issue><spage>142</spage><epage>153</epage><pages>142-153</pages><issn>1045-9219</issn><eissn>1558-2183</eissn><coden>ITDSEO</coden><abstract>Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.< ></abstract><cop>Los Alamitos, CA</cop><pub>IEEE</pub><doi>10.1109/71.342125</doi><tpages>12</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1045-9219 |
ispartof | IEEE transactions on parallel and distributed systems, 1995-02, Vol.6 (2), p.142-153 |
issn | 1045-9219 1558-2183 |
language | eng |
recordid | cdi_ieee_primary_342125 |
source | IEEE Electronic Library (IEL) |
subjects | Applied sciences Computer science control theory systems Computer systems and distributed systems. User interface Exact sciences and technology File systems Frequency Information retrieval Load management Message passing Multiprocessing systems Parallel architectures Scalability Software Spatial databases System performance |
title | Inverted file partitioning schemes in multiple disk systems |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T10%3A26%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Inverted%20file%20partitioning%20schemes%20in%20multiple%20disk%20systems&rft.jtitle=IEEE%20transactions%20on%20parallel%20and%20distributed%20systems&rft.au=Byeong-Soo%20Jeong&rft.date=1995-02-01&rft.volume=6&rft.issue=2&rft.spage=142&rft.epage=153&rft.pages=142-153&rft.issn=1045-9219&rft.eissn=1558-2183&rft.coden=ITDSEO&rft_id=info:doi/10.1109/71.342125&rft_dat=%3Cproquest_RIE%3E28493559%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=23783920&rft_id=info:pmid/&rft_ieee_id=342125&rfr_iscdi=true |