Iterative spatial join
The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the parti...
Gespeichert in:
Veröffentlicht in: | ACM transactions on database systems 2003-09, Vol.28 (3), p.230-256 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 256 |
---|---|
container_issue | 3 |
container_start_page | 230 |
container_title | ACM transactions on database systems |
container_volume | 28 |
creator | Jacox, Edwin H Samet, Hanan |
description | The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the partition corresponds to a hash-based join while the plane-sweep corresponds to a sort-merge join. In this article, we look at extending the idea of the sort-merge join for one-dimensional data to multiple dimensions and introduce the Iterative Spatial Join. As with the sort-merge join, the Iterative Spatial Join is best suited to cases where the data is already sorted. However, as we show in the experiments, the Iterative Spatial Join performs well when internal memory is limited, compared to the partitioning methods. This suggests that the Iterative Spatial Join would be useful for very large data sets or in situations where internal memory is a shared resource and is therefore limited, such as with today's database engines which share internal memory amongst several queries. Furthermore, the performance of the Iterative Spatial Join is predictable and has no parameters which need to be tuned, unlike other algorithms. The Iterative Spatial Join is based on a plane sweep algorithm, which requires the entire data set to fit in internal memory. When internal memory overflows, the Iterative Spatial Join simply makes additional passes on the data, thereby exhibiting only a gradual performance degradation. To demonstrate the use and efficacy of the Iterative Spatial Join, we first examine and analyze current approaches to performing spatial joins, and then give a detailed analysis of the Iterative Spatial Join as well as present the results of extensive testing of the algorithm, including a comparison with partitioning-based spatial join methods. These tests show that the Iterative Spatial Join overcomes the performance limitations of the other algorithms for data sets of all sizes as well as differing amounts of internal memory. |
doi_str_mv | 10.1145/937598.937600 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_29088702</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>476406461</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-36790970c8093f02dfe5cbcbf76a3ad97d97a62eb958647d006e988f090aec763</originalsourceid><addsrcrecordid>eNpdkM1LxDAQR4MouK4e9bx48NZ18p05yuLqwoIXPYc0TaGl29akFfzvjdSTMPAujx_DI-SWwpZSIR-Ra4lmm6EAzsiKSqkLoYQ4JyvgihUSqbwkVym1ACAM6hW5O0whuqn5Cps0Zrpu0w5Nf00uatelcPPHNfnYP7_vXovj28th93QsPEM5FVxpBNTgDSCvgVV1kL70Za2V465Cnc8pFkqURgldAaiAxtSA4ILXiq_Jw7I7xuFzDmmypyb50HWuD8OcLEMwRgPL4v0_sR3m2OffLEXJgQmts1Qsko9DSjHUdozNycVvS8H-JrJLIrsk4j9J91Xr</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>195302477</pqid></control><display><type>article</type><title>Iterative spatial join</title><source>ACM Digital Library Complete</source><creator>Jacox, Edwin H ; Samet, Hanan</creator><creatorcontrib>Jacox, Edwin H ; Samet, Hanan</creatorcontrib><description>The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the partition corresponds to a hash-based join while the plane-sweep corresponds to a sort-merge join. In this article, we look at extending the idea of the sort-merge join for one-dimensional data to multiple dimensions and introduce the Iterative Spatial Join. As with the sort-merge join, the Iterative Spatial Join is best suited to cases where the data is already sorted. However, as we show in the experiments, the Iterative Spatial Join performs well when internal memory is limited, compared to the partitioning methods. This suggests that the Iterative Spatial Join would be useful for very large data sets or in situations where internal memory is a shared resource and is therefore limited, such as with today's database engines which share internal memory amongst several queries. Furthermore, the performance of the Iterative Spatial Join is predictable and has no parameters which need to be tuned, unlike other algorithms. The Iterative Spatial Join is based on a plane sweep algorithm, which requires the entire data set to fit in internal memory. When internal memory overflows, the Iterative Spatial Join simply makes additional passes on the data, thereby exhibiting only a gradual performance degradation. To demonstrate the use and efficacy of the Iterative Spatial Join, we first examine and analyze current approaches to performing spatial joins, and then give a detailed analysis of the Iterative Spatial Join as well as present the results of extensive testing of the algorithm, including a comparison with partitioning-based spatial join methods. These tests show that the Iterative Spatial Join overcomes the performance limitations of the other algorithms for data sets of all sizes as well as differing amounts of internal memory.</description><identifier>ISSN: 0362-5915</identifier><identifier>EISSN: 1557-4644</identifier><identifier>DOI: 10.1145/937598.937600</identifier><identifier>CODEN: ATDSD3</identifier><language>eng</language><publisher>New York: Association for Computing Machinery</publisher><subject>Algorithms ; Datasets ; Information systems</subject><ispartof>ACM transactions on database systems, 2003-09, Vol.28 (3), p.230-256</ispartof><rights>Copyright Association for Computing Machinery Sep 2003</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-36790970c8093f02dfe5cbcbf76a3ad97d97a62eb958647d006e988f090aec763</citedby><cites>FETCH-LOGICAL-c295t-36790970c8093f02dfe5cbcbf76a3ad97d97a62eb958647d006e988f090aec763</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Jacox, Edwin H</creatorcontrib><creatorcontrib>Samet, Hanan</creatorcontrib><title>Iterative spatial join</title><title>ACM transactions on database systems</title><description>The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the partition corresponds to a hash-based join while the plane-sweep corresponds to a sort-merge join. In this article, we look at extending the idea of the sort-merge join for one-dimensional data to multiple dimensions and introduce the Iterative Spatial Join. As with the sort-merge join, the Iterative Spatial Join is best suited to cases where the data is already sorted. However, as we show in the experiments, the Iterative Spatial Join performs well when internal memory is limited, compared to the partitioning methods. This suggests that the Iterative Spatial Join would be useful for very large data sets or in situations where internal memory is a shared resource and is therefore limited, such as with today's database engines which share internal memory amongst several queries. Furthermore, the performance of the Iterative Spatial Join is predictable and has no parameters which need to be tuned, unlike other algorithms. The Iterative Spatial Join is based on a plane sweep algorithm, which requires the entire data set to fit in internal memory. When internal memory overflows, the Iterative Spatial Join simply makes additional passes on the data, thereby exhibiting only a gradual performance degradation. To demonstrate the use and efficacy of the Iterative Spatial Join, we first examine and analyze current approaches to performing spatial joins, and then give a detailed analysis of the Iterative Spatial Join as well as present the results of extensive testing of the algorithm, including a comparison with partitioning-based spatial join methods. These tests show that the Iterative Spatial Join overcomes the performance limitations of the other algorithms for data sets of all sizes as well as differing amounts of internal memory.</description><subject>Algorithms</subject><subject>Datasets</subject><subject>Information systems</subject><issn>0362-5915</issn><issn>1557-4644</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2003</creationdate><recordtype>article</recordtype><recordid>eNpdkM1LxDAQR4MouK4e9bx48NZ18p05yuLqwoIXPYc0TaGl29akFfzvjdSTMPAujx_DI-SWwpZSIR-Ra4lmm6EAzsiKSqkLoYQ4JyvgihUSqbwkVym1ACAM6hW5O0whuqn5Cps0Zrpu0w5Nf00uatelcPPHNfnYP7_vXovj28th93QsPEM5FVxpBNTgDSCvgVV1kL70Za2V465Cnc8pFkqURgldAaiAxtSA4ILXiq_Jw7I7xuFzDmmypyb50HWuD8OcLEMwRgPL4v0_sR3m2OffLEXJgQmts1Qsko9DSjHUdozNycVvS8H-JrJLIrsk4j9J91Xr</recordid><startdate>20030901</startdate><enddate>20030901</enddate><creator>Jacox, Edwin H</creator><creator>Samet, Hanan</creator><general>Association for Computing Machinery</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><scope>7SC</scope><scope>8FD</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20030901</creationdate><title>Iterative spatial join</title><author>Jacox, Edwin H ; Samet, Hanan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-36790970c8093f02dfe5cbcbf76a3ad97d97a62eb958647d006e988f090aec763</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Algorithms</topic><topic>Datasets</topic><topic>Information systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jacox, Edwin H</creatorcontrib><creatorcontrib>Samet, Hanan</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on database systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jacox, Edwin H</au><au>Samet, Hanan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Iterative spatial join</atitle><jtitle>ACM transactions on database systems</jtitle><date>2003-09-01</date><risdate>2003</risdate><volume>28</volume><issue>3</issue><spage>230</spage><epage>256</epage><pages>230-256</pages><issn>0362-5915</issn><eissn>1557-4644</eissn><coden>ATDSD3</coden><abstract>The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the partition corresponds to a hash-based join while the plane-sweep corresponds to a sort-merge join. In this article, we look at extending the idea of the sort-merge join for one-dimensional data to multiple dimensions and introduce the Iterative Spatial Join. As with the sort-merge join, the Iterative Spatial Join is best suited to cases where the data is already sorted. However, as we show in the experiments, the Iterative Spatial Join performs well when internal memory is limited, compared to the partitioning methods. This suggests that the Iterative Spatial Join would be useful for very large data sets or in situations where internal memory is a shared resource and is therefore limited, such as with today's database engines which share internal memory amongst several queries. Furthermore, the performance of the Iterative Spatial Join is predictable and has no parameters which need to be tuned, unlike other algorithms. The Iterative Spatial Join is based on a plane sweep algorithm, which requires the entire data set to fit in internal memory. When internal memory overflows, the Iterative Spatial Join simply makes additional passes on the data, thereby exhibiting only a gradual performance degradation. To demonstrate the use and efficacy of the Iterative Spatial Join, we first examine and analyze current approaches to performing spatial joins, and then give a detailed analysis of the Iterative Spatial Join as well as present the results of extensive testing of the algorithm, including a comparison with partitioning-based spatial join methods. These tests show that the Iterative Spatial Join overcomes the performance limitations of the other algorithms for data sets of all sizes as well as differing amounts of internal memory.</abstract><cop>New York</cop><pub>Association for Computing Machinery</pub><doi>10.1145/937598.937600</doi><tpages>27</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0362-5915 |
ispartof | ACM transactions on database systems, 2003-09, Vol.28 (3), p.230-256 |
issn | 0362-5915 1557-4644 |
language | eng |
recordid | cdi_proquest_miscellaneous_29088702 |
source | ACM Digital Library Complete |
subjects | Algorithms Datasets Information systems |
title | Iterative spatial join |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T20%3A49%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Iterative%20spatial%20join&rft.jtitle=ACM%20transactions%20on%20database%20systems&rft.au=Jacox,%20Edwin%20H&rft.date=2003-09-01&rft.volume=28&rft.issue=3&rft.spage=230&rft.epage=256&rft.pages=230-256&rft.issn=0362-5915&rft.eissn=1557-4644&rft.coden=ATDSD3&rft_id=info:doi/10.1145/937598.937600&rft_dat=%3Cproquest_cross%3E476406461%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=195302477&rft_id=info:pmid/&rfr_iscdi=true |