Iterative spatial join

The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the parti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on database systems 2003-09, Vol.28 (3), p.230-256
Hauptverfasser: Jacox, Edwin H, Samet, Hanan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 256
container_issue 3
container_start_page 230
container_title ACM transactions on database systems
container_volume 28
creator Jacox, Edwin H
Samet, Hanan
description The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the partition corresponds to a hash-based join while the plane-sweep corresponds to a sort-merge join. In this article, we look at extending the idea of the sort-merge join for one-dimensional data to multiple dimensions and introduce the Iterative Spatial Join. As with the sort-merge join, the Iterative Spatial Join is best suited to cases where the data is already sorted. However, as we show in the experiments, the Iterative Spatial Join performs well when internal memory is limited, compared to the partitioning methods. This suggests that the Iterative Spatial Join would be useful for very large data sets or in situations where internal memory is a shared resource and is therefore limited, such as with today's database engines which share internal memory amongst several queries. Furthermore, the performance of the Iterative Spatial Join is predictable and has no parameters which need to be tuned, unlike other algorithms. The Iterative Spatial Join is based on a plane sweep algorithm, which requires the entire data set to fit in internal memory. When internal memory overflows, the Iterative Spatial Join simply makes additional passes on the data, thereby exhibiting only a gradual performance degradation. To demonstrate the use and efficacy of the Iterative Spatial Join, we first examine and analyze current approaches to performing spatial joins, and then give a detailed analysis of the Iterative Spatial Join as well as present the results of extensive testing of the algorithm, including a comparison with partitioning-based spatial join methods. These tests show that the Iterative Spatial Join overcomes the performance limitations of the other algorithms for data sets of all sizes as well as differing amounts of internal memory.
doi_str_mv 10.1145/937598.937600
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_29088702</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>476406461</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-36790970c8093f02dfe5cbcbf76a3ad97d97a62eb958647d006e988f090aec763</originalsourceid><addsrcrecordid>eNpdkM1LxDAQR4MouK4e9bx48NZ18p05yuLqwoIXPYc0TaGl29akFfzvjdSTMPAujx_DI-SWwpZSIR-Ra4lmm6EAzsiKSqkLoYQ4JyvgihUSqbwkVym1ACAM6hW5O0whuqn5Cps0Zrpu0w5Nf00uatelcPPHNfnYP7_vXovj28th93QsPEM5FVxpBNTgDSCvgVV1kL70Za2V465Cnc8pFkqURgldAaiAxtSA4ILXiq_Jw7I7xuFzDmmypyb50HWuD8OcLEMwRgPL4v0_sR3m2OffLEXJgQmts1Qsko9DSjHUdozNycVvS8H-JrJLIrsk4j9J91Xr</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>195302477</pqid></control><display><type>article</type><title>Iterative spatial join</title><source>ACM Digital Library Complete</source><creator>Jacox, Edwin H ; Samet, Hanan</creator><creatorcontrib>Jacox, Edwin H ; Samet, Hanan</creatorcontrib><description>The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the partition corresponds to a hash-based join while the plane-sweep corresponds to a sort-merge join. In this article, we look at extending the idea of the sort-merge join for one-dimensional data to multiple dimensions and introduce the Iterative Spatial Join. As with the sort-merge join, the Iterative Spatial Join is best suited to cases where the data is already sorted. However, as we show in the experiments, the Iterative Spatial Join performs well when internal memory is limited, compared to the partitioning methods. This suggests that the Iterative Spatial Join would be useful for very large data sets or in situations where internal memory is a shared resource and is therefore limited, such as with today's database engines which share internal memory amongst several queries. Furthermore, the performance of the Iterative Spatial Join is predictable and has no parameters which need to be tuned, unlike other algorithms. The Iterative Spatial Join is based on a plane sweep algorithm, which requires the entire data set to fit in internal memory. When internal memory overflows, the Iterative Spatial Join simply makes additional passes on the data, thereby exhibiting only a gradual performance degradation. To demonstrate the use and efficacy of the Iterative Spatial Join, we first examine and analyze current approaches to performing spatial joins, and then give a detailed analysis of the Iterative Spatial Join as well as present the results of extensive testing of the algorithm, including a comparison with partitioning-based spatial join methods. These tests show that the Iterative Spatial Join overcomes the performance limitations of the other algorithms for data sets of all sizes as well as differing amounts of internal memory.</description><identifier>ISSN: 0362-5915</identifier><identifier>EISSN: 1557-4644</identifier><identifier>DOI: 10.1145/937598.937600</identifier><identifier>CODEN: ATDSD3</identifier><language>eng</language><publisher>New York: Association for Computing Machinery</publisher><subject>Algorithms ; Datasets ; Information systems</subject><ispartof>ACM transactions on database systems, 2003-09, Vol.28 (3), p.230-256</ispartof><rights>Copyright Association for Computing Machinery Sep 2003</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-36790970c8093f02dfe5cbcbf76a3ad97d97a62eb958647d006e988f090aec763</citedby><cites>FETCH-LOGICAL-c295t-36790970c8093f02dfe5cbcbf76a3ad97d97a62eb958647d006e988f090aec763</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Jacox, Edwin H</creatorcontrib><creatorcontrib>Samet, Hanan</creatorcontrib><title>Iterative spatial join</title><title>ACM transactions on database systems</title><description>The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the partition corresponds to a hash-based join while the plane-sweep corresponds to a sort-merge join. In this article, we look at extending the idea of the sort-merge join for one-dimensional data to multiple dimensions and introduce the Iterative Spatial Join. As with the sort-merge join, the Iterative Spatial Join is best suited to cases where the data is already sorted. However, as we show in the experiments, the Iterative Spatial Join performs well when internal memory is limited, compared to the partitioning methods. This suggests that the Iterative Spatial Join would be useful for very large data sets or in situations where internal memory is a shared resource and is therefore limited, such as with today's database engines which share internal memory amongst several queries. Furthermore, the performance of the Iterative Spatial Join is predictable and has no parameters which need to be tuned, unlike other algorithms. The Iterative Spatial Join is based on a plane sweep algorithm, which requires the entire data set to fit in internal memory. When internal memory overflows, the Iterative Spatial Join simply makes additional passes on the data, thereby exhibiting only a gradual performance degradation. To demonstrate the use and efficacy of the Iterative Spatial Join, we first examine and analyze current approaches to performing spatial joins, and then give a detailed analysis of the Iterative Spatial Join as well as present the results of extensive testing of the algorithm, including a comparison with partitioning-based spatial join methods. These tests show that the Iterative Spatial Join overcomes the performance limitations of the other algorithms for data sets of all sizes as well as differing amounts of internal memory.</description><subject>Algorithms</subject><subject>Datasets</subject><subject>Information systems</subject><issn>0362-5915</issn><issn>1557-4644</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2003</creationdate><recordtype>article</recordtype><recordid>eNpdkM1LxDAQR4MouK4e9bx48NZ18p05yuLqwoIXPYc0TaGl29akFfzvjdSTMPAujx_DI-SWwpZSIR-Ra4lmm6EAzsiKSqkLoYQ4JyvgihUSqbwkVym1ACAM6hW5O0whuqn5Cps0Zrpu0w5Nf00uatelcPPHNfnYP7_vXovj28th93QsPEM5FVxpBNTgDSCvgVV1kL70Za2V465Cnc8pFkqURgldAaiAxtSA4ILXiq_Jw7I7xuFzDmmypyb50HWuD8OcLEMwRgPL4v0_sR3m2OffLEXJgQmts1Qsko9DSjHUdozNycVvS8H-JrJLIrsk4j9J91Xr</recordid><startdate>20030901</startdate><enddate>20030901</enddate><creator>Jacox, Edwin H</creator><creator>Samet, Hanan</creator><general>Association for Computing Machinery</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><scope>7SC</scope><scope>8FD</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20030901</creationdate><title>Iterative spatial join</title><author>Jacox, Edwin H ; Samet, Hanan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-36790970c8093f02dfe5cbcbf76a3ad97d97a62eb958647d006e988f090aec763</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Algorithms</topic><topic>Datasets</topic><topic>Information systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jacox, Edwin H</creatorcontrib><creatorcontrib>Samet, Hanan</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on database systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jacox, Edwin H</au><au>Samet, Hanan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Iterative spatial join</atitle><jtitle>ACM transactions on database systems</jtitle><date>2003-09-01</date><risdate>2003</risdate><volume>28</volume><issue>3</issue><spage>230</spage><epage>256</epage><pages>230-256</pages><issn>0362-5915</issn><eissn>1557-4644</eissn><coden>ATDSD3</coden><abstract>The key issue in performing spatial joins is finding the pairs of intersecting rectangles. For unindexed data sets, this is usually resolved by partitioning the data and then performing a plane sweep on the individual partitions. The resulting join can be viewed as a two-step process where the partition corresponds to a hash-based join while the plane-sweep corresponds to a sort-merge join. In this article, we look at extending the idea of the sort-merge join for one-dimensional data to multiple dimensions and introduce the Iterative Spatial Join. As with the sort-merge join, the Iterative Spatial Join is best suited to cases where the data is already sorted. However, as we show in the experiments, the Iterative Spatial Join performs well when internal memory is limited, compared to the partitioning methods. This suggests that the Iterative Spatial Join would be useful for very large data sets or in situations where internal memory is a shared resource and is therefore limited, such as with today's database engines which share internal memory amongst several queries. Furthermore, the performance of the Iterative Spatial Join is predictable and has no parameters which need to be tuned, unlike other algorithms. The Iterative Spatial Join is based on a plane sweep algorithm, which requires the entire data set to fit in internal memory. When internal memory overflows, the Iterative Spatial Join simply makes additional passes on the data, thereby exhibiting only a gradual performance degradation. To demonstrate the use and efficacy of the Iterative Spatial Join, we first examine and analyze current approaches to performing spatial joins, and then give a detailed analysis of the Iterative Spatial Join as well as present the results of extensive testing of the algorithm, including a comparison with partitioning-based spatial join methods. These tests show that the Iterative Spatial Join overcomes the performance limitations of the other algorithms for data sets of all sizes as well as differing amounts of internal memory.</abstract><cop>New York</cop><pub>Association for Computing Machinery</pub><doi>10.1145/937598.937600</doi><tpages>27</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0362-5915
ispartof ACM transactions on database systems, 2003-09, Vol.28 (3), p.230-256
issn 0362-5915
1557-4644
language eng
recordid cdi_proquest_miscellaneous_29088702
source ACM Digital Library Complete
subjects Algorithms
Datasets
Information systems
title Iterative spatial join
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T20%3A49%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Iterative%20spatial%20join&rft.jtitle=ACM%20transactions%20on%20database%20systems&rft.au=Jacox,%20Edwin%20H&rft.date=2003-09-01&rft.volume=28&rft.issue=3&rft.spage=230&rft.epage=256&rft.pages=230-256&rft.issn=0362-5915&rft.eissn=1557-4644&rft.coden=ATDSD3&rft_id=info:doi/10.1145/937598.937600&rft_dat=%3Cproquest_cross%3E476406461%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=195302477&rft_id=info:pmid/&rfr_iscdi=true