Mining Statistically Significant Co-location and Segregation Patterns

In spatial domains, interaction between features gives rise to two types of interaction patterns: co-location and segregation patterns. Existing approaches to finding co-location patterns have several shortcomings: (1) They depend on user specified thresholds for prevalence measures; (2) they do not...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on knowledge and data engineering 2014-05, Vol.26 (5), p.1185-1199
Hauptverfasser: Barua, Sajib, Sander, Jorg
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1199
container_issue 5
container_start_page 1185
container_title IEEE transactions on knowledge and data engineering
container_volume 26
creator Barua, Sajib
Sander, Jorg
description In spatial domains, interaction between features gives rise to two types of interaction patterns: co-location and segregation patterns. Existing approaches to finding co-location patterns have several shortcomings: (1) They depend on user specified thresholds for prevalence measures; (2) they do not take spatial auto-correlation into account; and (3) they may report co-locations even if the features are randomly distributed. Segregation patterns have yet to receive much attention. In this paper, we propose a method for finding both types of interaction patterns, based on a statistical test. We introduce a new definition of co-location and segregation pattern, we propose a model for the null distribution of features so spatial auto-correlation is taken into account, and we design an algorithm for finding both co-location and segregation patterns. We also develop two strategies to reduce the computational cost compared to a naïve approach based on simulations of the data distribution, and we propose an approach to reduce the runtime of our algorithm even further by using an approximation of the neighborhood of features. We evaluate our method empirically using synthetic and real data sets and demonstrate its advantages over a state-of-the-art co-location mining algorithm.
doi_str_mv 10.1109/TKDE.2013.88
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_1549541281</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6523223</ieee_id><sourcerecordid>3387986341</sourcerecordid><originalsourceid>FETCH-LOGICAL-c283t-1b25753a52191aaef89c09902096bcaf8c05d8a2634976afcc02e63c75b3a5d53</originalsourceid><addsrcrecordid>eNo9kL1PwzAQxS0EEqWwsbFEYiXFZ8eJPaISPkQRSC2z5bhO5Co4xXaH_vc4CmK6e7rf3Ts9hK4BLwCwuN-8PdYLgoEuOD9BM2CM5wQEnKYeF5AXtKjO0UUIO4wxrzjMUP1unXVdto4q2hCtVn1_zNa2c7ZNwsVsOeT9oNN0cJly22xtOm-6SX-qGI134RKdtaoP5uqvztHXU71ZvuSrj-fX5cMq14TTmENDWMWoYuNTSpmWC42FwASLstGq5RqzLVekpIWoStVqjYkpqa5Yk5a2jM7R7XR374efgwlR7oaDd8lSAisEK4BwSNTdRGk_hOBNK_fefit_lIDlGJQcg5JjUJLzhN9MuDXG_KMlI5QQSn8BnhpjUw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1549541281</pqid></control><display><type>article</type><title>Mining Statistically Significant Co-location and Segregation Patterns</title><source>IEEE Electronic Library (IEL)</source><creator>Barua, Sajib ; Sander, Jorg</creator><creatorcontrib>Barua, Sajib ; Sander, Jorg</creatorcontrib><description>In spatial domains, interaction between features gives rise to two types of interaction patterns: co-location and segregation patterns. Existing approaches to finding co-location patterns have several shortcomings: (1) They depend on user specified thresholds for prevalence measures; (2) they do not take spatial auto-correlation into account; and (3) they may report co-locations even if the features are randomly distributed. Segregation patterns have yet to receive much attention. In this paper, we propose a method for finding both types of interaction patterns, based on a statistical test. We introduce a new definition of co-location and segregation pattern, we propose a model for the null distribution of features so spatial auto-correlation is taken into account, and we design an algorithm for finding both co-location and segregation patterns. We also develop two strategies to reduce the computational cost compared to a naïve approach based on simulations of the data distribution, and we propose an approach to reduce the runtime of our algorithm even further by using an approximation of the neighborhood of features. We evaluate our method empirically using synthetic and real data sets and demonstrate its advantages over a state-of-the-art co-location mining algorithm.</description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2013.88</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Atmospheric measurements ; Computational modeling ; Data mining ; Data models ; Database Applications ; Database Management ; Indexes ; Information Technology and Systems ; Particle measurements ; Runtime ; Spatial databases ; Systems</subject><ispartof>IEEE transactions on knowledge and data engineering, 2014-05, Vol.26 (5), p.1185-1199</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) May 2014</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c283t-1b25753a52191aaef89c09902096bcaf8c05d8a2634976afcc02e63c75b3a5d53</citedby><cites>FETCH-LOGICAL-c283t-1b25753a52191aaef89c09902096bcaf8c05d8a2634976afcc02e63c75b3a5d53</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6523223$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6523223$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Barua, Sajib</creatorcontrib><creatorcontrib>Sander, Jorg</creatorcontrib><title>Mining Statistically Significant Co-location and Segregation Patterns</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description>In spatial domains, interaction between features gives rise to two types of interaction patterns: co-location and segregation patterns. Existing approaches to finding co-location patterns have several shortcomings: (1) They depend on user specified thresholds for prevalence measures; (2) they do not take spatial auto-correlation into account; and (3) they may report co-locations even if the features are randomly distributed. Segregation patterns have yet to receive much attention. In this paper, we propose a method for finding both types of interaction patterns, based on a statistical test. We introduce a new definition of co-location and segregation pattern, we propose a model for the null distribution of features so spatial auto-correlation is taken into account, and we design an algorithm for finding both co-location and segregation patterns. We also develop two strategies to reduce the computational cost compared to a naïve approach based on simulations of the data distribution, and we propose an approach to reduce the runtime of our algorithm even further by using an approximation of the neighborhood of features. We evaluate our method empirically using synthetic and real data sets and demonstrate its advantages over a state-of-the-art co-location mining algorithm.</description><subject>Atmospheric measurements</subject><subject>Computational modeling</subject><subject>Data mining</subject><subject>Data models</subject><subject>Database Applications</subject><subject>Database Management</subject><subject>Indexes</subject><subject>Information Technology and Systems</subject><subject>Particle measurements</subject><subject>Runtime</subject><subject>Spatial databases</subject><subject>Systems</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kL1PwzAQxS0EEqWwsbFEYiXFZ8eJPaISPkQRSC2z5bhO5Co4xXaH_vc4CmK6e7rf3Ts9hK4BLwCwuN-8PdYLgoEuOD9BM2CM5wQEnKYeF5AXtKjO0UUIO4wxrzjMUP1unXVdto4q2hCtVn1_zNa2c7ZNwsVsOeT9oNN0cJly22xtOm-6SX-qGI134RKdtaoP5uqvztHXU71ZvuSrj-fX5cMq14TTmENDWMWoYuNTSpmWC42FwASLstGq5RqzLVekpIWoStVqjYkpqa5Yk5a2jM7R7XR374efgwlR7oaDd8lSAisEK4BwSNTdRGk_hOBNK_fefit_lIDlGJQcg5JjUJLzhN9MuDXG_KMlI5QQSn8BnhpjUw</recordid><startdate>20140501</startdate><enddate>20140501</enddate><creator>Barua, Sajib</creator><creator>Sander, Jorg</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20140501</creationdate><title>Mining Statistically Significant Co-location and Segregation Patterns</title><author>Barua, Sajib ; Sander, Jorg</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c283t-1b25753a52191aaef89c09902096bcaf8c05d8a2634976afcc02e63c75b3a5d53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Atmospheric measurements</topic><topic>Computational modeling</topic><topic>Data mining</topic><topic>Data models</topic><topic>Database Applications</topic><topic>Database Management</topic><topic>Indexes</topic><topic>Information Technology and Systems</topic><topic>Particle measurements</topic><topic>Runtime</topic><topic>Spatial databases</topic><topic>Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Barua, Sajib</creatorcontrib><creatorcontrib>Sander, Jorg</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Barua, Sajib</au><au>Sander, Jorg</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Mining Statistically Significant Co-location and Segregation Patterns</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2014-05-01</date><risdate>2014</risdate><volume>26</volume><issue>5</issue><spage>1185</spage><epage>1199</epage><pages>1185-1199</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract>In spatial domains, interaction between features gives rise to two types of interaction patterns: co-location and segregation patterns. Existing approaches to finding co-location patterns have several shortcomings: (1) They depend on user specified thresholds for prevalence measures; (2) they do not take spatial auto-correlation into account; and (3) they may report co-locations even if the features are randomly distributed. Segregation patterns have yet to receive much attention. In this paper, we propose a method for finding both types of interaction patterns, based on a statistical test. We introduce a new definition of co-location and segregation pattern, we propose a model for the null distribution of features so spatial auto-correlation is taken into account, and we design an algorithm for finding both co-location and segregation patterns. We also develop two strategies to reduce the computational cost compared to a naïve approach based on simulations of the data distribution, and we propose an approach to reduce the runtime of our algorithm even further by using an approximation of the neighborhood of features. We evaluate our method empirically using synthetic and real data sets and demonstrate its advantages over a state-of-the-art co-location mining algorithm.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TKDE.2013.88</doi><tpages>15</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1041-4347
ispartof IEEE transactions on knowledge and data engineering, 2014-05, Vol.26 (5), p.1185-1199
issn 1041-4347
1558-2191
language eng
recordid cdi_proquest_journals_1549541281
source IEEE Electronic Library (IEL)
subjects Atmospheric measurements
Computational modeling
Data mining
Data models
Database Applications
Database Management
Indexes
Information Technology and Systems
Particle measurements
Runtime
Spatial databases
Systems
title Mining Statistically Significant Co-location and Segregation Patterns
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T02%3A39%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Mining%20Statistically%20Significant%20Co-location%20and%20Segregation%20Patterns&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Barua,%20Sajib&rft.date=2014-05-01&rft.volume=26&rft.issue=5&rft.spage=1185&rft.epage=1199&rft.pages=1185-1199&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2013.88&rft_dat=%3Cproquest_RIE%3E3387986341%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1549541281&rft_id=info:pmid/&rft_ieee_id=6523223&rfr_iscdi=true