Maximum interpolable gap length in missing smartphone-based GPS mobility data
Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation a...
Gespeichert in:
Veröffentlicht in: | Transportation (Dordrecht) 2024-02, Vol.51 (1), p.297-327 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 327 |
---|---|
container_issue | 1 |
container_start_page | 297 |
container_title | Transportation (Dordrecht) |
container_volume | 51 |
creator | McCool, Danielle Lugtig, Peter Schouten, Barry |
description | Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors. |
doi_str_mv | 10.1007/s11116-022-10328-2 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2910728898</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2910728898</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-405f7d5b0722cd721934618eb99e028668d6ee1b718b42e0a2a4368554151f153</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwFPAc3SSTXazRynaCi0K6jkk3ew2Zf-ZpGC_vdEVvDmXgeG9NzM_hK4p3FKA4i7QVDkBxgiFjEnCTtCMioKRkmfiFM0AeEk4l_IcXYSwBwBBBZ2hzUZ_uu7QYddH68eh1aa1uNEjbm3fxF2a486F4PoGh077OO6G3hKjg63w8uUVd4NxrYtHXOmoL9FZrdtgr377HL0_PrwtVmT9vHxa3K_JNuNFJBxEXVTCQMHYtioYLTOeU2lNWVpgMs9llVtLTUGl4cyCZppnuRSCp5trKrI5uplyRz98HGyIaj8cfJ9WKlbSFCtlKZOKTaqtH0Lwtlajd-mHo6KgvrGpCZtK2NQPNsWSKZtMIYn7xvq_6H9cX5lcbmQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2910728898</pqid></control><display><type>article</type><title>Maximum interpolable gap length in missing smartphone-based GPS mobility data</title><source>Springer Nature - Complete Springer Journals</source><creator>McCool, Danielle ; Lugtig, Peter ; Schouten, Barry</creator><creatorcontrib>McCool, Danielle ; Lugtig, Peter ; Schouten, Barry</creatorcontrib><description>Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors.</description><identifier>ISSN: 0049-4488</identifier><identifier>EISSN: 1572-9435</identifier><identifier>DOI: 10.1007/s11116-022-10328-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Bias ; Economic Geography ; Economics ; Economics and Finance ; Engineering Economics ; Innovation/Technology Management ; Interpolation ; Logistics ; Marketing ; Missing data ; Mobility ; Organization ; Regional/Spatial Science ; Segmentation ; Smartphones ; Sparsity ; Spatial aspects ; Time ; Travel</subject><ispartof>Transportation (Dordrecht), 2024-02, Vol.51 (1), p.297-327</ispartof><rights>The Author(s) 2022</rights><rights>The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c347t-405f7d5b0722cd721934618eb99e028668d6ee1b718b42e0a2a4368554151f153</cites><orcidid>0000-0002-7055-7539</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11116-022-10328-2$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11116-022-10328-2$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,27911,27912,41475,42544,51306</link.rule.ids></links><search><creatorcontrib>McCool, Danielle</creatorcontrib><creatorcontrib>Lugtig, Peter</creatorcontrib><creatorcontrib>Schouten, Barry</creatorcontrib><title>Maximum interpolable gap length in missing smartphone-based GPS mobility data</title><title>Transportation (Dordrecht)</title><addtitle>Transportation</addtitle><description>Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors.</description><subject>Bias</subject><subject>Economic Geography</subject><subject>Economics</subject><subject>Economics and Finance</subject><subject>Engineering Economics</subject><subject>Innovation/Technology Management</subject><subject>Interpolation</subject><subject>Logistics</subject><subject>Marketing</subject><subject>Missing data</subject><subject>Mobility</subject><subject>Organization</subject><subject>Regional/Spatial Science</subject><subject>Segmentation</subject><subject>Smartphones</subject><subject>Sparsity</subject><subject>Spatial aspects</subject><subject>Time</subject><subject>Travel</subject><issn>0049-4488</issn><issn>1572-9435</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwFPAc3SSTXazRynaCi0K6jkk3ew2Zf-ZpGC_vdEVvDmXgeG9NzM_hK4p3FKA4i7QVDkBxgiFjEnCTtCMioKRkmfiFM0AeEk4l_IcXYSwBwBBBZ2hzUZ_uu7QYddH68eh1aa1uNEjbm3fxF2a486F4PoGh077OO6G3hKjg63w8uUVd4NxrYtHXOmoL9FZrdtgr377HL0_PrwtVmT9vHxa3K_JNuNFJBxEXVTCQMHYtioYLTOeU2lNWVpgMs9llVtLTUGl4cyCZppnuRSCp5trKrI5uplyRz98HGyIaj8cfJ9WKlbSFCtlKZOKTaqtH0Lwtlajd-mHo6KgvrGpCZtK2NQPNsWSKZtMIYn7xvq_6H9cX5lcbmQ</recordid><startdate>20240201</startdate><enddate>20240201</enddate><creator>McCool, Danielle</creator><creator>Lugtig, Peter</creator><creator>Schouten, Barry</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7ST</scope><scope>8BJ</scope><scope>8FD</scope><scope>C1K</scope><scope>FQK</scope><scope>FR3</scope><scope>JBE</scope><scope>KR7</scope><scope>SOI</scope><orcidid>https://orcid.org/0000-0002-7055-7539</orcidid></search><sort><creationdate>20240201</creationdate><title>Maximum interpolable gap length in missing smartphone-based GPS mobility data</title><author>McCool, Danielle ; Lugtig, Peter ; Schouten, Barry</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-405f7d5b0722cd721934618eb99e028668d6ee1b718b42e0a2a4368554151f153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Bias</topic><topic>Economic Geography</topic><topic>Economics</topic><topic>Economics and Finance</topic><topic>Engineering Economics</topic><topic>Innovation/Technology Management</topic><topic>Interpolation</topic><topic>Logistics</topic><topic>Marketing</topic><topic>Missing data</topic><topic>Mobility</topic><topic>Organization</topic><topic>Regional/Spatial Science</topic><topic>Segmentation</topic><topic>Smartphones</topic><topic>Sparsity</topic><topic>Spatial aspects</topic><topic>Time</topic><topic>Travel</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>McCool, Danielle</creatorcontrib><creatorcontrib>Lugtig, Peter</creatorcontrib><creatorcontrib>Schouten, Barry</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>Environment Abstracts</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>International Bibliography of the Social Sciences</collection><collection>Engineering Research Database</collection><collection>International Bibliography of the Social Sciences</collection><collection>Civil Engineering Abstracts</collection><collection>Environment Abstracts</collection><jtitle>Transportation (Dordrecht)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>McCool, Danielle</au><au>Lugtig, Peter</au><au>Schouten, Barry</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Maximum interpolable gap length in missing smartphone-based GPS mobility data</atitle><jtitle>Transportation (Dordrecht)</jtitle><stitle>Transportation</stitle><date>2024-02-01</date><risdate>2024</risdate><volume>51</volume><issue>1</issue><spage>297</spage><epage>327</epage><pages>297-327</pages><issn>0049-4488</issn><eissn>1572-9435</eissn><abstract>Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11116-022-10328-2</doi><tpages>31</tpages><orcidid>https://orcid.org/0000-0002-7055-7539</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0049-4488 |
ispartof | Transportation (Dordrecht), 2024-02, Vol.51 (1), p.297-327 |
issn | 0049-4488 1572-9435 |
language | eng |
recordid | cdi_proquest_journals_2910728898 |
source | Springer Nature - Complete Springer Journals |
subjects | Bias Economic Geography Economics Economics and Finance Engineering Economics Innovation/Technology Management Interpolation Logistics Marketing Missing data Mobility Organization Regional/Spatial Science Segmentation Smartphones Sparsity Spatial aspects Time Travel |
title | Maximum interpolable gap length in missing smartphone-based GPS mobility data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T20%3A45%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Maximum%20interpolable%20gap%20length%20in%20missing%20smartphone-based%20GPS%20mobility%20data&rft.jtitle=Transportation%20(Dordrecht)&rft.au=McCool,%20Danielle&rft.date=2024-02-01&rft.volume=51&rft.issue=1&rft.spage=297&rft.epage=327&rft.pages=297-327&rft.issn=0049-4488&rft.eissn=1572-9435&rft_id=info:doi/10.1007/s11116-022-10328-2&rft_dat=%3Cproquest_cross%3E2910728898%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2910728898&rft_id=info:pmid/&rfr_iscdi=true |