Maximum interpolable gap length in missing smartphone-based GPS mobility data

Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Transportation (Dordrecht) 2024-02, Vol.51 (1), p.297-327
Hauptverfasser: McCool, Danielle, Lugtig, Peter, Schouten, Barry
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 327
container_issue 1
container_start_page 297
container_title Transportation (Dordrecht)
container_volume 51
creator McCool, Danielle
Lugtig, Peter
Schouten, Barry
description Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors.
doi_str_mv 10.1007/s11116-022-10328-2
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2910728898</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2910728898</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-405f7d5b0722cd721934618eb99e028668d6ee1b718b42e0a2a4368554151f153</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwFPAc3SSTXazRynaCi0K6jkk3ew2Zf-ZpGC_vdEVvDmXgeG9NzM_hK4p3FKA4i7QVDkBxgiFjEnCTtCMioKRkmfiFM0AeEk4l_IcXYSwBwBBBZ2hzUZ_uu7QYddH68eh1aa1uNEjbm3fxF2a486F4PoGh077OO6G3hKjg63w8uUVd4NxrYtHXOmoL9FZrdtgr377HL0_PrwtVmT9vHxa3K_JNuNFJBxEXVTCQMHYtioYLTOeU2lNWVpgMs9llVtLTUGl4cyCZppnuRSCp5trKrI5uplyRz98HGyIaj8cfJ9WKlbSFCtlKZOKTaqtH0Lwtlajd-mHo6KgvrGpCZtK2NQPNsWSKZtMIYn7xvq_6H9cX5lcbmQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2910728898</pqid></control><display><type>article</type><title>Maximum interpolable gap length in missing smartphone-based GPS mobility data</title><source>Springer Nature - Complete Springer Journals</source><creator>McCool, Danielle ; Lugtig, Peter ; Schouten, Barry</creator><creatorcontrib>McCool, Danielle ; Lugtig, Peter ; Schouten, Barry</creatorcontrib><description>Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors.</description><identifier>ISSN: 0049-4488</identifier><identifier>EISSN: 1572-9435</identifier><identifier>DOI: 10.1007/s11116-022-10328-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Bias ; Economic Geography ; Economics ; Economics and Finance ; Engineering Economics ; Innovation/Technology Management ; Interpolation ; Logistics ; Marketing ; Missing data ; Mobility ; Organization ; Regional/Spatial Science ; Segmentation ; Smartphones ; Sparsity ; Spatial aspects ; Time ; Travel</subject><ispartof>Transportation (Dordrecht), 2024-02, Vol.51 (1), p.297-327</ispartof><rights>The Author(s) 2022</rights><rights>The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c347t-405f7d5b0722cd721934618eb99e028668d6ee1b718b42e0a2a4368554151f153</cites><orcidid>0000-0002-7055-7539</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11116-022-10328-2$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11116-022-10328-2$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,27911,27912,41475,42544,51306</link.rule.ids></links><search><creatorcontrib>McCool, Danielle</creatorcontrib><creatorcontrib>Lugtig, Peter</creatorcontrib><creatorcontrib>Schouten, Barry</creatorcontrib><title>Maximum interpolable gap length in missing smartphone-based GPS mobility data</title><title>Transportation (Dordrecht)</title><addtitle>Transportation</addtitle><description>Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors.</description><subject>Bias</subject><subject>Economic Geography</subject><subject>Economics</subject><subject>Economics and Finance</subject><subject>Engineering Economics</subject><subject>Innovation/Technology Management</subject><subject>Interpolation</subject><subject>Logistics</subject><subject>Marketing</subject><subject>Missing data</subject><subject>Mobility</subject><subject>Organization</subject><subject>Regional/Spatial Science</subject><subject>Segmentation</subject><subject>Smartphones</subject><subject>Sparsity</subject><subject>Spatial aspects</subject><subject>Time</subject><subject>Travel</subject><issn>0049-4488</issn><issn>1572-9435</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwFPAc3SSTXazRynaCi0K6jkk3ew2Zf-ZpGC_vdEVvDmXgeG9NzM_hK4p3FKA4i7QVDkBxgiFjEnCTtCMioKRkmfiFM0AeEk4l_IcXYSwBwBBBZ2hzUZ_uu7QYddH68eh1aa1uNEjbm3fxF2a486F4PoGh077OO6G3hKjg63w8uUVd4NxrYtHXOmoL9FZrdtgr377HL0_PrwtVmT9vHxa3K_JNuNFJBxEXVTCQMHYtioYLTOeU2lNWVpgMs9llVtLTUGl4cyCZppnuRSCp5trKrI5uplyRz98HGyIaj8cfJ9WKlbSFCtlKZOKTaqtH0Lwtlajd-mHo6KgvrGpCZtK2NQPNsWSKZtMIYn7xvq_6H9cX5lcbmQ</recordid><startdate>20240201</startdate><enddate>20240201</enddate><creator>McCool, Danielle</creator><creator>Lugtig, Peter</creator><creator>Schouten, Barry</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7ST</scope><scope>8BJ</scope><scope>8FD</scope><scope>C1K</scope><scope>FQK</scope><scope>FR3</scope><scope>JBE</scope><scope>KR7</scope><scope>SOI</scope><orcidid>https://orcid.org/0000-0002-7055-7539</orcidid></search><sort><creationdate>20240201</creationdate><title>Maximum interpolable gap length in missing smartphone-based GPS mobility data</title><author>McCool, Danielle ; Lugtig, Peter ; Schouten, Barry</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-405f7d5b0722cd721934618eb99e028668d6ee1b718b42e0a2a4368554151f153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Bias</topic><topic>Economic Geography</topic><topic>Economics</topic><topic>Economics and Finance</topic><topic>Engineering Economics</topic><topic>Innovation/Technology Management</topic><topic>Interpolation</topic><topic>Logistics</topic><topic>Marketing</topic><topic>Missing data</topic><topic>Mobility</topic><topic>Organization</topic><topic>Regional/Spatial Science</topic><topic>Segmentation</topic><topic>Smartphones</topic><topic>Sparsity</topic><topic>Spatial aspects</topic><topic>Time</topic><topic>Travel</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>McCool, Danielle</creatorcontrib><creatorcontrib>Lugtig, Peter</creatorcontrib><creatorcontrib>Schouten, Barry</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>Environment Abstracts</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>International Bibliography of the Social Sciences</collection><collection>Engineering Research Database</collection><collection>International Bibliography of the Social Sciences</collection><collection>Civil Engineering Abstracts</collection><collection>Environment Abstracts</collection><jtitle>Transportation (Dordrecht)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>McCool, Danielle</au><au>Lugtig, Peter</au><au>Schouten, Barry</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Maximum interpolable gap length in missing smartphone-based GPS mobility data</atitle><jtitle>Transportation (Dordrecht)</jtitle><stitle>Transportation</stitle><date>2024-02-01</date><risdate>2024</risdate><volume>51</volume><issue>1</issue><spage>297</spage><epage>327</epage><pages>297-327</pages><issn>0049-4488</issn><eissn>1572-9435</eissn><abstract>Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11116-022-10328-2</doi><tpages>31</tpages><orcidid>https://orcid.org/0000-0002-7055-7539</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0049-4488
ispartof Transportation (Dordrecht), 2024-02, Vol.51 (1), p.297-327
issn 0049-4488
1572-9435
language eng
recordid cdi_proquest_journals_2910728898
source Springer Nature - Complete Springer Journals
subjects Bias
Economic Geography
Economics
Economics and Finance
Engineering Economics
Innovation/Technology Management
Interpolation
Logistics
Marketing
Missing data
Mobility
Organization
Regional/Spatial Science
Segmentation
Smartphones
Sparsity
Spatial aspects
Time
Travel
title Maximum interpolable gap length in missing smartphone-based GPS mobility data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T20%3A45%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Maximum%20interpolable%20gap%20length%20in%20missing%20smartphone-based%20GPS%20mobility%20data&rft.jtitle=Transportation%20(Dordrecht)&rft.au=McCool,%20Danielle&rft.date=2024-02-01&rft.volume=51&rft.issue=1&rft.spage=297&rft.epage=327&rft.pages=297-327&rft.issn=0049-4488&rft.eissn=1572-9435&rft_id=info:doi/10.1007/s11116-022-10328-2&rft_dat=%3Cproquest_cross%3E2910728898%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2910728898&rft_id=info:pmid/&rfr_iscdi=true