Using Geographic Location-based Public Health Features in Survival Analysis

Time elapsed till an event of interest is often modeled using the survival analysis methodology, which estimates a survival score based on the input features. There is a resurgence of interest in developing more accurate prediction models for time-to-event prediction in personalized healthcare using...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-04
Hauptverfasser: Seidi, Navid, Tripathy, Ardhendu, Das, Sajal K
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Seidi, Navid
Tripathy, Ardhendu
Das, Sajal K
description Time elapsed till an event of interest is often modeled using the survival analysis methodology, which estimates a survival score based on the input features. There is a resurgence of interest in developing more accurate prediction models for time-to-event prediction in personalized healthcare using modern tools such as neural networks. Higher quality features and more frequent observations improve the predictions for a patient, however, the impact of including a patient's geographic location-based public health statistics on individual predictions has not been studied. This paper proposes a complementary improvement to survival analysis models by incorporating public health statistics in the input features. We show that including geographic location-based public health information results in a statistically significant improvement in the concordance index evaluated on the Surveillance, Epidemiology, and End Results (SEER) dataset containing nationwide cancer incidence data. The improvement holds for both the standard Cox proportional hazards model and the state-of-the-art Deep Survival Machines model. Our results indicate the utility of geographic location-based public health features in survival analysis.
doi_str_mv 10.48550/arxiv.2304.07679
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2304_07679</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2802661255</sourcerecordid><originalsourceid>FETCH-LOGICAL-a525-1f9fb54eea908d7dfc074e5a2152b4aea0f713f01f8b9f556a6638b5c153deec3</originalsourceid><addsrcrecordid>eNotz1tLw0AQBeBFECy1P8AnF3xO3Esml8dSbCsWFKzPYZLMtltiEneTYv-9aevTgcNhmI-xBynCKAUQz-h-7TFUWkShSOIku2ETpbUM0kipOzbz_iCEUHGiAPSEvX152-z4itqdw25vS75pS-xt2wQFeqr4x1DUY7smrPs9XxL2gyPPbcM_B3e0R6z5vMH65K2_Z7cGa0-z_5yy7fJlu1gHm_fV62K-CRAUBNJkpoCICDORVkllSpFEBKgkqCJCQmESqY2QJi0yAxBjHOu0gFKCrohKPWWP17MXad45-43ulJ_F-UU8Lp6ui861PwP5Pj-0gxu_9LlKR3ssz_g_ePpZSw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2802661255</pqid></control><display><type>article</type><title>Using Geographic Location-based Public Health Features in Survival Analysis</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Seidi, Navid ; Tripathy, Ardhendu ; Das, Sajal K</creator><creatorcontrib>Seidi, Navid ; Tripathy, Ardhendu ; Das, Sajal K</creatorcontrib><description>Time elapsed till an event of interest is often modeled using the survival analysis methodology, which estimates a survival score based on the input features. There is a resurgence of interest in developing more accurate prediction models for time-to-event prediction in personalized healthcare using modern tools such as neural networks. Higher quality features and more frequent observations improve the predictions for a patient, however, the impact of including a patient's geographic location-based public health statistics on individual predictions has not been studied. This paper proposes a complementary improvement to survival analysis models by incorporating public health statistics in the input features. We show that including geographic location-based public health information results in a statistically significant improvement in the concordance index evaluated on the Surveillance, Epidemiology, and End Results (SEER) dataset containing nationwide cancer incidence data. The improvement holds for both the standard Cox proportional hazards model and the state-of-the-art Deep Survival Machines model. Our results indicate the utility of geographic location-based public health features in survival analysis.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2304.07679</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Computer Science - Learning ; Epidemiology ; Geographical locations ; Neural networks ; Prediction models ; Public health ; Statistical models ; Statistics - Applications ; Survival ; Survival analysis</subject><ispartof>arXiv.org, 2023-04</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,785,886,27927</link.rule.ids><backlink>$$Uhttps://doi.org/10.1145/3580252.3586972$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2304.07679$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Seidi, Navid</creatorcontrib><creatorcontrib>Tripathy, Ardhendu</creatorcontrib><creatorcontrib>Das, Sajal K</creatorcontrib><title>Using Geographic Location-based Public Health Features in Survival Analysis</title><title>arXiv.org</title><description>Time elapsed till an event of interest is often modeled using the survival analysis methodology, which estimates a survival score based on the input features. There is a resurgence of interest in developing more accurate prediction models for time-to-event prediction in personalized healthcare using modern tools such as neural networks. Higher quality features and more frequent observations improve the predictions for a patient, however, the impact of including a patient's geographic location-based public health statistics on individual predictions has not been studied. This paper proposes a complementary improvement to survival analysis models by incorporating public health statistics in the input features. We show that including geographic location-based public health information results in a statistically significant improvement in the concordance index evaluated on the Surveillance, Epidemiology, and End Results (SEER) dataset containing nationwide cancer incidence data. The improvement holds for both the standard Cox proportional hazards model and the state-of-the-art Deep Survival Machines model. Our results indicate the utility of geographic location-based public health features in survival analysis.</description><subject>Computer Science - Learning</subject><subject>Epidemiology</subject><subject>Geographical locations</subject><subject>Neural networks</subject><subject>Prediction models</subject><subject>Public health</subject><subject>Statistical models</subject><subject>Statistics - Applications</subject><subject>Survival</subject><subject>Survival analysis</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotz1tLw0AQBeBFECy1P8AnF3xO3Esml8dSbCsWFKzPYZLMtltiEneTYv-9aevTgcNhmI-xBynCKAUQz-h-7TFUWkShSOIku2ETpbUM0kipOzbz_iCEUHGiAPSEvX152-z4itqdw25vS75pS-xt2wQFeqr4x1DUY7smrPs9XxL2gyPPbcM_B3e0R6z5vMH65K2_Z7cGa0-z_5yy7fJlu1gHm_fV62K-CRAUBNJkpoCICDORVkllSpFEBKgkqCJCQmESqY2QJi0yAxBjHOu0gFKCrohKPWWP17MXad45-43ulJ_F-UU8Lp6ui861PwP5Pj-0gxu_9LlKR3ssz_g_ePpZSw</recordid><startdate>20230416</startdate><enddate>20230416</enddate><creator>Seidi, Navid</creator><creator>Tripathy, Ardhendu</creator><creator>Das, Sajal K</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20230416</creationdate><title>Using Geographic Location-based Public Health Features in Survival Analysis</title><author>Seidi, Navid ; Tripathy, Ardhendu ; Das, Sajal K</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a525-1f9fb54eea908d7dfc074e5a2152b4aea0f713f01f8b9f556a6638b5c153deec3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Learning</topic><topic>Epidemiology</topic><topic>Geographical locations</topic><topic>Neural networks</topic><topic>Prediction models</topic><topic>Public health</topic><topic>Statistical models</topic><topic>Statistics - Applications</topic><topic>Survival</topic><topic>Survival analysis</topic><toplevel>online_resources</toplevel><creatorcontrib>Seidi, Navid</creatorcontrib><creatorcontrib>Tripathy, Ardhendu</creatorcontrib><creatorcontrib>Das, Sajal K</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Seidi, Navid</au><au>Tripathy, Ardhendu</au><au>Das, Sajal K</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Using Geographic Location-based Public Health Features in Survival Analysis</atitle><jtitle>arXiv.org</jtitle><date>2023-04-16</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Time elapsed till an event of interest is often modeled using the survival analysis methodology, which estimates a survival score based on the input features. There is a resurgence of interest in developing more accurate prediction models for time-to-event prediction in personalized healthcare using modern tools such as neural networks. Higher quality features and more frequent observations improve the predictions for a patient, however, the impact of including a patient's geographic location-based public health statistics on individual predictions has not been studied. This paper proposes a complementary improvement to survival analysis models by incorporating public health statistics in the input features. We show that including geographic location-based public health information results in a statistically significant improvement in the concordance index evaluated on the Surveillance, Epidemiology, and End Results (SEER) dataset containing nationwide cancer incidence data. The improvement holds for both the standard Cox proportional hazards model and the state-of-the-art Deep Survival Machines model. Our results indicate the utility of geographic location-based public health features in survival analysis.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2304.07679</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-04
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2304_07679
source arXiv.org; Free E- Journals
subjects Computer Science - Learning
Epidemiology
Geographical locations
Neural networks
Prediction models
Public health
Statistical models
Statistics - Applications
Survival
Survival analysis
title Using Geographic Location-based Public Health Features in Survival Analysis
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-17T23%3A54%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Using%20Geographic%20Location-based%20Public%20Health%20Features%20in%20Survival%20Analysis&rft.jtitle=arXiv.org&rft.au=Seidi,%20Navid&rft.date=2023-04-16&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2304.07679&rft_dat=%3Cproquest_arxiv%3E2802661255%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2802661255&rft_id=info:pmid/&rfr_iscdi=true