A multi-modal approach towards mining social media data during natural disasters -- a case study of Hurricane Irma

Streaming social media provides a real-time glimpse of extreme weather impacts. However, the volume of streaming data makes mining information a challenge for emergency managers, policy makers, and disciplinary scientists. Here we explore the effectiveness of data learned approaches to mine and filt...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Mohanty, Somya D, Biggers, Brown, Sayedahmed, Saed, Pourebrahim, Nastaran, Goldstein, Evan B, Bunch, Rick, Chi, Guangqing, Sadri, Fereidoon, McCoy, Tom P, Cosby, Arthur
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Mohanty, Somya D
Biggers, Brown
Sayedahmed, Saed
Pourebrahim, Nastaran
Goldstein, Evan B
Bunch, Rick
Chi, Guangqing
Sadri, Fereidoon
McCoy, Tom P
Cosby, Arthur
description Streaming social media provides a real-time glimpse of extreme weather impacts. However, the volume of streaming data makes mining information a challenge for emergency managers, policy makers, and disciplinary scientists. Here we explore the effectiveness of data learned approaches to mine and filter information from streaming social media data from Hurricane Irma's landfall in Florida, USA. We use 54,383 Twitter messages (out of 784K geolocated messages) from 16,598 users from Sept. 10 - 12, 2017 to develop 4 independent models to filter data for relevance: 1) a geospatial model based on forcing conditions at the place and time of each tweet, 2) an image classification model for tweets that include images, 3) a user model to predict the reliability of the tweeter, and 4) a text model to determine if the text is related to Hurricane Irma. All four models are independently tested, and can be combined to quickly filter and visualize tweets based on user-defined thresholds for each submodel. We envision that this type of filtering and visualization routine can be useful as a base model for data capture from noisy sources such as Twitter. The data can then be subsequently used by policy makers, environmental managers, emergency managers, and domain scientists interested in finding tweets with specific attributes to use during different stages of the disaster (e.g., preparedness, response, and recovery), or for detailed research.
doi_str_mv 10.48550/arxiv.2101.00480
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2101_00480</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2101_00480</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-e40ba1550eddfdff4c9146aacc7aa5e329ecb3103a1121b3ec2aa2abf657e96c3</originalsourceid><addsrcrecordid>eNotj8tOwzAURLNhgQofwKr3B1zsOI9mWVVAK1Vi03008QMsxUlkO0D_nrSwmVkcaTQny54E3xTbsuTPCD_ua5MLLjacF1t-n4Ud-blPjvlRoydMUxihPimN3wg6kneDGz4ojsot2BvtQBppiTlcwYA0h4VoFxGTCZEYI5BCNBTTrC80WjrMITiFwdAxeDxkdxZ9NI__vcrOry_n_YGd3t-O-92Joao5MwXvIJbTRmurrS1UI4oKUKoGSiPzxqhOCi4hRC46aVQO5OhsVdamqZRcZeu_2Zt0OwXnES7tVb69yctfqRVWvQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A multi-modal approach towards mining social media data during natural disasters -- a case study of Hurricane Irma</title><source>arXiv.org</source><creator>Mohanty, Somya D ; Biggers, Brown ; Sayedahmed, Saed ; Pourebrahim, Nastaran ; Goldstein, Evan B ; Bunch, Rick ; Chi, Guangqing ; Sadri, Fereidoon ; McCoy, Tom P ; Cosby, Arthur</creator><creatorcontrib>Mohanty, Somya D ; Biggers, Brown ; Sayedahmed, Saed ; Pourebrahim, Nastaran ; Goldstein, Evan B ; Bunch, Rick ; Chi, Guangqing ; Sadri, Fereidoon ; McCoy, Tom P ; Cosby, Arthur</creatorcontrib><description>Streaming social media provides a real-time glimpse of extreme weather impacts. However, the volume of streaming data makes mining information a challenge for emergency managers, policy makers, and disciplinary scientists. Here we explore the effectiveness of data learned approaches to mine and filter information from streaming social media data from Hurricane Irma's landfall in Florida, USA. We use 54,383 Twitter messages (out of 784K geolocated messages) from 16,598 users from Sept. 10 - 12, 2017 to develop 4 independent models to filter data for relevance: 1) a geospatial model based on forcing conditions at the place and time of each tweet, 2) an image classification model for tweets that include images, 3) a user model to predict the reliability of the tweeter, and 4) a text model to determine if the text is related to Hurricane Irma. All four models are independently tested, and can be combined to quickly filter and visualize tweets based on user-defined thresholds for each submodel. We envision that this type of filtering and visualization routine can be useful as a base model for data capture from noisy sources such as Twitter. The data can then be subsequently used by policy makers, environmental managers, emergency managers, and domain scientists interested in finding tweets with specific attributes to use during different stages of the disaster (e.g., preparedness, response, and recovery), or for detailed research.</description><identifier>DOI: 10.48550/arxiv.2101.00480</identifier><language>eng</language><subject>Computer Science - Information Retrieval ; Computer Science - Learning ; Computer Science - Social and Information Networks</subject><creationdate>2021-01</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2101.00480$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2101.00480$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Mohanty, Somya D</creatorcontrib><creatorcontrib>Biggers, Brown</creatorcontrib><creatorcontrib>Sayedahmed, Saed</creatorcontrib><creatorcontrib>Pourebrahim, Nastaran</creatorcontrib><creatorcontrib>Goldstein, Evan B</creatorcontrib><creatorcontrib>Bunch, Rick</creatorcontrib><creatorcontrib>Chi, Guangqing</creatorcontrib><creatorcontrib>Sadri, Fereidoon</creatorcontrib><creatorcontrib>McCoy, Tom P</creatorcontrib><creatorcontrib>Cosby, Arthur</creatorcontrib><title>A multi-modal approach towards mining social media data during natural disasters -- a case study of Hurricane Irma</title><description>Streaming social media provides a real-time glimpse of extreme weather impacts. However, the volume of streaming data makes mining information a challenge for emergency managers, policy makers, and disciplinary scientists. Here we explore the effectiveness of data learned approaches to mine and filter information from streaming social media data from Hurricane Irma's landfall in Florida, USA. We use 54,383 Twitter messages (out of 784K geolocated messages) from 16,598 users from Sept. 10 - 12, 2017 to develop 4 independent models to filter data for relevance: 1) a geospatial model based on forcing conditions at the place and time of each tweet, 2) an image classification model for tweets that include images, 3) a user model to predict the reliability of the tweeter, and 4) a text model to determine if the text is related to Hurricane Irma. All four models are independently tested, and can be combined to quickly filter and visualize tweets based on user-defined thresholds for each submodel. We envision that this type of filtering and visualization routine can be useful as a base model for data capture from noisy sources such as Twitter. The data can then be subsequently used by policy makers, environmental managers, emergency managers, and domain scientists interested in finding tweets with specific attributes to use during different stages of the disaster (e.g., preparedness, response, and recovery), or for detailed research.</description><subject>Computer Science - Information Retrieval</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Social and Information Networks</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAURLNhgQofwKr3B1zsOI9mWVVAK1Vi03008QMsxUlkO0D_nrSwmVkcaTQny54E3xTbsuTPCD_ua5MLLjacF1t-n4Ud-blPjvlRoydMUxihPimN3wg6kneDGz4ojsot2BvtQBppiTlcwYA0h4VoFxGTCZEYI5BCNBTTrC80WjrMITiFwdAxeDxkdxZ9NI__vcrOry_n_YGd3t-O-92Joao5MwXvIJbTRmurrS1UI4oKUKoGSiPzxqhOCi4hRC46aVQO5OhsVdamqZRcZeu_2Zt0OwXnES7tVb69yctfqRVWvQ</recordid><startdate>20210102</startdate><enddate>20210102</enddate><creator>Mohanty, Somya D</creator><creator>Biggers, Brown</creator><creator>Sayedahmed, Saed</creator><creator>Pourebrahim, Nastaran</creator><creator>Goldstein, Evan B</creator><creator>Bunch, Rick</creator><creator>Chi, Guangqing</creator><creator>Sadri, Fereidoon</creator><creator>McCoy, Tom P</creator><creator>Cosby, Arthur</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20210102</creationdate><title>A multi-modal approach towards mining social media data during natural disasters -- a case study of Hurricane Irma</title><author>Mohanty, Somya D ; Biggers, Brown ; Sayedahmed, Saed ; Pourebrahim, Nastaran ; Goldstein, Evan B ; Bunch, Rick ; Chi, Guangqing ; Sadri, Fereidoon ; McCoy, Tom P ; Cosby, Arthur</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-e40ba1550eddfdff4c9146aacc7aa5e329ecb3103a1121b3ec2aa2abf657e96c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Computer Science - Information Retrieval</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Social and Information Networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Mohanty, Somya D</creatorcontrib><creatorcontrib>Biggers, Brown</creatorcontrib><creatorcontrib>Sayedahmed, Saed</creatorcontrib><creatorcontrib>Pourebrahim, Nastaran</creatorcontrib><creatorcontrib>Goldstein, Evan B</creatorcontrib><creatorcontrib>Bunch, Rick</creatorcontrib><creatorcontrib>Chi, Guangqing</creatorcontrib><creatorcontrib>Sadri, Fereidoon</creatorcontrib><creatorcontrib>McCoy, Tom P</creatorcontrib><creatorcontrib>Cosby, Arthur</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Mohanty, Somya D</au><au>Biggers, Brown</au><au>Sayedahmed, Saed</au><au>Pourebrahim, Nastaran</au><au>Goldstein, Evan B</au><au>Bunch, Rick</au><au>Chi, Guangqing</au><au>Sadri, Fereidoon</au><au>McCoy, Tom P</au><au>Cosby, Arthur</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A multi-modal approach towards mining social media data during natural disasters -- a case study of Hurricane Irma</atitle><date>2021-01-02</date><risdate>2021</risdate><abstract>Streaming social media provides a real-time glimpse of extreme weather impacts. However, the volume of streaming data makes mining information a challenge for emergency managers, policy makers, and disciplinary scientists. Here we explore the effectiveness of data learned approaches to mine and filter information from streaming social media data from Hurricane Irma's landfall in Florida, USA. We use 54,383 Twitter messages (out of 784K geolocated messages) from 16,598 users from Sept. 10 - 12, 2017 to develop 4 independent models to filter data for relevance: 1) a geospatial model based on forcing conditions at the place and time of each tweet, 2) an image classification model for tweets that include images, 3) a user model to predict the reliability of the tweeter, and 4) a text model to determine if the text is related to Hurricane Irma. All four models are independently tested, and can be combined to quickly filter and visualize tweets based on user-defined thresholds for each submodel. We envision that this type of filtering and visualization routine can be useful as a base model for data capture from noisy sources such as Twitter. The data can then be subsequently used by policy makers, environmental managers, emergency managers, and domain scientists interested in finding tweets with specific attributes to use during different stages of the disaster (e.g., preparedness, response, and recovery), or for detailed research.</abstract><doi>10.48550/arxiv.2101.00480</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2101.00480
ispartof
issn
language eng
recordid cdi_arxiv_primary_2101_00480
source arXiv.org
subjects Computer Science - Information Retrieval
Computer Science - Learning
Computer Science - Social and Information Networks
title A multi-modal approach towards mining social media data during natural disasters -- a case study of Hurricane Irma
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T14%3A35%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20multi-modal%20approach%20towards%20mining%20social%20media%20data%20during%20natural%20disasters%20--%20a%20case%20study%20of%20Hurricane%20Irma&rft.au=Mohanty,%20Somya%20D&rft.date=2021-01-02&rft_id=info:doi/10.48550/arxiv.2101.00480&rft_dat=%3Carxiv_GOX%3E2101_00480%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true