RweetMiner: Automatic identification and categorization of help requests on twitter during disasters

•Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based fe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2021-08, Vol.176, p.114787, Article 114787
Hauptverfasser: Ullah, Irfan, Khan, Sharifullah, Imran, Muhammad, Lee, Young-Koo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 114787
container_title Expert systems with applications
container_volume 176
creator Ullah, Irfan
Khan, Sharifullah
Imran, Muhammad
Lee, Young-Koo
description •Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based features for learning subtle differences between request and non-request tweets, as well as six different types of request tweets.•Store intermediate data to speed up the machine learning development life cycle.•Performance improvement on the request identification and request categorization on Twitter. Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, shelter, using logistic regression shows promising results and outperforms exiting works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers.
doi_str_mv 10.1016/j.eswa.2021.114787
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2543511672</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417421002281</els_id><sourcerecordid>2543511672</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-75b51e2782ff5364384550ea4545fb64fa7cdf420f753beadd3232802a97fe893</originalsourceid><addsrcrecordid>eNp9UE1LAzEUDKJgrf4BTwHPW_O52RUvpfgFiiB6DunmpWZpd2uSteivN3U9e3pvhpn3hkHonJIZJbS8bGcQd2bGCKMzSoWq1AGa0ErxolQ1P0QTUktVCKrEMTqJsSWEKkLUBNmXHUB68h2EKzwfUr8xyTfYW-iSd77JqO-w6SzOK6z64L9Hqnf4HdZbHOBjgJgizlza-ZQgYDsE362w9dHEjOMpOnJmHeHsb07R2-3N6-K-eHy-e1jMH4uGsyoVSi4lBaYq5pzkpeCVkJKAEVJItyyFM6qxTjDilORLMNZylo2EmVo5qGo-RRfj3W3of1Ppth9Cl19qJgWXlJaKZRUbVU3oYwzg9Db4jQlfmhK9b1O3et-m3repxzaz6Xo0Qc7_6SHo2HjoGrA-QJO07f1_9h8VnX82</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2543511672</pqid></control><display><type>article</type><title>RweetMiner: Automatic identification and categorization of help requests on twitter during disasters</title><source>Elsevier ScienceDirect Journals</source><creator>Ullah, Irfan ; Khan, Sharifullah ; Imran, Muhammad ; Lee, Young-Koo</creator><creatorcontrib>Ullah, Irfan ; Khan, Sharifullah ; Imran, Muhammad ; Lee, Young-Koo</creatorcontrib><description>•Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based features for learning subtle differences between request and non-request tweets, as well as six different types of request tweets.•Store intermediate data to speed up the machine learning development life cycle.•Performance improvement on the request identification and request categorization on Twitter. Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, shelter, using logistic regression shows promising results and outperforms exiting works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2021.114787</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Catastrophic events ; Classification ; Digital media ; Disaster relief ; Disaster response ; Intermediate data ; Intermediate results ; Machine learning ; Relief efforts ; Request tweets ; Social networking sites ; Social networks ; System effectiveness</subject><ispartof>Expert systems with applications, 2021-08, Vol.176, p.114787, Article 114787</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Aug 15, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-75b51e2782ff5364384550ea4545fb64fa7cdf420f753beadd3232802a97fe893</citedby><cites>FETCH-LOGICAL-c328t-75b51e2782ff5364384550ea4545fb64fa7cdf420f753beadd3232802a97fe893</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0957417421002281$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Ullah, Irfan</creatorcontrib><creatorcontrib>Khan, Sharifullah</creatorcontrib><creatorcontrib>Imran, Muhammad</creatorcontrib><creatorcontrib>Lee, Young-Koo</creatorcontrib><title>RweetMiner: Automatic identification and categorization of help requests on twitter during disasters</title><title>Expert systems with applications</title><description>•Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based features for learning subtle differences between request and non-request tweets, as well as six different types of request tweets.•Store intermediate data to speed up the machine learning development life cycle.•Performance improvement on the request identification and request categorization on Twitter. Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, shelter, using logistic regression shows promising results and outperforms exiting works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers.</description><subject>Catastrophic events</subject><subject>Classification</subject><subject>Digital media</subject><subject>Disaster relief</subject><subject>Disaster response</subject><subject>Intermediate data</subject><subject>Intermediate results</subject><subject>Machine learning</subject><subject>Relief efforts</subject><subject>Request tweets</subject><subject>Social networking sites</subject><subject>Social networks</subject><subject>System effectiveness</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9UE1LAzEUDKJgrf4BTwHPW_O52RUvpfgFiiB6DunmpWZpd2uSteivN3U9e3pvhpn3hkHonJIZJbS8bGcQd2bGCKMzSoWq1AGa0ErxolQ1P0QTUktVCKrEMTqJsSWEKkLUBNmXHUB68h2EKzwfUr8xyTfYW-iSd77JqO-w6SzOK6z64L9Hqnf4HdZbHOBjgJgizlza-ZQgYDsE362w9dHEjOMpOnJmHeHsb07R2-3N6-K-eHy-e1jMH4uGsyoVSi4lBaYq5pzkpeCVkJKAEVJItyyFM6qxTjDilORLMNZylo2EmVo5qGo-RRfj3W3of1Ppth9Cl19qJgWXlJaKZRUbVU3oYwzg9Db4jQlfmhK9b1O3et-m3repxzaz6Xo0Qc7_6SHo2HjoGrA-QJO07f1_9h8VnX82</recordid><startdate>20210815</startdate><enddate>20210815</enddate><creator>Ullah, Irfan</creator><creator>Khan, Sharifullah</creator><creator>Imran, Muhammad</creator><creator>Lee, Young-Koo</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20210815</creationdate><title>RweetMiner: Automatic identification and categorization of help requests on twitter during disasters</title><author>Ullah, Irfan ; Khan, Sharifullah ; Imran, Muhammad ; Lee, Young-Koo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-75b51e2782ff5364384550ea4545fb64fa7cdf420f753beadd3232802a97fe893</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Catastrophic events</topic><topic>Classification</topic><topic>Digital media</topic><topic>Disaster relief</topic><topic>Disaster response</topic><topic>Intermediate data</topic><topic>Intermediate results</topic><topic>Machine learning</topic><topic>Relief efforts</topic><topic>Request tweets</topic><topic>Social networking sites</topic><topic>Social networks</topic><topic>System effectiveness</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ullah, Irfan</creatorcontrib><creatorcontrib>Khan, Sharifullah</creatorcontrib><creatorcontrib>Imran, Muhammad</creatorcontrib><creatorcontrib>Lee, Young-Koo</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ullah, Irfan</au><au>Khan, Sharifullah</au><au>Imran, Muhammad</au><au>Lee, Young-Koo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RweetMiner: Automatic identification and categorization of help requests on twitter during disasters</atitle><jtitle>Expert systems with applications</jtitle><date>2021-08-15</date><risdate>2021</risdate><volume>176</volume><spage>114787</spage><pages>114787-</pages><artnum>114787</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based features for learning subtle differences between request and non-request tweets, as well as six different types of request tweets.•Store intermediate data to speed up the machine learning development life cycle.•Performance improvement on the request identification and request categorization on Twitter. Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, shelter, using logistic regression shows promising results and outperforms exiting works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2021.114787</doi></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2021-08, Vol.176, p.114787, Article 114787
issn 0957-4174
1873-6793
language eng
recordid cdi_proquest_journals_2543511672
source Elsevier ScienceDirect Journals
subjects Catastrophic events
Classification
Digital media
Disaster relief
Disaster response
Intermediate data
Intermediate results
Machine learning
Relief efforts
Request tweets
Social networking sites
Social networks
System effectiveness
title RweetMiner: Automatic identification and categorization of help requests on twitter during disasters
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T09%3A36%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RweetMiner:%20Automatic%20identification%20and%20categorization%20of%20help%20requests%20on%20twitter%20during%20disasters&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Ullah,%20Irfan&rft.date=2021-08-15&rft.volume=176&rft.spage=114787&rft.pages=114787-&rft.artnum=114787&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2021.114787&rft_dat=%3Cproquest_cross%3E2543511672%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2543511672&rft_id=info:pmid/&rft_els_id=S0957417421002281&rfr_iscdi=true