RweetMiner: Automatic identification and categorization of help requests on twitter during disasters
•Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based fe...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2021-08, Vol.176, p.114787, Article 114787 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 114787 |
container_title | Expert systems with applications |
container_volume | 176 |
creator | Ullah, Irfan Khan, Sharifullah Imran, Muhammad Lee, Young-Koo |
description | •Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based features for learning subtle differences between request and non-request tweets, as well as six different types of request tweets.•Store intermediate data to speed up the machine learning development life cycle.•Performance improvement on the request identification and request categorization on Twitter.
Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, shelter, using logistic regression shows promising results and outperforms exiting works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers. |
doi_str_mv | 10.1016/j.eswa.2021.114787 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2543511672</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417421002281</els_id><sourcerecordid>2543511672</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-75b51e2782ff5364384550ea4545fb64fa7cdf420f753beadd3232802a97fe893</originalsourceid><addsrcrecordid>eNp9UE1LAzEUDKJgrf4BTwHPW_O52RUvpfgFiiB6DunmpWZpd2uSteivN3U9e3pvhpn3hkHonJIZJbS8bGcQd2bGCKMzSoWq1AGa0ErxolQ1P0QTUktVCKrEMTqJsSWEKkLUBNmXHUB68h2EKzwfUr8xyTfYW-iSd77JqO-w6SzOK6z64L9Hqnf4HdZbHOBjgJgizlza-ZQgYDsE362w9dHEjOMpOnJmHeHsb07R2-3N6-K-eHy-e1jMH4uGsyoVSi4lBaYq5pzkpeCVkJKAEVJItyyFM6qxTjDilORLMNZylo2EmVo5qGo-RRfj3W3of1Ppth9Cl19qJgWXlJaKZRUbVU3oYwzg9Db4jQlfmhK9b1O3et-m3repxzaz6Xo0Qc7_6SHo2HjoGrA-QJO07f1_9h8VnX82</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2543511672</pqid></control><display><type>article</type><title>RweetMiner: Automatic identification and categorization of help requests on twitter during disasters</title><source>Elsevier ScienceDirect Journals</source><creator>Ullah, Irfan ; Khan, Sharifullah ; Imran, Muhammad ; Lee, Young-Koo</creator><creatorcontrib>Ullah, Irfan ; Khan, Sharifullah ; Imran, Muhammad ; Lee, Young-Koo</creatorcontrib><description>•Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based features for learning subtle differences between request and non-request tweets, as well as six different types of request tweets.•Store intermediate data to speed up the machine learning development life cycle.•Performance improvement on the request identification and request categorization on Twitter.
Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, shelter, using logistic regression shows promising results and outperforms exiting works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2021.114787</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Catastrophic events ; Classification ; Digital media ; Disaster relief ; Disaster response ; Intermediate data ; Intermediate results ; Machine learning ; Relief efforts ; Request tweets ; Social networking sites ; Social networks ; System effectiveness</subject><ispartof>Expert systems with applications, 2021-08, Vol.176, p.114787, Article 114787</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Aug 15, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-75b51e2782ff5364384550ea4545fb64fa7cdf420f753beadd3232802a97fe893</citedby><cites>FETCH-LOGICAL-c328t-75b51e2782ff5364384550ea4545fb64fa7cdf420f753beadd3232802a97fe893</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0957417421002281$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Ullah, Irfan</creatorcontrib><creatorcontrib>Khan, Sharifullah</creatorcontrib><creatorcontrib>Imran, Muhammad</creatorcontrib><creatorcontrib>Lee, Young-Koo</creatorcontrib><title>RweetMiner: Automatic identification and categorization of help requests on twitter during disasters</title><title>Expert systems with applications</title><description>•Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based features for learning subtle differences between request and non-request tweets, as well as six different types of request tweets.•Store intermediate data to speed up the machine learning development life cycle.•Performance improvement on the request identification and request categorization on Twitter.
Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, shelter, using logistic regression shows promising results and outperforms exiting works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers.</description><subject>Catastrophic events</subject><subject>Classification</subject><subject>Digital media</subject><subject>Disaster relief</subject><subject>Disaster response</subject><subject>Intermediate data</subject><subject>Intermediate results</subject><subject>Machine learning</subject><subject>Relief efforts</subject><subject>Request tweets</subject><subject>Social networking sites</subject><subject>Social networks</subject><subject>System effectiveness</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9UE1LAzEUDKJgrf4BTwHPW_O52RUvpfgFiiB6DunmpWZpd2uSteivN3U9e3pvhpn3hkHonJIZJbS8bGcQd2bGCKMzSoWq1AGa0ErxolQ1P0QTUktVCKrEMTqJsSWEKkLUBNmXHUB68h2EKzwfUr8xyTfYW-iSd77JqO-w6SzOK6z64L9Hqnf4HdZbHOBjgJgizlza-ZQgYDsE362w9dHEjOMpOnJmHeHsb07R2-3N6-K-eHy-e1jMH4uGsyoVSi4lBaYq5pzkpeCVkJKAEVJItyyFM6qxTjDilORLMNZylo2EmVo5qGo-RRfj3W3of1Ppth9Cl19qJgWXlJaKZRUbVU3oYwzg9Db4jQlfmhK9b1O3et-m3repxzaz6Xo0Qc7_6SHo2HjoGrA-QJO07f1_9h8VnX82</recordid><startdate>20210815</startdate><enddate>20210815</enddate><creator>Ullah, Irfan</creator><creator>Khan, Sharifullah</creator><creator>Imran, Muhammad</creator><creator>Lee, Young-Koo</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20210815</creationdate><title>RweetMiner: Automatic identification and categorization of help requests on twitter during disasters</title><author>Ullah, Irfan ; Khan, Sharifullah ; Imran, Muhammad ; Lee, Young-Koo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-75b51e2782ff5364384550ea4545fb64fa7cdf420f753beadd3232802a97fe893</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Catastrophic events</topic><topic>Classification</topic><topic>Digital media</topic><topic>Disaster relief</topic><topic>Disaster response</topic><topic>Intermediate data</topic><topic>Intermediate results</topic><topic>Machine learning</topic><topic>Relief efforts</topic><topic>Request tweets</topic><topic>Social networking sites</topic><topic>Social networks</topic><topic>System effectiveness</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ullah, Irfan</creatorcontrib><creatorcontrib>Khan, Sharifullah</creatorcontrib><creatorcontrib>Imran, Muhammad</creatorcontrib><creatorcontrib>Lee, Young-Koo</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ullah, Irfan</au><au>Khan, Sharifullah</au><au>Imran, Muhammad</au><au>Lee, Young-Koo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RweetMiner: Automatic identification and categorization of help requests on twitter during disasters</atitle><jtitle>Expert systems with applications</jtitle><date>2021-08-15</date><risdate>2021</risdate><volume>176</volume><spage>114787</spage><pages>114787-</pages><artnum>114787</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•Redefining request under the term “rweet” in the context of social networking sties, as well as defining its primary types and subtypes.•Proposing optimized and effective preprocessing strategy.•Generating n-grams (bag of words) with n = 1, 2, and 3, combining them with each other and rule based features for learning subtle differences between request and non-request tweets, as well as six different types of request tweets.•Store intermediate data to speed up the machine learning development life cycle.•Performance improvement on the request identification and request categorization on Twitter.
Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, shelter, using logistic regression shows promising results and outperforms exiting works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2021.114787</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2021-08, Vol.176, p.114787, Article 114787 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_journals_2543511672 |
source | Elsevier ScienceDirect Journals |
subjects | Catastrophic events Classification Digital media Disaster relief Disaster response Intermediate data Intermediate results Machine learning Relief efforts Request tweets Social networking sites Social networks System effectiveness |
title | RweetMiner: Automatic identification and categorization of help requests on twitter during disasters |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T09%3A36%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RweetMiner:%20Automatic%20identification%20and%20categorization%20of%20help%20requests%20on%20twitter%20during%20disasters&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Ullah,%20Irfan&rft.date=2021-08-15&rft.volume=176&rft.spage=114787&rft.pages=114787-&rft.artnum=114787&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2021.114787&rft_dat=%3Cproquest_cross%3E2543511672%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2543511672&rft_id=info:pmid/&rft_els_id=S0957417421002281&rfr_iscdi=true |