A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source

We widely use emojis in social networking to heighten, mitigate or negate the sentiment of the text. Emoji suggestions already exist in many cross-platform applications but an emoji is predicted solely based a few prominent words instead of understanding the subject and substance of the text. Throug...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Venkit, Pranav, Karishma, Zeba, Hsu, Chi-Yang, Katiki, Rahul, Huang, Kenneth, Wilson, Shomir, Dudas, Patrick
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Venkit, Pranav
Karishma, Zeba
Hsu, Chi-Yang
Katiki, Rahul
Huang, Kenneth
Wilson, Shomir
Dudas, Patrick
description We widely use emojis in social networking to heighten, mitigate or negate the sentiment of the text. Emoji suggestions already exist in many cross-platform applications but an emoji is predicted solely based a few prominent words instead of understanding the subject and substance of the text. Through this paper, we showcase the importance of using Twitter features to help the model understand the sentiment involved and hence to predict the most suitable emoji for the text. Hashtags and Application Sources like Android, etc. are two features which we found to be important yet underused in emoji prediction and Twitter sentiment analysis on the whole. To approach this shortcoming and to further understand emoji behavioral patterns, we propose a more balanced dataset by crawling additional Twitter data, including timestamp, hashtags, and application source acting as additional attributes to the tweet. Our data analysis and neural network model performance evaluations depict that using hashtags and application sources as features allows to encode different information and is effective in emoji prediction.
doi_str_mv 10.48550/arxiv.2103.07833
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2103_07833</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2103_07833</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-7228bd2694eb4398938c5bf7fa5b8a62ba99d2850c7682629d1696dabbfcea533</originalsourceid><addsrcrecordid>eNotjztPwzAYRb0woMIPYMIbCwmOHTs2W6gKRapEpWbpFD6_wCgvxSmPf09JWe69w9WRDkJXGUlzyTm5g_E7fKY0IywlhWTsHO1L_LrrD6Nx_tDc4OorxOker9r-I-Dt6GwwU-g7_ADRWXwcO9dNoT3GLV5DfJ_gLWLoLC6HoQkG5vOJd4HOPDTRXf73AlWPq2q5TjYvT8_LcpOAKFhSUCq1pULlTudMScWk4doXHriWIKgGpSyVnJhCSCqosplQwoLW3jjgjC3Q9Qk7u9XDGFoYf-o_x3p2ZL-lokwb</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source</title><source>arXiv.org</source><creator>Venkit, Pranav ; Karishma, Zeba ; Hsu, Chi-Yang ; Katiki, Rahul ; Huang, Kenneth ; Wilson, Shomir ; Dudas, Patrick</creator><creatorcontrib>Venkit, Pranav ; Karishma, Zeba ; Hsu, Chi-Yang ; Katiki, Rahul ; Huang, Kenneth ; Wilson, Shomir ; Dudas, Patrick</creatorcontrib><description>We widely use emojis in social networking to heighten, mitigate or negate the sentiment of the text. Emoji suggestions already exist in many cross-platform applications but an emoji is predicted solely based a few prominent words instead of understanding the subject and substance of the text. Through this paper, we showcase the importance of using Twitter features to help the model understand the sentiment involved and hence to predict the most suitable emoji for the text. Hashtags and Application Sources like Android, etc. are two features which we found to be important yet underused in emoji prediction and Twitter sentiment analysis on the whole. To approach this shortcoming and to further understand emoji behavioral patterns, we propose a more balanced dataset by crawling additional Twitter data, including timestamp, hashtags, and application source acting as additional attributes to the tweet. Our data analysis and neural network model performance evaluations depict that using hashtags and application sources as features allows to encode different information and is effective in emoji prediction.</description><identifier>DOI: 10.48550/arxiv.2103.07833</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2021-03</creationdate><rights>http://creativecommons.org/licenses/by-nc-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2103.07833$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2103.07833$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Venkit, Pranav</creatorcontrib><creatorcontrib>Karishma, Zeba</creatorcontrib><creatorcontrib>Hsu, Chi-Yang</creatorcontrib><creatorcontrib>Katiki, Rahul</creatorcontrib><creatorcontrib>Huang, Kenneth</creatorcontrib><creatorcontrib>Wilson, Shomir</creatorcontrib><creatorcontrib>Dudas, Patrick</creatorcontrib><title>A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source</title><description>We widely use emojis in social networking to heighten, mitigate or negate the sentiment of the text. Emoji suggestions already exist in many cross-platform applications but an emoji is predicted solely based a few prominent words instead of understanding the subject and substance of the text. Through this paper, we showcase the importance of using Twitter features to help the model understand the sentiment involved and hence to predict the most suitable emoji for the text. Hashtags and Application Sources like Android, etc. are two features which we found to be important yet underused in emoji prediction and Twitter sentiment analysis on the whole. To approach this shortcoming and to further understand emoji behavioral patterns, we propose a more balanced dataset by crawling additional Twitter data, including timestamp, hashtags, and application source acting as additional attributes to the tweet. Our data analysis and neural network model performance evaluations depict that using hashtags and application sources as features allows to encode different information and is effective in emoji prediction.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotjztPwzAYRb0woMIPYMIbCwmOHTs2W6gKRapEpWbpFD6_wCgvxSmPf09JWe69w9WRDkJXGUlzyTm5g_E7fKY0IywlhWTsHO1L_LrrD6Nx_tDc4OorxOker9r-I-Dt6GwwU-g7_ADRWXwcO9dNoT3GLV5DfJ_gLWLoLC6HoQkG5vOJd4HOPDTRXf73AlWPq2q5TjYvT8_LcpOAKFhSUCq1pULlTudMScWk4doXHriWIKgGpSyVnJhCSCqosplQwoLW3jjgjC3Q9Qk7u9XDGFoYf-o_x3p2ZL-lokwb</recordid><startdate>20210313</startdate><enddate>20210313</enddate><creator>Venkit, Pranav</creator><creator>Karishma, Zeba</creator><creator>Hsu, Chi-Yang</creator><creator>Katiki, Rahul</creator><creator>Huang, Kenneth</creator><creator>Wilson, Shomir</creator><creator>Dudas, Patrick</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20210313</creationdate><title>A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source</title><author>Venkit, Pranav ; Karishma, Zeba ; Hsu, Chi-Yang ; Katiki, Rahul ; Huang, Kenneth ; Wilson, Shomir ; Dudas, Patrick</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-7228bd2694eb4398938c5bf7fa5b8a62ba99d2850c7682629d1696dabbfcea533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Venkit, Pranav</creatorcontrib><creatorcontrib>Karishma, Zeba</creatorcontrib><creatorcontrib>Hsu, Chi-Yang</creatorcontrib><creatorcontrib>Katiki, Rahul</creatorcontrib><creatorcontrib>Huang, Kenneth</creatorcontrib><creatorcontrib>Wilson, Shomir</creatorcontrib><creatorcontrib>Dudas, Patrick</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Venkit, Pranav</au><au>Karishma, Zeba</au><au>Hsu, Chi-Yang</au><au>Katiki, Rahul</au><au>Huang, Kenneth</au><au>Wilson, Shomir</au><au>Dudas, Patrick</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source</atitle><date>2021-03-13</date><risdate>2021</risdate><abstract>We widely use emojis in social networking to heighten, mitigate or negate the sentiment of the text. Emoji suggestions already exist in many cross-platform applications but an emoji is predicted solely based a few prominent words instead of understanding the subject and substance of the text. Through this paper, we showcase the importance of using Twitter features to help the model understand the sentiment involved and hence to predict the most suitable emoji for the text. Hashtags and Application Sources like Android, etc. are two features which we found to be important yet underused in emoji prediction and Twitter sentiment analysis on the whole. To approach this shortcoming and to further understand emoji behavioral patterns, we propose a more balanced dataset by crawling additional Twitter data, including timestamp, hashtags, and application source acting as additional attributes to the tweet. Our data analysis and neural network model performance evaluations depict that using hashtags and application sources as features allows to encode different information and is effective in emoji prediction.</abstract><doi>10.48550/arxiv.2103.07833</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2103.07833
ispartof
issn
language eng
recordid cdi_arxiv_primary_2103_07833
source arXiv.org
subjects Computer Science - Computation and Language
title A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T00%3A29%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20%60Sourceful'%20Twist:%20Emoji%20Prediction%20Based%20on%20Sentiment,%20Hashtags%20and%20Application%20Source&rft.au=Venkit,%20Pranav&rft.date=2021-03-13&rft_id=info:doi/10.48550/arxiv.2103.07833&rft_dat=%3Carxiv_GOX%3E2103_07833%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true