Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus - A Comprehensive Review

A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of advanced computer science & applications 2022, Vol.13 (2)
Hauptverfasser: Ahmad, Gazi Imtiyaz, Singla, Jimmy, Ali, Anis, Reshi, Aijaz Ahmad, Salameh, Anas A.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 2
container_start_page
container_title International journal of advanced computer science & applications
container_volume 13
creator Ahmad, Gazi Imtiyaz
Singla, Jimmy
Ali, Anis
Reshi, Aijaz Ahmad
Salameh, Anas A.
description A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual population, primarily in spoken but also in written communication, especially on social media. Code-mixing involves combining lower linguistic units like words and phrases of a language into the sentences of other language (the base language) and code-switching involves switching to another language, for the length of one sentence or more. In code-mixing and switching, a bilingual person takes one or more words or phrases from one language and introduces them into another language while communicating in that language in spoken or written mode. People nowadays express their views and opinions on several issues on social media. In multilingual countries, people express their views using English as well as their native languages. Several reasons can be attributed to code-mixing. Lack of knowledge in one language on a particular subject, being empathetic, interjection and clarification are some to name. Sentiment analysis of monolingual social media content has been carried out for the last two decades. However, during recent years, Natural Language Processing (NLP) research focus has also shifted towards the exploration of code-mixed data, thereby, making code mixed sentiment analysis an evolving field of research. Systems have been developed using ML techniques to predict the polarity of code-mixed text corpus and to fine tune the existing models to improve their performance.
doi_str_mv 10.14569/IJACSA.2022.0130254
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2652931004</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2652931004</sourcerecordid><originalsourceid>FETCH-LOGICAL-c325t-c5025a21bef24c845f268a450f6877304411b69a70e7dd92b8fb4fcdd61f98123</originalsourceid><addsrcrecordid>eNotUMtqwzAQNKWFhjR_0IOgZ6eSLPlxNKaPlIRCnUJvRpZXtUIiu5Lz-oF-d5Wke9idgdmBmSC4J3hKGI-zx9lbXpT5lGJKp5hEmHJ2FYwo4XHIeYKvzzgNCU6-boOJcyvsJ8ponEaj4HchZKsNoDkIa7T5RkuQrdE_W3BIdRaVYAa98QvlRqyPTjvUKVR0DYQLfYAGCdOgcq8H2XoyM40WBpWd1GKNFuCZNzwM_sH2W4dClHu46S20YJzeAfqAnYb9XXCjxNrB5P-Og8_np2XxGs7fX2ZFPg9lRPkQSu7TCUpqUJTJlHHlUwjGsYrTJIkwY4TUcSYSDEnTZLROVc2UbJqYqCwlNBoHDxff3naniEO16rbWB3MVjTnNIoIx8yp2UUnbOWdBVb3VG2GPFcHVufTqUnp1Kr36Lz36A38FdP4</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2652931004</pqid></control><display><type>article</type><title>Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus - A Comprehensive Review</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Ahmad, Gazi Imtiyaz ; Singla, Jimmy ; Ali, Anis ; Reshi, Aijaz Ahmad ; Salameh, Anas A.</creator><creatorcontrib>Ahmad, Gazi Imtiyaz ; Singla, Jimmy ; Ali, Anis ; Reshi, Aijaz Ahmad ; Salameh, Anas A.</creatorcontrib><description>A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual population, primarily in spoken but also in written communication, especially on social media. Code-mixing involves combining lower linguistic units like words and phrases of a language into the sentences of other language (the base language) and code-switching involves switching to another language, for the length of one sentence or more. In code-mixing and switching, a bilingual person takes one or more words or phrases from one language and introduces them into another language while communicating in that language in spoken or written mode. People nowadays express their views and opinions on several issues on social media. In multilingual countries, people express their views using English as well as their native languages. Several reasons can be attributed to code-mixing. Lack of knowledge in one language on a particular subject, being empathetic, interjection and clarification are some to name. Sentiment analysis of monolingual social media content has been carried out for the last two decades. However, during recent years, Natural Language Processing (NLP) research focus has also shifted towards the exploration of code-mixed data, thereby, making code mixed sentiment analysis an evolving field of research. Systems have been developed using ML techniques to predict the polarity of code-mixed text corpus and to fine tune the existing models to improve their performance.</description><identifier>ISSN: 2158-107X</identifier><identifier>EISSN: 2156-5570</identifier><identifier>DOI: 10.14569/IJACSA.2022.0130254</identifier><language>eng</language><publisher>West Yorkshire: Science and Information (SAI) Organization Limited</publisher><subject>Attitudes ; Bilingualism ; Code switching ; Communication ; Corpus analysis ; Corpus linguistics ; Data mining ; Deep learning ; Digital media ; Empathy ; Interjections ; Language ; Linguistic units ; Linguistics ; Machine learning ; Multilingualism ; Natural language processing ; Polarity ; Sentences ; Sentiment analysis ; Social media ; Social networks ; Sociolinguistics ; Words (language) ; Writing</subject><ispartof>International journal of advanced computer science &amp; applications, 2022, Vol.13 (2)</ispartof><rights>2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c325t-c5025a21bef24c845f268a450f6877304411b69a70e7dd92b8fb4fcdd61f98123</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,778,782,4012,27910,27911,27912</link.rule.ids></links><search><creatorcontrib>Ahmad, Gazi Imtiyaz</creatorcontrib><creatorcontrib>Singla, Jimmy</creatorcontrib><creatorcontrib>Ali, Anis</creatorcontrib><creatorcontrib>Reshi, Aijaz Ahmad</creatorcontrib><creatorcontrib>Salameh, Anas A.</creatorcontrib><title>Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus - A Comprehensive Review</title><title>International journal of advanced computer science &amp; applications</title><description>A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual population, primarily in spoken but also in written communication, especially on social media. Code-mixing involves combining lower linguistic units like words and phrases of a language into the sentences of other language (the base language) and code-switching involves switching to another language, for the length of one sentence or more. In code-mixing and switching, a bilingual person takes one or more words or phrases from one language and introduces them into another language while communicating in that language in spoken or written mode. People nowadays express their views and opinions on several issues on social media. In multilingual countries, people express their views using English as well as their native languages. Several reasons can be attributed to code-mixing. Lack of knowledge in one language on a particular subject, being empathetic, interjection and clarification are some to name. Sentiment analysis of monolingual social media content has been carried out for the last two decades. However, during recent years, Natural Language Processing (NLP) research focus has also shifted towards the exploration of code-mixed data, thereby, making code mixed sentiment analysis an evolving field of research. Systems have been developed using ML techniques to predict the polarity of code-mixed text corpus and to fine tune the existing models to improve their performance.</description><subject>Attitudes</subject><subject>Bilingualism</subject><subject>Code switching</subject><subject>Communication</subject><subject>Corpus analysis</subject><subject>Corpus linguistics</subject><subject>Data mining</subject><subject>Deep learning</subject><subject>Digital media</subject><subject>Empathy</subject><subject>Interjections</subject><subject>Language</subject><subject>Linguistic units</subject><subject>Linguistics</subject><subject>Machine learning</subject><subject>Multilingualism</subject><subject>Natural language processing</subject><subject>Polarity</subject><subject>Sentences</subject><subject>Sentiment analysis</subject><subject>Social media</subject><subject>Social networks</subject><subject>Sociolinguistics</subject><subject>Words (language)</subject><subject>Writing</subject><issn>2158-107X</issn><issn>2156-5570</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNotUMtqwzAQNKWFhjR_0IOgZ6eSLPlxNKaPlIRCnUJvRpZXtUIiu5Lz-oF-d5Wke9idgdmBmSC4J3hKGI-zx9lbXpT5lGJKp5hEmHJ2FYwo4XHIeYKvzzgNCU6-boOJcyvsJ8ponEaj4HchZKsNoDkIa7T5RkuQrdE_W3BIdRaVYAa98QvlRqyPTjvUKVR0DYQLfYAGCdOgcq8H2XoyM40WBpWd1GKNFuCZNzwM_sH2W4dClHu46S20YJzeAfqAnYb9XXCjxNrB5P-Og8_np2XxGs7fX2ZFPg9lRPkQSu7TCUpqUJTJlHHlUwjGsYrTJIkwY4TUcSYSDEnTZLROVc2UbJqYqCwlNBoHDxff3naniEO16rbWB3MVjTnNIoIx8yp2UUnbOWdBVb3VG2GPFcHVufTqUnp1Kr36Lz36A38FdP4</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Ahmad, Gazi Imtiyaz</creator><creator>Singla, Jimmy</creator><creator>Ali, Anis</creator><creator>Reshi, Aijaz Ahmad</creator><creator>Salameh, Anas A.</creator><general>Science and Information (SAI) Organization Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7T9</scope><scope>7XB</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>COVID</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>2022</creationdate><title>Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus - A Comprehensive Review</title><author>Ahmad, Gazi Imtiyaz ; Singla, Jimmy ; Ali, Anis ; Reshi, Aijaz Ahmad ; Salameh, Anas A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c325t-c5025a21bef24c845f268a450f6877304411b69a70e7dd92b8fb4fcdd61f98123</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Attitudes</topic><topic>Bilingualism</topic><topic>Code switching</topic><topic>Communication</topic><topic>Corpus analysis</topic><topic>Corpus linguistics</topic><topic>Data mining</topic><topic>Deep learning</topic><topic>Digital media</topic><topic>Empathy</topic><topic>Interjections</topic><topic>Language</topic><topic>Linguistic units</topic><topic>Linguistics</topic><topic>Machine learning</topic><topic>Multilingualism</topic><topic>Natural language processing</topic><topic>Polarity</topic><topic>Sentences</topic><topic>Sentiment analysis</topic><topic>Social media</topic><topic>Social networks</topic><topic>Sociolinguistics</topic><topic>Words (language)</topic><topic>Writing</topic><toplevel>online_resources</toplevel><creatorcontrib>Ahmad, Gazi Imtiyaz</creatorcontrib><creatorcontrib>Singla, Jimmy</creatorcontrib><creatorcontrib>Ali, Anis</creatorcontrib><creatorcontrib>Reshi, Aijaz Ahmad</creatorcontrib><creatorcontrib>Salameh, Anas A.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of advanced computer science &amp; applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ahmad, Gazi Imtiyaz</au><au>Singla, Jimmy</au><au>Ali, Anis</au><au>Reshi, Aijaz Ahmad</au><au>Salameh, Anas A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus - A Comprehensive Review</atitle><jtitle>International journal of advanced computer science &amp; applications</jtitle><date>2022</date><risdate>2022</risdate><volume>13</volume><issue>2</issue><issn>2158-107X</issn><eissn>2156-5570</eissn><abstract>A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual population, primarily in spoken but also in written communication, especially on social media. Code-mixing involves combining lower linguistic units like words and phrases of a language into the sentences of other language (the base language) and code-switching involves switching to another language, for the length of one sentence or more. In code-mixing and switching, a bilingual person takes one or more words or phrases from one language and introduces them into another language while communicating in that language in spoken or written mode. People nowadays express their views and opinions on several issues on social media. In multilingual countries, people express their views using English as well as their native languages. Several reasons can be attributed to code-mixing. Lack of knowledge in one language on a particular subject, being empathetic, interjection and clarification are some to name. Sentiment analysis of monolingual social media content has been carried out for the last two decades. However, during recent years, Natural Language Processing (NLP) research focus has also shifted towards the exploration of code-mixed data, thereby, making code mixed sentiment analysis an evolving field of research. Systems have been developed using ML techniques to predict the polarity of code-mixed text corpus and to fine tune the existing models to improve their performance.</abstract><cop>West Yorkshire</cop><pub>Science and Information (SAI) Organization Limited</pub><doi>10.14569/IJACSA.2022.0130254</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2158-107X
ispartof International journal of advanced computer science & applications, 2022, Vol.13 (2)
issn 2158-107X
2156-5570
language eng
recordid cdi_proquest_journals_2652931004
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Attitudes
Bilingualism
Code switching
Communication
Corpus analysis
Corpus linguistics
Data mining
Deep learning
Digital media
Empathy
Interjections
Language
Linguistic units
Linguistics
Machine learning
Multilingualism
Natural language processing
Polarity
Sentences
Sentiment analysis
Social media
Social networks
Sociolinguistics
Words (language)
Writing
title Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus - A Comprehensive Review
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T04%3A16%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Machine%20Learning%20Techniques%20for%20Sentiment%20Analysis%20of%20Code-Mixed%20and%20Switched%20Indian%20Social%20Media%20Text%20Corpus%20-%20A%20Comprehensive%20Review&rft.jtitle=International%20journal%20of%20advanced%20computer%20science%20&%20applications&rft.au=Ahmad,%20Gazi%20Imtiyaz&rft.date=2022&rft.volume=13&rft.issue=2&rft.issn=2158-107X&rft.eissn=2156-5570&rft_id=info:doi/10.14569/IJACSA.2022.0130254&rft_dat=%3Cproquest_cross%3E2652931004%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2652931004&rft_id=info:pmid/&rfr_iscdi=true