SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis

Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be ach...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Knowledge-based systems 2016-05, Vol.100, p.97-111
Hauptverfasser: Khan, Farhan Hassan, Qamar, Usman, Bashir, Saba
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 111
container_issue
container_start_page 97
container_title Knowledge-based systems
container_volume 100
creator Khan, Farhan Hassan
Qamar, Usman
Bashir, Saba
description Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis.
doi_str_mv 10.1016/j.knosys.2016.02.011
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1816039021</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705116000976</els_id><sourcerecordid>1816039021</sourcerecordid><originalsourceid>FETCH-LOGICAL-c409t-5573b33c9b50d2c1160b1bbf883810feb62bc66fd0b8a6771f408aa198dba6a63</originalsourceid><addsrcrecordid>eNp9kEFP3DAQha2qldhC_wEHH7kkHTubxOkBCaFSkEActoijZTuTrdPEWTzJVvvv8Wp75jR6mveeZj7GLgXkAkT1vc__hokOlMukcpA5CPGJrYSqZVavofnMVtCUkNVQijP2lagHACmFWrF-8_rwtPnBNzj6jJYdxr0nbDkttkc3-z3yDs28ROT_0G__zD5suQkt92HGYfBbDDMfpxYHTjgcE1Pg3RSTCrMfj1sTzHAgTxfsS2cGwm__5zl7ufv5-_Y-e3z-9XB785i5dOqclWVd2KJwjS2hlU6ICqywtlOqUAI6tJW0rqq6FqwyVV2Lbg3KGNGo1prKVMU5uzr17uL0tiDNevTk0rEm4LSQFipVFg1Ikazrk9XFiShip3fRjyYetAB9RKt7fUKrj2g1SJ3Qptj1KYbpjb3HqMl5DA5bHxMC3U7-44J3zT-GhQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1816039021</pqid></control><display><type>article</type><title>SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Khan, Farhan Hassan ; Qamar, Usman ; Bashir, Saba</creator><creatorcontrib>Khan, Farhan Hassan ; Qamar, Usman ; Bashir, Saba</creatorcontrib><description>Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis.</description><identifier>ISSN: 0950-7051</identifier><identifier>EISSN: 1872-7409</identifier><identifier>DOI: 10.1016/j.knosys.2016.02.011</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Benchmarking ; Classification ; Cornell ; Data mining ; Feature selection ; Knowledge base ; Machine learning ; Movie reviews ; Natural Language Processing (NLP) ; Performance enhancement ; Sentiment analysis ; Speech ; Support Vector Machine ; Support vector machines</subject><ispartof>Knowledge-based systems, 2016-05, Vol.100, p.97-111</ispartof><rights>2016</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c409t-5573b33c9b50d2c1160b1bbf883810feb62bc66fd0b8a6771f408aa198dba6a63</citedby><cites>FETCH-LOGICAL-c409t-5573b33c9b50d2c1160b1bbf883810feb62bc66fd0b8a6771f408aa198dba6a63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0950705116000976$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Khan, Farhan Hassan</creatorcontrib><creatorcontrib>Qamar, Usman</creatorcontrib><creatorcontrib>Bashir, Saba</creatorcontrib><title>SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis</title><title>Knowledge-based systems</title><description>Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis.</description><subject>Benchmarking</subject><subject>Classification</subject><subject>Cornell</subject><subject>Data mining</subject><subject>Feature selection</subject><subject>Knowledge base</subject><subject>Machine learning</subject><subject>Movie reviews</subject><subject>Natural Language Processing (NLP)</subject><subject>Performance enhancement</subject><subject>Sentiment analysis</subject><subject>Speech</subject><subject>Support Vector Machine</subject><subject>Support vector machines</subject><issn>0950-7051</issn><issn>1872-7409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNp9kEFP3DAQha2qldhC_wEHH7kkHTubxOkBCaFSkEActoijZTuTrdPEWTzJVvvv8Wp75jR6mveeZj7GLgXkAkT1vc__hokOlMukcpA5CPGJrYSqZVavofnMVtCUkNVQijP2lagHACmFWrF-8_rwtPnBNzj6jJYdxr0nbDkttkc3-z3yDs28ROT_0G__zD5suQkt92HGYfBbDDMfpxYHTjgcE1Pg3RSTCrMfj1sTzHAgTxfsS2cGwm__5zl7ufv5-_Y-e3z-9XB785i5dOqclWVd2KJwjS2hlU6ICqywtlOqUAI6tJW0rqq6FqwyVV2Lbg3KGNGo1prKVMU5uzr17uL0tiDNevTk0rEm4LSQFipVFg1Ikazrk9XFiShip3fRjyYetAB9RKt7fUKrj2g1SJ3Qptj1KYbpjb3HqMl5DA5bHxMC3U7-44J3zT-GhQ</recordid><startdate>20160515</startdate><enddate>20160515</enddate><creator>Khan, Farhan Hassan</creator><creator>Qamar, Usman</creator><creator>Bashir, Saba</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20160515</creationdate><title>SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis</title><author>Khan, Farhan Hassan ; Qamar, Usman ; Bashir, Saba</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c409t-5573b33c9b50d2c1160b1bbf883810feb62bc66fd0b8a6771f408aa198dba6a63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Benchmarking</topic><topic>Classification</topic><topic>Cornell</topic><topic>Data mining</topic><topic>Feature selection</topic><topic>Knowledge base</topic><topic>Machine learning</topic><topic>Movie reviews</topic><topic>Natural Language Processing (NLP)</topic><topic>Performance enhancement</topic><topic>Sentiment analysis</topic><topic>Speech</topic><topic>Support Vector Machine</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khan, Farhan Hassan</creatorcontrib><creatorcontrib>Qamar, Usman</creatorcontrib><creatorcontrib>Bashir, Saba</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khan, Farhan Hassan</au><au>Qamar, Usman</au><au>Bashir, Saba</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis</atitle><jtitle>Knowledge-based systems</jtitle><date>2016-05-15</date><risdate>2016</risdate><volume>100</volume><spage>97</spage><epage>111</epage><pages>97-111</pages><issn>0950-7051</issn><eissn>1872-7409</eissn><abstract>Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2016.02.011</doi><tpages>15</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0950-7051
ispartof Knowledge-based systems, 2016-05, Vol.100, p.97-111
issn 0950-7051
1872-7409
language eng
recordid cdi_proquest_miscellaneous_1816039021
source ScienceDirect Journals (5 years ago - present)
subjects Benchmarking
Classification
Cornell
Data mining
Feature selection
Knowledge base
Machine learning
Movie reviews
Natural Language Processing (NLP)
Performance enhancement
Sentiment analysis
Speech
Support Vector Machine
Support vector machines
title SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T18%3A04%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SWIMS:%20Semi-supervised%20subjective%20feature%20weighting%20and%20intelligent%20model%20selection%20for%20sentiment%20analysis&rft.jtitle=Knowledge-based%20systems&rft.au=Khan,%20Farhan%20Hassan&rft.date=2016-05-15&rft.volume=100&rft.spage=97&rft.epage=111&rft.pages=97-111&rft.issn=0950-7051&rft.eissn=1872-7409&rft_id=info:doi/10.1016/j.knosys.2016.02.011&rft_dat=%3Cproquest_cross%3E1816039021%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1816039021&rft_id=info:pmid/&rft_els_id=S0950705116000976&rfr_iscdi=true