SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis
Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be ach...
Gespeichert in:
Veröffentlicht in: | Knowledge-based systems 2016-05, Vol.100, p.97-111 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 111 |
---|---|
container_issue | |
container_start_page | 97 |
container_title | Knowledge-based systems |
container_volume | 100 |
creator | Khan, Farhan Hassan Qamar, Usman Bashir, Saba |
description | Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis. |
doi_str_mv | 10.1016/j.knosys.2016.02.011 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1816039021</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705116000976</els_id><sourcerecordid>1816039021</sourcerecordid><originalsourceid>FETCH-LOGICAL-c409t-5573b33c9b50d2c1160b1bbf883810feb62bc66fd0b8a6771f408aa198dba6a63</originalsourceid><addsrcrecordid>eNp9kEFP3DAQha2qldhC_wEHH7kkHTubxOkBCaFSkEActoijZTuTrdPEWTzJVvvv8Wp75jR6mveeZj7GLgXkAkT1vc__hokOlMukcpA5CPGJrYSqZVavofnMVtCUkNVQijP2lagHACmFWrF-8_rwtPnBNzj6jJYdxr0nbDkttkc3-z3yDs28ROT_0G__zD5suQkt92HGYfBbDDMfpxYHTjgcE1Pg3RSTCrMfj1sTzHAgTxfsS2cGwm__5zl7ufv5-_Y-e3z-9XB785i5dOqclWVd2KJwjS2hlU6ICqywtlOqUAI6tJW0rqq6FqwyVV2Lbg3KGNGo1prKVMU5uzr17uL0tiDNevTk0rEm4LSQFipVFg1Ikazrk9XFiShip3fRjyYetAB9RKt7fUKrj2g1SJ3Qptj1KYbpjb3HqMl5DA5bHxMC3U7-44J3zT-GhQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1816039021</pqid></control><display><type>article</type><title>SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Khan, Farhan Hassan ; Qamar, Usman ; Bashir, Saba</creator><creatorcontrib>Khan, Farhan Hassan ; Qamar, Usman ; Bashir, Saba</creatorcontrib><description>Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis.</description><identifier>ISSN: 0950-7051</identifier><identifier>EISSN: 1872-7409</identifier><identifier>DOI: 10.1016/j.knosys.2016.02.011</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Benchmarking ; Classification ; Cornell ; Data mining ; Feature selection ; Knowledge base ; Machine learning ; Movie reviews ; Natural Language Processing (NLP) ; Performance enhancement ; Sentiment analysis ; Speech ; Support Vector Machine ; Support vector machines</subject><ispartof>Knowledge-based systems, 2016-05, Vol.100, p.97-111</ispartof><rights>2016</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c409t-5573b33c9b50d2c1160b1bbf883810feb62bc66fd0b8a6771f408aa198dba6a63</citedby><cites>FETCH-LOGICAL-c409t-5573b33c9b50d2c1160b1bbf883810feb62bc66fd0b8a6771f408aa198dba6a63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0950705116000976$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Khan, Farhan Hassan</creatorcontrib><creatorcontrib>Qamar, Usman</creatorcontrib><creatorcontrib>Bashir, Saba</creatorcontrib><title>SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis</title><title>Knowledge-based systems</title><description>Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis.</description><subject>Benchmarking</subject><subject>Classification</subject><subject>Cornell</subject><subject>Data mining</subject><subject>Feature selection</subject><subject>Knowledge base</subject><subject>Machine learning</subject><subject>Movie reviews</subject><subject>Natural Language Processing (NLP)</subject><subject>Performance enhancement</subject><subject>Sentiment analysis</subject><subject>Speech</subject><subject>Support Vector Machine</subject><subject>Support vector machines</subject><issn>0950-7051</issn><issn>1872-7409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNp9kEFP3DAQha2qldhC_wEHH7kkHTubxOkBCaFSkEActoijZTuTrdPEWTzJVvvv8Wp75jR6mveeZj7GLgXkAkT1vc__hokOlMukcpA5CPGJrYSqZVavofnMVtCUkNVQijP2lagHACmFWrF-8_rwtPnBNzj6jJYdxr0nbDkttkc3-z3yDs28ROT_0G__zD5suQkt92HGYfBbDDMfpxYHTjgcE1Pg3RSTCrMfj1sTzHAgTxfsS2cGwm__5zl7ufv5-_Y-e3z-9XB785i5dOqclWVd2KJwjS2hlU6ICqywtlOqUAI6tJW0rqq6FqwyVV2Lbg3KGNGo1prKVMU5uzr17uL0tiDNevTk0rEm4LSQFipVFg1Ikazrk9XFiShip3fRjyYetAB9RKt7fUKrj2g1SJ3Qptj1KYbpjb3HqMl5DA5bHxMC3U7-44J3zT-GhQ</recordid><startdate>20160515</startdate><enddate>20160515</enddate><creator>Khan, Farhan Hassan</creator><creator>Qamar, Usman</creator><creator>Bashir, Saba</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20160515</creationdate><title>SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis</title><author>Khan, Farhan Hassan ; Qamar, Usman ; Bashir, Saba</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c409t-5573b33c9b50d2c1160b1bbf883810feb62bc66fd0b8a6771f408aa198dba6a63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Benchmarking</topic><topic>Classification</topic><topic>Cornell</topic><topic>Data mining</topic><topic>Feature selection</topic><topic>Knowledge base</topic><topic>Machine learning</topic><topic>Movie reviews</topic><topic>Natural Language Processing (NLP)</topic><topic>Performance enhancement</topic><topic>Sentiment analysis</topic><topic>Speech</topic><topic>Support Vector Machine</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khan, Farhan Hassan</creatorcontrib><creatorcontrib>Qamar, Usman</creatorcontrib><creatorcontrib>Bashir, Saba</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khan, Farhan Hassan</au><au>Qamar, Usman</au><au>Bashir, Saba</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis</atitle><jtitle>Knowledge-based systems</jtitle><date>2016-05-15</date><risdate>2016</risdate><volume>100</volume><spage>97</spage><epage>111</epage><pages>97-111</pages><issn>0950-7051</issn><eissn>1872-7409</eissn><abstract>Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics’ sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2016.02.011</doi><tpages>15</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0950-7051 |
ispartof | Knowledge-based systems, 2016-05, Vol.100, p.97-111 |
issn | 0950-7051 1872-7409 |
language | eng |
recordid | cdi_proquest_miscellaneous_1816039021 |
source | ScienceDirect Journals (5 years ago - present) |
subjects | Benchmarking Classification Cornell Data mining Feature selection Knowledge base Machine learning Movie reviews Natural Language Processing (NLP) Performance enhancement Sentiment analysis Speech Support Vector Machine Support vector machines |
title | SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T18%3A04%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SWIMS:%20Semi-supervised%20subjective%20feature%20weighting%20and%20intelligent%20model%20selection%20for%20sentiment%20analysis&rft.jtitle=Knowledge-based%20systems&rft.au=Khan,%20Farhan%20Hassan&rft.date=2016-05-15&rft.volume=100&rft.spage=97&rft.epage=111&rft.pages=97-111&rft.issn=0950-7051&rft.eissn=1872-7409&rft_id=info:doi/10.1016/j.knosys.2016.02.011&rft_dat=%3Cproquest_cross%3E1816039021%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1816039021&rft_id=info:pmid/&rft_els_id=S0950705116000976&rfr_iscdi=true |