High-Level Context Representation for Emotion Recognition in Images
Emotion recognition is the task of classifying perceived emotions in people. Previous works have utilized various nonverbal cues to extract features from images and correlate them to emotions. Of these cues, situational context is particularly crucial in emotion perception since it can directly infl...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Costa, Willams de Lima Martinez, Estefania Talavera Figueiredo, Lucas Silva Teichrieb, Veronica |
description | Emotion recognition is the task of classifying perceived emotions in people.
Previous works have utilized various nonverbal cues to extract features from
images and correlate them to emotions. Of these cues, situational context is
particularly crucial in emotion perception since it can directly influence the
emotion of a person. In this paper, we propose an approach for high-level
context representation extraction from images. The model relies on a single cue
and a single encoding stream to correlate this representation with emotions.
Our model competes with the state-of-the-art, achieving an mAP of 0.3002 on the
EMOTIC dataset while also being capable of execution on consumer-grade hardware
at approximately 90 frames per second. Overall, our approach is more efficient
than previous models and can be easily deployed to address real-world problems
related to emotion recognition. |
doi_str_mv | 10.48550/arxiv.2305.03500 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2305_03500</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2305_03500</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-6b04774be0e257a723304e244ad6e6e095b401da1e62463eb885fa011a9178673</originalsourceid><addsrcrecordid>eNotz81qwkAUhuHZdFGsF9CVcwNJz_zHZQlWhYAg7sOJnqQDZkYmQfTuxejqe1cfPIx9C8h1YQz8YLr5ay4VmByUAfhk5cZ3_1lFVzrzMoaRbiPf0yXRQGHE0cfA25j4qo9T7-kYu-Cn9oFve-xo-GIfLZ4Hmr93xg5_q0O5yardelv-VhlaB5ltQDunGwKSxqGTSoEmqTWeLFmCpWk0iBMKslJbRU1RmBZBCFwKV1inZmzxup0U9SX5HtO9fmrqSaMeuIZDvA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>High-Level Context Representation for Emotion Recognition in Images</title><source>arXiv.org</source><creator>Costa, Willams de Lima ; Martinez, Estefania Talavera ; Figueiredo, Lucas Silva ; Teichrieb, Veronica</creator><creatorcontrib>Costa, Willams de Lima ; Martinez, Estefania Talavera ; Figueiredo, Lucas Silva ; Teichrieb, Veronica</creatorcontrib><description>Emotion recognition is the task of classifying perceived emotions in people.
Previous works have utilized various nonverbal cues to extract features from
images and correlate them to emotions. Of these cues, situational context is
particularly crucial in emotion perception since it can directly influence the
emotion of a person. In this paper, we propose an approach for high-level
context representation extraction from images. The model relies on a single cue
and a single encoding stream to correlate this representation with emotions.
Our model competes with the state-of-the-art, achieving an mAP of 0.3002 on the
EMOTIC dataset while also being capable of execution on consumer-grade hardware
at approximately 90 frames per second. Overall, our approach is more efficient
than previous models and can be easily deployed to address real-world problems
related to emotion recognition.</description><identifier>DOI: 10.48550/arxiv.2305.03500</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Human-Computer Interaction</subject><creationdate>2023-05</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2305.03500$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2305.03500$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Costa, Willams de Lima</creatorcontrib><creatorcontrib>Martinez, Estefania Talavera</creatorcontrib><creatorcontrib>Figueiredo, Lucas Silva</creatorcontrib><creatorcontrib>Teichrieb, Veronica</creatorcontrib><title>High-Level Context Representation for Emotion Recognition in Images</title><description>Emotion recognition is the task of classifying perceived emotions in people.
Previous works have utilized various nonverbal cues to extract features from
images and correlate them to emotions. Of these cues, situational context is
particularly crucial in emotion perception since it can directly influence the
emotion of a person. In this paper, we propose an approach for high-level
context representation extraction from images. The model relies on a single cue
and a single encoding stream to correlate this representation with emotions.
Our model competes with the state-of-the-art, achieving an mAP of 0.3002 on the
EMOTIC dataset while also being capable of execution on consumer-grade hardware
at approximately 90 frames per second. Overall, our approach is more efficient
than previous models and can be easily deployed to address real-world problems
related to emotion recognition.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Human-Computer Interaction</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz81qwkAUhuHZdFGsF9CVcwNJz_zHZQlWhYAg7sOJnqQDZkYmQfTuxejqe1cfPIx9C8h1YQz8YLr5ay4VmByUAfhk5cZ3_1lFVzrzMoaRbiPf0yXRQGHE0cfA25j4qo9T7-kYu-Cn9oFve-xo-GIfLZ4Hmr93xg5_q0O5yardelv-VhlaB5ltQDunGwKSxqGTSoEmqTWeLFmCpWk0iBMKslJbRU1RmBZBCFwKV1inZmzxup0U9SX5HtO9fmrqSaMeuIZDvA</recordid><startdate>20230505</startdate><enddate>20230505</enddate><creator>Costa, Willams de Lima</creator><creator>Martinez, Estefania Talavera</creator><creator>Figueiredo, Lucas Silva</creator><creator>Teichrieb, Veronica</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230505</creationdate><title>High-Level Context Representation for Emotion Recognition in Images</title><author>Costa, Willams de Lima ; Martinez, Estefania Talavera ; Figueiredo, Lucas Silva ; Teichrieb, Veronica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-6b04774be0e257a723304e244ad6e6e095b401da1e62463eb885fa011a9178673</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Human-Computer Interaction</topic><toplevel>online_resources</toplevel><creatorcontrib>Costa, Willams de Lima</creatorcontrib><creatorcontrib>Martinez, Estefania Talavera</creatorcontrib><creatorcontrib>Figueiredo, Lucas Silva</creatorcontrib><creatorcontrib>Teichrieb, Veronica</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Costa, Willams de Lima</au><au>Martinez, Estefania Talavera</au><au>Figueiredo, Lucas Silva</au><au>Teichrieb, Veronica</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>High-Level Context Representation for Emotion Recognition in Images</atitle><date>2023-05-05</date><risdate>2023</risdate><abstract>Emotion recognition is the task of classifying perceived emotions in people.
Previous works have utilized various nonverbal cues to extract features from
images and correlate them to emotions. Of these cues, situational context is
particularly crucial in emotion perception since it can directly influence the
emotion of a person. In this paper, we propose an approach for high-level
context representation extraction from images. The model relies on a single cue
and a single encoding stream to correlate this representation with emotions.
Our model competes with the state-of-the-art, achieving an mAP of 0.3002 on the
EMOTIC dataset while also being capable of execution on consumer-grade hardware
at approximately 90 frames per second. Overall, our approach is more efficient
than previous models and can be easily deployed to address real-world problems
related to emotion recognition.</abstract><doi>10.48550/arxiv.2305.03500</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2305.03500 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2305_03500 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition Computer Science - Human-Computer Interaction |
title | High-Level Context Representation for Emotion Recognition in Images |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T20%3A27%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=High-Level%20Context%20Representation%20for%20Emotion%20Recognition%20in%20Images&rft.au=Costa,%20Willams%20de%20Lima&rft.date=2023-05-05&rft_id=info:doi/10.48550/arxiv.2305.03500&rft_dat=%3Carxiv_GOX%3E2305_03500%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |