Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

The scene graph generation (SGG) task involves detecting objects within an image and predicting predicates that represent the relationships between the objects. However, in SGG benchmark datasets, each subject-object pair is annotated with a single predicate even though a single predicate may exhibi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-07
Hauptverfasser:	Jeon, Jaehyeong, Kim, Kibum, Yoon, Kanghoon, Park, Chanyoung
Format:	Artikel
Sprache:	eng
Schlagworte:	Learning Predictions Prototypes Semantics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Jeon, Jaehyeong Kim, Kibum Yoon, Kanghoon Park, Chanyoung
description	The scene graph generation (SGG) task involves detecting objects within an image and predicting predicates that represent the relationships between the objects. However, in SGG benchmark datasets, each subject-object pair is annotated with a single predicate even though a single predicate may exhibit diverse semantics (i.e., semantic diversity), existing SGG models are trained to predict the one and only predicate for each pair. This in turn results in the SGG models to overlook the semantic diversity that may exist in a predicate, thus leading to biased predictions. In this paper, we propose a novel model-agnostic Semantic Diversity-aware Prototype-based Learning (DPL) framework that enables unbiased predictions based on the understanding of the semantic diversity of predicates. Specifically, DPL learns the regions in the semantic space covered by each predicate to distinguish among the various different semantics that a single predicate can represent. Extensive experiments demonstrate that our proposed model-agnostic DPL framework brings significant performance improvement on existing SGG models, and also effectively understands the semantic diversity of predicates.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3083763953</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3083763953</sourcerecordid><originalsourceid>FETCH-proquest_journals_30837639533</originalsourceid><addsrcrecordid>eNqNissKwjAUBYMgWLT_EHAdiIl9uPZRFy4E7bqk9VZTNKk3qdK_t4gf4OoMM2dEAiHlgqVLISYkdK7hnIs4EVEkA5Kf4KGM1xXd6Beg075n6q0Q6BGtt75vgZXKwYUeQKHR5kprizQ3pf7aUwUGaIaqvdFsQFReWzMj41rdHYS_nZL5bnte71mL9tmB80VjOzRDKiRPZRLLVSTlf68P4qlA_A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3083763953</pqid></control><display><type>article</type><title>Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation</title><source>Freely Accessible Journals</source><creator>Jeon, Jaehyeong ; Kim, Kibum ; Yoon, Kanghoon ; Park, Chanyoung</creator><creatorcontrib>Jeon, Jaehyeong ; Kim, Kibum ; Yoon, Kanghoon ; Park, Chanyoung</creatorcontrib><description>The scene graph generation (SGG) task involves detecting objects within an image and predicting predicates that represent the relationships between the objects. However, in SGG benchmark datasets, each subject-object pair is annotated with a single predicate even though a single predicate may exhibit diverse semantics (i.e., semantic diversity), existing SGG models are trained to predict the one and only predicate for each pair. This in turn results in the SGG models to overlook the semantic diversity that may exist in a predicate, thus leading to biased predictions. In this paper, we propose a novel model-agnostic Semantic Diversity-aware Prototype-based Learning (DPL) framework that enables unbiased predictions based on the understanding of the semantic diversity of predicates. Specifically, DPL learns the regions in the semantic space covered by each predicate to distinguish among the various different semantics that a single predicate can represent. Extensive experiments demonstrate that our proposed model-agnostic DPL framework brings significant performance improvement on existing SGG models, and also effectively understands the semantic diversity of predicates.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Learning ; Predictions ; Prototypes ; Semantics</subject><ispartof>arXiv.org, 2024-07</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>778,782</link.rule.ids></links><search><creatorcontrib>Jeon, Jaehyeong</creatorcontrib><creatorcontrib>Kim, Kibum</creatorcontrib><creatorcontrib>Yoon, Kanghoon</creatorcontrib><creatorcontrib>Park, Chanyoung</creatorcontrib><title>Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation</title><title>arXiv.org</title><description>The scene graph generation (SGG) task involves detecting objects within an image and predicting predicates that represent the relationships between the objects. However, in SGG benchmark datasets, each subject-object pair is annotated with a single predicate even though a single predicate may exhibit diverse semantics (i.e., semantic diversity), existing SGG models are trained to predict the one and only predicate for each pair. This in turn results in the SGG models to overlook the semantic diversity that may exist in a predicate, thus leading to biased predictions. In this paper, we propose a novel model-agnostic Semantic Diversity-aware Prototype-based Learning (DPL) framework that enables unbiased predictions based on the understanding of the semantic diversity of predicates. Specifically, DPL learns the regions in the semantic space covered by each predicate to distinguish among the various different semantics that a single predicate can represent. Extensive experiments demonstrate that our proposed model-agnostic DPL framework brings significant performance improvement on existing SGG models, and also effectively understands the semantic diversity of predicates.</description><subject>Learning</subject><subject>Predictions</subject><subject>Prototypes</subject><subject>Semantics</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNissKwjAUBYMgWLT_EHAdiIl9uPZRFy4E7bqk9VZTNKk3qdK_t4gf4OoMM2dEAiHlgqVLISYkdK7hnIs4EVEkA5Kf4KGM1xXd6Beg075n6q0Q6BGtt75vgZXKwYUeQKHR5kprizQ3pf7aUwUGaIaqvdFsQFReWzMj41rdHYS_nZL5bnte71mL9tmB80VjOzRDKiRPZRLLVSTlf68P4qlA_A</recordid><startdate>20240725</startdate><enddate>20240725</enddate><creator>Jeon, Jaehyeong</creator><creator>Kim, Kibum</creator><creator>Yoon, Kanghoon</creator><creator>Park, Chanyoung</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240725</creationdate><title>Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation</title><author>Jeon, Jaehyeong ; Kim, Kibum ; Yoon, Kanghoon ; Park, Chanyoung</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30837639533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Learning</topic><topic>Predictions</topic><topic>Prototypes</topic><topic>Semantics</topic><toplevel>online_resources</toplevel><creatorcontrib>Jeon, Jaehyeong</creatorcontrib><creatorcontrib>Kim, Kibum</creatorcontrib><creatorcontrib>Yoon, Kanghoon</creatorcontrib><creatorcontrib>Park, Chanyoung</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jeon, Jaehyeong</au><au>Kim, Kibum</au><au>Yoon, Kanghoon</au><au>Park, Chanyoung</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation</atitle><jtitle>arXiv.org</jtitle><date>2024-07-25</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>The scene graph generation (SGG) task involves detecting objects within an image and predicting predicates that represent the relationships between the objects. However, in SGG benchmark datasets, each subject-object pair is annotated with a single predicate even though a single predicate may exhibit diverse semantics (i.e., semantic diversity), existing SGG models are trained to predict the one and only predicate for each pair. This in turn results in the SGG models to overlook the semantic diversity that may exist in a predicate, thus leading to biased predictions. In this paper, we propose a novel model-agnostic Semantic Diversity-aware Prototype-based Learning (DPL) framework that enables unbiased predictions based on the understanding of the semantic diversity of predicates. Specifically, DPL learns the regions in the semantic space covered by each predicate to distinguish among the various different semantics that a single predicate can represent. Extensive experiments demonstrate that our proposed model-agnostic DPL framework brings significant performance improvement on existing SGG models, and also effectively understands the semantic diversity of predicates.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2024-07
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_3083763953
source	Freely Accessible Journals
subjects	Learning Predictions Prototypes Semantics
title	Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T08%3A17%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Semantic%20Diversity-aware%20Prototype-based%20Learning%20for%20Unbiased%20Scene%20Graph%20Generation&rft.jtitle=arXiv.org&rft.au=Jeon,%20Jaehyeong&rft.date=2024-07-25&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3083763953%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3083763953&rft_id=info:pmid/&rfr_iscdi=true