Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification

In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system w...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Machine vision and applications 2024-11, Vol.35 (6), p.133, Article 133
Hauptverfasser:	Wu, Cheng-Hsuan, Liu, An-Sheng, Chen, Chiung-Tao, Fu, Li-Chen
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptation Communications Engineering Computer Science Datasets Image Processing and Computer Vision Knowledge management Labels Labor Learning Networks Pattern Recognition Surveillance Surveillance systems
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	6
container_start_page	133
container_title	Machine vision and applications
container_volume	35
creator	Wu, Cheng-Hsuan Liu, An-Sheng Chen, Chiung-Tao Fu, Li-Chen
description	In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.
doi_str_mv	10.1007/s00138-024-01613-4
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3113455918</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3113455918</sourcerecordid><originalsourceid>FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</originalsourceid><addsrcrecordid>eNp9UMtKxDAUDaLgOPoDrgKuozdN2rRLGZ8w4EbXIU3SMcNMWpN0xC_wt81MBXeu7oHzuJyD0CWFawogbiIAZTWBghOgFWWEH6EZ5awgVFTNMZpBk3ENTXGKzmJcAwAXgs_Q912_Vc5j46L1SfnVxm4zwMobrHufgorJ7SzeWBW88yv86dI7jv0YtCWr0RlrcFTbYbPnuj7g0cdxsGGX8ww2U7gyakgqud7jTMV8giXZ6pPrnD4Q5-ikU5toL37vHL093L8unsjy5fF5cbskugBIpLO14aIrqQDBlOYtLUDRoq06Uyra5ualqZUtG14ZbeqyFazWtm2NBlG1WrE5uppyh9B_jDYmuc5dfH4pGaWMl2VD66wqJpUOfYzBdnIIbqvCl6Qg94PLaXCZB5eHwSXPJjaZYhb7lQ1_0f-4fgAdfYgI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3113455918</pqid></control><display><type>article</type><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><source>SpringerLink Journals - AutoHoldings</source><creator>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</creator><creatorcontrib>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</creatorcontrib><description>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</description><identifier>ISSN: 0932-8092</identifier><identifier>EISSN: 1432-1769</identifier><identifier>DOI: 10.1007/s00138-024-01613-4</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Adaptation ; Communications Engineering ; Computer Science ; Datasets ; Image Processing and Computer Vision ; Knowledge management ; Labels ; Labor ; Learning ; Networks ; Pattern Recognition ; Surveillance ; Surveillance systems</subject><ispartof>Machine vision and applications, 2024-11, Vol.35 (6), p.133, Article 133</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00138-024-01613-4$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00138-024-01613-4$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Wu, Cheng-Hsuan</creatorcontrib><creatorcontrib>Liu, An-Sheng</creatorcontrib><creatorcontrib>Chen, Chiung-Tao</creatorcontrib><creatorcontrib>Fu, Li-Chen</creatorcontrib><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><title>Machine vision and applications</title><addtitle>Machine Vision and Applications</addtitle><description>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</description><subject>Adaptation</subject><subject>Communications Engineering</subject><subject>Computer Science</subject><subject>Datasets</subject><subject>Image Processing and Computer Vision</subject><subject>Knowledge management</subject><subject>Labels</subject><subject>Labor</subject><subject>Learning</subject><subject>Networks</subject><subject>Pattern Recognition</subject><subject>Surveillance</subject><subject>Surveillance systems</subject><issn>0932-8092</issn><issn>1432-1769</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9UMtKxDAUDaLgOPoDrgKuozdN2rRLGZ8w4EbXIU3SMcNMWpN0xC_wt81MBXeu7oHzuJyD0CWFawogbiIAZTWBghOgFWWEH6EZ5awgVFTNMZpBk3ENTXGKzmJcAwAXgs_Q912_Vc5j46L1SfnVxm4zwMobrHufgorJ7SzeWBW88yv86dI7jv0YtCWr0RlrcFTbYbPnuj7g0cdxsGGX8ww2U7gyakgqud7jTMV8giXZ6pPrnD4Q5-ikU5toL37vHL093L8unsjy5fF5cbskugBIpLO14aIrqQDBlOYtLUDRoq06Uyra5ualqZUtG14ZbeqyFazWtm2NBlG1WrE5uppyh9B_jDYmuc5dfH4pGaWMl2VD66wqJpUOfYzBdnIIbqvCl6Qg94PLaXCZB5eHwSXPJjaZYhb7lQ1_0f-4fgAdfYgI</recordid><startdate>20241101</startdate><enddate>20241101</enddate><creator>Wu, Cheng-Hsuan</creator><creator>Liu, An-Sheng</creator><creator>Chen, Chiung-Tao</creator><creator>Fu, Li-Chen</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20241101</creationdate><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><author>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Adaptation</topic><topic>Communications Engineering</topic><topic>Computer Science</topic><topic>Datasets</topic><topic>Image Processing and Computer Vision</topic><topic>Knowledge management</topic><topic>Labels</topic><topic>Labor</topic><topic>Learning</topic><topic>Networks</topic><topic>Pattern Recognition</topic><topic>Surveillance</topic><topic>Surveillance systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wu, Cheng-Hsuan</creatorcontrib><creatorcontrib>Liu, An-Sheng</creatorcontrib><creatorcontrib>Chen, Chiung-Tao</creatorcontrib><creatorcontrib>Fu, Li-Chen</creatorcontrib><collection>CrossRef</collection><jtitle>Machine vision and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wu, Cheng-Hsuan</au><au>Liu, An-Sheng</au><au>Chen, Chiung-Tao</au><au>Fu, Li-Chen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</atitle><jtitle>Machine vision and applications</jtitle><stitle>Machine Vision and Applications</stitle><date>2024-11-01</date><risdate>2024</risdate><volume>35</volume><issue>6</issue><spage>133</spage><pages>133-</pages><artnum>133</artnum><issn>0932-8092</issn><eissn>1432-1769</eissn><abstract>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00138-024-01613-4</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 0932-8092
ispartof	Machine vision and applications, 2024-11, Vol.35 (6), p.133, Article 133
issn	0932-8092 1432-1769
language	eng
recordid	cdi_proquest_journals_3113455918
source	SpringerLink Journals - AutoHoldings
subjects	Adaptation Communications Engineering Computer Science Datasets Image Processing and Computer Vision Knowledge management Labels Labor Learning Networks Pattern Recognition Surveillance Surveillance systems
title	Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T01%3A12%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Domain%20disentanglement%20and%20contrastive%20learning%20with%20source-guided%20sampling%20for%20unsupervised%20domain%20adaptation%20person%20re-identification&rft.jtitle=Machine%20vision%20and%20applications&rft.au=Wu,%20Cheng-Hsuan&rft.date=2024-11-01&rft.volume=35&rft.issue=6&rft.spage=133&rft.pages=133-&rft.artnum=133&rft.issn=0932-8092&rft.eissn=1432-1769&rft_id=info:doi/10.1007/s00138-024-01613-4&rft_dat=%3Cproquest_cross%3E3113455918%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3113455918&rft_id=info:pmid/&rfr_iscdi=true