Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification

In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system w...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Machine vision and applications 2024-11, Vol.35 (6), p.133, Article 133
Hauptverfasser: Wu, Cheng-Hsuan, Liu, An-Sheng, Chen, Chiung-Tao, Fu, Li-Chen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 6
container_start_page 133
container_title Machine vision and applications
container_volume 35
creator Wu, Cheng-Hsuan
Liu, An-Sheng
Chen, Chiung-Tao
Fu, Li-Chen
description In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.
doi_str_mv 10.1007/s00138-024-01613-4
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3113455918</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3113455918</sourcerecordid><originalsourceid>FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</originalsourceid><addsrcrecordid>eNp9UMtKxDAUDaLgOPoDrgKuozdN2rRLGZ8w4EbXIU3SMcNMWpN0xC_wt81MBXeu7oHzuJyD0CWFawogbiIAZTWBghOgFWWEH6EZ5awgVFTNMZpBk3ENTXGKzmJcAwAXgs_Q912_Vc5j46L1SfnVxm4zwMobrHufgorJ7SzeWBW88yv86dI7jv0YtCWr0RlrcFTbYbPnuj7g0cdxsGGX8ww2U7gyakgqud7jTMV8giXZ6pPrnD4Q5-ikU5toL37vHL093L8unsjy5fF5cbskugBIpLO14aIrqQDBlOYtLUDRoq06Uyra5ualqZUtG14ZbeqyFazWtm2NBlG1WrE5uppyh9B_jDYmuc5dfH4pGaWMl2VD66wqJpUOfYzBdnIIbqvCl6Qg94PLaXCZB5eHwSXPJjaZYhb7lQ1_0f-4fgAdfYgI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3113455918</pqid></control><display><type>article</type><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><source>SpringerLink Journals - AutoHoldings</source><creator>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</creator><creatorcontrib>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</creatorcontrib><description>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</description><identifier>ISSN: 0932-8092</identifier><identifier>EISSN: 1432-1769</identifier><identifier>DOI: 10.1007/s00138-024-01613-4</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Adaptation ; Communications Engineering ; Computer Science ; Datasets ; Image Processing and Computer Vision ; Knowledge management ; Labels ; Labor ; Learning ; Networks ; Pattern Recognition ; Surveillance ; Surveillance systems</subject><ispartof>Machine vision and applications, 2024-11, Vol.35 (6), p.133, Article 133</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00138-024-01613-4$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00138-024-01613-4$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Wu, Cheng-Hsuan</creatorcontrib><creatorcontrib>Liu, An-Sheng</creatorcontrib><creatorcontrib>Chen, Chiung-Tao</creatorcontrib><creatorcontrib>Fu, Li-Chen</creatorcontrib><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><title>Machine vision and applications</title><addtitle>Machine Vision and Applications</addtitle><description>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</description><subject>Adaptation</subject><subject>Communications Engineering</subject><subject>Computer Science</subject><subject>Datasets</subject><subject>Image Processing and Computer Vision</subject><subject>Knowledge management</subject><subject>Labels</subject><subject>Labor</subject><subject>Learning</subject><subject>Networks</subject><subject>Pattern Recognition</subject><subject>Surveillance</subject><subject>Surveillance systems</subject><issn>0932-8092</issn><issn>1432-1769</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9UMtKxDAUDaLgOPoDrgKuozdN2rRLGZ8w4EbXIU3SMcNMWpN0xC_wt81MBXeu7oHzuJyD0CWFawogbiIAZTWBghOgFWWEH6EZ5awgVFTNMZpBk3ENTXGKzmJcAwAXgs_Q912_Vc5j46L1SfnVxm4zwMobrHufgorJ7SzeWBW88yv86dI7jv0YtCWr0RlrcFTbYbPnuj7g0cdxsGGX8ww2U7gyakgqud7jTMV8giXZ6pPrnD4Q5-ikU5toL37vHL093L8unsjy5fF5cbskugBIpLO14aIrqQDBlOYtLUDRoq06Uyra5ualqZUtG14ZbeqyFazWtm2NBlG1WrE5uppyh9B_jDYmuc5dfH4pGaWMl2VD66wqJpUOfYzBdnIIbqvCl6Qg94PLaXCZB5eHwSXPJjaZYhb7lQ1_0f-4fgAdfYgI</recordid><startdate>20241101</startdate><enddate>20241101</enddate><creator>Wu, Cheng-Hsuan</creator><creator>Liu, An-Sheng</creator><creator>Chen, Chiung-Tao</creator><creator>Fu, Li-Chen</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20241101</creationdate><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><author>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Adaptation</topic><topic>Communications Engineering</topic><topic>Computer Science</topic><topic>Datasets</topic><topic>Image Processing and Computer Vision</topic><topic>Knowledge management</topic><topic>Labels</topic><topic>Labor</topic><topic>Learning</topic><topic>Networks</topic><topic>Pattern Recognition</topic><topic>Surveillance</topic><topic>Surveillance systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wu, Cheng-Hsuan</creatorcontrib><creatorcontrib>Liu, An-Sheng</creatorcontrib><creatorcontrib>Chen, Chiung-Tao</creatorcontrib><creatorcontrib>Fu, Li-Chen</creatorcontrib><collection>CrossRef</collection><jtitle>Machine vision and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wu, Cheng-Hsuan</au><au>Liu, An-Sheng</au><au>Chen, Chiung-Tao</au><au>Fu, Li-Chen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</atitle><jtitle>Machine vision and applications</jtitle><stitle>Machine Vision and Applications</stitle><date>2024-11-01</date><risdate>2024</risdate><volume>35</volume><issue>6</issue><spage>133</spage><pages>133-</pages><artnum>133</artnum><issn>0932-8092</issn><eissn>1432-1769</eissn><abstract>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00138-024-01613-4</doi></addata></record>
fulltext fulltext
identifier ISSN: 0932-8092
ispartof Machine vision and applications, 2024-11, Vol.35 (6), p.133, Article 133
issn 0932-8092
1432-1769
language eng
recordid cdi_proquest_journals_3113455918
source SpringerLink Journals - AutoHoldings
subjects Adaptation
Communications Engineering
Computer Science
Datasets
Image Processing and Computer Vision
Knowledge management
Labels
Labor
Learning
Networks
Pattern Recognition
Surveillance
Surveillance systems
title Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T01%3A12%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Domain%20disentanglement%20and%20contrastive%20learning%20with%20source-guided%20sampling%20for%20unsupervised%20domain%20adaptation%20person%20re-identification&rft.jtitle=Machine%20vision%20and%20applications&rft.au=Wu,%20Cheng-Hsuan&rft.date=2024-11-01&rft.volume=35&rft.issue=6&rft.spage=133&rft.pages=133-&rft.artnum=133&rft.issn=0932-8092&rft.eissn=1432-1769&rft_id=info:doi/10.1007/s00138-024-01613-4&rft_dat=%3Cproquest_cross%3E3113455918%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3113455918&rft_id=info:pmid/&rfr_iscdi=true