Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification
In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system w...
Gespeichert in:
Veröffentlicht in: | Machine vision and applications 2024-11, Vol.35 (6), p.133, Article 133 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 6 |
container_start_page | 133 |
container_title | Machine vision and applications |
container_volume | 35 |
creator | Wu, Cheng-Hsuan Liu, An-Sheng Chen, Chiung-Tao Fu, Li-Chen |
description | In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments. |
doi_str_mv | 10.1007/s00138-024-01613-4 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3113455918</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3113455918</sourcerecordid><originalsourceid>FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</originalsourceid><addsrcrecordid>eNp9UMtKxDAUDaLgOPoDrgKuozdN2rRLGZ8w4EbXIU3SMcNMWpN0xC_wt81MBXeu7oHzuJyD0CWFawogbiIAZTWBghOgFWWEH6EZ5awgVFTNMZpBk3ENTXGKzmJcAwAXgs_Q912_Vc5j46L1SfnVxm4zwMobrHufgorJ7SzeWBW88yv86dI7jv0YtCWr0RlrcFTbYbPnuj7g0cdxsGGX8ww2U7gyakgqud7jTMV8giXZ6pPrnD4Q5-ikU5toL37vHL093L8unsjy5fF5cbskugBIpLO14aIrqQDBlOYtLUDRoq06Uyra5ualqZUtG14ZbeqyFazWtm2NBlG1WrE5uppyh9B_jDYmuc5dfH4pGaWMl2VD66wqJpUOfYzBdnIIbqvCl6Qg94PLaXCZB5eHwSXPJjaZYhb7lQ1_0f-4fgAdfYgI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3113455918</pqid></control><display><type>article</type><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><source>SpringerLink Journals - AutoHoldings</source><creator>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</creator><creatorcontrib>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</creatorcontrib><description>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</description><identifier>ISSN: 0932-8092</identifier><identifier>EISSN: 1432-1769</identifier><identifier>DOI: 10.1007/s00138-024-01613-4</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Adaptation ; Communications Engineering ; Computer Science ; Datasets ; Image Processing and Computer Vision ; Knowledge management ; Labels ; Labor ; Learning ; Networks ; Pattern Recognition ; Surveillance ; Surveillance systems</subject><ispartof>Machine vision and applications, 2024-11, Vol.35 (6), p.133, Article 133</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00138-024-01613-4$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00138-024-01613-4$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Wu, Cheng-Hsuan</creatorcontrib><creatorcontrib>Liu, An-Sheng</creatorcontrib><creatorcontrib>Chen, Chiung-Tao</creatorcontrib><creatorcontrib>Fu, Li-Chen</creatorcontrib><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><title>Machine vision and applications</title><addtitle>Machine Vision and Applications</addtitle><description>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</description><subject>Adaptation</subject><subject>Communications Engineering</subject><subject>Computer Science</subject><subject>Datasets</subject><subject>Image Processing and Computer Vision</subject><subject>Knowledge management</subject><subject>Labels</subject><subject>Labor</subject><subject>Learning</subject><subject>Networks</subject><subject>Pattern Recognition</subject><subject>Surveillance</subject><subject>Surveillance systems</subject><issn>0932-8092</issn><issn>1432-1769</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9UMtKxDAUDaLgOPoDrgKuozdN2rRLGZ8w4EbXIU3SMcNMWpN0xC_wt81MBXeu7oHzuJyD0CWFawogbiIAZTWBghOgFWWEH6EZ5awgVFTNMZpBk3ENTXGKzmJcAwAXgs_Q912_Vc5j46L1SfnVxm4zwMobrHufgorJ7SzeWBW88yv86dI7jv0YtCWr0RlrcFTbYbPnuj7g0cdxsGGX8ww2U7gyakgqud7jTMV8giXZ6pPrnD4Q5-ikU5toL37vHL093L8unsjy5fF5cbskugBIpLO14aIrqQDBlOYtLUDRoq06Uyra5ualqZUtG14ZbeqyFazWtm2NBlG1WrE5uppyh9B_jDYmuc5dfH4pGaWMl2VD66wqJpUOfYzBdnIIbqvCl6Qg94PLaXCZB5eHwSXPJjaZYhb7lQ1_0f-4fgAdfYgI</recordid><startdate>20241101</startdate><enddate>20241101</enddate><creator>Wu, Cheng-Hsuan</creator><creator>Liu, An-Sheng</creator><creator>Chen, Chiung-Tao</creator><creator>Fu, Li-Chen</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20241101</creationdate><title>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</title><author>Wu, Cheng-Hsuan ; Liu, An-Sheng ; Chen, Chiung-Tao ; Fu, Li-Chen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c200t-fe8d47f517073ac4b120a12b6fd5a1b6135d8ae5946dcd85b738cebbdc076bca3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Adaptation</topic><topic>Communications Engineering</topic><topic>Computer Science</topic><topic>Datasets</topic><topic>Image Processing and Computer Vision</topic><topic>Knowledge management</topic><topic>Labels</topic><topic>Labor</topic><topic>Learning</topic><topic>Networks</topic><topic>Pattern Recognition</topic><topic>Surveillance</topic><topic>Surveillance systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wu, Cheng-Hsuan</creatorcontrib><creatorcontrib>Liu, An-Sheng</creatorcontrib><creatorcontrib>Chen, Chiung-Tao</creatorcontrib><creatorcontrib>Fu, Li-Chen</creatorcontrib><collection>CrossRef</collection><jtitle>Machine vision and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wu, Cheng-Hsuan</au><au>Liu, An-Sheng</au><au>Chen, Chiung-Tao</au><au>Fu, Li-Chen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification</atitle><jtitle>Machine vision and applications</jtitle><stitle>Machine Vision and Applications</stitle><date>2024-11-01</date><risdate>2024</risdate><volume>35</volume><issue>6</issue><spage>133</spage><pages>133-</pages><artnum>133</artnum><issn>0932-8092</issn><eissn>1432-1769</eissn><abstract>In recent years, fully supervised Person re-id methods have already been well developed. Still, they cannot be easily applied to real-life applications because of the domain gap between real-world databases and training datasets. And annotating ground truth label for the entire surveillance system with multiple cameras and videos are labor-intensive and impracticable in the real application. Besides, as the awareness of the right to privacy is rising, it becomes more challenging to collect sufficient training data from the public. Thence, the difficulty of constructing a new dataset for deployment not only arises from the labor cost of labeling but also because the raw data from the public are hard to come by. To be better adapted to real-life system deployment, we proposed an unsupervised domain adaptation based method, which involves Domain Disentanglement Network and Source-Guided Contrastive learning (SGCL). DD-Net first narrows down the domain gap between two datasets, and then SGCL utilizes the labeled source dataset as the clue to guide the training on the target domain. With these two modules, the knowledge transfer can be completed successfully from the training dataset to real-world scenarios. The conducted experiment shows that the proposed method is competitive with the state-of-the-art methods on two public datasets and even outperforms them under the setting of the small-scale target dataset. Therefore, not only the Person Re-ID, but also the object tracking in video or surveillance system can benefit from our new approach when we went to deploy to different environments.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00138-024-01613-4</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0932-8092 |
ispartof | Machine vision and applications, 2024-11, Vol.35 (6), p.133, Article 133 |
issn | 0932-8092 1432-1769 |
language | eng |
recordid | cdi_proquest_journals_3113455918 |
source | SpringerLink Journals - AutoHoldings |
subjects | Adaptation Communications Engineering Computer Science Datasets Image Processing and Computer Vision Knowledge management Labels Labor Learning Networks Pattern Recognition Surveillance Surveillance systems |
title | Domain disentanglement and contrastive learning with source-guided sampling for unsupervised domain adaptation person re-identification |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T01%3A12%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Domain%20disentanglement%20and%20contrastive%20learning%20with%20source-guided%20sampling%20for%20unsupervised%20domain%20adaptation%20person%20re-identification&rft.jtitle=Machine%20vision%20and%20applications&rft.au=Wu,%20Cheng-Hsuan&rft.date=2024-11-01&rft.volume=35&rft.issue=6&rft.spage=133&rft.pages=133-&rft.artnum=133&rft.issn=0932-8092&rft.eissn=1432-1769&rft_id=info:doi/10.1007/s00138-024-01613-4&rft_dat=%3Cproquest_cross%3E3113455918%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3113455918&rft_id=info:pmid/&rfr_iscdi=true |