Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing
Face Anti-Spoofing (FAS) is crucial for securing face recognition systems against presentation attacks. With advancements in sensor manufacture and multi-modal learning techniques, many multi-modal FAS approaches have emerged. However, they face challenges in generalizing to unseen attacks and deplo...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Lin, Xun Wang, Shuai Cai, Rizhao Liu, Yizhong Fu, Ying Yu, Zitong Tang, Wenzhong Kot, Alex |
description | Face Anti-Spoofing (FAS) is crucial for securing face recognition systems
against presentation attacks. With advancements in sensor manufacture and
multi-modal learning techniques, many multi-modal FAS approaches have emerged.
However, they face challenges in generalizing to unseen attacks and deployment
conditions. These challenges arise from (1) modality unreliability, where some
modality sensors like depth and infrared undergo significant domain shifts in
varying environments, leading to the spread of unreliable information during
cross-modal feature fusion, and (2) modality imbalance, where training overly
relies on a dominant modality hinders the convergence of others, reducing
effectiveness against attack types that are indistinguishable sorely using the
dominant modality. To address modality unreliability, we propose the
Uncertainty-Guided Cross-Adapter (U-Adapter) to recognize unreliably detected
regions within each modality and suppress the impact of unreliable regions on
other modalities. For modality imbalance, we propose a Rebalanced Modality
Gradient Modulation (ReGrad) strategy to rebalance the convergence speed of all
modalities by adaptively adjusting their gradients. Besides, we provide the
first large-scale benchmark for evaluating multi-modal FAS performance under
domain generalization scenarios. Extensive experiments demonstrate that our
method outperforms state-of-the-art methods. Source code and protocols will be
released on https://github.com/OMGGGGG/mmdg. |
doi_str_mv | 10.48550/arxiv.2402.19298 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2402_19298</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2402_19298</sourcerecordid><originalsourceid>FETCH-LOGICAL-a678-a760de231d8f5a261eda01728d7d0a9f0ed625596a626dfedad25fed3c60853e3</originalsourceid><addsrcrecordid>eNotj8tOwzAQRb1hgQofwAr_QIIfteOwqypaQK2QaPbRNDOuLBknciivrye0rI6urnSkw9iNFOXcGSPuIH-Fj1LNhSplrWp3yZ53x2HINI4cEvJX2kOE1NE9b_pPyDjyNSXKEMMPId8e43sotj1C5CvoiC_StHdD3_uQDlfswkMc6fqfM9asHprlY7F5WT8tF5sCbOUKqKxAUlqi8waUlYQgZKUcViig9oLQKmNqC1ZZ9NOLykzQnRXOaNIzdnvWnmLaIYc3yN_tX1R7itK_HQtHfQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing</title><source>arXiv.org</source><creator>Lin, Xun ; Wang, Shuai ; Cai, Rizhao ; Liu, Yizhong ; Fu, Ying ; Yu, Zitong ; Tang, Wenzhong ; Kot, Alex</creator><creatorcontrib>Lin, Xun ; Wang, Shuai ; Cai, Rizhao ; Liu, Yizhong ; Fu, Ying ; Yu, Zitong ; Tang, Wenzhong ; Kot, Alex</creatorcontrib><description>Face Anti-Spoofing (FAS) is crucial for securing face recognition systems
against presentation attacks. With advancements in sensor manufacture and
multi-modal learning techniques, many multi-modal FAS approaches have emerged.
However, they face challenges in generalizing to unseen attacks and deployment
conditions. These challenges arise from (1) modality unreliability, where some
modality sensors like depth and infrared undergo significant domain shifts in
varying environments, leading to the spread of unreliable information during
cross-modal feature fusion, and (2) modality imbalance, where training overly
relies on a dominant modality hinders the convergence of others, reducing
effectiveness against attack types that are indistinguishable sorely using the
dominant modality. To address modality unreliability, we propose the
Uncertainty-Guided Cross-Adapter (U-Adapter) to recognize unreliably detected
regions within each modality and suppress the impact of unreliable regions on
other modalities. For modality imbalance, we propose a Rebalanced Modality
Gradient Modulation (ReGrad) strategy to rebalance the convergence speed of all
modalities by adaptively adjusting their gradients. Besides, we provide the
first large-scale benchmark for evaluating multi-modal FAS performance under
domain generalization scenarios. Extensive experiments demonstrate that our
method outperforms state-of-the-art methods. Source code and protocols will be
released on https://github.com/OMGGGGG/mmdg.</description><identifier>DOI: 10.48550/arxiv.2402.19298</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-02</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2402.19298$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2402.19298$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Lin, Xun</creatorcontrib><creatorcontrib>Wang, Shuai</creatorcontrib><creatorcontrib>Cai, Rizhao</creatorcontrib><creatorcontrib>Liu, Yizhong</creatorcontrib><creatorcontrib>Fu, Ying</creatorcontrib><creatorcontrib>Yu, Zitong</creatorcontrib><creatorcontrib>Tang, Wenzhong</creatorcontrib><creatorcontrib>Kot, Alex</creatorcontrib><title>Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing</title><description>Face Anti-Spoofing (FAS) is crucial for securing face recognition systems
against presentation attacks. With advancements in sensor manufacture and
multi-modal learning techniques, many multi-modal FAS approaches have emerged.
However, they face challenges in generalizing to unseen attacks and deployment
conditions. These challenges arise from (1) modality unreliability, where some
modality sensors like depth and infrared undergo significant domain shifts in
varying environments, leading to the spread of unreliable information during
cross-modal feature fusion, and (2) modality imbalance, where training overly
relies on a dominant modality hinders the convergence of others, reducing
effectiveness against attack types that are indistinguishable sorely using the
dominant modality. To address modality unreliability, we propose the
Uncertainty-Guided Cross-Adapter (U-Adapter) to recognize unreliably detected
regions within each modality and suppress the impact of unreliable regions on
other modalities. For modality imbalance, we propose a Rebalanced Modality
Gradient Modulation (ReGrad) strategy to rebalance the convergence speed of all
modalities by adaptively adjusting their gradients. Besides, we provide the
first large-scale benchmark for evaluating multi-modal FAS performance under
domain generalization scenarios. Extensive experiments demonstrate that our
method outperforms state-of-the-art methods. Source code and protocols will be
released on https://github.com/OMGGGGG/mmdg.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAQRb1hgQofwAr_QIIfteOwqypaQK2QaPbRNDOuLBknciivrye0rI6urnSkw9iNFOXcGSPuIH-Fj1LNhSplrWp3yZ53x2HINI4cEvJX2kOE1NE9b_pPyDjyNSXKEMMPId8e43sotj1C5CvoiC_StHdD3_uQDlfswkMc6fqfM9asHprlY7F5WT8tF5sCbOUKqKxAUlqi8waUlYQgZKUcViig9oLQKmNqC1ZZ9NOLykzQnRXOaNIzdnvWnmLaIYc3yN_tX1R7itK_HQtHfQ</recordid><startdate>20240229</startdate><enddate>20240229</enddate><creator>Lin, Xun</creator><creator>Wang, Shuai</creator><creator>Cai, Rizhao</creator><creator>Liu, Yizhong</creator><creator>Fu, Ying</creator><creator>Yu, Zitong</creator><creator>Tang, Wenzhong</creator><creator>Kot, Alex</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240229</creationdate><title>Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing</title><author>Lin, Xun ; Wang, Shuai ; Cai, Rizhao ; Liu, Yizhong ; Fu, Ying ; Yu, Zitong ; Tang, Wenzhong ; Kot, Alex</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a678-a760de231d8f5a261eda01728d7d0a9f0ed625596a626dfedad25fed3c60853e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Lin, Xun</creatorcontrib><creatorcontrib>Wang, Shuai</creatorcontrib><creatorcontrib>Cai, Rizhao</creatorcontrib><creatorcontrib>Liu, Yizhong</creatorcontrib><creatorcontrib>Fu, Ying</creatorcontrib><creatorcontrib>Yu, Zitong</creatorcontrib><creatorcontrib>Tang, Wenzhong</creatorcontrib><creatorcontrib>Kot, Alex</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Lin, Xun</au><au>Wang, Shuai</au><au>Cai, Rizhao</au><au>Liu, Yizhong</au><au>Fu, Ying</au><au>Yu, Zitong</au><au>Tang, Wenzhong</au><au>Kot, Alex</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing</atitle><date>2024-02-29</date><risdate>2024</risdate><abstract>Face Anti-Spoofing (FAS) is crucial for securing face recognition systems
against presentation attacks. With advancements in sensor manufacture and
multi-modal learning techniques, many multi-modal FAS approaches have emerged.
However, they face challenges in generalizing to unseen attacks and deployment
conditions. These challenges arise from (1) modality unreliability, where some
modality sensors like depth and infrared undergo significant domain shifts in
varying environments, leading to the spread of unreliable information during
cross-modal feature fusion, and (2) modality imbalance, where training overly
relies on a dominant modality hinders the convergence of others, reducing
effectiveness against attack types that are indistinguishable sorely using the
dominant modality. To address modality unreliability, we propose the
Uncertainty-Guided Cross-Adapter (U-Adapter) to recognize unreliably detected
regions within each modality and suppress the impact of unreliable regions on
other modalities. For modality imbalance, we propose a Rebalanced Modality
Gradient Modulation (ReGrad) strategy to rebalance the convergence speed of all
modalities by adaptively adjusting their gradients. Besides, we provide the
first large-scale benchmark for evaluating multi-modal FAS performance under
domain generalization scenarios. Extensive experiments demonstrate that our
method outperforms state-of-the-art methods. Source code and protocols will be
released on https://github.com/OMGGGGG/mmdg.</abstract><doi>10.48550/arxiv.2402.19298</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2402.19298 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2402_19298 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition |
title | Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T21%3A02%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Suppress%20and%20Rebalance:%20Towards%20Generalized%20Multi-Modal%20Face%20Anti-Spoofing&rft.au=Lin,%20Xun&rft.date=2024-02-29&rft_id=info:doi/10.48550/arxiv.2402.19298&rft_dat=%3Carxiv_GOX%3E2402_19298%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |