Improving anti-spoofing with octave spectrum and short-term spectral statistics information
The long-term window based features have been found to be effective for spoofing attack detection. One such important countermeasure is constant-Q cepstral coefficients (CQCC) that is derived from constant-Q transform. During its extraction, the octave power spectrum is converted to the linear power...
Gespeichert in:
Veröffentlicht in: | Applied acoustics 2020-01, Vol.157, p.107017, Article 107017 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The long-term window based features have been found to be effective for spoofing attack detection. One such important countermeasure is constant-Q cepstral coefficients (CQCC) that is derived from constant-Q transform. During its extraction, the octave power spectrum is converted to the linear power spectrum by performing uniform resampling. However, the information from the octave power spectrum is different from that carried by the linear power spectrum. We believe that the octave power spectrum can offer complementary information to the linear power spectrum for spoofing attack detection. In this regard, we propose to combine the coefficients generated using both linear and octave power spectrum. The combined feature is referred to as extended CQCC (eCQCC), which is hypothesized to have better discriminative information for detection of spoofing attacks. In addition, we use the short-term spectral statistics information (STSSI) along with eCQCC feature to form another novel feature representation referred to as eCQCC-STSSI to have improved anti-spoofing countermeasure. We perform the studies with the proposed features for both synthetic and playback attacks using ASVspoof 2015 and ASVspoof 2017 version 2.0 corpus, respectively. The studies reveal that eCQCC outperforms the conventional CQCC feature as well most of the known systems showing importance of octave spectrum information. Further, the hybrid feature eCQCC-STSSI improve the performance of eCQCC feature due to the STSSI information combined with it. |
---|---|
ISSN: | 0003-682X 1872-910X |
DOI: | 10.1016/j.apacoust.2019.107017 |