Improving anti-spoofing with octave spectrum and short-term spectral statistics information

The long-term window based features have been found to be effective for spoofing attack detection. One such important countermeasure is constant-Q cepstral coefficients (CQCC) that is derived from constant-Q transform. During its extraction, the octave power spectrum is converted to the linear power...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied acoustics 2020-01, Vol.157, p.107017, Article 107017
Hauptverfasser: Yang, Jichen, Das, Rohan Kumar
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The long-term window based features have been found to be effective for spoofing attack detection. One such important countermeasure is constant-Q cepstral coefficients (CQCC) that is derived from constant-Q transform. During its extraction, the octave power spectrum is converted to the linear power spectrum by performing uniform resampling. However, the information from the octave power spectrum is different from that carried by the linear power spectrum. We believe that the octave power spectrum can offer complementary information to the linear power spectrum for spoofing attack detection. In this regard, we propose to combine the coefficients generated using both linear and octave power spectrum. The combined feature is referred to as extended CQCC (eCQCC), which is hypothesized to have better discriminative information for detection of spoofing attacks. In addition, we use the short-term spectral statistics information (STSSI) along with eCQCC feature to form another novel feature representation referred to as eCQCC-STSSI to have improved anti-spoofing countermeasure. We perform the studies with the proposed features for both synthetic and playback attacks using ASVspoof 2015 and ASVspoof 2017 version 2.0 corpus, respectively. The studies reveal that eCQCC outperforms the conventional CQCC feature as well most of the known systems showing importance of octave spectrum information. Further, the hybrid feature eCQCC-STSSI improve the performance of eCQCC feature due to the STSSI information combined with it.
ISSN:0003-682X
1872-910X
DOI:10.1016/j.apacoust.2019.107017