Replay spoofing countermeasure using autoencoder and siamese networks on ASVspoof 2019 challenge

被引:18
作者
Adiban, Mohammad [1 ]
Sameti, Hossein [1 ]
Shehnepoor, Saeedreza [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
FEATURES;
D O I
10.1016/j.csl.2020.101105
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Automatic Speaker Verification (ASV) is authentication of individuals by analyzing their speech signals. Different synthetic approaches allow spoofing to deceive ASV systems (ASVs), whether using techniques to imitate a voice or reconstruct the features. Attackers beat up the ASVs using four general techniques; impersonation, speech synthesis, voice conversion, and replay. The last technique is considered as a common and high potential tool for spoofing purposes since replay attacks are more accessible and require no technical knowledge of adversaries. In this study, we introduce a novel replay spoofing countermeasure for ASVs. Accordingly, we use the Constant Q Cepstral Coefficient (CQCC) features fed into an autoencoder to attain more informative features and to consider the noise information of spoofed utterances for discrimination purpose. Finally, different configurations of the Siamese network are used for the first time in this context for classification. The experiments performed on ASVspoof challenge 2019 dataset using Equal Error Rate (EER) and Tandem Detection Cost Function (t-DCF) as evaluation metrics show that the proposed system improved the results over the baseline by 10.73% and 0.2344 in terms of EER and t-DCF, respectively. © 2020 Elsevier Ltd
引用
收藏
页数:13
相关论文
共 51 条
[1]
Adiban Mohammad, 2017, P 29 C COMP LING SPE, P264
[2]
Ahrabian K., 2018, Neural Computing, V31, P1
[3]
Alam M. J., 2018, Proc. Odyssey 2018 The Speaker and Language Recognition Workshop, P393
[4]
Alzantot M., 2019, ARXIV190700501
[5]
[Anonymous], 2015, ARXIV151102683
[6]
[Anonymous], 2016, SPEAK OD WORKSH BILB
[7]
[Anonymous], 2015, 16 ANN C INT SPEECH
[8]
[Anonymous], 2014, ABS14085601 CORR
[9]
[Anonymous], 2019, ARXIV190405576
[10]
[Anonymous], SPEAK OD 2018 SPEAK