Detection of voice conversion spoofing attacks using voiced speech

dc.contributor.authorSankar, M. S. Arun
dc.contributor.authorDe Leon, Phillip L.
dc.contributor.authorRoedig, Utz
dc.contributor.editorRiser, H. P.
dc.contributor.editorKyas, M.
dc.contributor.funderScience Foundation Irelanden
dc.date.accessioned2023-01-17T15:23:46Z
dc.date.available2023-01-17T15:23:46Z
dc.date.issued2022-11-11
dc.date.updated2023-01-17T15:10:30Z
dc.description.abstractSpeech consists of voiced and unvoiced segments that differ in their production process and exhibit different characteristics. In this paper, we investigate the spectral differences between bonafide and spoofed speech for voiced and unvoiced speech segments. We observe that the largest spectral differences lie in the 0–4 kHz band of voiced speech. Based on this observation, we propose a low-complexity, pre-processing stage which subsamples voiced frames prior to spoofing detection. The proposed pre-processing stage is applied to two systems, LFCC+GMM and IA/IF+KNN that differ entirely on the features and classifier used for spoofing detection. Our results show improvement with both systems in detection of the ASVspoof 2019 A17 voice conversion attack, which is recognized to have one of the highest spoofing capabilities. We also show improvements in the A18 and A19 voice conversion attacks for the IA/IF+KNN system. The resulting A17 EERs are lower than all reported systems where the A17 spoofing attack is the worst attack except the Capsule Network. Finally, we note that the proposed pre-processing stage reduces the speech date by more than 4× due to subsampling and using only voiced frames but at the same time maintaining similar pooled EER as that for the baseline systems, which may be advantageous for resource constrained spoofing detectors.en
dc.description.sponsorshipScience Foundation Ireland (Grant number 19/FFP/6775; 13/RC/2077 P2)en
dc.description.statusPeer revieweden
dc.description.versionAccepted Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationSankar, Arun M. S.., L. De Leon, P. and Roedig, U. (2022) ‘Detection of voice conversion spoofing attacks using voiced speech’, NordSec 2022, 27th Nordic Conference on Secure IT Systems, Reykjavik, Iceland, 30 Nov-02 Dec, in H.P. Reiser and M. Kyas (eds) Secure IT Systems, Lecture Notes in Computer Science, vol 13700, Cham: Springer International Publishing, pp. 159-175. https://doi.org/10.1007/978-3-031-22295-5_9en
dc.identifier.doi10.1007/978-3-031-22295-5_9en
dc.identifier.endpage175en
dc.identifier.isbn978-3-031-22294-8
dc.identifier.isbn978-3-031-22295-5
dc.identifier.journaltitleLecture Notes in Computer Scienceen
dc.identifier.startpage159en
dc.identifier.urihttps://hdl.handle.net/10468/14081
dc.identifier.volume13700en
dc.language.isoenen
dc.publisherSpringeren
dc.relation.projectinfo:eu-repo/grantAgreement/SFI/SFI Research Centres/13/RC/2077/IE/CONNECT: The Centre for Future Networks & Communications/en
dc.relation.urihttps://doi.org/10.1007/978-3-031-22295-5_9
dc.rights© 2022 The Author(s). This is a post-peer-review, pre-copyedit version of an article published in Lecture Notes in Computer Science. The final authenticated version is available online at: https://doi.org/10.1007/978-3-031-22295-5_9en
dc.subjectSpoofing detectionen
dc.subjectSpeech processingen
dc.subjectComputer securityen
dc.subjectVoice bio-metricen
dc.titleDetection of voice conversion spoofing attacks using voiced speechen
dc.typeConference itemen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Detection_of_Voice_Conversion_Spoofing_Attacks_using_Voiced_Speech_pdf.pdf
Size:
369.59 KB
Format:
Adobe Portable Document Format
Description:
Accepted version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: