Low-complexity speech spoofing detection using instantaneous spectral features

Loading...
Thumbnail Image
Files
SpoofdetectionEMD.pdf(314.35 KB)
Accepted Version
Date
2022-05
Authors
Sankar, M. S. Arun
De Leon, Phillip L.
Sandoval, Steven
Roedig, Utz
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Research Projects
Organizational Units
Journal Issue
Abstract
Over the last decade, various detection mechanisms for spoofed speech have been proposed. Thus far the development focus has been on detection accuracy, largely ignoring secondary goals such as computational complexity or storage effort. In this work, we use empirical mode decomposition to compute intrinsic mode functions which are then demodulated to obtain features consisting of short-time statistics of instantaneous amplitude and instantaneous frequency. These features are then used with a simple k-nearest neighbours classifier. We further show that voiced segments from short speech signals can be used in the feature extraction resulting in a spoofing detection competitive with top-performing systems while having up to 103× less computation.
Description
Keywords
Computer security , Biometrics , Speaker recognition , Speech processing
Citation
Sankar M. S., A., De Leon, P. L., Sandoval, S. and Roedig, U. (2022) 'Low-complexity speech spoofing detection using instantaneous spectral features,' 2022 29th International Conference on Systems, Signals and Image Processing (IWSSIP), Sofia, Bulgaria, 1-3 June, pp. 1-4. doi: 10.1109/IWSSIP55020.2022.9854446
Link to publisher’s version