Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations
Sree Hari Krishnan Parthasarathi, H. Bourlard, D. Gatica-Perez
DOI: 10.1109/tasl.2012.2215588
Journal: IEEE Transactions on Audio Speech and Language Processing
A supervised framework using deep neural architecture for deriving privacy-sensitive audio features for speaker diarization in multiparty conversations is proposed and experiments show that the proposed approaches yield darization performance close to the MFCC features on the single distant microphone dataset.
ivySCI AI Smartly Parses PDF, Answers Researchers' Questions, and Helps You Understand Papers in Seconds
Journal Info
Journals:
ISSN 1558-7916