Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations

Sree Hari Krishnan Parthasarathi, H. Bourlard, D. Gatica-Perez

DOI: 10.1109/tasl.2012.2215588

Journal: IEEE Transactions on Audio Speech and Language Processing

A supervised framework using deep neural architecture for deriving privacy-sensitive audio features for speaker diarization in multiparty conversations is proposed and experiments show that the proposed approaches yield darization performance close to the MFCC features on the single distant microphone dataset.

ivySCI AI Smartly Parses PDF, Answers Researchers' Questions, and Helps You Understand Papers in Seconds

Download ivySCI

Journal Info

Journals:

ISSN 1558-7916

Built withby Ivy Science