logo Idiap Research Institute        
 [BibTeX] [Marc21]
Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations
Type of publication: Idiap-RR
Citation: Parthasarathi_Idiap-RR-12-2009
Number: Idiap-RR-12-2009
Year: 2009
Month: 6
Institution: Idiap
Address: Idiap Research Institute, Martigny, Switzerland.
Abstract: We investigate four different privacy-sensitive features, namely energy, zero crossing rate, spectral flatness, and kurtosis, for speech detection in multiparty conversations. We liken this scenario to a meeting room and define our datasets and annotations accordingly. The temporal context of these features is modeled. With no temporal context, energy is the best performing single feature. But by modeling temporal context, kurtosis emerges as the most effective feature. Also, we combine the features. Besides yielding a gain in performance, certain combinations of features also reveal that a shorter temporal context is sufficient. We then benchmark other privacy-sensitive features utilized in previous studies. Our experiments show that the performance of all the privacy-sensitive features modeled with context is close to that of state-of-the-art spectral-based features, without extracting and using any features that can be used to reconstruct the speech signal.
Keywords:
Projects Idiap
SNSF-MULTI
IM2
Authors Parthasarathi, Sree Hari Krishnan
Magimai.-Doss, Mathew
Bourlard, Hervé
Gatica-Perez, Daniel
Crossref by Parthasarathi_INTERSPEECH_2009
Added by: [ADM]
Total mark: 0
Attachments
  • Parthasarathi_Idiap-RR-12-2009.pdf (MD5: 177b85091fc8ee93acbdabd04cc650a9)
Notes