Who's speaking? Predicting speaker profession from speech - Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

Who's speaking? Predicting speaker profession from speech

Résumé

Variations in speech can reveal the gender, birth place, age, and socioeconomic level of the speaker. In this paper, we show that even the profession of the speaker can be recovered from a recording. For this purpose, we design a method that combines features from both the speech signal and the transcription. For the features from the transcription, we used pretrained language models. This allows us to train a model that predicts the speaker profession from both signals. Our empirical results show that our model can narrow down the profession of the speakers considerably.
Fichier principal
Vignette du fichier
icphs-2023.pdf (120.47 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04190126 , version 1 (29-08-2023)

Licence

Paternité

Identifiants

  • HAL Id : hal-04190126 , version 1

Citer

Yaru Wu, Lihu Chen, Benjamin Elie, Fabian M. Suchanek, Ioana Vasilescu, et al.. Who's speaking? Predicting speaker profession from speech. International Congress of Phonetic Sciences 2023, Aug 2023, Prague, Czech Republic. pp.3086-3090. ⟨hal-04190126⟩
148 Consultations
36 Téléchargements

Partager

Gmail Facebook X LinkedIn More