Skip to Main content Skip to Navigation
Theses

Secure, efficient automatic speaker verification for embedded applications

Abstract : This industrial CIFRE PhD thesis addresses automatic speaker verification (ASV) issues in the context of embedded applications. The first part of this thesis focuses on more traditional problems and topics. The first work investigates the minimum enrolment data requirements for a practical, text-dependent short-utterance ASV system. Contributions in part A of the thesis consist in a statistical analysis whose objective is to isolate text-dependent factors and prove they are consistent across different sets of speakers. For very short utterances, the influence of a specific text content on the system performance can be considered a speaker-independent factor. Part B of the thesis focuses on neural network-based solutions. While it was clear that neural networks and deep learning were becoming state-of-the-art in several machine learning domains, their use for embedded solutions was hindered by their complexity. Contributions described in the second part of the thesis comprise blue-sky, experimental research which tackles the substitution of hand-crafted, traditional speaker features in favour of operating directly upon the audio waveform and the search for optimal network architectures and weights by means of genetic algorithms. This work is the most fundamental contribution: lightweight, neuro-evolved network structures which are able to learn from the raw audio input.
Complete list of metadatas

Cited literature [292 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-03001286
Contributor : Abes Star :  Contact
Submitted on : Thursday, November 12, 2020 - 12:15:26 PM
Last modification on : Monday, November 16, 2020 - 10:21:28 AM

File

VALENTI_Giacomo_2019.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-03001286, version 1

Citation

Giacomo Valenti. Secure, efficient automatic speaker verification for embedded applications. Artificial Intelligence [cs.AI]. Sorbonne Université, 2019. English. ⟨NNT : 2019SORUS471⟩. ⟨tel-03001286⟩

Share

Metrics

Record views

60

Files downloads

33