Skip to Main content Skip to Navigation
Theses

Sélection de paramètres acoustiques pertinents pour la reconnaissance de la parole

Abstract : The objective of this thesis is to propose solutions and performance improvements to certain problems of relevant acoustic features selection in the framework of the speech recognition. Thus, our first contribution consists in proposing a new method of relevant feature selection based on an exact development of the redundancy between a feature and the feature previously selected using Forward search algorithm. The estimation problem of the higher order probability densities is solved by the truncation of the theoretical development of this redundancy up to acceptable orders. Moreover, we proposed a stopping criterion which allows fixing the number of features selected according to the mutual information approximated at the iteration J of the search algorithm. However, the mutual information estimation is difficult since its definition depends on the probability densities of the variables (features) in which the type of these distributions is unknown and their estimates are carried out on a finite sample set. An approach for the estimate of these distributions is based on the histogram method. This method requires a good choice of the bin number (cells of the histogram). Thus, we also proposed a new formula of computation of bin number that allows minimizing the estimator bias of the entropy and mutual information. This new estimator was validated on simulated data and speech data. More particularly, this estimator was applied in the selection of the static and dynamic MFCC parameters that were the most relevant for a recognition task of the connected words of the Aurora2 base.
Document type :
Theses
Complete list of metadatas

Cited literature [119 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00843652
Contributor : Abes Star :  Contact
Submitted on : Thursday, July 11, 2013 - 8:09:32 PM
Last modification on : Thursday, March 5, 2020 - 6:49:27 PM
Long-term archiving on: : Saturday, October 12, 2013 - 10:40:08 AM

File

abdenour.hacinegharbi_2151_vm....
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-00843652, version 1

Citation

Abdenour Hacine-Gharbi. Sélection de paramètres acoustiques pertinents pour la reconnaissance de la parole. Autre. Université d'Orléans, 2012. Français. ⟨NNT : 2012ORLE2080⟩. ⟨tel-00843652⟩

Share

Metrics

Record views

1544

Files downloads

7557