Applications of large randommatrix to high dimensional statistical signal processing

Gia-Thuy Pham

Résumé

This thesis focuses on statistical problems involving a multivariate time series ${\bf y}_n$ of large dimension $M$ defined as the sum of gaussian white noise temporally and spatially and a useful signal defined as the output of an unknown finite impulse response single input multiple outputs system driven by a deterministic scalar nonobservable sequence. Supposing $({\bf y}_n)_{n=1,...,N}$ is available, a number of existing methods are based on the functionals of empirical covariance matrix $\hat{\bf R}_L$ of $ML$--dimensional vectors $({\bf y}^{(L)}_n)_{n=1,...,N}$ obtained by stacking the vectors $({\bf y}_k)_{k=n,...,n+L-1}$, where $L$ is a relevant parameter. In the case where the number of observations $N$ is much larger than $ML$ the dimension of vectors $({\bf y}^{(L)}_n)_{n=1,...,N}$, $\hat{\bf R}_L$ behaves as its mathematical expectation in the sense of spectral norm. This allows us to study the inference technique based on $\hat{\bf R}_L$ via classical techniques of asymptotic statistics. In this thesis, we interested in the case where $ML$ and $N$ have the same order of magnitude, we call this the asymptotic regimes in which $M$ and $N$ converge towards infinity, such that the ratio $\frac{ML}{N}$ converges towards a strictly positive constant, given that $L$ may scale with $M,N$. To solve the problems in this work, it is necessary to investigate the behaviour of the eigenvalues and eigenvectors of the random matrix $\hat{\bf R}_L$. Taking account of the particular structure of vectors $({\bf y}^{(L)}_n)_{n=1,...,N}$, $\hat{\bf R}_L$ coincides with the Gram matrix of a block-Hankel matrix ${\bf \Sigma }_L$, and this specificity requires the development of appropriate techniques. We interested to the case where the number of coefficients $P$ of the finite impulse response generated the useful signal and the parameter $L$ remain fixed when $M,N$ grow large. As a consequence, the matrix ${\bf \Sigma }_L$ is a finite rank perturbation of block-Hankel matrix ${\bf W}_L$ composed of additive noise. We prove that eigenvalues and eigenvectors of $\hat{\bf R}_L$ behave as if the entries of matrix ${\bf W}_L$ are independent and identically distributed. This allows us to construct detection tests of useful signal based on largest eigenvalues of $\hat{\bf R}_L$ and to develop new estimation strategies of the regularization parameter of the spatio-temporal Wiener filter estimated from a training sequence. This approach is characterized by the asymptotic behaviour of the resolvent of matrix $\hat{\bf R}_L$. We also prove that these results provide consistent subspace estimation methods for source localization using spatial-smoothing scheme. Motivated by the case where $P$ and $L$ may converge towards infinity, we move off somewhat the initial model. We suppose that the matrix ${\bf \Sigma }_L$ is the sum of the block-Hankel random matrix ${\bf W}_L$ with a deterministic matrix without particular structure. Using the approaches based on Stieltjes transform and tools adapted to gaussian noise, we prove that the empirical eigenvalue distribution of $\hat{\bf R}_L$ has deterministic behaviour which we shall describe. Provided $\frac{L^2}{MN}$ converges towards $0$, we do likewise for the elements of the resolvent of $\hat{\bf R}_L$. Finally, we return to the initial model, but we suppose that $P,L$ converge towards infinity with the same rate. In this context, matrix ${\bf \Sigma }_L$ is a matrix whose rank goes to infinity, and thus the techniques employed in chapter 2 are not applicable. Using the results obtained in chapter 3, we establish that when $\frac{L^2}{MN}$ goes to $0$, the elements of the resolvent of $\hat{\bf R}_L$ behave as the elements of a deterministic matrix which coincides with the deterministic equivalent of the resolvent of the information plus noise model in which entries of the noise matrix are independent and identically distributed. In the case where $\frac{L}{M}$ goes to $0$, this allows us to extend the results of chapter 2 related to the determination of the regularization parameter of the spatial-temporal Wiener filter estimated from a training sequence.

Cette thèse porte sur des problèmes de statistiques mettant en jeu une série temporelle multivariable ${\bf y}_n$ de grande dimension $M$ définie comme la somme d'un bruit gaussien blanc temporellement et spatialement et d'un signal utile généré comme la sortie d'un filtre $1$ entrée / $M$ sorties à réponse impulsionnelle finie excité par une séquence déterministe scalaire non observable. Si l'on suppose que ${\bf y}$ est observé entre les instants $1$ et $N$, un bon nombre de techniques existantes sont basées sur des fonctionnelles de la matrice de covariance empirique $\hat{{\bf R}}_L$ des vecteurs de dimensions $ML$ $({\bf y}_n^{(L)})_{n=1, \ldots, N}$ obtenus en empilant les vecteurs ${\bf y}_k$ entre les instants $n$ et $n+L-1$, où $L$ est un paramètre bien choisi. Lorsque l'on est en mesure de collecter un nombre d'observations très nettement plus grand que la dimension $ML$ des vecteurs $({\bf y}_n^{(L)})_{n=1, \ldots, N}$, $\hat{{\bf R}}_L$ a le même comportement en norme spectrale que son espérance mathématique, et cela permet d'étudier les techniques d'inférences basés sur $\hat{{\bf R}}_L$ par le biais de techniques classiques de statistique asymptotique. Dans cette thèse, nous nous intéressons au cas où $ML$ et $N$ sont du même ordre de grandeur, ce que nous modélisons par des régimes asymptotiques dans lesquels $M$ et $N$ tendent tous les deux vers l'infini, et où le rapport $ML/N$ converge vers une constante non nulle, $L$ pouvant aussi croître avec $M$ et $N$. Les problèmes que nous résolvons dans ce travail nécessitent d'étudier le comportement des éléments propres de la grande matrice aléatoire $\hat{{\bf R}}_L$. Compte tenu de la structure particulière des vecteurs $({\bf y}_n^{(L)})_{n=1, \ldots, N}$, $\hat{{\bf R}}_L$ co\"{\i}ncide avec la matrice de Gram d'une matrice Hankel par bloc $\boldsymbol{\Sigma}_L$, et cette spécificité nécessite le développement de techniques appropriées. Dans le chapitre 2, nous nous intéressons au cas où le nombre de coefficients $P$ de la réponse impulsionnelle générant le signal utile et le paramètre $L$ restent fixes quand $M$ et $N$ grandissent. La matrice $\boldsymbol{\Sigma}_L$ est alors une perturbation de rang fini de la matrice Hankel par bloc ${\bf W}_{L}$ constituée à partir du bruit additif. Nous montrons que les éléments propres de $\hat{{\bf R}}_L$ se comportent comme si la matrice ${\bf W}_{L}$ était à éléments indépendants et identiquement distribués. Cela nous permet d'aborder l'étude de tests de détection du signal utile portant sur les plus grandes valeurs propres de $\hat{{\bf R}}_L$ ainsi que la mise en évidence de nouvelles stratégies de détermination du paramètre de régularisation de filtres de Wiener spatio-temporels estimés à partir d'une séquence d'apprentissage. Techniquement, ce dernier point est abordé en caractérisant le comportement asymptotique des éléments de la résolvente de la matrice $\hat{{\bf R}}_L$. Nous montrons enfin également que ces résultats permettent d'analyser le comportement d'algorithmes sous-espace de localisation de sources bande étroite utilisant la technique du lissage spatial. Dans le chapitre 3, motivés par le cas où $P$ et $L$ peuvent tendre vers l'infini, nous nous écartons quelque peu du modèle initial, et supposons que la matrice $\boldsymbol{\Sigma}_{L}$ est la somme de la matrice aléatoire Hankel par bloc ${\bf W}_{L}$ avec une matrice déterministe sans structure particulière. En utilisant des approches basées sur la transformée de Stieltjes et des outils adaptés au caractère gaussien du bruit, nous montrons que la distribution empirique des valeurs propres de $\hat{{\bf R}}_L$ a un comportement déterministe que nous caractérisons. Sous réserve $L^{2}/MN$ tende vers $0$, nous faisons de même pour les éléments de la résolvante de $\hat{{\bf R}}_L$. Dans le chapitre 4, nous revenons au modèle initial, mais supposons que $P$ et $L$ tendent vers l'infini au même rythme. Dans ce contexte, la contribution du signal utile à la matrice $\boldsymbol{\Sigma}_{L}$ est une matrice dont le rang tend vers l'infini, et les techniques utilisées dans le chapitre 2 ne sont plus applicables. En utilisant les résultats du chapitre 3, nous établissons que si $L^{2}/MN$ tende vers $0$, les éléments de la résolvante de $\hat{{\bf R}}_L$ se comportent comme les éléments d'une matrice déterministe qui co\"{\i}ncide avec l'équivalent déterministe de la resolvente d'un modèle information plus bruit dans lequel les éléments de la matrice de bruit sont indépendants et identiquement distribués. Dans le cas où $L/M$ tend vers $0$, ceci nous permet d'étendre les résultats du chapitre 2 relatifs à la détermination du paramètre de régularisation des filtres de Wiener spatio-temporels estimés à partir d'une séquence d'apprentissage.

Applications of large randommatrix to high dimensional statistical signal processing

Applications des grandes matrices aléatoires au traitement du signal de grandes dimensions

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager