Méthodes régularisées pour l’analyse de données multivariées en grande dimension : théorie et applications.


Date
Jan 10, 2020 10:30 AM
Event
Séminiare MIAT
Location
INRA Toulouse, France

In this presentation we study general linear model (multivariate linear model) in high dimensional settings. We propose a novel variable selection approach in the framework of multivariate linear models taking into account the dependence that may exist between the responses. It consists in estimating beforehand the co- variance matrix of the responses and to plug this estimator in a Lasso criterion, in order to obtain a sparse estimator of the coefficient matrix. The properties of our ap- proach are investigated both from a theoretical and a numerical point of view. More precisely, we give general conditions that the estimators of the covariance matrix and its inverse have to satisfy in order to recover the positions of the zero and non-zero entries of the coefficient matrix when the number of responses is not fixed and can tend to infinity. We also propose novel, efficient and fully data-driven approaches for estimating Toeplitz and large block structured sparse covariance matrices in the case where the number of variables is much larger than the number of samples wi- thout limiting ourselves to block diagonal matrices. These approaches are applied to different biological issues in metabolomics, in proteomics and in immunology.

Avatar
Marie Perrot-Dockès
Maitre de conférence (Assistant professor)

My research interests include variable selection and post hoc bound on the number false discoveries in high dimensional frameworks.