Out-of-distribution Generalization in Deep Learning : Classification and Spatiotemporal Forecasting

Matthieu Kirchmeyer

Theses Year : 2023

Out-of-distribution Generalization in Deep Learning : Classification and Spatiotemporal Forecasting

Généralisation hors-distribution en apprentissage profond : classification et prévision spatiotemporelle

(1)

Matthieu Kirchmeyer

Function : Author
PersonId : 1114497
ORCID : 0000-0003-3750-4141
IdRef : 27039866X

Institut des Systèmes Intelligents et de Robotique

Abstract

Deep learning has emerged as a powerful approach for modelling static data like images and more recently for modelling dynamical systems like those underlying times series, videos or physical phenomena. Yet, neural networks were observed to not generalize well outside the training distribution, in other words out-of-distribution. This lack of generalization limits the deployment of deep learning in autonomous systems or online production pipelines, which are faced with constantly evolving data. In this thesis, we design new strategies for out-of-distribution generalization. These strategies handle the specific challenges posed by two main application tasks, classification of static data and spatiotemporal dynamics forecasting. The first two parts of this thesis consider the classification problem. We first investigate how we can efficiently leverage some observed training data from a target domain for adaptation. We then explore how to generalize to unobserved domains without access to such data. The last part of this thesis handles various generalization problems specific to spatiotemporal forecasting.

L’apprentissage profond a émergé comme une approche puissante pour la modélisation de données statiques comme les images et, plus récemment, pour la modélisation de systèmes dynamiques comme ceux sous-jacents aux séries temporelles, aux vidéos ou aux phénomènes physiques. Cependant, les réseaux neuronaux ne généralisent pas bien en dehors de la distribution d’apprentissage, en d’autres termes, hors-distribution. Ceci limite le déploiement de l’apprentissage profond dans les systèmes autonomes ou les systèmes de production en ligne, qui sont confrontés à des données en constante évolution. Dans cette thèse, nous concevons de nouvelles stratégies d’apprentissage pour la généralisation hors-distribution. Celles-ci tiennent compte des défis spécifiques posés par deux tâches d’application principales, la classification de données statiques et la prévision de dynamiques spatiotemporelles. Les deux premières parties de cette thèse étudient la classification. Nous présentons d’abord comment utiliser des données d’entraînement en quantité limitée d’un domaine cible pour l’adaptation. Nous explorons ensuite comment généraliser à des domaines non observés sans accès à de telles données. La dernière partie de cette thèse présente diverses tâches de généralisation, spécifiques à la prévision spatiotemporelle.

Keywords

Deep learning Out-of-distribution generalization Classification and spatiotemporal forecasting

Apprentissage profond Généralisation Classification Spatiotemporel Équations différentielles Physique

Domains

Computer Vision and Pattern Recognition [cs.CV] Artificial Intelligence [cs.AI] Machine Learning [cs.LG]

Fichier principal

KIRCHMEYER_Matthieu_these_2023.pdf (21.41 Mo)

Origin	Version validated by the jury (STAR)

ABES STAR : Contact

https://theses.hal.science/tel-04139066

Submitted on : Friday, June 23, 2023-1:41:24 PM

Last modification on : Friday, May 24, 2024-4:37:40 AM

Long-term archiving on: Sunday, September 24, 2023-7:53:47 PM

Dates and versions

tel-04139066 , version 1 (23-06-2023)

Identifiers

HAL Id : tel-04139066 , version 1

Cite

Matthieu Kirchmeyer. Out-of-distribution Generalization in Deep Learning : Classification and Spatiotemporal Forecasting. Computer Vision and Pattern Recognition [cs.CV]. Sorbonne Université, 2023. English. ⟨NNT : 2023SORUS080⟩. ⟨tel-04139066⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS STAR ISIR SORBONNE-UNIVERSITE THESES-SU SU-SCIENCES

134 View

60 Download

Out-of-distribution Generalization in Deep Learning : Classification and Spatiotemporal Forecasting

Généralisation hors-distribution en apprentissage profond : classification et prévision spatiotemporelle

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share