Reconnaissance visuelle robuste par réseaux de neurones dans des scénarios d'exploration robotique. Détecte-moi si tu peux !

Abstract : The main objective of this thesis is visual recognition for a mobile robot in difficult conditions. We are particularly interested in neural networks which present today the best performances in computer vision. We studied the concept of method selection for the classification of 2D images by using a neural network selector to choose the best available classifier given the observed situation. This strategy works when data can be easily partitioned with respect to available classifiers, which is the case when complementary modalities are used. We have therefore used RGB-D data (2.5D) in particular applied to people detection. We propose a combination of independent neural network detectors specific to each modality (color & depth map) based on the same architecture (Faster RCNN). We share intermediate results of the detectors to allow them to complement and improve overall performance in difficult situations (luminosity loss or acquisition noise of the depth map). We are establishing new state of the art scores in the field and propose a more complex and richer data set to the community (ONERA.ROOM). Finally, we made use of the 3D information contained in the RGB-D images through a multi-view method. We have defined a strategy for generating 2D virtual views that are consistent with the 3D structure. For a semantic segmentation task, this approach artificially increases the training data for each RGB-D image and accumulates different predictions during the test. We obtain new reference results on the SUNRGBD and NYUDv2 datasets. All these works allowed us to handle in an original way 2D, 2.5D and 3D robotic data with neural networks. Whether for classification, detection and semantic segmentation, we not only validated our approaches on difficult data sets, but also brought the state of the art to a new level of performance.
Complete list of metadatas

Cited literature [154 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01680372
Contributor : Abes Star <>
Submitted on : Wednesday, January 17, 2018 - 5:35:09 PM
Last modification on : Friday, May 10, 2019 - 1:47:00 PM
Long-term archiving on : Wednesday, May 23, 2018 - 9:30:44 PM

File

65202_GUERRY_2017_archivage.pd...
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01680372, version 2

Citation

Joris Guerry. Reconnaissance visuelle robuste par réseaux de neurones dans des scénarios d'exploration robotique. Détecte-moi si tu peux !. Vision par ordinateur et reconnaissance de formes [cs.CV]. Université Paris-Saclay, 2017. Français. ⟨NNT : 2017SACLX080⟩. ⟨tel-01680372v2⟩

Share

Metrics

Record views

694

Files downloads

433