Skip to Main content Skip to Navigation

Semantic segmentation of 3D medical images with deep learning

Abstract : Deep Learning has recently shown impressive results in computer vision. Especially with theConvolutional Neural Networks (ConvNets) which have redefined the state of the art in many applications such asmedical image segmentation. In this thesis we address problems in the task of abdominal organ segmentation usingdeep learning models. In the first part, we address the issue of training deep ConvNets on partially labeled data.Professionals often focus on specific anatomical regions leading to heterogeneous datasets with partially labeledimages. Training a model directly on such data leads to very poor results. Thus, we propose a training schemethat leverages all the labels without being affected by the missing ones. Moreover, an iterative scheme relabelsthe missing organs of the training set which further improves the segmentation model. The second part aims atusing spatial prior about the position of the organs to improve the detection of structures and reduce outliersin the segmentation. ConvNets by construction, does not capture absolute spatial information. However, medicalimages are very structured and there are conventions about the expected position of organs. Thus, we propose a 3Dspatial prior that captures the spatial position of organs and then explicitly biases the model through a prior-drivenactivation function. Finally, we propose to use Transformers to model long range dependencies between anatomicalstructures in a segmentation model used for organ segmentation. ConvNets do not capture such interactionsbecause of the receptive field which is often limited. Using dense attention introduced in Transformers allows toconnect every pixel with each other and thus to model complex interactions on different parts of the input image.We propose U-Transformer and show that it improves the quality of the segmentation on various datasets.
Document type :
Complete list of metadata
Contributor : ABES STAR :  Contact
Submitted on : Thursday, June 2, 2022 - 2:04:13 PM
Last modification on : Saturday, June 25, 2022 - 3:32:55 AM


Version validated by the jury (STAR)


  • HAL Id : tel-03685889, version 1



Olivier Petit. Semantic segmentation of 3D medical images with deep learning. Medical Imaging. HESAM Université, 2021. English. ⟨NNT : 2021HESAC042⟩. ⟨tel-03685889⟩



Record views


Files downloads