Skip to Main content Skip to Navigation

Contribution à l'analyse complexe de documents anciens, application aux lettrines

Abstract : In the general context of cultural heritage preservation campaigns, many digitization projects are being conducted in France and Europe to save the contents of thousands of ancient documents. Images of these documents are used by historians to identify the history of books. This thesis was led into the Navidomass project (ANR-06-MDCA-012) which aims at promoting the written heritage of the documents from the Renaissance, by proposing to identify its images. As part of this thesis, we are particularly interested in graphical images, and more specifically to dropcaps. These graphical images, which emerged with the beginning of printing, are complex images which can be seen as composed of different layers of information (images composed of strokes). To address this problem, we propose an ontological model of complex analysis of images of old documents. This model allows to integrate the knowledge specific to historians, and the knowledge extracted by image processing, into a single database. Due to the complex nature of these images, the usual methods of image analysis and automatic extraction of knowledge are inefficient. We therefore propose a new approach for analyzing images of old documents that can be characterized on their features basis. This approach begins by simplifying the images, separated in different layers of information (shapes and lines). Then, for each layer, we extract patterns used to describe the images. Thus, images are described with most common bags of patterns, and bags of stroke. For these two layers of information, we have also extracted graphs of regions that allow extracting a more structural knowledge of the images. A more complex description is then inserted into the knowledge base in order to allow complex queries. The purpose of this database is to offer the possiblity to make either query by example, or query by specific features of the images, to user.
Document type :
Complete list of metadata

Cited literature [170 references]  Display  Hide  Download
Contributor : ABES STAR :  Contact
Submitted on : Friday, April 27, 2012 - 12:47:35 PM
Last modification on : Friday, June 3, 2022 - 10:24:29 AM
Long-term archiving on: : Saturday, July 28, 2012 - 3:10:25 AM


Version validated by the jury (STAR)


  • HAL Id : tel-00691922, version 1



Mickaël Coustaty. Contribution à l'analyse complexe de documents anciens, application aux lettrines. Autre [cs.OH]. Université de La Rochelle, 2011. Français. ⟨NNT : 2011LAROS333⟩. ⟨tel-00691922⟩



Record views


Files downloads