Introduction de la vision perceptive pour la reconnaissance de la structure de documents

Aurélie Lemaitre Legargeant 1
1 IMADOC - Interprétation et Reconnaissance d’Images et de Documents
UR1 - Université de Rennes 1, INSA Rennes - Institut National des Sciences Appliquées - Rennes, CNRS - Centre National de la Recherche Scientifique : UMR6074
Abstract : The human perceptive vision combines several points of view in order to improve the interpretation of a scene. It is modeled by a physiologic component, the perceptive cycle, guided by a psychological aspect, the visual attention. This mechanism is the base of our work on a generic method for document structure recognition. In this context, we propose the formalism of perceptive layer and some multiresolution tools to simulate the perceptive vision and the visual attention. This produces the perceptive method DMOS-P, which is an improvement of the existing DMOS method. Thanks to this method, it becomes possible to easily specify some complex mechanisms of perceptive cooperation, adapted to each kind of document, and that improve the recognition of the structure. We point out a mechanism of prediction/verication, linked to the perceptive vision : at low resolution, hypotheses on the contents are proposed, that are veried at a higher resolution. This mechanism simplies and improves document recognition : for noisy documents, the perceptive vision makes it possible to select only relevant information, whereas for low structured documents, the perceptive vision helps to rebuild the structure. We validated this approach on various kinds of structured documents (incoming mail, archive registers, newspapers. . .), at a large scale (more than 80,000 images) and thanks to an industrial transfer to Evodia company.
Document type :
Interface homme-machine [cs.HC]. INSA de Rennes, 2008. Français
Contributor : Aurélie Lemaitre <>
Submitted on : Friday, December 3, 2010 - 9:32:24 AM
Last modification on : Friday, January 13, 2017 - 2:16:59 PM
Document(s) archivé(s) le : Friday, March 4, 2011 - 2:53:02 AM


  • HAL Id : tel-00542490, version 1



Aurélie Lemaitre Legargeant. Introduction de la vision perceptive pour la reconnaissance de la structure de documents. Interface homme-machine [cs.HC]. INSA de Rennes, 2008. Français. <tel-00542490>



Record views


Document downloads