Interprétation contextuelle et assistée de fonds d'archives numérisées : application à des registres de ventes du XVIIIe siècle

Joseph Chazalon 1
1 IntuiDoc - intuitive user interaction for document
IRISA-D6 - MEDIA ET INTERACTIONS
Abstract : Fonds, also called historical document collections, are important amounts of digitized documents which are difficult to interpret automatically: usual approaches require a lot of work during design, but do not manage to avoid producing many errors which have to be corrected after processing.To cope with those limitations, our work aimed at improving the interpretation process by making use of information extracted from the fond, or provided by human operators, while keeping a page by page processing.We proposed a simple extension of page description language which permits to automatically generate information exchange between the interpretation process and its environment. A global iterative mechanism progressively brings contextual information to the later process, and improves interpretation.Experiments and application of those new tools for the processing of documents from the 18th century showed that our propositions were easy to integrate in an existing system, that its design is still simple, and that required manual corrections were reduced.
Document type :
Theses
Complete list of metadatas

https://tel.archives-ouvertes.fr/tel-00903372
Contributor : Joseph Chazalon <>
Submitted on : Tuesday, March 5, 2013 - 8:17:32 PM
Last modification on : Friday, November 16, 2018 - 1:38:35 AM
Long-term archiving on : Thursday, June 6, 2013 - 4:01:53 AM

Identifiers

  • HAL Id : tel-00903372, version 2

Citation

Joseph Chazalon. Interprétation contextuelle et assistée de fonds d'archives numérisées : application à des registres de ventes du XVIIIe siècle. Autre [cs.OH]. INSA de Rennes, 2013. Français. ⟨NNT : 2013ISAR0001⟩. ⟨tel-00903372v2⟩

Share

Metrics

Record views

412

Files downloads

702