Skip to Main content Skip to Navigation
New interface

Aspects textuels de la procédure judiciaire exploitée en analyse criminelle et perspectives pour son traitement automatique

Abstract : Criminal analysis is a discipline that supports investigations practiced within the National Gendarmerie. It is based on the use of the documents compiled in the judicial procedure file (witness interviews, search warrants, expert reports, phone and bank data, etc.) to synthesize the information collected and to propose a new understanding of the facts examined. While criminal analysis uses data visualization software (i. e. IBM Analyst's Notebook) to display the hypotheses formulated, the digital and textual management of the file documents is entirely manual. However, criminal analysis relies on entities to formalize its practice. The presentation of the research context details the practice of criminal analysis as well as the constitution of judicial procedure files as textual corpora. We then propose perspectives for the adaptation of natural language processing (NLP) and information extraction methods to the case study, including a comparison of the concepts of entity in criminal analysis and named entity in NLP. This comparison is done on the conceptual and linguistic plans. A first approach to the detection of entities in witness interviews is presented. Finally, since textual genre is a parameter to be taken into account when applying automatic processing to text, we develop a structure of the 'legal' textual genre into discourse, genres, and sub-genres through a textometric study aimed at characterizing different types of texts (including witness interviews) produced by the field of justice.
Complete list of metadata

Cited literature [185 references]  Display  Hide  Download
Contributor : Lucie Gianola Connect in order to contact the contributor
Submitted on : Friday, March 27, 2020 - 7:47:19 PM
Last modification on : Wednesday, October 5, 2022 - 11:12:07 AM
Long-term archiving on: : Sunday, June 28, 2020 - 3:30:46 PM


Files produced by the author(s)


  • HAL Id : tel-02522680, version 1


Lucie Gianola. Aspects textuels de la procédure judiciaire exploitée en analyse criminelle et perspectives pour son traitement automatique. Linguistique. Université de Cergy-Pontoise, 2020. Français. ⟨NNT : ⟩. ⟨tel-02522680⟩



Record views


Files downloads