Skip to Main content Skip to Navigation
Theses

Aspects textuels de la procédure judiciaire exploitée en analyse criminelle et perspectives pour son traitement automatique

Abstract : Criminal analysis is a discipline that supports investigations practiced within the National Gendarmerie. It is based on the use of the documents compiled in the judicial procedure file (witness interviews, search warrants, expert reports, phone and bank data, etc.) to synthesize the information collected and to propose a new understanding of the facts examined. While criminal analysis uses data visualization software (i. e. IBM Analyst's Notebook) to display the hypotheses formulated, the digital and textual management of the file documents is entirely manual. However, criminal analysis relies on entities to formalize its practice. The presentation of the research context details the practice of criminal analysis as well as the constitution of judicial procedure files as textual corpora. We then propose perspectives for the adaptation of natural language processing (NLP) and information extraction methods to the case study, including a comparison of the concepts of entity in criminal analysis and named entity in NLP. This comparison is done on the conceptual and linguistic plans. A first approach to the detection of entities in witness interviews is presented. Finally, since textual genre is a parameter to be taken into account when applying automatic processing to text, we develop a structure of the 'legal' textual genre into discourse, genres, and sub-genres through a textometric study aimed at characterizing different types of texts (including witness interviews) produced by the field of justice.
Complete list of metadatas

Cited literature [185 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-02522680
Contributor : Lucie Gianola <>
Submitted on : Friday, March 27, 2020 - 7:47:19 PM
Last modification on : Monday, October 19, 2020 - 11:12:01 AM
Long-term archiving on: : Sunday, June 28, 2020 - 3:30:46 PM

File

manuscrit_LucieGianola.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : tel-02522680, version 1

Collections

Citation

Lucie Gianola. Aspects textuels de la procédure judiciaire exploitée en analyse criminelle et perspectives pour son traitement automatique. Linguistique. Université de Cergy-Pontoise, 2020. Français. ⟨tel-02522680⟩

Share

Metrics

Record views

113

Files downloads

332