Mise au point d'un formalisme syntaxique de haut niveau pour le traitement automatique des langues

Abstract : The goal of computational linguistics is to provide a formal account linguistical knowledge, and to produce algorithmic tools for natural languageprocessing. Often, this is done in a so-called generative framework, where grammars describe sets of valid sentences by iteratively applying some set of rewrite rules. Another approach, based on model theory, describes instead grammaticality as a set of well-formedness logical constraints, relying on deep links between logic and automata in order to produce efficient parsers. This thesis favors the latter approach. Making use of several existing results in theoretical computer science, we propose a tool for linguistical description that is both expressive and designed to facilitate grammar engineering. It first tackles the abstract structure of sentences, providing a logical language based on lexical properties of words in order to concisely describe the set of grammaticaly valid sentences. It then draws the link between these abstract structures and their representations (both in syntax and semantics), through the use of linearization rules that rely on logic and lambda-calculus. Then in order to validate this proposal, we use it to model various linguistic phenomenas, ending with a specific focus on languages that include free word order phenomenas (that is, sentences which allow the free reordering of some of their words or syntagmas while keeping their meaning), and on their algorithmic complexity.
Document type :
Theses
Complete list of metadatas

https://tel.archives-ouvertes.fr/tel-01267716
Contributor : Abes Star <>
Submitted on : Thursday, February 4, 2016 - 5:54:06 PM
Last modification on : Thursday, January 11, 2018 - 6:20:17 AM
Long-term archiving on : Saturday, November 12, 2016 - 9:58:30 AM

File

KIRMAN_JEROME_2015.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01267716, version 1

Collections

Citation

Jerome Kirman. Mise au point d'un formalisme syntaxique de haut niveau pour le traitement automatique des langues. Informatique. Université de Bordeaux, 2015. Français. ⟨NNT : 2015BORD0330⟩. ⟨tel-01267716⟩

Share

Metrics

Record views

380

Files downloads

263