Skip to Main content Skip to Navigation
Theses

Extraction de Motifs Communs dans un Ensemble de Séquences.
Application à l'identification de sites de liaison aux protéines dans les séquences primaires d'ADN.

Alban Mancheron 1
1 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : The extraction of significant biological patterns, and in particular the identification of regulation sites of proteinic synthesis in DNA primary sequences, is one of the major issues today in bioinformatics. Indeed any anomaly in proteinic synthesis regulation has detrimental damages on the well-being of certain organisms. Extracting these sites enables to better understand cellular operation or even to remove or cure pathology.

What is problematic is the lack of information on patterns to be extracted, as well as the large volume of data to mine. In this dissertation, we introduce two polynomial algorithms -- the first one is deterministic and the other one is probabilist -- to address the issue of pattern extraction. We introduce a new family of score functions and we study their statistical properties. We characterize the language which is recognized by the index structure named "Oracle", and we modify this structure in order to make it more efficient.
Document type :
Theses
Complete list of metadata

https://tel.archives-ouvertes.fr/tel-00257587
Contributor : Alban Mancheron <>
Submitted on : Tuesday, February 19, 2008 - 4:57:06 PM
Last modification on : Thursday, May 24, 2018 - 3:59:22 PM
Long-term archiving on: : Thursday, May 20, 2010 - 10:52:05 PM

Identifiers

  • HAL Id : tel-00257587, version 1

Citation

Alban Mancheron. Extraction de Motifs Communs dans un Ensemble de Séquences.
Application à l'identification de sites de liaison aux protéines dans les séquences primaires d'ADN.. Autre [cs.OH]. Université de Nantes, 2006. Français. ⟨tel-00257587⟩

Share

Metrics

Record views

667

Files downloads

2455