Structured prediction for sequential data

Rémi Lajugie 1, 2
2 SIERRA - Statistical Machine Learning and Parsimony
DI-ENS - Département d'informatique de l'École normale supérieure, ENS Paris - École normale supérieure - Paris, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
Abstract : In this manuscript, we consider structured machine learning problems and consider more precisely the ones involving sequential structure. In a first part, we consider the problem of similarity measure learning for two tasks where sequential structure is at stake: (i) the multivariate change-point detection and (ii) the time warping of pairs of time series. The methods generally used to solve these tasks rely on a similarity measure to compare timestamps. We propose to learn a similarity measure from fully labelled data, i.e., signals already segmented or pairs of signals for which the optimal time warping is known. Using standard structured prediction methods, we present algorithmically efficient ways for learning. We propose to use loss functions specifically designed for the tasks. We validate our approach on real-world data. In a second part, we focus on the problem of weak supervision, in which sequen tial data are not totally labeled. We conduct our study on the problem of aligning a time series to its symbolic representation, using as a leading example the problem of aligning an audio recording with the score. We consider the symbolic representation as two fold: (i) a complete information about the order of events or notes played and (ii) an approximate idea about the expected shape of the alignment. We propose to learn a classifier for each note using these two kinds of information. Our learning problem is based on the optimization of a convex function that takes advantage of the weak supervision and of the sequential structure of data. Our approach is validated through experiments on the task of audio-to-score on real musical data.
Document type :
Theses
Complete list of metadatas

Cited literature [245 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01203438
Contributor : Rémi Lajugie <>
Submitted on : Wednesday, September 23, 2015 - 9:38:14 AM
Last modification on : Friday, May 25, 2018 - 12:02:06 PM
Long-term archiving on: Tuesday, December 29, 2015 - 9:29:41 AM

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : tel-01203438, version 1

Citation

Rémi Lajugie. Structured prediction for sequential data. Machine Learning [cs.LG]. Ecole Normale Supérieure, 2015. English. ⟨tel-01203438v1⟩

Share

Metrics

Record views

1030

Files downloads

747