A. Abeillé, Guide des annotateurs -annotation fonctionnelle, 2004.

A. Abeillé, L. Clément, and F. Toussenel, Building a Treebank for French, 2003.
DOI : 10.1007/978-94-010-0201-1_10

A. Abeillé, F. Toussenel, and M. Chéradame, Corpus le monde -annotations en constituants -guide pour les correcteurs, 2004.

A. Abeillé and N. Barrier, Enriching a french treebank, LREC, 2004.

A. Abeillé and L. Clément, Annotation morpho-syntaxique, 2006.

C. Adam, Voisinage lexical pour l'analyse du discours, 2012.
URL : https://hal.archives-ouvertes.fr/tel-00784531

E. Alpaydin, Introduction to machine learning, 2004.

E. Henestroza and A. , Efficient Large-Context Dependency Parsing and Correction with Distributional Lexical Resources, 2013.
URL : https://hal.archives-ouvertes.fr/tel-00860720

M. Baroni, S. Bernardini, A. Ferraresi, and E. Zanchetta, The wacky wide web: a collection of very large linguistically processed web-crawled corpora. Language resources and evaluation, pp.209-226, 2009.

S. Beinfeld and H. Bochner, Comprehensive Yiddish-English Dictionary, 2013.

R. Bisiani, Beam search, Encyclopedia of Artificial Intelligence, pp.1467-1468, 1987.

B. Bohnet and J. Nivre, A transition-based system for joint part-of-speech tagging and labeled non-projective dependency parsing, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp.1455-1465, 2012.

G. Bouma, Normalized (pointwise) mutual information in collocation extraction, Proceedings of the Biennial GSCL Conference, pp.31-40, 2009.

D. Bourigault, Upery: un outil d'analyse distributionnelle étendue pour la construction d'ontologies à partir de corpus, Actes de la 9ème conférence annuelle sur le Traitement Automatique des Langues, pp.75-84, 2002.

D. Bourigault, Un analyseur syntaxique opérationnel: SYNTEX. Habilitation à diriger des recherches en linguistique, 2007.

M. Bras, Le projet teloc: construction d'une base textuelle occitane. Langues et Cité: bulletin de l'observation des pratiques linguistiques, 2006.

F. Peter, . Brown, V. Peter, . Desouza, L. Robert et al., Class-based n-gram models of natural language, Computational linguistics, vol.18, issue.4, pp.467-479, 1992.

M. Candito and B. Crabbé, Improving generative statistical parsing with semisupervised word clustering, Proceedings of the 11th International Conference on Parsing Technologies, pp.138-141, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00495267

M. Candito and D. Seddah, Effectively long-distance dependencies in french: annotation and parsing evaluation, TLT 11-The 11th International Workshop on Treebanks and Linguistic Theories, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00769625

M. Candito, B. Crabbé, and P. Denis, Statistical french dependency parsing: treebank conversion and first results, Proceedings of the Seventh International Conference on Language Resources and Evaluation, pp.1840-1847, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00495196

M. Candito, J. Nivre, P. Denis, and E. H. Anguiano, Benchmarking of statistical dependency parsers for french, Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp.108-116, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00514815

M. Candito, E. H. Anguiano, and D. Seddah, A word clustering approach to domain adaptation: Effective parsing of biomedical texts, Proceedings of the 12th International Conference on Parsing Technologies, pp.37-42, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00659577

M. Candito, B. Crabbé, and M. Falco, Dépendances syntaxiques de surface pour le français, 2011.

M. Candito and D. Seddah, Le corpus sequoia: annotation syntaxique et exploitation pour l'adaptation d'analyseur par pont lexical, TALN 2012-19e conférence sur le Traitement Automatique des Langues Naturelles, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00698938

J. Carletta, Assessing agreement on classification tasks: the kappa statistic, Computational linguistics, vol.22, issue.2, pp.249-254, 1996.

E. Charniak, A maximum-entropy-inspired parser, Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference, pp.132-139, 2000.

N. Chomsky, Syntactic structures. Mouton & co., The Hague, 1957.

Y. J. Chu and T. H. Liu, On the shortest arborescence of a directed graph, Science Sinica, vol.14, pp.1396-1400, 1965.

K. Church, A pendulum swung too far. Linguistic Issues in Language Technology, 2011.

M. Collins, Head-driven statistical methods for natural language parsing, 1999.

M. Collins, Discriminative training methods for hidden Markov models, Proceedings of the ACL-02 conference on Empirical methods in natural language processing , EMNLP '02, pp.1-8, 2002.
DOI : 10.3115/1118693.1118694

M. Collins, Head-Driven Statistical Models for Natural Language Parsing, Computational Linguistics, vol.18, issue.1, pp.589-637, 2003.
DOI : 10.1109/18.87000

M. Cori and J. Léon, La constitution du TAL : Étude historique des dénominations et des concepts, TAL, vol.43, issue.3, pp.21-55, 2002.

B. Crabbé and M. Candito, Expériences d'analyses syntaxique statistique du français, TALN 2008-conférence sur le Traitement Automatique des Langues Naturelles. ATALA, 2008.

J. N. Darroch and D. Ratcliff, Generalized iterative scaling for log-linear models. The Annals of Mathematical Statistics, pp.1470-1480, 1972.

É. De, L. Clergerie, B. Sagot, L. Nicolas, and M. Guénot, Frmg: évolutions d'un analyseur syntaxique tag du français, Journée de l'ATALA sur: Quels analyseurs syntaxiques pour le français? ATALA, 2009.

P. Denis and B. Sagot, Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging, Language Resources and Evaluation, vol.20, issue.2, pp.721-736, 2012.
DOI : 10.1007/s10579-012-9193-0

URL : https://hal.archives-ouvertes.fr/inria-00614819

M. Jason and . Eisner, Three new probabilistic models for dependency parsing: An exploration, Proceedings of the 16th conference on Computational linguistics, pp.340-345, 1996.

C. Fabre, Affinités syntaxiques et sémantiques entre mots: apports mutuels de la linguistique et du TAL. Habilitation à diriger des recherches en linguistique, 2010.

C. Fabre, J. Rebeyrolle, and L. Ho-dac, Examen du statut des syntagmes pr??positionnels ?? la lumi??re de donn??es issues de corpus annot??s, Congr??s Mondial de Linguistique Fran??aise 2008, pp.2484-2494, 2008.
DOI : 10.1051/cmlf08227

URL : https://hal.archives-ouvertes.fr/hal-00559912/file/Fabre-et-alCMLF08.pdf

Y. Goldberg and J. Orwant, A dataset of syntactic-ngrams over time from a very large corpus of english books, Second Joint Conference on Lexical and Computational Semantics (*SEM) Proceedings of the Main Conference and the Shared Task, pp.241-247, 2013.

K. Hall, R. Mcdonald, J. Katz-brown, and M. Ringgaard, Training dependency parsers by jointly optimizing multiple objectives, Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp.1489-1499, 2011.

C. Ho and C. Lin, Large-scale linear support vector regression, Journal of Machine Learning Research, vol.13, pp.3323-3348, 2012.

M. Jacques, Que : la valse des étiquettes, Actes de la 12ème conférence sur le Traitement Automatique des Langues Naturelles (TALN'2005), pp.133-142, 2005.

T. Edwin and . Jaynes, Information theory and statistical mechanics, p.620, 1957.

T. Joachims, Training linear SVMs in linear time, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '06, pp.217-226, 2006.
DOI : 10.1145/1150402.1150429

R. Johansson and P. Nugues, Investigating multilingual dependency parsing, Proceedings of the Tenth Conference on Computational Natural Language Learning, CoNLL-X '06, pp.206-210, 2006.
DOI : 10.3115/1596276.1596315

R. Johansson and P. Nugues, Incremental dependency parsing using online learning, Proceedings of the CoNLL/EMNLP, pp.1134-1138, 2007.

J. Judge, A. Cahill, and J. Van-genabith, QuestionBank, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL , ACL '06, pp.497-504, 2006.
DOI : 10.3115/1220175.1220238

D. Kawahara and K. Uchimoto, Learning reliability of parses for domain adaptation of dependency parsing, p.8, 2008.

S. Kübler, R. Mcdonald, and J. Nivre, Dependency Parsing, Synthesis Lectures on Human Language Technologies, vol.2, issue.1, 2009.
DOI : 10.2200/S00169ED1V01Y200901HLT002

D. Lin, An information-theoretic definition of similarity, ICML, pp.296-304, 1998.

N. Edward and . Lorenz, Deterministic nonperiodic flow, Journal of the atmospheric sciences, vol.20, issue.2, pp.130-141, 1963.

D. Mcclosky, E. Charniak, and M. Johnson, Reranking and self-training for parser adaptation, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL , ACL '06, pp.337-344, 2006.
DOI : 10.3115/1220175.1220218

D. Mcclosky, E. Charniak, and M. Johnson, When is self-training effective for parsing?, Proceedings of the 22nd International Conference on Computational Linguistics, COLING '08, pp.561-568, 2008.
DOI : 10.3115/1599081.1599152

R. Mcdonald and F. Pereira, Online learning of approximate dependency parsing algorithms, Proceedings of EACL, pp.81-88, 2006.

R. Mcdonald, F. Pereira, K. Ribarov, and J. Haji?, Non-projective dependency parsing using spanning tree algorithms, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing , HLT '05, pp.523-530, 2005.
DOI : 10.3115/1220575.1220641

Q. Mcnemar, Note on the sampling error of the difference between correlated proportions or percentages, Psychometrika, vol.12, issue.2, pp.153-157, 1947.
DOI : 10.1007/BF02295996

A. Seyed-abolghasem-mirroshandel, J. L. Nasr, and . Roux, Semi-supervised dependency parsing using lexical affinities, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, pp.777-785, 2012.

A. Seyed-abolghasem-mirroshandel, B. Nasr, and . Sagot, Enforcing subcategorization constraints in a parser using sub-parses recombining, Proceedings of NAACL-HLT, pp.239-247, 2013.

M. Mohri, A. Rostamizadeh, and A. Talwalkar, Foundations of machine learning, 2012.

F. Morlane-hondère, Une approche linguistique de l'évaluation des ressources extraites par analyse distributionnelle automatique, 2013.

P. Kevin and . Murphy, Machine learning: a probabilistic perspective, 2012.

Y. Niborski and B. Vaisbrot, Yidish?frantseyzish verterbukh / Dictionnaire yiddish?français, Bibliothèque Medem, 2002.

J. Nivre, J. Hall, S. Kübler, R. T. Mcdonald, J. Nilsson et al., The conll 2007 shared task on dependency parsing, EMNLP-CoNLL, pp.915-932, 2007.

J. Nivre, J. Hall, J. Nilsson, A. Chanev, G. Eryigit et al., MaltParser: A language-independent system for data-driven dependency parsing, Natural Language Engineering, vol.13, issue.2, p.95, 2007.
DOI : 10.1017/S1351324906004505

P. Paroubek, I. Robba, A. Vilnat, and C. Ayache, Data, annotations and measures in easy, the evaluation campaign for parsers of french, In proceedings of the fifth international conference on Language Resources and Evaluation, pp.315-320, 2006.

S. Petrov, L. Barrett, R. Thibaux, and D. Klein, Learning accurate, compact, and interpretable tree annotation, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL , ACL '06, pp.433-440, 2006.
DOI : 10.3115/1220175.1220230

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.87.6643

S. Petrov, P. Chang, M. Ringgaard, and H. Alshawi, Uptraining for accurate deterministic question parsing, Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp.705-713, 2010.

A. Ratnaparkhi, Maximum entropy models for natural language ambiguity resolution, 1998.

F. Rosenblatt, The perceptron: A probabilistic model for information storage and organization in the brain., Psychological Review, vol.65, issue.6, p.386, 1958.
DOI : 10.1037/h0042519

K. Sagae, Self-training without reranking for parser domain adaptation and its impact on semantic role labeling, Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing, pp.37-44, 2010.

K. Sagae and A. Lavie, A best-first probabilistic shift-reduce parser, Proceedings of the COLING/ACL on Main conference poster sessions -, pp.691-698, 2006.
DOI : 10.3115/1273073.1273162

K. Sagae and J. Tsujii, Dependency parsing and domain adaptation with lr models and parser ensembles, Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL, pp.1044-1050, 2007.

B. Sagot, The lefff, a freely available and large-coverage morphological and syntactic lexicon for french, 7th international conference on Language Resources and Evaluation, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00521242

B. Sagot, L. Clément, E. De-la-clergerie, and P. Boullier, The lefff 2 syntactic lexicon for french: architecture, acquisition, use, LREC, pp.1-4, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00413071

F. Sajous, N. Hathout, and B. Calderone, Glàff, un gros lexique à tout faire du français, Actes de la 20e conférence sur le Traitement Automatique des Langues Naturelles (TALN'2013), pp.285-298, 2013.

J. Sinclair, Chapter 1: Corpus and text ? basic principles Developing linguistic corpora: a guide to good practice, 2005.

P. Stenetorp, S. Pyysalo, G. Topi?, T. Ohta, S. Ananiadou et al., Brat: a web-based tool for nlp-assisted text annotation, Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp.102-107, 2012.

L. Tanguy, Complexification des données et des techniques en linguistique: contributions du TAL aux solutions et aux problèmes Habilitation à diriger des recherches en linguistique, 2012.

L. Tanguy and N. Hathout, Webaffix: un outil d'acquisition morphologique dérivationnelle à partir du web, 2002.

L. Tanguy and A. Urieli, Basilio Calderone, Nabil Hathout, and Franck Sajous. A multitude of linguistically-rich features for authorship attribution, Notebook for PAN at CLEF 2011, 2011.

F. Robert and . Tate, Correlation between a discrete and a continuous variable. point-biserial correlation . The Annals of mathematical statistics, pp.603-607, 1954.

L. Tesnière, Eléments de syntaxe structurale, Editions Klincksieck, 1959.

I. Titov and J. Henderson, A latent variable model for generative dependency parsing, Trends in Parsing Technology, pp.35-55, 2010.

D. Peter, P. Turney, and . Pantel, From frequency to meaning: Vector space models of semantics, Journal of artificial intelligence research, vol.37, issue.1, pp.141-188, 2010.

A. Urieli and L. Tanguy, L'apport du faisceau dans l'analyse syntaxique en dépendances par transitions : études de cas avec l'analyseur Talismane, Actes de la 20e conférence sur le Traitement Automatique des Langues Naturelles (TALN'2013), pp.188-201, 2013.

A. Urieli and M. Vergez-couret, Jochre, océrisation par apprentissage automatique : étude comparée sur le yiddish et l'occitan, Actes de TALARE 2013 : Traitement Automatique des Langues Régionales de France et d'Europe, pp.221-234, 2013.

K. Van-den-eynde and P. Mertens, Le dictionnaire de valence dicovalence: manuel d'utilisation, 2006.

V. Vapnik, The Nature of Statistical Learning Theory, 1995.

A. Viterbi, Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. Information Theory, IEEE Transactions on, vol.13, issue.2, pp.260-269, 1967.

E. Wehrli, Fips, a "deep" linguistic multilingual parser, Proceedings of the Workshop on Deep Linguistic Processing, DeepLP '07, pp.120-127, 2007.
DOI : 10.3115/1608912.1608931

K. Yoshida, Y. Tsuruoka, Y. Miyao, and J. Tsujii, Ambiguous partof-speech tagging for improving accuracy and domain portability of syntactic parsers, Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, 2007.

H. Daniel and . Younger, Recognition and parsing of context-free languages in time n 3, Information and control, vol.10, issue.2, pp.189-208, 1967.

Y. Zhang and S. Clark, A tale of two parsers, Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '08, pp.562-571, 2008.
DOI : 10.3115/1613715.1613784

Y. Zhang and S. Clark, Transition-based parsing of the Chinese treebank using a global discriminative model, Proceedings of the 11th International Conference on Parsing Technologies, IWPT '09, pp.162-171, 2009.
DOI : 10.3115/1697236.1697267

Y. Zhang and J. Nivre, Transition-based dependency parsing with rich non-local features, ACL (Short Papers), pp.188-193, 2011.

Y. Zhang and J. Nivre, Analyzing the effect of global learning and beam-search on transition-based dependency parsing, COLING (Posters), pp.1391-1400, 2012.