. Pour-une-telle-tâche-d-'alignement and . Beaufort, définissent le mot comme la plus longue séquence qui ne possède pas le même séparateur de part et d'autre de l'alignement, 2010.

I. Bayoudh, N. Béchet, and M. Roche, Blog Classification: Adding Linguistic Knowledge to Improve the K-NN Algorithm, Proceedings of Intelligent Information Processing (IIP), pp.68-77, 2008.
DOI : 10.1007/978-0-387-87685-6_10

URL : https://hal.archives-ouvertes.fr/lirmm-00336580

N. Béchet, J. Chauché, V. Prince, and M. Roche, Corpus and Web : Two Allies in Building and Automatically Expanding Conceptual Classes, Informatica, vol.34, issue.3, pp.279-286, 2010.

N. Béchet, M. Roche, and J. Chauché, How the ExpLSA approach impacts the document classification tasks, 2008 Third International Conference on Digital Information Management, pp.241-246, 2008.
DOI : 10.1109/ICDIM.2008.4746814

N. Béchet, M. Roche, and J. Chauché, Comment valider automatiquement des relations syntaxiques induites, Actes de la conférence Extraction et Gestion des Connaissances (EGC), article nominé parmi les meilleurs articles académiques d'EGC'09, pp.169-180, 2009.

N. Béchet, M. Roche, and J. Chauché, Towards the Selection of Induced Syntactic Relations, Proceedings of 31st European Conference on Information Retrieval (ECIR), pp.786-790, 2009.
DOI : 10.1007/3-540-44795-4_42

Z. Bellahsene, S. Benbernou, H. Jaudoin, F. Pinet, O. Pivert et al., FORUM, ACM SIGMOD Record, vol.39, issue.2, pp.11-18, 2010.
DOI : 10.1145/1893173.1893175

URL : https://hal.archives-ouvertes.fr/lirmm-00588571

S. Bringay, N. Béchet, F. Bouillot, P. Poncelet, M. Roche et al., Towards an On-Line Analysis of Tweets Processing, Proceedings of Database and Expert Systems Applications (DEXA), pp.154-161, 2011.
DOI : 10.1145/361219.361220

URL : https://hal.archives-ouvertes.fr/hal-00636285

F. Duchateau, Z. Bellahsene, and M. Roche, A context-based measure for discovering approximate semantic matching between schema elements, Proceedings of IEEE Research Challenges in Information Science (RCIS), pp.9-20, 2007.
URL : https://hal.archives-ouvertes.fr/lirmm-00113849

F. Duchateau, Z. Bellahsene, and M. Roche, Improving quality and performance of schema matching in large scale, Ingénierie des Systèmes d'Information (ISI), pp.59-82, 2008.
DOI : 10.3166/isi.13.5.59-82

URL : https://hal.archives-ouvertes.fr/lirmm-00343491

A. Harb, M. Plantié, M. Roche, G. Dray, F. Trousset et al., D??tection d'opinion. Comment d??terminer les adjectifs d'opinion d'un domaine donn??, Document num??rique, vol.11, issue.1-2, pp.37-61, 2008.
DOI : 10.3166/dn.11.1-2.37-61

R. Kessler, N. Béchet, J. Moreno, M. Roche, and M. El-bèze, Job Offer Management: How Improve the Ranking of Candidates, Proceedings of Foundations of Intelligent Systems (ISMIS), pp.431-441, 2009.
DOI : 10.1007/978-3-540-88875-8_86

URL : https://hal.archives-ouvertes.fr/lirmm-00537101

S. Laroum, N. Béchet, H. Hamza, and M. Roche, Classification automatique de documents bruités à faible contenu textuel. Numéro spécial de la revue RNTI, 2010.

D. Li, A. Laurent, P. Poncelet, and M. Roche, Extraction of unexpected sentences : A sentiment classification assessed approach, Intelligent Data Analysis, vol.14, issue.1, pp.31-46, 2010.
URL : https://hal.archives-ouvertes.fr/lirmm-00401363

C. Lopez, V. Prince, and M. Roche, Automatic titling of electronic documents with noun phrase extraction, 2010 International Conference of Soft Computing and Pattern Recognition, 2010.
DOI : 10.1109/SOCPAR.2010.5686088

URL : https://hal.archives-ouvertes.fr/lirmm-00563903

C. Lopez, V. Prince, and M. Roche, Automatic generation approach of short titles, Proceedings of Language and Technology Conference (LTC), 2011.
URL : https://hal.archives-ouvertes.fr/lirmm-00651571

C. Lopez, V. Prince, and M. Roche, Recherche documentaire par titrage automatique, Actes d'INFORSID, pp.217-232, 2011.
URL : https://hal.archives-ouvertes.fr/lirmm-00637968

C. Lopez and M. Roche, Approche de construction automatique de titres courts par des méthodes de fouille du web, Actes de Traitement Automatique des Langues Naturelles (TALN), pp.39-50, 2011.

A. Mela, M. Roche, and M. A. Bekhtaoui, Mixer les moyens pour extraire les gloses, Actes de la conférence Extraction et Gestion des Connaissances (EGC), pp.95-106, 2011.
URL : https://hal.archives-ouvertes.fr/lirmm-00588524

M. Plantié, M. Roche, G. Dray, and P. Poncelet, Is a Voting Approach Accurate for Opinion Mining?, Proceedings of Data Warehousing and Knowledge Discovery (DaWaK), pp.413-422, 2008.
DOI : 10.1007/978-3-540-85836-2_39

V. Prince and M. Roche, Information Retrieval in Biomedicine : Natural Language Processing for Knowledge Integration, Medical Information Science Reference, IGI Gobal, p.460, 2009.
DOI : 10.4018/978-1-60566-274-9

M. Roche, Intégration de la construction de la terminologie de domaines spécialisés dans un processus global de fouille de textes, 2004.

M. Roche, How statistical information from the web can help identify named entities, Proceedings of International Conference on Web Information Systems (WEBIST), Session Web and Text Mining, 2011.
URL : https://hal.archives-ouvertes.fr/lirmm-00588581

M. Roche and Y. Kodratoff, Pruning Terminology Extracted from a Specialized Corpus for CV Ontology Acquisition, Proceedings of onToContent Workshop - OTM'06, pp.1107-1116, 2006.
DOI : 10.1007/11915072_13

URL : https://hal.archives-ouvertes.fr/lirmm-00113165

M. Roche and Y. Kodratoff, Text and web mining approaches in order to build specialized ontologies, Journal of Digital Information, vol.10, issue.4, 2009.
URL : https://hal.archives-ouvertes.fr/lirmm-00424463

M. Roche and V. Prince, AcroDef: A Quality Measure for Discriminating Expansions of Ambiguous Acronyms, Proceedings of CONTEXT, LNCS, pp.411-424, 2007.
DOI : 10.1007/978-3-540-74255-5_31

URL : https://hal.archives-ouvertes.fr/lirmm-00168945

M. Roche and V. Prince, Managing the Acronym/Expansion Identification Process for Text-Mining Applications, International Journal of Software and Informatics, vol.2, issue.2, pp.163-179, 2008.
URL : https://hal.archives-ouvertes.fr/lirmm-00349235

M. Roche and V. Prince, A web-mining approach to disambiguate biomedical acronym expansions, Informatica, vol.34, issue.2, pp.243-253, 2010.
URL : https://hal.archives-ouvertes.fr/lirmm-00487536

B. Rosoor, L. Sebag, S. Bringay, and M. Roche, Quand un tweet détecte une catastrophe naturelle, Actes de la conférence Veille Stratégique Scientifique et Technologique (VSST), pp.283-286, 2010.

A. Sallaberry, N. Pecheur, S. Bringay, M. Roche, and M. Teisseire, Sequential patterns mining and gene sequence visualization to discover novelty from microarray data, Journal of Biomedical Informatics, vol.44, issue.5, pp.44760-774, 2011.
DOI : 10.1016/j.jbi.2011.04.002

URL : https://hal.archives-ouvertes.fr/hal-00625539

H. Saneifar, S. Bonniol, A. Laurent, P. Poncelet, and M. Roche, Recherche de passages pertinents dans les fichiers logs par enrichissement de requêtes, Actes des Journées Francophones sur les Ontologies, 2009.

H. Saneifar, S. Bonniol, A. Laurent, P. Poncelet, and M. Roche, Terminology Extraction from Log Files, Proceedings of Database and Expert Systems Applications (DEXA), pp.769-776, 2009.
DOI : 10.1007/978-3-540-24775-3_79

URL : https://hal.archives-ouvertes.fr/lirmm-00423940

H. Saneifar, S. Bonniol, A. Laurent, P. Poncelet, and M. Roche, Passage Retrieval in Log Files: An Approach Based on Query Enrichment, Proceedings of Advances in Natural Language Processing (IceTAL), pp.357-368, 2010.
DOI : 10.1007/978-3-642-14770-8_39

URL : https://hal.archives-ouvertes.fr/lirmm-00816291

H. Saneifar, S. Bonniol, A. Laurent, P. Poncelet, and M. Roche, Recherche de passages pertinents dans les fichiers logs par enrichissement de requêtes, Actes de la COnférence en Recherche d'Infomations et Applications (CORIA), pp.239-254, 2010.

H. Saneifar, S. Bonniol, P. Poncelet, and M. Roche, Identification des divisions logiques de fichiers logs, Actes des Rencontres de la Société Francophone de Classification (SFC), 2011.
URL : https://hal.archives-ouvertes.fr/lirmm-00723590

C. Serp, E. Cazal, A. Laurent, and M. Roche, Tervotiq : un système de vote pour l'extraction de la terminologie d'un corpus en français médiéval, Proceedings of Journées internationales d'Analyse statistique des Données Textuelles (JADT), pp.1069-1080, 2008.

. Bibliographie, S. Agrawal, R. Agrawal, and R. Srikant, Fast algorithms for mining association rules in large databases, p.94, 1994.

B. Andreevskaia, A. Andreevskaia, and S. Bergler, Semantic tag extraction from wordnet glosses, Proceedings of LREC-06, the 5th Conference on Language Resources and Evaluation, 2006.

L. Audibert, étude des critères de désambiguïsation sémantique automatique : résultats sur les cooccurrences, Actes de la conférence Traitement Automatique des Langues Naturelles (TALN), pp.33-44, 2003.

N. Aussenac-gilles and D. Bourigault, Construction d'ontologies à partir de textes, Actes de Traitement Automatique des Langues Naturelles (TALN), pp.27-47, 2003.

N. Aussenac-gilles and M. Jacques, Designing and Evaluating Patterns for Relation Acquisition from Texts with CAMELEON. Terminology, Pattern-Based approaches to Semantic Relations, pp.45-73, 2008.

J. Azé, Extraction de Connaissances dans des Données Numériques et Textuelles, Thèse de Doctorat, 2003.

B. Baxendale, Man-made index for technical literature -an experiment, IBM Journal of Research and Development, pp.354-361, 1958.

. Beaufort, A hybrid rule/model-based finite-state framework for normalizing sms messages, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL), pp.770-779, 2010.

D. Bourigault, Analyse syntaxique locale pour le repérage de termes complexes dans un texte, pp.105-118, 1993.

D. Bourigault, UPERY : un outil d'analyse distributionnelle étendue pour la construction d'ontologies à partir de corpus, Actes de Traitement Automatique des Langues Naturelles (TALN), pp.75-84, 2002.

. Bourigault, Construction de ressources terminologiques ou ontologiques ?? partir de textes Un cadre unificateur pour trois ??tudes de cas, Revue d'intelligence artificielle, vol.18, issue.1, pp.87-110, 2004.
DOI : 10.3166/ria.18.87-110

. Bourigault, . Fabre, D. Bourigault, and C. Fabre, Approche linguistique pour l'analyse syntaxique de corpus, pp.131-151, 2000.

. Bourigault, . Jacquemin, D. Bourigault, and C. Jacquemin, Term extraction + term clustering, Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics -, pp.15-22, 1999.
DOI : 10.3115/977035.977039

E. Brill, Some advances in transformation-based part of speech tagging, Conference on Artificial Intelligence (AAAI), pp.722-727, 1994.

D. Brody, S. Brody, and N. Diakopoulos, Using Word Lengthening to Detect Sentiment in Microblogs, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.562-570, 2011.

C. Brun and C. Hagège, Intertwining Deep Syntactic Processing and Named Entity Detection, Proceedings of Advances in Natural Language Processing, 4th International Conference (EsTAL), pp.195-206, 2004.
DOI : 10.1007/978-3-540-30228-5_18

. Cacheda, Performance evaluation of large-scale information retrieval systems scaling down, Proceedings of the International Workshop on Large-Scale and Distributed Systems for Information Retrieval ? SIGIR, 2010.

. Cambria, SenticNet, AAAI Fall Symposium Series, 2010.
DOI : 10.1007/978-3-319-23654-4_2

J. Chauché, Un outil multidimensionnel de l'analyse du discours, Proceedings of the International Conference on Computational Linguistics (COLING), pp.11-15, 1984.

K. Church and P. Hanks, Word association norms, mutual information, and lexicography, Proceedings of the 27th annual meeting on Association for Computational Linguistics -, pp.22-29, 1990.
DOI : 10.3115/981623.981633

URL : http://acl.ldc.upenn.edu/J/J90/J90-1003.pdf

. Cilibrasi, . Vitanyi, R. Cilibrasi, and P. M. Vitanyi, The Google Similarity Distance, IEEE Transactions on Knowledge and Data Engineering, vol.19, issue.3, pp.370-383, 2007.
DOI : 10.1109/TKDE.2007.48

A. Clas, Collocations et langues de sp??cialit??, Meta: Journal des traducteurs, vol.39, issue.4, pp.576-580, 1994.
DOI : 10.7202/002327ar

URL : http://www.erudit.org/revue/meta/1994/v39/n4/002327ar.pdf

V. Claveau and P. Sébillot, Apprentissage symbolique pour l'acquisition de ressources linguistiques, Actes de l'atelier "Acquisition, apprentissage et exploitation de connaissances sémantiques pour l'accès au contenu textuel, 2003.

J. Clech and D. Zighed, Data mining et analyse des cv : Une expérience et des perspectives, Proceedings of Extraction et Gestion de Connaissances (EGC), pp.189-200, 2003.

. Codd, Providing OLAP (on-line analytical processing) to user-analysts : An IT mandate, 1993.

B. Daille, Approche mixte pour l'extraction automatique de terminologie : statistiques lexicales et filtres linguistiques, 1994.

. Daille, Catégorisation des noms propres : une étude en corpus, pp.115-129, 2000.

P. David, S. David, and P. Plante, De la nécéssité d'une approche morpho syntaxique dans l'analyse de textes, Intelligence Artificielle et Sciences Cognitives au Quebec, pp.140-154, 1990.

. Davidiv, Enhanced sentiment learning using twitter hashtags and smileys, Proceedings of International Conference on Computational Linguistics (COLING), 2010.

. Doan, Learning to map between ontologies on the semantic web, Proceedings of the eleventh international conference on World Wide Web , WWW '02, 2002.
DOI : 10.1145/511446.511532

. Downey, Locating complex named entities in web text, Proceedings of theInternational Joint Conference on Artificial Intelligence (IJCAI), pp.2733-2739, 2007.

. Duan, An empirical study on learning to rank of tweets, Proceedings of International Conference on Computational Linguistics (COLING), 2010.

. Dulac-arnold, Text Classification: A Sequential Reading Approach, Proceedings of the European Conference on Information Retrieval (ECIR), pp.411-423, 2011.
DOI : 10.1016/j.patrec.2010.02.015

URL : https://hal.archives-ouvertes.fr/inria-00607185

. Eisenstein, A latent variable model for geographic lexical variation, Proceedings of Empirical Methods in Natural Language Processing (EMNLP), pp.1277-1287, 2010.

. Esuli, . Sebastiani, A. Esuli, and F. Sebastiani, SentiWordNet : A publicly available lexical resource for opinion mining, Proceedings of the 5th Conference on Language Resourcesand Evaluation (LREC), pp.417-422, 2006.

B. D. Eugenio and M. Glass, The Kappa Statistic: A Second Look, Computational Linguistics, vol.23, issue.1, pp.95-101, 2004.
DOI : 10.1086/266577

. Euzenat, State of the art on ontology matching, 2004.

S. Euzenat, J. Euzenat, and P. Shvaiko, Ontology Matching, 2007.
DOI : 10.1007/978-3-642-38721-0

URL : https://hal.archives-ouvertes.fr/hal-00918122

C. Enguehard, Extraction d'informations à partir de corpus dégradés, Proceedings of 9ème conference sur le Traitement Automatique des Langues Naturelles (TALN'02), pp.105-115, 2002.

L. Facca, F. M. Facca, and P. L. Lanzi, Mining interesting knowledge from weblogs: a survey, Data & Knowledge Engineering, vol.53, issue.3, pp.225-241, 2005.
DOI : 10.1016/j.datak.2004.08.001

. Fairon, C. Paumier-]-fairon, and S. Paumier, A translated corpus of 30,000 french SMS, Proceedings of Language Resources and Evaluation Conference (LREC), 2006.
URL : https://hal.archives-ouvertes.fr/hal-00621421

D. Faure, Conception de méthode d'apprentissage symbolique et automatique pour l'acquisition de cadres de sous-catégorisation de verbes et de connaissances sémantiques à partir de textes : le système ASIUM, 2000.

N. Faure, D. Faure, and C. Nédellec, Knowledge acquisition of predicate argument structures from technical texts using Machine Learning: the system Asium, Proc of the European Workshop, Knowledge Acquisition, Modelling and Management, LNAI, pp.329-334, 1999.
DOI : 10.1007/3-540-48775-1_22

. Ferri, Learning decision trees using the area under the ROC curve, Proceedings of 9th International Conference on Machine Learning (ICML), pp.139-146, 2002.

. Fort, Vers une méthodologie d'annotation des entités nommées en corpus, Actes de Traitement Automatique du Langage Naturel (TALN), 2009.

. Frantzi, Automatic recognition of multi-word terms:. the C-value/NC-value method, International Journal on Digital Libraries, vol.3, issue.2, pp.115-130, 2000.
DOI : 10.1007/s007999900023

. Ginsberg, Detecting influenza epidemics using search engine query data, Nature, vol.36, issue.7232, pp.1012-1014, 2009.
DOI : 10.1038/nature07634

T. Hamon and A. Nazarenko, Using general semantic information to help the terminology structuration, Proceedings of the First International Conference on Language Resources and Evaluation (LREC), pp.675-680, 1998.
URL : https://hal.archives-ouvertes.fr/hal-00090074

U. Heid, Towards a corpus-based dictionary of German noun-verb collocations, Proceedings of the Euralex International Congress, pp.301-312, 1998.

S. Heiden and C. Guillot, Capitalisation des savoirs par le web : une application de la tei pour l'encodage et l'exploitation des textes de la base de français médiéval, 2003.

. Ho-dac, Sur la fonction discursive des titres L'unité texte, pp.125-152, 2004.

L. Hu, M. Hu, and B. Liu, Mining and summarizing customer reviews, Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '04, 2004.
DOI : 10.1145/1014052.1014073

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.76.2378

G. Illouz, Méta étiqueteur adaptatif : Vers une utilisation pragmatique des ressources linguistiques, Actes de Traitement Automatique des Langues Naturelles (TALN), 1999.

. Jackiewicz, Opinions, sentiments et jugements d'évaluation, Traitement Automatique des Langues, vol.51, issue.3, 2010.

C. Jacquemin, Variation terminologique : Reconnaissance et acquisition automatiques de termes et de leurs variantes en corpus, Mémoire d'Habilitation à Diriger des Recherches en informatique fondamentale, 1997.

A. Jacques, M. Jacques, and N. Aussenac-gilles, Variabilité des performances des outils de tal et genre textuel, 2006.

J. , R. Jacques, M. Rebeyrolle, and J. , Titres et structuration des documents, Actes International Symposium : Discourse and Document, pp.125-152, 2004.

R. Jalam and J. Chauchat, Pourquoi les n-grammes permettent de classer des textes ? recherche de mots-clefs pertinents à l'aide des ngrammes caractéristiques, 6th International Conference on Textual Data Statistical Analysis, France, pp.381-390, 2002.

S. Szpakowicz, Roget's thesaurus and semantic similarity, Conference on Recent Advances in Natural Language Processing, pp.212-219, 2003.

. Joshi, A comparative study of supervised learning as applied to acronym expansion in clinical reports, Proceedings of the Annual Symposium of the American Medical Informatics Association, pp.399-403, 2006.

. Junker, . Hoch, M. Junker, and R. Hoch, Evaluating OCR and non-OCR text representations for learning document classifiers, Proceedings of the Fourth International Conference on Document Analysis and Recognition, pp.1060-1066, 1997.
DOI : 10.1109/ICDAR.1997.620671

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.6.6732

. Kamps, Using wordnet to measure semantic orientation of adjectives, Proceedings of LREC 2004, the 4th International Conference on Language Resources and Evaluation, pp.174-181, 2004.

. Kastner, . Monz, I. Kastner, and C. Monz, Automatic single-document key fact extraction from newswire articles, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics on, EACL '09, pp.415-423, 2009.
DOI : 10.3115/1609067.1609113

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.324.660

H. Kefi, Ontologies et aide à l'utilisateur pour l'interrogation de sources multiples et hétérogènes, 2006.

L. Keller, F. Keller, and M. Lapata, Using the Web to Obtain Frequencies for Unseen Bigrams, Computational Linguistics, vol.24, issue.2, pp.459-484, 2003.
DOI : 10.1093/ijl/3.4.235

. Lallich, . Teytaud, S. Lallich, and O. Teytaud, évaluation et validation des règles d'association. Numéro spécial "Mesures de qualité pour la fouille des données, Revue des Nouvelles Technologies de l'Information (RNTI), RNTI-E-1, pp.193-218, 2004.

. Landauer, . Dumais, T. Landauer, and S. Dumais, A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge., Psychological Review, vol.104, issue.2, pp.211-240, 1997.
DOI : 10.1037/0033-295X.104.2.211

. Larkey, Acrophile, Proceedings of the fifth ACM conference on Digital libraries , DL '00, pp.205-214, 2000.
DOI : 10.1145/336597.336664

T. Larousse, Thésaurus Larousse -des idées aux mots, des mots aux idées, 1992.

M. Laurens, La description des collocations et leur, 1999.

V. Levenshtein, Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Soviet Physics Doklady, vol.10, p.707, 1966.

. Lin, Text Cube: Computing IR Measures for Multidimensional Text Database Analysis, 2008 Eighth IEEE International Conference on Data Mining, pp.905-910, 2008.
DOI : 10.1109/ICDM.2008.135

D. Lin, An information-theoretic definition of similarity, Proceedings of 15th International Conf. on Machine Learning, pp.296-304, 1998.

Y. Lv and C. Zhai, Adaptive relevance feedback in information retrieval, Proceeding of the 18th ACM conference on Information and knowledge management, CIKM '09, pp.255-264, 2009.
DOI : 10.1145/1645953.1645988

S. Maedche, A. Maedche, and S. Staab, Ontology learning for the Semantic Web, IEEE Intelligent Systems, vol.16, issue.2, pp.72-79, 2001.
DOI : 10.1109/5254.920602

S. Maedche, A. Maedche, and S. Staab, Measuring Similarity between Ontologies, Proceedings of Knowledge Engineering and Knowledge Management (EKAW), pp.251-263, 2002.
DOI : 10.1007/3-540-45810-7_24

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.131.5761

. Màrquez, Improving tagging accuracy by using voting taggers, Proceedings of NLP+IA/TAL+AI ?98, 1999.

A. Mela, Linguistes et "talistes" peuvent coopérer : repérage et analyse des gloses. Revue Française de Linguistique AppliquéeLinguistique et informatique : nouveaux défis, 2004.

A. Mela, Le repérage automatique des gloses de nomination seconde. Langues et langageLes marqueurs de la glose, A. Steuckardt, 2005.

G. Miller, Wordnet : A lexical database for english, Communications of the ACM, 1995.

L. Monceaux, Adaptation du niveau d ?analyse des interventions dans un dialogue ? application à un système de question-réponse, BIBLIOGRAPHIE, 2002.

S. Nadeau, D. Nadeau, and S. Sekine, A survey of named entity recognition and classification, Lingvisticae Investigationes, pp.3-26, 2007.
DOI : 10.1075/bct.19.03nad

G. Navarro, A guided tour to approximate string matching, ACM Computing Surveys, vol.33, issue.1, 1999.
DOI : 10.1145/375360.375365

R. Navigli, Word sense disambiguation, ACM Computing Surveys, vol.41, issue.2, 2009.
DOI : 10.1145/1459352.1459355

. Nenadic, Terminology-driven mining of biomedical literature, Bioinformatics, vol.19, issue.8, pp.938-943, 2003.
DOI : 10.1093/bioinformatics/btg105

S. Nouvel, D. Nouvel, and A. Soulet, Annotation d'entités nommées par extraction de règles de transduction, Proceedings of Extraction et Gestion des Connaissances (EGC), pp.119-130, 2011.

. Nyberg, Document classification utilising ontologies and relations between documents, Proceedings of the Eighth Workshop on Mining and Learning with Graphs, MLG '10, 2010.
DOI : 10.1145/1830252.1830264

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.172.3983

. Okazaki, . Ananiadou, N. Okazaki, and S. Ananiadou, Building an abbreviation dictionary using a term recognition approach, Bioinformatics, vol.22, issue.24, pp.3089-3095, 2006.
DOI : 10.1093/bioinformatics/btl534

. Orasan, Anaphora resolution exercise : an overview, Proceedings of the International Conference on Language Resources and Evaluation (LREC), 2008.

. Paik, Categorizing and standardizing proper nouns for efficient information retrieval, Corpus Processing for Lexical Acquisition, 1994.

. Pattabhi, How to get the same news from different language news papers, Proceedings of Fourth International Workshop on Cross Lingual Information Access ? COLING Conference, pp.11-15, 2010.

. Pérez-martínez, Contextualizing data warehouses with documents, Decision Support Systems, vol.45, issue.1, pp.77-94, 2008.
DOI : 10.1016/j.dss.2006.12.005

. Petrovic, Comparison of collocation extraction measures for document indexing, Proceedings of Information Technology Interfaces (ITI), pp.451-456, 2006.

[. Vicea, Le titre est-il un désignateur rigide ? Dialnet, pp.251-258, 2003.

M. Porter, An algorithm for suffix stripping. Program, pp.130-137, 1980.

V. Prince and A. Labadié, Text Segmentation Based on Document Understanding for Information Retrieval, Natural Language Processing and Information Systems, pp.295-304, 2007.
DOI : 10.1007/978-3-540-73351-5_26

URL : https://hal.archives-ouvertes.fr/lirmm-00161996

. Bibliographie-[-pujolle, Fonctions d'agrégation pour l'analyse en ligne (OLAP) de données textuelles. fonctions top_kwk et avg_kw opérant sur des termes, pp.61-84, 2008.

A. Qamar and E. Gaussier, Online and Batch Learning of Generalized Cosine Similarities, 2009 Ninth IEEE International Conference on Data Mining, pp.926-931, 2009.
DOI : 10.1109/ICDM.2009.114

URL : https://hal.archives-ouvertes.fr/hal-00953853

B. Rahm, E. Rahm, and P. A. Bernstein, A survey of approaches to automatic schema matching, VLDB Journal : Very Large Data Bases, pp.334-350, 2001.
DOI : 10.1007/s007780100057

J. Rebeyrolles, Forme et fonction de la définition en discours, 2000.

. Rehder, Using latent semantic analysis to assess knowledge: Some technical considerations, Discourse Processes, pp.337-354, 1998.
DOI : 10.1080/01638539809545030

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.28.1532

. Robertson, Okapi at trec-3, Proceedings of TREC (Text REtrieval Confe- rence), 1994.

. Sakaki, Earthquake shakes Twitter users, Proceedings of the 19th international conference on World wide web, WWW '10, pp.851-860, 2010.
DOI : 10.1145/1772690.1772777

. Salton, G. Buckley-]-salton, and C. Buckley, Term-weighting approaches in automatic text retrieval, Information Processing & Management, vol.24, issue.5, pp.513-523, 1988.
DOI : 10.1016/0306-4573(88)90021-0

. Salton, A vector space model for automatic indexing, Communications of the ACM, vol.18, issue.11, pp.613-620, 1975.
DOI : 10.1145/361219.361220

H. Schmid, Probabilistic part-of-speech tagging using decision trees, International Conference on New Methods in Language Processing, pp.44-49, 1994.

V. Sclano, F. Sclano, and P. Velardi, TermExtractor: a Web Application to Learn the Shared Terminology of Emergent Web Communities, Proceedings of the 3rd International Conference on Interoperability for Enterprise Software and Applications (I-ESA 2007), 2007.
DOI : 10.1007/978-1-84628-858-6_32

M. Scott, S. Scott, and S. Matwin, Feature engineering for text classification, Proceedings of the Sixteenth International Conference on Machine Learning (ICML), pp.379-388, 1999.

F. Sebastiani, Machine learning in automated text categorization, ACM Computing Surveys, vol.34, issue.1, pp.1-47, 2002.
DOI : 10.1145/505282.505283

J. Sjobergh, Combining pos-taggers for improved accuracy on swedish text, Proceedings of the Nordic Conference of Computational Linguistics (NoDaLiDa), 2003.

F. Smadja, Retrieving collocations from text : Xtract, Computational Linguistics, vol.19, issue.1, pp.143-177, 1993.

. Smadja, Translating collocations for bilingual lexicons : A statistical approach, Computational Linguistics, vol.22, issue.1, pp.1-38, 1996.

L. Sokolova, M. Sokolova, and G. Lapalme, Verbs Speak Loud: Verb Categories in Learning Polarity and Strength of Opinions, Proceedings of Conference of the Canadian Society for Computational Studies of Intelligence, pp.320-331, 2008.
DOI : 10.1007/978-3-540-68825-9_30

L. Sokolova, M. Sokolova, and G. Lapalme, Learning opinions in user-generated web content, Natural Language Engineering, vol.17, issue.04, pp.541-567, 2011.
DOI : 10.1109/TKDE.2003.1245283

A. Stein, M. Stevenson, Y. Guo, A. Alamri, and R. Gaizauskas, Part of speech tagging and lemmatisation of old french texts Disambiguation of biomedical abbreviations, Proceedings of the BioNLP 2009 Workshop, pp.71-79, 2003.

. Taboada, Creating semantic orientation dictionaries, Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC), 2006.

. Taboada, Lexicon-Based Methods for Sentiment Analysis, Computational Linguistics, vol.25, issue.3, pp.267-307, 2011.
DOI : 10.1007/s10579-005-7880-9

. Thanopoulos, Comparative Evaluation of Collocation Extraction Metrics, Proceedings of the International Conference on Language Resources and Evaluation (LREC), pp.620-625, 2002.

P. Turney, Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL, Proceedings of the 12th European Conference on Machine Learning (ECML), pp.2167491-502, 2001.
DOI : 10.1007/3-540-44795-4_42

P. Turney, Thumbs up or thumbs down ? semantic orientation applied to unsupervised classification of reviews, Proceedings of 40th Meeting of the Association for Computational Linguistics, pp.417-424, 2002.

. Turney, P. D. Turney, and P. Pantel, From frequency to meaning : Vector space models of semantics, Journal of Artificial Intelligence Research (JAIR), vol.37, pp.141-188, 2010.

M. Vinet, L'aspet et la copule vide dans la grammaire des titres, pp.83-101, 1993.
DOI : 10.3406/lfr.1993.5928

URL : http://www.persee.fr/docAsPDF/lfr_0023-8368_1993_num_100_1_5928.pdf

. Vivaldi, Improving Term Extraction by System Combination Using Boosting, Proceedings of the 12th European Conference on Machine Learning (ECML), pp.515-526, 2001.
DOI : 10.1007/3-540-44795-4_44

K. Voll and M. Taboada, Not All Words Are Created Equal: Extracting Semantic Orientation as a Function of Adjective Relevance, Proceedings of the 20th Australian Joint Conference on Artificial Intelligence, 2007.
DOI : 10.1007/978-3-540-76928-6_35

E. M. Voorhees, The trec-8 question answering track report, Proceedings of Text REtrieval Conference (TREC-8), pp.77-82, 1999.

. Wiegand, A survey on the role of negation in sentiment analysis, Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, 2010.

Y. Wilks, Language processing and the thesaurus, National Language Research Institute, 1998.

H. Xu, J. Xu, and Y. Huang, Using SVM to Extract Acronyms from Text, Soft Computing, vol.9, issue.Pt 1, pp.369-373, 2007.
DOI : 10.1007/s00500-006-0091-5

. Yamanishi, K. Maruyama-]-yamanishi, and Y. Maruyama, Dynamic syslog mining for network failure monitoring, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining , KDD '05, pp.499-508, 2005.
DOI : 10.1145/1081870.1081927

S. Yeates, Automatic extraction of acronyms from text, New Zealand Computer Science Research Students' Conference, pp.117-124, 1999.

. Zajic, Automatic headline generation for newspaper stories, Workshop on Text Summarization (ACL 2002 and DUC 2002 meeting on Text Summarization). Philadelphia, 2002.

. Zhang, Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases, Proceedings of the SIAM Int. Conference on Data Mining, pp.1123-1134, 2009.
DOI : 10.1137/1.9781611972795.96