A. Ferré, M. Ba, and R. Bossy, Improving at BLAH5 the CONTES Method for Normalizing Biomedical Text Entities with Concepts from an Ontology with (almost) no Training Data, Journal of Genomics & Informatics, 2019.

A. Ferré, L. Deléger, P. Zweigenbaum, and C. Nédellec, Combining rulebased and embedding-based approaches to normalize textual entities with an ontology, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC, 2018.

H. Falentin, E. Chaix, S. Derozier, M. Weber, S. Buchin et al., Florilege: a database gathering microbial phenotypes of food interest, 4th International Conference on Microbial Diversity, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01651302

A. Ferré, P. Zweigenbaum, and C. Nédellec, Representation of complex terms in a vector space structured by an ontology for a normalization task, 2017.

A. Ferré, Représentation de termes complexes dans un espace vectoriel relié à une ontologie pour une tâche de catégorisation, Rencontres des Jeunes Chercheurs en Intelligence Artificielle, 2017.

A. Ferré, Normalisation de termes complexes par sémantique distributionnelle guidée par une ontologie. 19es REncontres jeunes Chercheurs en Informatique pour le TAL, 2017.

R. Bossy, E. Chaix, L. Deleger, and A. Ferré, OntoBiotope : une ontologie pour croiser les habitats microbiens avec les analyses de génomes, Les journées Bioinformatique de l'Inra, 2016.

L. Del?ger, R. Bossy, E. Chaix, and M. Ba, Arnaud Ferr?, Philippe Bessières, and Claire N?dellec. 2016. Overview of the Bacteria Biotope task at BioNLP Shared Task, 2016.

V. Henry, A. Goelzer, A. Ferré, S. Fischer, M. Dinh et al., The bacterial interlocked process ONtology (BiPON): a systemic multi-scale unified representation of biological processes in prokaryotes, Journal of biomedical semantics, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01652923

V. Henry, A. Ferré, C. Froidevaux, A. Goelzer, V. Fromion et al., Représentation systémique multi-échelle des processus biologiques de la bactérie. IC2016: Ingénierie des Connaissances, 2016.

, Bioinformatique et médiation scientifique

W. Briand, O. Dao, G. Garnier, R. Guegan, B. Marta et al., Dégradation d'un anticancéreux dans les eaux usées -une médaille d'or pour l'équipe GO Paris-Saclay, 2018.

N. Abdollahi, A. Albani, E. Anthony, A. Baud, M. Cardon et al., Meet-U: educating through research immersion, PLoS computational biology, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01722019

N. Allias, Bioinfo-fr. net : présentation du blog communautaire scientifique francophone par les Geekus biologicus, Journées Ouvertes de Biologie Informatique & Mathématiques, 2016.

M. Benony, M. Cardon, A. Ferré, J. Coquet, N. Foulquier et al., The smell of us -crowdsourcing human body odor evaluation, Human Computation-A Transdisciplinary Journal, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01672595

R. L. Ackoff, The systems revolution, Long Range Planning, vol.7, issue.6, pp.2-20, 1974.

S. Agarwal and H. Yu, Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion, Bioinformatics, vol.25, issue.23, pp.3174-3180, 2009.

R. Al-rfou, B. Perozzi, and S. Skiena, Polyglot: Distributed Word Representations for Multilingual NLP, 2013.

S. Ananiadou, C. Freidman, and J. Tsujii, Introduction: named entity recognition in biomedicine, Journal of Biomedical Informatics, vol.37, issue.6, pp.393-395, 2004.

C. Arighi, L. Hirschman, T. Lemberger, S. Bayer, R. Liechti et al., Bio-ID track overview, Cell, vol.482, issue.7310, p.376, 2017.

R. Alan and . Aronson, Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program, Proceedings of the AMIA Symposium, p.17, 2001.

A. R. Aronson and F. Lang, An overview of MetaMap: historical perspective and recent advances, Journal of the American Medical Informatics Association, vol.17, issue.3, pp.229-236, 2010.

M. Ashburner, C. A. Ball, J. A. Blake, D. Botstein, H. Butler et al., Gene Ontology: tool for the unification of biology, Nature genetics, vol.25, issue.1, pp.25-29, 2000.

S. Aubin and T. Hamon, Improving term extraction with terminological resources, International Conference on Natural Language Processing, pp.380-387, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00091444

N. Aussenac-gilles, B. Biébow, and S. Szulman, Corpus Analysis for Conceptual Modelling, 2000.

M. Ba and R. Bossy, Interoperability of corpus processing workflow engines: the case of. AlvisNLP/ML in OpenMinTeD, Meeting of working Group Medicago sativa, p.page np, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01455853

F. Baader, D. L. Mcguinness, D. Nardi, and P. Schneider, The description logic handbook: Theory, implementation and applications, p.510, 2003.

A. Bairoch, The Universal Protein Resource (UniProt), Nucleic Acids Research, vol.33, pp.154-159, 2004.

M. Baroni and A. Lenci, Distributional memory: A general framework for corpus-based semantics, Computational Linguistics, vol.36, issue.4, pp.673-721, 2010.

A. Birou, Vocabulaire pratique des sciences sociales. Editions Economie et humanisme edition, 1966.

P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, Enriching Word Vectors with Subword Information, 2016.

H. Borchani, G. Varando, C. Bielza, and P. Larrañaga, A survey on multi-output regression, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.5, issue.5, pp.216-233, 2015.

R. Bossy, E. Chaix, L. Deleger, and A. Ferré, OntoBiotope: une ontologie pour croiser les habitats microbiens avec les analyses de génomes, Les journées Bioinformatique de l'Inra, p.1, 2016.

R. Bossy, W. Golik, Z. Ratkovic, P. Bessières, and C. Nédellec, BioNLP shared Task 2013 -An Overview of the Bacteria Biotope Task, Proceedings of the BioNLP Shared Task 2013 Workshop, pp.161-169, 2013.

R. Bossy, W. Golik, Z. Ratkovic, and D. Valsamou, Overview of the gene regulation network and the bacteria biotope tasks in bionlp'13 shared task, BMC bioinformatics, vol.16, issue.10, p.1, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01222411

R. Bossy, J. Jourde, A. Manine, P. Veber, E. Alphonse et al., BioNLP Shared Task-The Bacteria Track, BMC bioinformatics, vol.13, p.3, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01190775

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds et al., Learning to rank using gradient descent, Proceedings of the 22nd international conference on Machine learning -ICML '05, pp.89-96, 2005.

P. L. Buttigieg, N. Morrison, B. Smith, J. Christopher, S. E. Mungall et al., The environment ontology: contextualising biological and biomedical entities, Journal of biomedical semantics, vol.4, issue.1, p.43, 2013.

H. Cai, V. W. Zheng, and K. Chang, A Comprehensive Survey of Graph Embedding: Problems, Techniques and Applications, 2017.

N. Chinchor, D. David, L. Lewis, and . Hirschman, Evaluating message understanding systems: an analysis of the third message understanding conference (MUC-3), Computational linguistics, vol.19, issue.3, pp.409-449, 1993.

T. Ching, S. Daniel, . Himmelstein, K. Brett, A. A. Beaulieu-jones et al., Opportunities and obstacles for deep learning in biology and medicine, Journal of The Royal Society Interface, vol.15, issue.141, p.20170387, 2018.

B. Chiu, G. Crichton, A. Korhonen, and S. Pyysalo, How to train good word embeddings for biomedical NLP, Proceedings of BioNLP16, p.166, 2016.

C. W. Choo, The knowing organization: How organizations use information to construct meaning, create knowledge and make decisions, International Journal of Information Management, vol.16, issue.5, pp.329-340, 1996.

V. Claveau, IRISA participation to BioNLP-ST13: lazy-learning and information retrieval for information extraction tasks, Proceedings of the BioNLP Shared Task 2013 Workshop, pp.188-196, 2013.

M. Aaron and . Cohen, Unsupervised gene/protein named entity normalization using automatically extracted dictionaries, Proceedings of the acl-ismb workshop on linking biological literature, ontologies and databases: Mining biological semantics, pp.17-24, 2005.

T. Cohen and D. Widdows, Empirical distributional semantics: Methods and biomedical applications, Journal of Biomedical Informatics, vol.42, issue.2, pp.390-405, 2009.

W. William, S. Cohen, and . Sarawagi, Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods, p.10, 2004.

R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu et al., Natural Language Processing (Almost) from Scratch. Natural Language Processing, p.45

M. Francisco, . Couto, J. Mário, P. M. Silva, and . Coutinho, Finding genomic ontology terms in text using evidence content, BMC Bioinformatics, issue.6, p.21, 2005.

J. Cowie and Y. Wilks, Information extraction. Handbook of Natural Language Processing, vol.56, p.57, 2000.

K. Crammer and Y. Singer, Ultraconservative Online Algorithms for Multiclass Problems, Computational Learning Theory, vol.2111, pp.99-115, 2001.

A. P. Davis, T. C. Wiegers, M. C. Rosenstein, and C. J. Mattingly, MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database, Database, issue.0, pp.65-065, 2012.

F. De-saussure, Cours de linguistique générale: Édition critique, vol.1, 1989.

T. Declerck, C. Federmann, B. Kiefer, and H. Krieger, Ontologybased information extraction and reasoning for business intelligence applications, Annual Conference on Artificial Intelligence, pp.389-390, 2008.

S. Deerwester, T. Susan, G. W. Dumais, . Furnas, K. Thomas et al., Indexing by latent semantic analysis, Journal of the American society for information science, vol.41, issue.6, pp.391-407, 1990.

L. Del?ger, R. Bossy, E. Chaix, and M. Ba, Overview of the Bacteria Biotope task at BioNLP Shared Task, Proceedings of the 4th BioNLP Shared Task Workshop, pp.12-22, 2016.

L. Derczynski, D. Maynard, G. Rizzo, G. Marieke-van-erp, R. Gorrell et al., Analysis of Named Entity Recognition and Linking for Tweets. Information Processing & Management, vol.51, pp.32-49, 2015.

J. Devlin, M. Chang, K. Lee, and K. Toutanova, BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, 2018.

R. Rezarta-islamaj-do?an, Z. Leaman, and . Lu, NCBI disease corpus: A resource for disease name recognition and concept normalization, Journal of Biomedical Informatics, vol.47, pp.1-10, 2014.

F. Doshi-velez, B. C. Wallace, and R. Adams, Graph-Sparse LDA: A Topic Model with Structured Sparsity, p.7, 2009.

A. Drozd, A. Gladkova, and S. Matsuoka, Word embeddings, analogies, and machine learning: Beyond king-man+ woman= queen, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp.3519-3530, 2016.

D. Jennifer, V. Souza, and . Ng, Sieve-Based Entity Linking for the Biomedical Domain, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.2, pp.297-302, 2015.

M. Dupont, J. Vuillaume, B. Victorri, P. Enjalbert, Y. Mathet et al., Nouvelles perspectives en extraction d'information. Revue des Sciences et Technologies de l'Information-Série TSI, Technique et Science Informatiques, vol.1, issue.21, pp.37-63, 2002.
DOI : 10.3166/tsi.21.37-63

URL : https://hal.archives-ouvertes.fr/halshs-00009485

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning internal representations by back-propagating errors, 1986.
DOI : 10.1038/323533a0

. Jeffrey-l-elman, Distributed representations, simple recurrent networks, and grammatical structure, Machine learning, vol.7, issue.2-3, pp.195-225, 1991.

J. Ermine, M. Moradi, and S. Brunel, Une chaîne de valeur de la connaissance, Management international, vol.16, p.29, 2012.
DOI : 10.7202/1012391ar

URL : https://hal.archives-ouvertes.fr/hal-00949464/file/Une_chaA_ne_de_valeur_de_la_connaissance_nominatif_V3.pdf

H. Falentin, E. Chaix, S. Derozier, M. Weber, S. Buchin et al., Florilege: a database gathering microbial phenotypes of food interest, 4th International Conference on Microbial Diversity, p.page np, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01651302

M. Faruqui, J. Dodge, K. Sujay, C. Jauhar, E. Dyer et al.,

, Retrofitting word vectors to semantic lexicons, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

M. Faruqui, Y. Tsvetkov, P. Rastogi, and C. Dyer, Problems With Evaluation of Word Embeddings Using Word Similarity Tasks, 2016.

D. Faure and C. Nédellec, A Corpus-based Conceptual Clustering Method for Verb Frames and Ontology Acquisition, LREC workshop on adapting lexical and corpus resources to sublanguages and applications, pp.5-12, 1998.

. Scott-federhen, The NCBI taxonomy database, Nucleic acids research, vol.40, issue.D1, pp.136-143, 2011.

A. Ferré, M. Ba, and R. Bossy, Improving at BLAH5 the CONTES Method for Normalizing Biomedical Text Entities with Concepts from an Ontology with (almost) no Training Data, 2019.

A. Ferré, L. Deléger, P. Zweigenbaum, and C. Nédellec, Combining rulebased and embedding-based approaches to normalize textual entities with an ontology, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC, 2018.

A. Ferré, P. Zweigenbaum, and C. Nédellec, Representation of complex terms in a vector space structured by an ontology for a normalization task, BioNLP, vol.2017, pp.99-106, 2017.

J. R. Finkel, T. Grenager, and C. Manning, Incorporating non-local information into information extraction systems by Gibbs sampling, Proceedings of the 43rd annual meeting on association for computational linguistics, pp.363-370, 2005.

J. Rupert and F. , The Technique of Semantics, Transactions of the philological society, vol.34, issue.1, pp.36-73, 1935.

L. Floridi, Information: A Very Short Introduction, 2010.

J. Fluck, H. Heinz-theodor-mevissen, and . Dach, ProMiner: Recognition of Human Gene and Protein Names using regularly updated Dictionaries, p.3, 2007.

W. Peter, W. Foltz, T. K. Kintsch, and . Landauer, The measurement of textual coherence with latent semantic analysis, Discourse processes, vol.25, pp.285-307, 1998.

C. Friedman, P. Kra, H. Yu, M. Krauthammer, and A. Rzhetsky, GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles, ISMB (supplement of bioinformatics), pp.74-82, 2001.

G. W. Furnas, T. K. Landauer, L. M. Gomez, and S. T. Dumais, The vocabulary problem in human-system communication, Communications of the ACM, vol.30, issue.11, pp.964-971, 1987.

H. Gardner, Frames of mind: The theory of multiple intelligences, 2011.

M. Gerlach and . Eduardo-g-altmann, Stochastic model for the vocabulary growth in natural languages, Physical Review X, vol.3, issue.2, p.21006, 2013.

M. Gerner, G. Nenadic, and C. M. Bergman, LINNAEUS: a species name identification system for biomedical literature, BMC bioinformatics, vol.11, issue.1, p.85, 2010.

O. Ghiasvand and R. Kate, UWM: Disorder Mention Extraction from Clinical Text Using CRFs and Normalization Using Learned Edit Distance Patterns, Proceedings of the 8th International Workshop on Semantic Evaluation, pp.828-832, 2014.

W. Golik, R. Bossy, Z. Ratkovic, and C. Nédellec, Improving term extraction with linguistic analysis in the biomedical domain, Research in Computing Science, vol.70, pp.157-172, 2013.

W. Golik, P. Warnier, and C. Nédellec, Corpus-based extension of terminoontology by linguistic analysis: a use case in biomedical event extraction, WS 2 Workshop Extended Abstracts, 9th International Conference on Terminology and Artificial Intelligence, pp.37-39, 2011.

R. Grishman and B. Sundheim, Message Understanding Conference-6: A Brief History, The 16th International Conference on Computational Linguistics, vol.1, 1996.

C. Grouin, Identification of Mentions and Relations between Bacteria and Biotope from PubMed Abstracts, Proceedings of the 4th BioNLP Shared Task Workshop, p.64, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01831226

A. Grover and J. Leskovec, 2016. node2vec: Scalable feature learning for networks, Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining, pp.855-864
URL : https://hal.archives-ouvertes.fr/hal-01768501

B. Hachey, W. Radford, J. Nothman, M. Honnibal, and J. R. Curran, Evaluating Entity Linking with Wikipedia, Artificial Intelligence, vol.194, pp.130-150, 2013.

A. Hamosh, Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders, Nucleic Acids Research, vol.33, pp.514-517, 2004.

D. Hanisch, J. Fluck, H. Mevissen, and R. Zimmer, Playing biology's name game: identifying protein names in scientific text, Biocomputing 2003, pp.403-414, 2002.

D. Hanisch, K. Fundel, H. Mevissen, R. Zimmer, and J. Fluck, ProMiner: rule-based protein and gene entity recognition, BMC Bioinformatics, issue.6, p.14, 2005.

S. Zellig and . Harris, Distributional Structure. Word, vol.10, issue.2-3, pp.146-162, 1954.

P. Heisig, European guide to good practice in knowledge management. IPK, 2002.

R. Herbrich, Large margin rank boundaries for ordinal regression, Advances in large margin classifiers, pp.115-132, 2000.

G. E. Hinton, J. L. Rumelhart, and J. L. Mcclelland, , 1986.

E. Geoffrey, T. Hinton, and . Shallice, Lesioning an attractor network: investigations of acquired dyslexia, Psychological review, vol.98, issue.1, p.74, 1991.

L. Hirschman, M. Colosimo, A. Morgan, and A. Yeh, Overview of BioCreAtIvE task 1B: normalized gene lists, BMC Bioinformatics, issue.6, p.11, 2005.

L. Hirschman, A. A. Morgan, and A. Yeh, Rutabaga by any other name: extracting biological names, Journal of Biomedical Informatics, vol.35, issue.4, pp.247-259, 2002.

L. Hirschman, A. Yeh, C. Blaschke, and A. Valencia, Overview of BioCreAtIvE: critical assessment of information extraction for biology, BMC Bioinformatics, 2005.

S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural computation, vol.9, issue.8, pp.1735-1780, 1997.

J. Hutchins, The history of machine translation in a nutshell, 2005.

C. Hwang, Incompletely and Imprecisely Speaking : Using Dynamic Ontologies for Representing and Retrieving Information, p.13, 1999.

N. Ivanova, G. Susannah, K. Tringe, W. Liolios, N. Liu et al., A call for standardized classification of metagenome projects, Environmental microbiology, vol.12, issue.7, pp.1803-1805, 2010.

H. Ji, X. Pan, B. Zhang, J. Nothman, J. Mayfield et al., Overview of TAC-KBP2017 13 Languages Entity Discovery and Linking, 2017.

J. Jiang, Information extraction from text, Mining text data, pp.11-41, 2012.

K. Spärck and J. , A statistical interpretation of term specificity and its application in retrieval, Journal of Documentation, vol.28, pp.11-21, 1972.

D. Jurafsky, H. James, and . Martin, Speech and language processing, vol.3, 2014.

H. Kamper, W. Wang, and K. Livescu, Deep convolutional acoustic word embeddings using word-pair side information, 2015.

N. Kang, B. Singh, Z. Afzal, E. M. Van-mulligen, and J. A. Kors, Using rulebased natural language processing to improve disease normalization in biomedical text, Journal of the American Medical Informatics Association, vol.20, issue.5, pp.876-881, 2013.

K. Jun'ichi-kazama and . Torisawa, Exploiting Wikipedia as external knowledge for named entity recognition, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2007.

D. Kiela, F. Hill, and S. Clark, Specializing word embeddings for similarity or relatedness, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp.2044-2048, 2015.

S. Kim, M. Iglesias-sucasas, and V. Viollier, The FAO Geopolitical Ontology: A Reference for Country-Based Information, Journal of Agricultural & Food Information, vol.14, issue.1, pp.50-65, 2013.

W. Kintsch, Predication. Cognitive science, vol.25, issue.2, pp.173-202, 2001.

J. L. Klavans and S. Muresan, Evaluation of the DEFINDER system for fully automatic glossary construction, Proceedings of the AMIA Symposium, pp.324-328, 2001.

C. Kuo, M. H. Ling, K. Lin, and C. Hsu, BIOADI: a machine learning approach to identifying abbreviations and definitions in biological literature, BMC Bioinformatics, vol.10, issue.15, p.7, 2009.

G. Lame, Knowledge acquisition from texts towards an ontology of French law, 2000.

K. Thomas, S. T. Landauer, and . Dumais, A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge, Psychological review, vol.104, issue.2, p.211, 1997.

H. Larochelle, D. Erhan, and Y. Bengio, Zero-data learning of new tasks, AAAI, vol.1, p.3, 2008.

R. Leaman, R. Islamaj-dogan, and Z. Lu, DNorm: disease name normalization with pairwise learning to rank, Bioinformatics, vol.29, issue.22, pp.2909-2917, 2013.
DOI : 10.1093/bioinformatics/btt474

URL : https://academic.oup.com/bioinformatics/article-pdf/29/22/2909/888873/btt474.pdf

R. Leaman and Z. Lu, TaggerOne: joint named entity recognition and normalization with semi-Markov Models, Bioinformatics, vol.32, issue.18, pp.2839-2846, 2016.
DOI : 10.1093/bioinformatics/btw343

URL : https://academic.oup.com/bioinformatics/article-pdf/32/18/2839/24406872/btw343.pdf

R. Leaman, C. Wei, and Z. Lu, tmChem: a high performance approach for chemical named entity recognition and normalization, Journal of Cheminformatics, issue.7, p.3, 2015.

H. Lee, Y. Hsu, and H. Kao, An enhanced CRF-based system for disease name entity recognition and normalization on BioCreative V DNER Task, 2015.

. Vladimir-i-levenshtein, Binary codes capable of correcting deletions, insertions, and reversals, Soviet physics doklady, vol.10, pp.707-710, 1966.

O. Levy, Y. Goldberg, and I. Dagan, Improving Distributional Similarity with Lessons Learned from Word Embeddings, Transactions of the Association for Computational Linguistics, vol.3, issue.0, pp.211-225, 2015.
DOI : 10.1162/tacl_a_00134

URL : https://doi.org/10.1162/tacl_a_00134

D. David and . Lewis, Data Extraction as Text Categorization: An Experiment With the MUC-3, 1991.

. Corpus, third message understanding conference, vol.3, 1991.

Y. Li and K. Bontcheva, Hierarchical, perceptron-like learning for ontology-based information extraction, Proceedings of the 16th international conference on World Wide Web -WWW '07, p.777, 2007.

N. Limsopatham and N. Collier, Adapting Phrase-based Machine Translation to Normalise Medical Terms in Social Media Messages, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015.

N. Limsopatham and N. Collier, Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1014-1023, 2016.

Y. Lin, W. Li, K. Chen, and Y. Liu, A Document Clustering and Ranking System for Exploring MEDLINE Citations, Journal of the American Medical Informatics Association, vol.14, issue.5, pp.651-661, 2007.

E. Carolyn and . Lipscomb, Medical Subject Headings (MeSH), Bulletin of the Medical Library Association, vol.88, issue.3, pp.265-266, 2000.

Q. Liu, H. Jiang, S. Wei, Z. Ling, and Y. Hu, Learning semantic word embeddings based on ordinal knowledge constraints, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.1, pp.1501-1511, 2015.

Z. Lu, H. Kao, C. Wei, M. Huang, J. Liu et al., The gene normalization task in BioCreative III, BMC bioinformatics, vol.12, issue.8, p.2, 2011.

L. Van-der-maaten and G. Hinton, Visualizing data using t-SNE, Journal of machine learning research, vol.9, pp.2579-2605, 2008.

D. Maglott, Entrez Gene: gene-centered information at NCBI, Nucleic Acids Research, vol.33, pp.54-58, 2004.

C. Manning, P. Raghavan, and H. Schuetze, Introduction to Information Retrieval, Natural Language Engineering, p.581, 2009.

A. Martinet, La description phonologique, avec application au parler franco-provençal d'Hauteville (Savoie), Librairie Droz, vol.56, 1956.

F. Mehryary, K. Hakala, S. Kaewphan, J. Björne, T. Salakoski et al., End-to-End System for Bacteria Habitat Extraction, BioNLP, p.80, 2017.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, 2013.

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, Distributed Representations of Words and Phrases and their Compositionality, Advances in Neural Information Processing Systems, vol.26, pp.3111-3119, 2013.

T. Mikolov, Y. Wen-tau, and G. Zweig, Linguistic Regularities in Continuous Space Word Representations, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.746-751, 2013.

A. George and . Miller, WordNet: a lexical database for English, Communications of the ACM, vol.38, issue.11, pp.39-41, 1995.

J. Mitchell and M. Lapata, Composition in distributional models of semantics, Cognitive science, vol.34, issue.8, pp.1388-1429, 2010.

Z. Alexander-a-morgan, X. Lu, A. M. Wang, J. Cohen, P. Fluck et al., Overview of BioCreative II gene normalization, Genome Biology, vol.9, issue.2, p.3, 2008.

N. Mrk?i?, Ó. Diarmuid, B. Séaghdha, M. Thomson, L. Ga?i? et al., Counter-fitting Word Vectors to Linguistic Constraints, 2016.

N. Mrk?i?, I. Vuli?, Ó. Diarmuid, I. Séaghdha, R. Leviant et al., Semantic specialization of distributional word vector spaces using monolingual and cross-lingual constraints, Transactions of the Association for Computational Linguistics, vol.5, pp.309-324, 2017.

T. H. Muneeb, S. Kumar-sahu, and A. Anand, Evaluating distributed word representations for capturing semantics of biomedical concepts, Proceedings of ACL-IJCNLP, p.158, 2015.

V. Nair and G. E. Hinton, Rectified linear units improve restricted boltzmann machines, Proceedings of the 27th international conference on machine learning (ICML-10), pp.807-814, 2010.

R. Navigli and S. P. Ponzetto, BabelNet: Building a Very Large Multilingual Semantic Network, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp.216-225, 2010.

C. Nédellec, A. Nazarenko, and R. Bossy, Information extraction, Handbook on ontologies, pp.663-685, 2009.

G. Nenadic, I. Spasic, and S. Ananiadou, Mining Biomedical Abstracts: What's in a Term?, Natural Language Processing -IJCNLP, vol.3248, pp.797-806, 2004.

S. Kim-anh-nguyen, N. T. Schulte-im-walde, and . Vu, Integrating distributional lexical contrast into word embeddings for antonym-synonym distinction, 2016.

J. Nobécourt, A method to build formal ontologies from texts, EKAW-2000 Workshop on ontologies and text, 2000.

M. Ono, M. Miwa, and Y. Sasaki, Word Embedding-based Antonym Detection using Thesauri and Distributional Information, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.984-989, 2015.

D. Osborne, S. Narayan, and S. Cohen, Encoding prior knowledge with eigenword embeddings, Transactions of the Association for Computational Linguistics, vol.4, pp.417-430, 2016.

N. Peng, H. Poon, C. Quirk, K. Toutanova, and W. Yih, CrossSentence N-ary Relation Extraction with Graph LSTMs, Transactions of the Association for Computational Linguistics, vol.5, pp.101-115, 2017.

J. Pennington, R. Socher, and C. Manning, Glove: Global vectors for word representation, Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp.1532-1543, 2014.

Y. Pesqueux, La dualité" savoir-connaissance" en sciences des organisations, Séminaire de recherche en anthropologie de l'imaginaire, 2008.

M. E. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark et al., Deep contextualized word representations, 2018.

M. Pignatelli, A. Moya, and J. Tamames, EnvDB, a database for describing the environmental distribution of prokaryotic taxa, Environmental Microbiology Reports, vol.1, issue.3, pp.191-197, 2009.

J. B. Pollack, Recursive Distributed Representations, Artificial Intelligence, vol.46, pp.77-105, 1990.

W. Pratt and M. Yetisgen-yildiz, A Study of Biomedical Concept Identification: MetaMap vs, People. AMIA Annual Symposium Proceedings, pp.529-533, 2003.

R. Rada, H. Mili, E. Bicknell, and M. Blettner, Development and application of a metric on semantic nets, IEEE Transactions on Systems, Man, and Cybernetics, vol.19, issue.1, pp.17-30, 1989.

A. Ranjan-pal and D. Saha, Word Sense Disambiguation: A Survey, International Journal of Control Theory and Computer Modeling, vol.5, issue.3, pp.1-16, 2015.

L. Ratinov and D. Roth, Design challenges and misconceptions in named entity recognition, Proceedings of the thirteenth conference on computational natural language learning, pp.147-155, 2009.

N. Reimers and I. Gurevych, Reporting score distributions makes a difference: Performance study of lstm-networks for sequence tagging, 2017.

K. Riesen, M. Neuhaus, and H. Bunke, Graph embedding in vector spaces by means of prototype selection, International Workshop on Graph-Based Representations in Pattern Recognition, pp.383-393, 2007.

J. Rowley, The wisdom hierarchy: representations of the DIKW hierarchy, Journal of information science, vol.33, issue.2, pp.163-180, 2007.

J. Stuart, P. Russell, and . Norvig, Artificial intelligence: a modern approach. Malaysia; Pearson Education Limited, 2016.

P. Russom, BI search and text analytics, TDWI Best Practices Report, pp.9-11, 2007.

E. F. Tjong, K. Sang, and F. De-meulder, Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition, 2003.

H. Schmid, Improvements in part-of-speech tagging with an application to German, Natural language processing using very large corpora, pp.13-25, 1999.

J. Martijn, R. Schuemie, J. A. Jelier, and . Kors, Peregrine: Lightweight gene name normalization by dictionary lookup, Proc of the Second BioCreative Challenge Evaluation Workshop, pp.131-133, 2007.

C. E. Shannon, A mathematical theory of communication. Bell system technical journal, vol.27, pp.379-423, 1948.

, Modern information retrieval: A brief overview, IEEE Data Eng. Bull, vol.24, issue.4, pp.35-43, 2001.

F. Z. Smaili, X. Gao, and R. Hoehndorf, Onto2Vec: joint vector-based representation of biological entities and their ontology-based annotations, Bioinformatics, vol.34, issue.13, pp.52-60, 2018.

E. Edward, . Smith, and . Douglas-l-medin, Categories and concepts, vol.9, 1981.

A. Noah and . Smith, Linguistic structure prediction, vol.4, pp.1-274, 2011.

R. Socher, J. Pennington, H. Eric, A. Y. Huang, C. Ng et al., Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions, p.11

, Grigorios Tsoumakas, William Groves, and Ioannis Vlahavas

, Multi-label classification methods for multi-target regression, pp.1159-1168

M. Steyvers and J. B. Tenenbaum, The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth, Cognitive Science, vol.29, issue.1, pp.41-78, 2005.

J. L. Stocks, CORNFORD, F. M. -Plato's Theory of Knowledge. Mind, p.526, 1935.

M. Tiftikci, H. Sahin, and B. Büyüköz, Ontologybased Categorization of Bacteria and Habitat Entities using Information Retrieval Techniques, p.56, 2016.

Y. Tsuruoka, J. Mcnaught, J. Tsujii, and S. Ananiadou, Learning string similarity measures for gene/protein name dictionary look-up using logistic regression, Bioinformatics, vol.23, issue.20, pp.2768-2774, 2007.

J. Turian, L. Ratinov, and Y. Bengio, Word Representations: A Simple and General Method for Semi-Supervised Learning, Proceedings of the 48th annual meeting of the association for computational linguistics, p.11

M. Uschold and M. King, Towards a Methodology for Building Ontologies, p.15, 1995.

M. Vargas-vera, E. Motta, J. Domingue, S. B. Shum, and M. Lanzoni, Knowledge Extraction by using an Ontology-based Annotation Tool, 2001.

J. Z. Wang, Z. Du, R. Payattakool, P. S. Yu, and C. Chen, A new method to measure the semantic similarity of GO terms, Bioinformatics, vol.23, issue.10, pp.1274-1281, 2007.

Z. Wang, J. Zhang, J. Feng, and Z. Chen, Knowledge Graph Embedding by Translating on Hyperplanes, AAAI, pp.1112-1119, 2014.

W. Weaver, Translation. Machine translation of languages, vol.14, pp.15-23, 1955.

C. Wei and H. Kao, Cross-species gene normalization by species inference, BMC Bioinformatics, vol.12, issue.8, p.5, 2011.

C. Wei, Y. Peng, R. Leaman, A. P. Davis, C. J. Mattingly et al., Overview of the BioCreative V Chemical Disease Relation (CDR) Task. Proceedings of the fifth BioCreative challenge evaluation workshop, p.14, 2015.

J. Wieting, M. Bansal, K. Gimpel, and K. Livescu, From paraphrase database to compositional paraphrase model and back, Transactions of the Association for Computational Linguistics, vol.3, pp.345-358, 2015.

C. Daya, D. Wimalasuriya, and . Dou, Ontology-based information extraction: An introduction and a survey of current approaches, Journal of Information Science, vol.36, issue.3, pp.306-323, 2010.

F. Xu, H. Uszkoreit, and H. Li, Automatic event and relation detection with seeds of varying complexity, Proceedings of the AAAI workshop event extraction and synthesis, pp.12-17, 2006.

H. Victor and . Yngve, Random generation of English sentences, Massachusetts Inst. of Technology, 1961.

M. Yu and M. Dredze, Improving lexical embeddings with semantic knowledge, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol.2, pp.545-550, 2014.

D. Matthew and . Zeiler, ADADELTA: An Adaptive Learning Rate Method, 2012.

Q. Zhang, M. Chen, and L. Liu, A review on entity relation extraction, 2017 Second International Conference on Mechanical, Control and Computer Engineering (ICMCCE), pp.178-183, 2017.

D. Zhou, D. Zhong, and Y. He, Biomedical Relation Extraction: From Binary to Complex, Computational and Mathematical Methods in Medicine, 2014.

P. Zweigenbaum and B. Habert, Faire se rencontrer les parallèles : regards croisés sur l'acquisition lexicale monolingue et multilingue. Revue de sociolinguistique en ligne GLOTTOPOL, vol.8, pp.22-44, 2006.