A. Graves, M. Abdel-rahman, and G. Hinton, Speech recognition with deep recurrent neural networks, 2013 IEEE international conference on acoustics, speech and signal processing, pp.6645-6649, 2013.

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, Distributed representations of words and phrases and their compositionality, Advances in neural information processing systems, pp.3111-3119, 2013.

Y. Shengxian-wan, J. Lan, J. Guo, L. Xu, X. Pang et al., A deep architecture for semantic matching with multiple positional sentence representations, Thirtieth AAAI Conference on Artificial Intelligence, 2016.

Y. Yang, W. Yih, and C. Meek, WikiQA: A challenge dataset for open-domain question answering, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015.

Y. Fan, L. Pang, J. Hou, J. Guo, Y. Lan et al., A toolkit for deep text matching, 2017.

G. Salton, Information storage and retrieval. reports on analysis, search, and iterative retrieval, 1968.

M. Baziz, M. Boughanem, N. Aussenac-gilles, and C. Chrisment, Semantic cores for representing documents in ir, Proceedings of the 2005 ACM Symposium on Applied Computing, SAC '05, pp.1011-1017, 2005.

Q. Le and T. Mikolov, Distributed representations of sentences and documents, International conference on machine learning, pp.1188-1196, 2014.

J. Pennington, R. Socher, and C. Manning, Glove: Global vectors for word representation, Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp.1532-1543, 2014.

Y. Lecun, Y. Bengio, and G. Hinton, Deep learning. nature, vol.521, p.436, 2015.

J. Guo, Y. Fan, L. Pang, L. Yang, Q. Ai et al., A deep look into neural ranking models for information retrieval. Information Processing and Management, p.102067, 2019.

R. Kiros, Y. Zhu, R. Ruslan, R. Salakhutdinov, R. Zemel et al., Skip-thought vectors, Advances in Neural Information Processing Systems, vol.28, pp.3294-3302, 2015.

M. Kusner, Y. Sun, N. Kolkin, and K. Weinberger, From word embeddings to document distances, International Conference on Machine Learning, pp.957-966, 2015.

J. Guo, Y. Fan, A. Qingyao, and W. Bruce, Semantic matching by non-linear word transportation for information retrieval, Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, pp.701-710, 2016.

Y. Wang, S. Liu, N. Afzal, M. Rastegar-mojarad, L. Wang et al., A comparison of word embeddings for the biomedical natural language processing, Journal of biomedical informatics, vol.87, pp.12-20, 2018.

P. Huang, X. He, J. Gao, L. Deng, A. Acero et al., Learning deep structured semantic models for web search using clickthrough data, Proceedings of the 22nd ACM international conference on Information & Knowledge Management, pp.2333-2338, 2013.

B. Hu, Z. Lu, H. Li, and Q. Chen, Convolutional neural network architectures for matching natural language sentences, Advances in neural information processing systems, pp.2042-2050, 2014.

K. Cho, B. Van-merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares et al., Learning phrase representations using rnn encoder-decoder for statistical machine translation, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01433235

L. Pang, Y. Lan, J. Guo, J. Xu, S. Wan et al., Text matching as image recognition, Thirtieth AAAI Conference on Artificial Intelligence, 2016.

T. Gaurav-singh-tomar, O. Duque, J. Täckström, D. Uszkoreit, and . Das, Neural paraphrase identification of questions with noisy pretraining, pp.142-147, 2017.

L. Pang, Y. Lan, J. Guo, J. Xu, J. Xu et al., Deeprank: A new deep architecture for relevance ranking in information retrieval, Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM '17, pp.257-266, 2017.

Y. Peng and B. Liu, Attention-based neural network for shorttext question answering, Proceedings of the 2018 2Nd International Conference on Deep Learning Technologies, ICDLT '18, pp.21-26, 2018.

J. Bromley, I. Guyon, Y. Lecun, E. Säckinger, and R. Shah, Signature verification using a" siamese" time delay neural network, Advances in neural information processing systems, pp.737-744, 1994.

Y. Song, V. Hu, and L. He, P-cnn: Enhancing text matching with positional convolutional neural network. Knowledge-Based Systems, vol.169, pp.67-79, 2019.

Z. Yang, D. Yang, and C. Dyer, Xiaodong He, Alex Smola, and Eduard Hovy. Hierarchical attention networks for document classification, Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pp.1480-1489, 2016.

A. Parikh, O. Täckström, D. Das, and J. Uszkoreit, A decomposable attention model for natural language inference, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp.2249-2255, 2016.

Y. Fan, J. Guo, Y. Lan, J. Xu, C. Zhai et al., Modeling diverse relevance patterns in ad-hoc retrieval, The 41st International ACM SIGIR Conference on Research &#38

, Development in Information Retrieval, SIGIR '18, pp.375-384, 2018.

X. Liu and W. Croft, Passage retrieval based on language models, 11th international conf. CIKM'02, 2002.

M. Wang and L. Si, Discriminative probabilistic models for passage based retrieval, 31st international ACM SIGIR'08 conf, 2008.

Y. Lv and C. Zhai, Positional language models for information retrieval, 32nd international ACM SIGIR'09 conf, 2009.

D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

C. Manning, P. Raghavan, and H. Schütze, Introduction to information retrieval, Natural Language Engineering, vol.16, issue.1, pp.100-103, 2010.

H. Peter-luhn, A statistical approach to mechanized encoding and searching of literary information, IBM Journal of research and development, vol.1, issue.4, pp.309-317, 1957.

R. Baeza-yates and B. Ribeiro-neto, Modern information retrieval, vol.463, 1999.

C. Zhai and J. Lafferty, Model-based feedback in the language modeling approach to information retrieval, Proceedings of the Tenth International Conference on Information and Knowledge Management, CIKM '01, pp.403-410, 2001.

T. Susan and . Dumais, Improving the retrieval of information from external sources, Behavior Research Methods, Instruments, & Computers, vol.23, issue.2, pp.229-236, 1991.

S. Briet, Qu'est-ce que la documentation?, Éditions documentaires, industrielles et techniques, vol.1, 1951.

K. Michael and . Buckland, What is a "document, Journal of the American society for information science, vol.48, pp.804-809, 1997.

M. David and . Levy, Fixed or fluid?: document stability and new media, Proceedings of the 1994 ACM European conference on Hypermedia technology, pp.24-31, 1994.

J. Bernard, A. Jansen, J. Spink, T. Bateman, and . Saracevic, Real life information retrieval: A study of user queries on the web, Acm sigir forum, vol.32, pp.5-17, 1998.

C. Silverstein, H. Marais, M. Henzinger, and M. Moricz, Analysis of a very large web search engine query log, ACm SIGIR Forum, vol.33, pp.6-12, 1999.

A. Carlos and . Cuadra, Experimental Studies of Relevance Judgments, System Development Corporation, 1967.

S. William and . Cooper, A definition of relevance for information retrieval. Information storage and retrieval, vol.7, pp.19-37, 1971.

S. Mizzaro, Relevance: The whole history, Journal of the American society for information science, vol.48, issue.9, pp.810-832, 1997.

J. Mao, Y. Liu, K. Zhou, J. Nie, J. Song et al., When does relevance mean usefulness and user satisfaction in web search?, Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '16, pp.463-472, 2016.

K. Sparck and J. , A statistical interpretation of term specificity and its application in retrieval, Journal of documentation, vol.28, issue.1, pp.11-21, 1972.

S. Deerwester, T. Susan, G. W. Dumais, . Furnas, K. Thomas et al., Journal of the American society for information science, vol.41, pp.391-407, 1990.

D. David and . Lewis, Text representation for intelligent text retrieval: A classification-oriented view. Text-based intelligent systems: current research and practice in information extraction and retrieval, pp.179-197, 1992.

D. Metzler and . Bruce, Combining the language model and inference network approaches to retrieval. Information processing & management, vol.40, pp.735-750, 2004.

M. W. Bilotti, P. Ogilvie, J. Callan, and E. Nyberg, Structured retrieval for question answering, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '07, pp.351-358, 2007.

Y. Bengio, R. Ducharme, P. Vincent, and C. Jauvin, A neural probabilistic language model, Journal of machine learning research, vol.3, pp.1137-1155, 2003.

T. Mikolov, Y. Wen-tau, and G. Zweig, Linguistic regularities in continuous space word representations, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.746-751, 2013.

K. Thomas, S. T. Landauer, and . Dumais, A solution to plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge, Psychological review, vol.104, issue.2, p.211, 1997.

K. Thomas, . Landauer, W. Peter, D. Foltz, and . Laham, An introduction to latent semantic analysis, Discourse processes, vol.25, pp.259-284, 1998.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, 2013.

O. Levy and Y. Goldberg, Linguistic regularities in sparse and explicit word representations, Proceedings of the eighteenth conference on computational natural language learning, pp.171-180, 2014.

E. Stephen, S. Robertson, and . Walker, Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval, 17th international ACM SIGIR'94 conf, 1994.

D. Metzler, Generalized inverse document frequency, Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM '08, pp.399-408, 2008.

G. Salton, A. Wong, and C. S. Yang, A vector space model for automatic indexing, Commun. ACM, vol.18, issue.11, pp.613-620, 1975.

J. Michael-ponte and W. Croft, A language modeling approach to information retrieval, 1998.

T. Liu, Learning to rank for information retrieval, Foundations and Trends R in Information Retrieval, vol.3, issue.3, pp.225-331, 2009.

C. Charu, C. Aggarwal, and . Zhai, Mining text data, 2012.

Y. Zhang, A. Md-mustafizur-rahman, B. Braylan, H. Dang, H. Chang et al., Neural information retrieval: A literature review, 2016.

J. Bernard, A. Jansen, and . Spink, Analysis of document viewing patterns of web search engine users, Web mining: Applications and techniques, pp.339-354, 2005.

W. James, K. Perry, M. M. Allen, and . Berry, Machine literature searching x. machine language; factors underlying its design and development, vol.6, p.242, 1955.

L. A. Charles, M. Clarke, G. V. Kolla, O. Cormack, A. Vechtomova et al., Novelty and diversity in information retrieval evaluation, Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '08, pp.659-666, 2008.

W. Goffman, A searching procedure for information retrieval, vol.2, pp.73-78, 1964.

A. Turpin and F. Scholer, User performance versus precision measures for simple search tasks, Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '06, pp.11-18, 2006.

K. Järvelin and J. Kekäläinen, Cumulated gain-based evaluation of ir techniques, ACM Transactions on Information Systems (TOIS), vol.20, issue.4, pp.422-446, 2002.

P. Paroubek, S. Chaudiron, and L. Hirschman, Principles of evaluation in natural language processing, Traitement Automatique des Langues, vol.48, issue.1, pp.7-31, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00502700

T. Susan, G. W. Dumais, . Furnas, K. Thomas, S. Landauer et al., Using latent semantic analysis to improve access to textual information, Proceedings of the SIGCHI conference on Human factors in computing systems, pp.281-285, 1988.

J. Gonzalo, F. Verdejo, I. Chugur, and J. Cigarran, Indexing with wordnet synsets can improve text retrieval, 1998.

M. Sanderson, Word sense disambiguation and information retrieval, SIGIR'94, pp.142-151, 1994.

A. Arampatzis and J. Kamps, A study of query length, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, pp.811-812, 2008.

L. Deng and D. Yu, Deep learning: methods and applications. Foundations and Trends R in Signal Processing, vol.7, pp.197-387, 2014.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012.

G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed et al., Deep neural networks for acoustic modeling in speech recognition, IEEE Signal processing magazine, p.29, 2012.

I. Sutskever, O. Vinyals, and Q. Le, Sequence to sequence learning with neural networks, Advances in neural information processing systems, pp.3104-3112, 2014.

S. Warren, W. Mcculloch, and . Pitts, A logical calculus of the ideas immanent in nervous activity, The bulletin of mathematical biophysics, vol.5, issue.4, pp.115-133, 1943.

M. Christopher and . Bishop, Neural networks for pattern recognition, 1995.

M. Boughanem, T. Dkaki, J. Mothe, and C. Soule-dupuy, Mercure at trec7. In TREC, vol.1998, pp.355-360, 1998.

J. Nickolls, I. Buck, and M. Garland, Scalable parallel programming, IEEE Hot Chips 20 Symposium (HCS), pp.40-53, 2008.

H. R. Jerome-y-lettvin, . Maturana, S. Warren, . Mcculloch, H. Walter et al., What the frog's eye tells the frog's brain, Proceedings of the IRE, vol.47, issue.11, pp.1940-1951, 1959.

G. Alcantara, Empirical analysis of non-linear activation functions for deep neural networks in classification tasks, 2017.

M. Minsky, A. Seymour, and . Papert, Perceptrons: An introduction to computational geometry, 1969.

D. Svozil, V. Kvasnicka, and J. Pospichal, Introduction to multi-layer feed-forward neural networks. Chemometrics and intelligent laboratory systems, vol.39, pp.43-62, 1997.

. Richard-p-lippmann, Pattern classification using neural networks, IEEE communications magazine, vol.27, issue.11, pp.47-50, 1989.

F. Donald and . Specht, A general regression neural network, IEEE transactions on neural networks, vol.2, pp.568-576, 1991.

. John-s-denker, H. P. Gardner, D. Graf, R. E. Henderson, W. Howard et al., Neural network recognizer for hand-written zip code digits, Advances in neural information processing systems, pp.323-331, 1989.

Y. Lecun, B. Boser, S. John, D. Denker, R. E. Henderson et al., Backpropagation applied to handwritten zip code recognition, Neural computation, vol.1, issue.4, pp.541-551, 1989.

Y. Lecun and Y. Bengio, Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, vol.3361, 1995.

M. Egmont-petersen, D. De-ridder, and H. Handels, Image processing with neural networks-a review, Pattern recognition, vol.35, issue.10, pp.2279-2301, 2002.

F. Milletari, N. Navab, and S. Ahmadi, V-net: Fully convolutional neural networks for volumetric medical image segmentation, 2016 Fourth International Conference on 3D Vision (3DV), pp.565-571, 2016.

G. E. David-e-rumelhart, R. J. Hinton, and . Williams, Learning representations by back-propagating errors, Cognitive modeling, vol.5, issue.3, p.1, 1988.

A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber, Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, Proceedings of the 23rd international conference on Machine learning, pp.369-376, 2006.

K. Kawakami, Supervised sequence labelling with recurrent neural networks, 2008.

I. Sutskever, J. Martens, and G. E. Hinton, Generating text with recurrent neural networks, Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp.1017-1024, 2011.

P. Neculoiu, M. Versteegh, and M. Rotaru, Learning text similarity with siamese recurrent networks, Proceedings of the 1st Workshop on Representation Learning for NLP, pp.148-157, 2016.

M. Schuster, K. Kuldip, and . Paliwal, Bidirectional recurrent neural networks, IEEE Transactions on Signal Processing, vol.45, issue.11, pp.2673-2681, 1997.

S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural computation, vol.9, issue.8, pp.1735-1780, 1997.

A. Graves and J. Schmidhuber, Framewise phoneme classification with bidirectional lstm and other neural network architectures, Neural Networks, vol.18, issue.5-6, pp.602-610, 2005.

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones et al., Attention is all you need, Advances in neural information processing systems, pp.5998-6008, 2017.

A. Radford, J. Wu, D. Amodei, D. Amodei, J. Clark et al., Better language models and their implications. OpenAI, 2018.

J. Devlin, M. Chang, K. Lee, and K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.4171-4186, 2019.

O. Chapelle, B. Scholkopf, and A. Zien, Semisupervised learning (chapelle, IEEE Transactions on Neural Networks, vol.20, issue.3, pp.542-542, 2006.

R. Hoffmann, C. Zhang, X. Ling, L. Zettlemoyer, and D. S. Weld, Knowledge-based weak supervision for information extraction of overlapping relations, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.541-550, 2011.

N. Rasiwasia and N. Vasconcelos, Scene classification with lowdimensional semantic spaces and weak supervision, IEEE Conference on Computer Vision and Pattern Recognition, pp.1-6, 2008.

S. Shangxuan-tian, C. Lu, and . Li, Wetext: Scene text detection under weak supervision, Proceedings of the IEEE International Conference on Computer Vision, pp.1492-1500, 2017.

M. Dehghani, A. Severyn, S. Rothe, and J. Kamps, Learning to learn from weak supervision by full supervision, 2017.

M. Dehghani, H. Zamani, A. Severyn, J. Kamps, and W. B. Croft, Neural ranking models with weak supervision, Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '17, pp.65-74, 2017.

Z. Zhou, A brief introduction to weakly supervised learning, National Science Review, vol.5, issue.1, pp.44-53, 2017.

T. J. Geoffrey-e-hinton, . Sejnowski, and . Poggio, Unsupervised learning: foundations of neural computation, 1999.

Y. Lecun, B. E. Boser, J. S. Denker, D. Henderson, R. E. Howard et al., Handwritten digit recognition with a back-propagation network, Advances in neural information processing systems, pp.396-404, 1990.

T. C. Anthony and . Goh, Back-propagation neural networks for modeling complex systems, Artificial Intelligence in Engineering, vol.9, issue.3, pp.143-151, 1995.

R. Hecht-nielsen, Theory of the backpropagation neural network, Neural networks for perception, pp.65-93, 1992.

E. John and . Moody, The effective number of parameters: An analysis of generalization and regularization in nonlinear learning systems, Advances in neural information processing systems, pp.847-854, 1992.

R. Reed, J. Robert, and . Marksii, Neural smithing: supervised learning in feedforward artificial neural networks, 1999.

Y. Jin, T. Okabe, and B. Sendhoff, Neural network regularization and ensembling using multi-objective evolutionary algorithms, Proceedings of the 2004 Congress on Evolutionary Computation, vol.1, pp.1-8, 2004.

Y. Yao, L. Rosasco, and A. Caponnetto, On early stopping in gradient descent learning, Constructive Approximation, vol.26, pp.289-315, 2007.

G. Raskutti, J. Martin, B. Wainwright, and . Yu, Early stopping and non-parametric regression: an optimal data-dependent stopping rule, The Journal of Machine Learning Research, vol.15, issue.1, pp.335-366, 2014.

N. Srivastava, G. Hinton, A. Krizhevsky, R. Ilya, and . Salakhutdinov, Dropout: a simple way to prevent neural networks from overfitting, The Journal of Machine Learning Research, vol.15, issue.1, pp.1929-1958, 2014.

N. Geoffrey-e-hinton, A. Srivastava, . Krizhevsky, . Ilya, R. Ruslan et al., Improving neural networks by preventing coadaptation of feature detectors, 2012.

S. Ioffe and C. Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift, International Conference on Machine Learning, pp.448-456, 2015.

M. Hanna and . Wallach, Topic modeling: Beyond bag-of-words, Proceedings of the 23rd International Conference on Machine Learning, ICML '06, pp.977-984, 2006.

A. Kao and S. R. Poteet, Natural language processing and text mining, 2007.

B. Croft, D. Metzler, and T. Strohman, Search engines: Information retrieval in practice, vol.520, 2010.

R. Krovetz and . Bruce-croft, Lexical ambiguity and information retrieval, ACM Transactions on Information Systems (TOIS), vol.10, issue.2, pp.115-141, 1992.

B. Burghard and . Rieger, On distributed representation in word semantics, ternational Computer Science Institute, 1991.

A. Bookstein and . Don-r-swanson, Probabilistic models for automatic indexing, Journal of the American Society for Information science, vol.25, issue.5, pp.312-316, 1974.

P. Stephen and . Harter, A probabilistic approach to automatic keyword indexing, 1974.

M. Baroni, G. Dinu, and G. Kruszewski, Don't count, predict! a systematic comparison of context-counting vs. contextpredicting semantic vectors, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol.1, pp.238-247, 2014.

K. Thomas, S. Landauer, and . Dumais, Latent semantic analysis, Scholarpedia, vol.3, issue.11, p.4356, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00190471

G. Nguyen, L. Tamine, L. Soulier, and N. Souf, Learning concept-driven document embeddings for medical information search, Conference on Artificial Intelligence in Medicine in Europe, pp.160-170, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01517094

Y. Ni, K. Qiong, F. Xu, Y. Cao, D. Mass et al., Semantic documents relatedness using concept graph representation, Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, pp.635-644, 2016.

A. Bordes, N. Usunier, A. Garcia-duran, J. Weston, and O. Yakhnenko, Translating embeddings for modeling multirelational data, Advances in neural information processing systems, pp.2787-2795, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00920777

G. Nguyen, Modèles neuronaux pour la recherche d'information: approches dirigées par les ressources sémantiques, 2018.

A. Joulin, E. Grave, P. Bojanowski, and T. Mikolov, Bag of tricks for efficient text classification, Proceedings of the 15th Conference of the European Chapter, vol.2, pp.427-431, 2017.

J. Wieting, M. Bansal, K. Gimpel, and K. Livescu, Charagram: Embedding words and sentences via character n-grams, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp.1504-1515, 2016.

G. E. Forsythe and P. Henrici, The cyclic jacobi method for computing the principal values of a complex matrix, Transactions of the American Mathematical Society, vol.94, issue.1, pp.1-23, 1960.

H. Gene, C. Golub, and . Reinsch, Singular value decomposition and least squares solutions, Linear Algebra, pp.134-151, 1971.

W. David and . Miller, Computer solution of linear algebraic systems, 1968.

D. S. Thomas-k-landauer, S. Mcnamara, W. Dennis, and . Kintsch, Handbook of latent semantic analysis, 2013.

T. Hofmann, Probabilistic latent semantic indexing, ACM SIGIR Forum, vol.51, pp.211-218, 2017.

K. Lund and C. Burgess, Producing high-dimensional semantic spaces from lexical co-occurrence. Behavior research methods, instruments, & computers, vol.28, pp.203-208, 1996.

L. M. Douglas-lt-rohde, D. C. Gonnerman, and . Plaut, An improved model of semantic similarity based on lexical co-occurrence, Communications of the ACM, vol.8, p.116, 2006.

R. Lebret and R. Collobert, Word emdeddings through hellinger pca, 2013.

A. Field, Discovering statistics using SPSS, 2009.

R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu et al., Natural language processing (almost) from scratch, Journal of machine learning research, vol.12, pp.2493-2537, 2011.

, Xin Rong. word2vec parameter learning explained, 2014.

I. Vuli? and M. Moens, Monolingual and cross-lingual information retrieval models based on (bilingual) word embeddings, Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp.363-372, 2015.

J. Coulmance, J. Marty, G. Wenzek, and A. Benhalloum, Trans-gram, 2016.

S. Upadhyay, M. Faruqui, C. Dyer, and D. Roth, Crosslingual models of word embeddings: An empirical comparison, 2016.

E. Nalisnick, B. Mitra, N. Craswell, and R. Caruana, Improving document ranking with dual word embeddings, Proceedings of the 25th International Conference Companion on World Wide Web, WWW '16 Companion, pp.83-84, 2016.

H. Zamani and . Bruce-croft, Relevance-based word embedding, Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.505-514, 2017.

G. Zhou, T. He, J. Zhao, and P. Hu, Learning continuous word embedding with metadata for question retrieval in community question answering, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.1, pp.250-259, 2015.

J. Lu, J. Yang, D. Batra, and D. Parikh, Hierarchical question-image co-attention for visual question answering, Advances In Neural Information Processing Systems, pp.289-297, 2016.

L. Yang, Q. Ai, J. Guo, and W. Croft, anmm: Ranking short answer texts with attention-based neural matching model, Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, pp.287-296, 2016.

G. Zheng and J. Callan, Learning to reweight terms with distributed representations, Proceedings of the 38th International ACM SI-GIR Conference on Research and Development in Information Retrieval, SIGIR '15, pp.575-584, 2015.

D. Ganguly, D. Roy, M. Mitra, J. F. Gareth, and . Jones, Word embedding based generalized language model for information retrieval, Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '15, pp.795-798, 2015.

F. Diaz, M. Bhaskar, and N. Craswell, Query expansion with locally-trained word embeddings, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.367-377, 2016.

T. Kenter, . Borisov, K. De-rijke, N. A. Erk, and . Smith, Siamese cbow: Optimizing word embeddings for sentence representations, 2016.

F. Hill, K. Cho, and A. Korhonen, Learning distributed representations of sentences from unlabelled data, Proceedings of NAACL-HLT, pp.1367-1377, 2016.

S. Arora, Y. Liang, and T. Ma, A simple but tough-tobeat baseline for sentence embeddings, 2016.

C. D. Boom, S. Van-canneyt, T. Demeester, and B. Dhoedt, Representation learning for very short texts using weighted word embedding aggregation, Pattern Recognition Letters, vol.80, pp.150-156, 2016.

H. Zamani and W. Croft, Estimating embedding vectors for queries, Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval, ICTIR '16, pp.123-132, 2016.

T. Zhao, K. Lee, and M. Eskenazi, Unsupervised discrete sentence representation learning for interpretable neural dialog generation, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1098-1107, 2018.

L. Logeswaran and H. Lee, An efficient framework for learning sentence representations, 2018.

W. Yin and H. Schütze, Convolutional neural network for paraphrase identification, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.901-911, 2015.

E. Geoffrey, . Hinton, R. Ruslan, and . Salakhutdinov, Reducing the dimensionality of data with neural networks. science, vol.313, pp.504-507, 2006.

K. Pearson and F. R. Liii, on lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine, Journal of Science, vol.2, issue.11, pp.559-572, 1901.

J. Francis and . Pelletier, The principle of semantic compositionality, vol.13, pp.11-24, 1994.

Q. Ai, L. Yang, J. Guo, and W. B. Croft, Improving language estimation with the paragraph vector model for ad-hoc retrieval, Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '16, pp.869-872, 2016.

L. Wu, I. En-hsu-yen, K. Xu, F. Xu, A. Balakrishnan et al., Word mover's embedding: From word2vec to document embedding, pp.4524-4534, 2018.

G. Huang, C. Guo, J. Matt, Y. Kusner, F. Sun et al., Supervised word mover's distance, Advances in Neural Information Processing Systems, pp.4862-4870, 2016.

L. Eunjeong, S. Park, P. Cho, and . Kang, Supervised paragraph vector: distributed representations of words, documents and class labels, IEEE Access, 2019.

T. Kenter and . Maarten-de-rijke, Short text similarity with word embeddings, Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM '15, pp.1411-1420, 2015.

Q. Ai, L. Yang, J. Guo, and W. B. Croft, Analysis of the paragraph vector model for information retrieval, Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval, ICTIR '16, pp.133-142, 2016.

J. Guo, Y. Fan, A. Qingyao, and W. Croft, A deep relevance matching model for ad-hoc retrieval, 25th ACM International Conf. CIKM'16, 2016.

B. Mitra, F. Diaz, and N. Craswell, Learning to match using local and distributed representations of text for web search, Proceedings of the 26th International Conference on World Wide Web, WWW '17, pp.1291-1299, 2017.

C. Xiong, Z. Dai, J. Callan, Z. Liu, and R. Power, End-to-end neural ad-hoc ranking with kernel pooling, Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '17, pp.55-64, 2017.

G. Zuccon, B. Koopman, P. Bruza, and L. Azzopardi, Integrating and evaluating neural word embeddings in information retrieval, Proceedings of the 20th Australasian document computing symposium, p.12, 2015.

A. Berger and J. Lafferty, Information retrieval as statistical translation, SIGIR Forum, vol.51, pp.219-226, 2017.

G. Zheng and J. Callan, Learning to reweight terms with distributed representations, Proceedings of the 38th International ACM SI-GIR Conference on Research and Development in Information Retrieval, SIGIR '15, pp.575-584, 2015.

E. Bhaskar-mitra, N. Nalisnick, R. Craswell, and . Caruana, A dual embedding space model for document ranking, 2016.

Y. Shen, X. He, J. Gao, L. Deng, and G. Mesnil, Learning semantic representations using convolutional neural networks for web search, 23rd International Conference on World Wide Web'14, 2014.

M. Peters, M. Neumann, L. Zettlemoyer, and W. Yih, Dissecting contextual word embeddings: Architecture and representation, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp.1499-1509, 2018.

M. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark et al., Deep contextualized word representations, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.2227-2237, 2018.

K. Al, -. Sabahi, and Z. Zuping, Document summarization using sentence-level semantic based on word embeddings, International Journal of Software Engineering and Knowledge Engineering, vol.29, issue.02, pp.177-196, 2019.

M. E. Maron and J. Kuhns, On relevance, probabilistic indexing and information retrieval, Journal of the ACM (JACM), vol.7, issue.3, pp.216-244, 1960.

D. Roy, D. Paul, M. Mitra, and U. Garain, Using word embeddings for automatic query expansion, 2016.

S. Kuzi, A. Shtok, and O. Kurland, Query expansion using word embeddings, Proceedings of the 25th ACM international on conference on information and knowledge management, pp.1929-1932, 2016.

N. Rekabsaz, M. Lupu, A. Hanbury, and H. Zamani, Word embedding causes topic shifting; exploit global context!, Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '17, pp.1105-1108, 2017.

A. Mnih and Y. Teh, A fast and simple algorithm for training neural probabilistic language models, 2012.

N. Rekabsaz, B. Mitra, M. Lupu, and A. Hanbury, Toward incorporation of relevant documents in word2vec, ArXiv, 2017.

K. Taghipour and H. Ng, Semi-supervised word sense disambiguation using word embeddings in general and specific domains, Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: human language technologies, pp.314-323, 2015.

O. Avraham and Y. Goldberg, The interplay of semantics and morphology in word embeddings, p.422, 2017.

I. Iacobacci, M. T. Pilehvar, and R. Navigli, Sensembed: Learning sense embeddings for word and relational similarity, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.1, pp.95-105, 2015.

Y. Yaghoobzadeh and H. Schütze, Intrinsic subspace evaluation of word embedding representations, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.236-246, 2016.

N. Mrk?ic, D. Oséaghdha, B. Thomson, M. Ga?ic, L. Rojas-barahona et al., Counter-fitting word vectors to linguistic constraints, Proceedings of NAACL-HLT, pp.142-148, 2016.

S. Macavaney, A. Yates, A. Cohan, and N. Goharian, Contextualized word representations for document re-ranking, 2019.

Z. Dai and J. Callan, Deeper text understanding for ir with contextual neural language modeling, 2019.

I. Tenney, P. Xia, B. Chen, A. Wang, A. Poliak et al., What do you learn from context? probing for sentence structure in contextualized word representations, 2019.

Y. Qiao, C. Xiong, Z. Liu, and Z. Liu, Understanding the behaviors of bert in ranking, 2019.

D. Roy, D. Ganguly, S. Bhatia, S. Bedathur, and M. Mitra, Using word embeddings for information retrieval: How collection and term normalization choices affect performance, Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pp.1835-1838, 2018.

H. Li, Learning to rank for information retrieval and natural language processing, Synthesis Lectures on Human Language Technologies, vol.4, issue.1, pp.1-113, 2011.

S. Cunningham, J. Littin, and I. H. Witten, Applications of machine learning in information retrieval, 1997.

T. Liu, Learning to rank for information retrieval, 2011.

K. Duh and K. Kirchhoff, Learning to rank with partially-labeled data, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, pp.251-258, 2008.

Z. Cao, T. Qin, T. Liu, M. Tsai, and H. Li, Learning to rank: from pairwise approach to listwise approach, Proceedings of the 24th international conference on Machine learning, pp.129-136, 2007.

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds et al., Learning to rank using gradient descent, Proceedings of the 22nd International Conference on Machine learning (ICML-05), pp.89-96, 2005.

W. Chen, T. Liu, Y. Lan, Z. Ma, and H. Li, Ranking measures and loss functions in learning to rank, Advances in Neural Information Processing Systems, pp.315-323, 2009.

M. Szummer and E. Yilmaz, Semi-supervised learning to rank with preference regularization, Proceedings of the 20th ACM international conference on Information and knowledge management, pp.269-278, 2011.

T. Qin and T. Liu, , 2013.

W. Chu and Z. Ghahramani, Gaussian processes for ordinal regression, Journal of machine learning research, vol.6, pp.1019-1041, 2005.

W. Chu and . Sathiya-keerthi, New approaches to support vector ordinal regression, Proceedings of the 22nd international conference on Machine learning, pp.145-152, 2005.

S. William, F. C. Cooper, D. Gey, and . Dabney, Probabilistic retrieval based on staged logistic regression, Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, pp.198-210, 1992.

B. Bartell, W. Garrison, R. Cottrell, and . Belew, Learning to retrieve information, Proceedings of the Swedish Conference on Connectionism, p.27, 1995.

Y. Cao, J. Xu, T. Liu, H. Li, Y. Huang et al., Adapting ranking svm to document retrieval, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pp.186-193, 2006.

T. Qin, T. Liu, M. Tsai, X. Zhang, and H. Li, Learning to search web pages with query-level loss functions, vol.156, 2006.

T. Qin, X. Zhang, M. Tsai, D. Wang, T. Liu et al., Query-level loss functions for information retrieval. Information Processing & Management, vol.44, pp.838-855, 2008.

X. Geng, T. Liu, T. Qin, and H. Li, Feature selection for ranking, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pp.407-414, 2007.

H. Wei and S. Billings, Feature subset selection and ranking for data dimensionality reduction, IEEE transactions on pattern analysis and machine intelligence, vol.29, pp.162-166, 2006.

Y. Goldberg, Neural network methods for natural language processing, Synthesis Lectures on Human Language Technologies, vol.10, issue.1, pp.1-309, 2017.

N. Bhaskar-mitra and . Craswell, An introduction to neural information retrieval. Foundations and Trends R in Information Retrieval, vol.13, pp.1-126, 2018.

N. Craswell, B. Croft, J. Guo, M. Bhaskar, and M. De-rijke, Report on the sigir 2016 workshop on neural information retrieval (neu-ir). In ACM Sigir forum, vol.50, pp.96-103, 2017.

B. Mitra, F. Diaz, and N. Craswell, Learning to match using local and distributed representations of text for web search, Proceedings of the 26th International Conference on World Wide Web, pp.1291-1299, 2017.

H. Li and Z. Lu, Deep learning for information retrieval, Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '16, pp.1203-1206, 2016.

Y. Kezban-dilek-onal, I. Zhang, M. Sengor-altingovde, P. Mustafizur-rahman, A. Karagoz et al., Neural information retrieval: at the end of the early years, Information Retrieval Journal, vol.21, issue.2, pp.111-182, 2018.

R. Schneider, S. Arnold, T. Oberhauser, T. Klatt, T. Steffek et al., Smart-md: Neural paragraph retrieval of medical topics, International World Wide Web Conferences Steering Committee, pp.203-206, 2018.

L. Yu, K. M. Hermann, P. Blunsom, and S. Pulman, Deep learning for answer sentence selection, 2014.

J. Rao, H. He, and J. Lin, Noise-contrastive estimation for answer selection with deep neural networks, Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM '16, pp.1913-1916, 2016.

B. Wang, K. Liu, and J. Zhao, Inner attention based recurrent neural networks for answer selection, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1288-1297, 2016.

X. Qiu and X. Huang, Convolutional neural tensor network architecture for community-based question answering, Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015.

X. Zhou, B. Hu, Q. Chen, B. Tang, and X. Wang, Answer sequence learning with neural networks for answer selection in community question answering, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.2, pp.713-718, 2015.

S. Lai, L. Xu, K. Liu, and J. Zhao, Recurrent convolutional neural networks for text classification, Twenty-ninth AAAI conference on artificial intelligence, 2015.

W. Yin, H. Schütze, B. Xiang, and B. Zhou, Abcnn: Attention-based convolutional neural network for modeling sentence pairs, Transactions of the Association for Computational Linguistics, vol.4, pp.259-272, 2016.

Y. Shen, X. He, J. Gao, L. Deng, and G. Mesnil, A latent semantic model with convolutional-pooling structure for information retrieval, Proceedings of the 23rd ACM international conference on conference on information and knowledge management, pp.101-110, 2014.

A. Severyn and A. Moschitti, Learning to rank short text pairs with convolutional deep neural networks, Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp.373-382, 2015.

H. Palangi, L. Deng, Y. Shen, J. Gao, X. He et al., Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval, IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), vol.24, issue.4, pp.694-707, 2016.

P. Chen, W. Guo, Z. Chen, J. Sun, and L. You, Gated convolutional neural network for sentence matching, Proc, pp.2853-2857, 2018.

S. Kim, I. Kang, and N. Kwak, Semantic sentence matching with densely-connected recurrent and co-attentive information, Proceedings of the AAAI Conference on Artificial Intelligence, vol.33, pp.6586-6593, 2019.

Z. Lu and H. Li, A deep architecture for matching short texts, Advances in neural information processing systems, pp.1367-1375, 2013.

S. Kamath, B. Grau, and Y. Ma, Predicting and Integrating Expected Answer Types into a Simple Recurrent Neural Network Model for Answer Sentence Selection, 20th International Conference on Computational Linguistics and Intelligent Text Processing, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02104488

X. Zhou, B. Hu, Q. Chen, and X. Wang, Recurrent convolutional neural network for answer selection in community question answering, Neurocomputing, vol.274, pp.8-18, 2018.

C. Xiong, Z. Dai, J. Callan, Z. Liu, and R. Power, End-to-end neural ad-hoc ranking with kernel pooling, Proceedings of the 40th International ACM SIGIR conference on research and development in information retrieval, pp.55-64, 2017.

Y. Shengxian-wan, J. Lan, J. Xu, and . Guo, Liang Pang, and Xueqi Cheng. Match-srnn: Modeling the recursive matching structure with spatial rnn, 2016.

K. Hui, A. Yates, K. Berberich, and G. De-melo, Positionaware representations for relevance matching in neural information retrieval, Proceedings of the 26th International Conference on World Wide Web Companion, WWW '17 Companion, pp.799-800, 2017.

N. Kalchbrenner, E. Grefenstette, and P. Blunsom, A convolutional neural network for modelling sentences, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol.1, pp.655-665, 2014.

D. Cohen, Q. Ai, and W. Croft, Adaptability of neural networks on varying granularity ir tasks, 2016.

Q. Liu-tieyan, . Tao, and . Xu-jun, Benchmark dataset for research on learning to rank for information retrieval, Proceedings of the Workshop on Learning to Rank for Information Retrieval, pp.137-145, 2007.

H. Zamani, B. Mitra, X. Song, N. Craswell, and S. Tiwary, Neural ranking models with multiple document fields, Proceedings of the eleventh ACM international conference on web search and data mining, pp.700-708, 2018.

T. Belkacem, G. Jose, T. Moreno, M. Dkaki, and . Boughanem, Asymmetry sensitive architecture for neural text matching, European Conference on Information Retrieval, pp.62-69, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02435348

S. Stephen-e-robertson, S. Walker, M. M. Jones, M. Hancock-beaulieu, and . Gatford, Okapi at trec-3, Nist Special Publication Sp, vol.109, p.109, 1995.

T. Strohman, D. Metzler, H. Turtle, and W. Croft, Indri: A language model-based search engine for complex queries, Proceedings of the International Conference on Intelligent Analysis, vol.2, pp.2-6, 2005.

C. Van-gysel, E. Kanoulas, and M. De-rijke, Pyndri: a python interface to the indri search engine, European Conference on Information Retrieval, pp.744-748, 2017.

C. Buckley, Available from ftp, 1975.

H. Zamani and . Bruce-croft, Embedding-based query language models, Proceedings of the 2016 ACM international conference on the theory of information retrieval, pp.147-156, 2016.

Y. Lv and C. Zhai, Lower-bounding term frequency normalization, Proceedings of the 20th ACM international conference on Information and knowledge management, pp.7-16, 2011.

T. Belkacem, T. Dkaki, J. G. Moreno, and M. Boughanem, Impact de la présence/absence des termes de la requête dans le document sur le processus d'appariement document-requête en utilisant word2vec, COnférence en Recherche d'Informations et Applications -CORIA 2018, 15th French Information Retrieval Conference, 2018.

D. Roy, D. Ganguly, M. Mitra, . Gareth, and . Jones, Representing documents and queries as sets of word embedded vectors for information retrieval, 2016.

N. Rekabsaz, M. Lupu, and A. Hanbury, Exploration of a threshold for similarity based on uncertainty in word embedding, Advances in Information Retrieval, pp.396-409, 2017.

G. Zweig, J. C. Platt, C. Meek, J. C. Christopher, A. Burges et al., Computational approaches to sentence completion, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol.1, pp.601-610, 2012.

Y. Kim, Convolutional neural networks for sentence classification, pp.1746-1751, 2014.

K. Abishek, C. Basuthkar-rajaram-hariharan, and . Valliyammai, An enhanced deep learning model for duplicate question pairs recognition, Soft Computing in Data Analytics, pp.769-777, 2019.

O. Ido-dagan, B. Glickman, and . Magnini, The pascal recognising textual entailment challenge, Machine Learning Challenges Workshop, pp.177-190, 2005.

T. Haug, . Octavian-eugen, P. Ganea, and . Grnarova, Neural multistep reasoning for question answering on semi-structured tables, European Conference on Information Retrieval, pp.611-617, 2018.

P. Zhou, Z. Qi, S. Zheng, J. Xu, H. Bao et al., Text classification improved by integrating bidirectional lstm with two-dimensional max pooling, 2016.

K. Tymoshenko and A. Moschitti, Cross-pair text representations for answer sentence selection, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp.2162-2173, 2018.

J. Allan, J. Aslam, N. Belkin, C. Buckley, J. Callan et al., Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, university of massachusetts amherst, ACM SIGIR Forum, vol.37, pp.31-47, 2002.

A. Zell, Simulation neuronaler netze, vol.1, 1994.

P. Diederik, J. Kingma, and . Ba, Adam: A method for stochastic optimization, 2014.

T. Belkacem, T. Dkaki, G. Jose, M. Moreno, and . Boughanem, amv-lstm: an attention-based model with multiple positional text matching, Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, pp.788-795, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02441990

P. Rajpurkar, J. Zhang, K. Lopyrev, and P. Liang, Squad: 100,000+ questions for machine comprehension of text, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp.2383-2392, 2016.

T. Nguyen, M. Rosenberg, X. Song, J. Gao, S. Tiwary et al., Ms marco: A human-generated machine reading comprehension dataset, 2016.