A. Automata, Modular, Prior Knowledge Sources Contents 6.1 Introduction, 100 6.2 Background on Grammars and Automata . . . . . . 102 6

]. L. Androutsopoulos, Natural language interfaces to databases ??? an introduction, Natural Language Engineering, vol.14, issue.01, p.2981, 1995.
DOI : 10.1145/319983.319986

URL : http://arxiv.org/pdf/cmp-lg/9503016

Z. Artzi, Y. Artzi, and L. Zettlemoyer, Weakly supervised learning of semantic parsers for mapping instructions to actions, Transactions of the Association for Computational Linguistics, vol.1, issue.1, p.4962, 2013.

. Aziz, Exact Decoding for Phrase-Based Statistical Machine Translation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014.
DOI : 10.3115/v1/D14-1131

URL : http://emnlp2014.org/papers/pdf/EMNLP2014131.pdf

. Bar, On formal properties of simple phrase structure grammars, Zeitschrift für Phonetik, Sprachwissenschaft und Kommunicationsforschung, vol.14, p.143172, 1961.

. Baydin, Atilim Gunes Baydin, Barak A. Pearlmutter, and Alexey Andreyevich Radul Automatic dierentiation in machine learning: a survey, 2015.

. Berant, J. Liang, P. Berant, and . Liang, Semantic Parsing via Paraphrasing, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2014.
DOI : 10.3115/v1/P14-1133

URL : http://www-nlp.stanford.edu/joberant/homepage_files/publications/ACL14.pdf

. Berant, Semantic parsing on freebase from question-answer pairs, Empirical Methods in Natural Language Processing (EMNLP), 2013.

. Bergstra, Theano: a CPU and GPU math expression compiler, Proceedings of the Python for Scientic Computing Conference (SciPy), 2010.

L. Billot, B. Billot, and . Lang, The structure of shared forests in ambiguous parsing, Proceedings of the 27th annual meeting on Association for Computational Linguistics -, p.143151, 1989.
DOI : 10.3115/981623.981641

URL : https://hal.archives-ouvertes.fr/inria-00075520

. Bollacker, Freebase, Proceedings of the 2008 ACM SIGMOD international conference on Management of data , SIGMOD '08, p.12471250, 2008.
DOI : 10.1145/1376616.1376746

. Bordes, Question Answering with Subgraph Embeddings, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), p.615620, 2014.
DOI : 10.3115/v1/D14-1067

URL : http://arxiv.org/pdf/1406.3676

. Bordes, Open Question Answering with Weakly Supervised Embedding Models, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML-PKDD), 2014.
DOI : 10.1007/978-3-662-44848-9_11

URL : https://hal.archives-ouvertes.fr/hal-01344007

]. Bottou, Une Approche théorique de l'Apprentissage Connexionniste: Applications à la Reconnaissance de la Parole, 1991.

]. Bottou, Large-scale machine learning with stochastic gradient descent, COMPSTAT, 2010.
DOI : 10.1201/b11429-4

. Cai, . Yates, A. Cai, and . Yates, Large-scale semantic parsing via schema matching and lexicon extension, Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2013.

. Calder, Unication categorial grammar: A concise, extendable grammar for natural language processing, Proceedings of the 12th conference on Computational linguistics, p.8386, 1988.
DOI : 10.3115/991635.991653

]. Chiang, A hierarchical phrase-based model for statistical machine translation, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics , ACL '05, p.263270, 2005.
DOI : 10.3115/1219840.1219873

URL : http://acl.ldc.upenn.edu/P/P05/P05-1033.pdf

C. , C. Clark, and J. R. Curran, Wide-coverage ecient statistical parsing with CCG and log-linear models, Computational Linguistics, vol.33, issue.4, p.493552, 2007.
DOI : 10.1162/coli.2007.33.4.493

URL : http://doi.org/10.1162/coli.2007.33.4.493

. Cuong, Conditional random eld with high-order dependencies for sequence labeling and segmentation, J. Mach. Learn. Res, vol.15, issue.1, p.9811009, 2014.

]. Cybenko, Approximation by superpositions of a sigmoidal function, Mathematics of Control, Signals, and Systems (MCSS), p.303314, 1989.

. Dahl, Expanding the scope of the ATIS task, Proceedings of the workshop on Human Language Technology , HLT '94, p.4348, 1994.
DOI : 10.3115/1075812.1075823

]. Dalrymple, Lexical-Functional Grammar (Syntax and Semantics, Syntax and Semantics, 2001.

M. Daumé, . Daumé, D. Iii, and . Marcu, Learning as search optimization, Proceedings of the 22nd international conference on Machine learning , ICML '05, p.169176, 2005.
DOI : 10.1145/1102351.1102373

. Deerwester, Indexing by latent semantic analysis, Journal of the American Society for Information Science, vol.41, issue.6, p.41391407, 1990.
DOI : 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

URL : http://www.cs.bham.ac.uk/~pxt/IDA/lsa_ind.pdf

. Deoras, Variational approximation of long-span language models for lvcsr, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), p.55325535, 2011.
DOI : 10.1109/ICASSP.2011.5947612

L. Dong, L. Dong, and M. Lapata, Language to Logical Form with Neural Attention, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016.
DOI : 10.18653/v1/P16-1004

URL : http://arxiv.org/pdf/1601.01280

. Duchi, Adaptive subgradient methods for online learning and stochastic optimization, Journal of Machine Learning Research, vol.12, p.21212159, 2011.

]. Dyer, A Formal Model of Ambiguity and its Applications in Machine Translation, 2010.

X. Dymetman, M. Dymetman, and C. Xiao, Log-linear rnns: Towards recurrent neural networks with exible prior knowledge, 1607.

L. Jerey and . Elman, Finding structure in time, COGNITIVE SCIENCE, vol.14, issue.2, p.179211, 1990.

. Fader, Identifying relations for open information extraction, Proceedings of the Conference of Empirical Methods in Natural Language Processing (EMNLP '11), 2011.

. Faruqui, Retrotting word vectors to semantic lexicons, 1411.
DOI : 10.3115/v1/n15-1184

URL : http://arxiv.org/pdf/1411.4166.pdf

. Ganitkevitch, Ppdb: The paraphrase database, North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), p.758764, 2013.

. Goodfellow, Deep Learning, 2016.

. Goyal, Natural language generation through character-based rnns with nite-state prior knowledge, COLING, p.10831092, 2016.

. Graham, An Improved Context-Free Recognizer, ACM Transactions on Programming Languages and Systems, vol.2, issue.3, p.415462, 1980.
DOI : 10.1145/357103.357112

. Guu, From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2017.
DOI : 10.18653/v1/P17-1097

URL : http://arxiv.org/pdf/1704.07926

S. Young, Semantic processing using the hidden vector state model, Computer Speech and Language, vol.19, issue.1, p.85106, 2005.

. Hendrix, Developing a natural language interface to complex data, ACM Trans. Database Syst, vol.3, issue.2, p.105147, 1978.
DOI : 10.1007/978-0-585-35958-8_5

URL : http://www.dtic.mil/cgi-bin/GetTRDoc?AD=ADA157892&Location=U2&doc=GetTRDoc.pdf

. Hirschman, Multisite data collection and evaluation in spoken language understanding, Proceedings of the Workshop on Human Language Technology, HLT '93, p.1924, 1993.
DOI : 10.3115/1075671.1075676

]. S. Hochreiter, Untersuchungen zu dynamischen neuronalen Netzen, 1991.

K. Hornik, Approximation capabilities of multilayer feedforward networks, Neural Networks, vol.4, issue.2, p.251257, 1991.
DOI : 10.1016/0893-6080(91)90009-T

E. P. Xing, Harnessing deep neural networks with logic rules, p.2016

L. Jia, R. Jia, and P. Liang, Data Recombination for Neural Semantic Parsing, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016.
DOI : 10.18653/v1/P16-1002

URL : http://arxiv.org/pdf/1606.03622

I. Michael and . Jordan, Serial order: A parallel, distributed processing approach, 1986.

V. Joshi, K. Aravind, K. Joshi, and . Vijay-shanker, Compositional semantics with lexicalized tree-adjoining grammar (ltag): How much underspecication is necessary? In Computing Meaning, p.147163, 2001.
DOI : 10.1007/978-94-010-0572-2_9

. Kate, J. Mooney-]-rohit, R. J. Kate, and . Mooney, Using stringkernels for learning semantic parsers, ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, p.913920, 2006.
DOI : 10.3115/1220175.1220290

URL : http://www.cs.utexas.edu/~ml/papers/krisp-acl-06.pdf

. Kate, Learning to transform natural to formal languages, Proceedings, The Twentieth National Conference on Articial Intelligence and the Seventeenth Innovative Applications of Articial Intelligence Conference, p.10621068, 2005.

. Kate, Learning to transform natural to formal languages, Proceedings of the 20th National Conference on Articial Intelligence -Volume 3, AAAI'05, p.10621068, 2005.

J. W. Shavlik, Guiding a reinforcement learner with natural language advice: Initial results in robocup soccer, The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems, 2004.

D. Kuhn, R. Mori, R. D. Kuhn, and . Mori, The application of semantic classication trees to natural language understanding, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.17, p.449460, 1995.

. Kwiatkowski, Lexical generalization in ccg grammar induction for semantic parsing, Proceedings of the Conference on Empirical Methods in Natural Language Processing, p.15121523, 2011.

. Kwiatkowski, Scaling semantic parsers with on-the-y ontology matching, Empirical Methods in Natural Language Processing, p.15451556, 2013.

. Pereira, Conditional random elds: Probabilistic models for segmenting and labeling sequence data, Proceedings of the Eighteenth International Conference on Machine Learning, ICML '01, p.282289, 2001.

C. Lebret, R. Lebret, and . Collobert, Rehabilitation of Count-Based Models for Word Vector Representations, Computational Linguistics and Intelligent Text Processing -16th International Conference Proceedings, Part I, p.417429, 2015.
DOI : 10.1007/978-3-319-18111-0_31

. Liang, Learning Dependency-Based Compositional Semantics, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, p.590599, 2011.
DOI : 10.3115/1220835.1220891

URL : https://doi.org/10.1162/coli_a_00127

. Liang, Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1611.
DOI : 10.18653/v1/P17-1003

URL : http://arxiv.org/pdf/1611.00020

]. Liang, Learning executable semantic parsers for natural language understanding, Communications of the ACM, vol.59, issue.9, p.6876, 2016.
DOI : 10.1145/2866568

URL : http://arxiv.org/pdf/1603.06677

. Lodhi, Text classication using string kernels, 2002.

. Macherey, Natural language understanding using statistical machine translation, European Conf. on Speech Communication and Technology, p.22052208, 2001.

. Mikolov, Ecient estimation of word representations in vector space, 1301.

. Mikolov, Distributed representations of words and phrases and their compositionality, Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting, p.31113119, 2013.

. Miller, A fully statistical approach to natural language interfaces, Proceedings of the 34th annual meeting on Association for Computational Linguistics -, p.5561, 1996.
DOI : 10.3115/981863.981871

URL : http://www.aclweb.org/anthology/P96-1008

]. Mohri, Weighted Automata Algorithms, p.213254, 2009.
DOI : 10.1007/978-3-642-01492-5_6

URL : http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/pubs/archive/35076.pdf

. Mou, Coupling distributed and symbolic execution for natural language queries, 1612.

C. Michael and . Mozer, The induction of multiscale temporal structure, 1992.

]. Nederhof and G. Satta, Probabilistic parsing as intersection, 8th International Workshop on Parsing Technologies, p.137148, 2003.

S. Nederhof, G. Nederhof, and . Satta, Probabilistic Parsing, New Developments in Formal Languages and Applications, pp.229-258, 2008.
DOI : 10.1007/978-3-540-78291-9_7

URL : https://research-repository.st-andrews.ac.uk/bitstream/10023/1660/1/2008a.pdf

. Neelakantan, Learning a natural language interface with neural programmer, 1611.

. Papineni, Featurebased language understanding, Fifth European Conference on Speech Communication and Technology, 1997.

L. Pasupat, Panupong Pasupat and Percy Liang. Compositional semantic parsing on semi-structured tables, p.2015, 2015.

W. Pereira, C. N. Fernando, D. H. Pereira, and . Warren, Definite clause grammars for language analysis???A survey of the formalism and a comparison with augmented transition networks, Artificial Intelligence, vol.13, issue.3, pp.231-278, 1980.
DOI : 10.1016/0004-3702(80)90003-X

I. A. Pollard, Carl Pollard and Sag Ivan A. Head-driven phrase structure grammar, 1994.

. Quirk, Language to Code: Learning Semantic Parsers for If-This-Then-That Recipes, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), p.878888, 2015.
DOI : 10.3115/v1/P15-1085

R. , R. Raymond, and G. Riccardi, Generative and discriminative algorithms for spoken language understanding, Inter- Speech, p.16051608, 2007.

. Reddy, Large-scale semantic parsing without question-answer pairs, Transactions of the Association for Computational Linguistics, vol.2, p.377392, 2014.

. Williams, In Neurocomputing: Foundations of Research, chapter Learning Representations by Back-propagating Errors, p.696699, 1988.

J. Stuart, P. Russell, and . Norvig, Articial Intelligence: A Modern Approach, 2003.

M. Sahlgren, An introduction to random indexing, Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering, 2005.

. Salakhutdinov, Learning with Hierarchical-Deep Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.8, p.19581971, 2013.
DOI : 10.1109/TPAMI.2012.269

URL : http://www.cs.toronto.edu/~rsalakhu/papers/HD_PAMI.pdf

. Schwartz, Language understanding using hidden understanding models, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, p.9971000, 1996.
DOI : 10.1109/ICSLP.1996.607771

URL : http://www.asel.udel.edu/icslp/cdrom/vol2/665/a665.pdf

M. Stuart and . Shieber, Bimorphisms and synchronous grammars, J. Language Modelling, vol.2, issue.1, p.51104, 2014.

S. Siegelmann, T. Hava, E. D. Siegelmann, and . Sontag, Turing computability with neural nets, Applied Mathematics Letters, vol.4, issue.6, p.7780, 1991.
DOI : 10.1016/0893-9659(91)90080-F

URL : https://doi.org/10.1016/0893-9659(91)90080-f

. Srivastava, Dropout: A simple way to prevent neural networks from overtting, Journal of Machine Learning Research, vol.15, issue.1, p.19291958, 2014.

]. Steedman, Surface structure and interpretation. Linguistic inquiry monographs, 1996.

. Sutskever, Ilya Sutskever, Oriol Vinyals, and Quoc V Le. Sequence to sequence learning with neural networks, Advances in Neural Information Processing Systems (NIPS), p.31043112, 2014.

. Tang, R. Mooney-]-lappoon, R. J. Tang, and . Mooney, Using Multiple Clause Constructors in Inductive Logic Programming for Semantic Parsing, Proceedings of the 12th European Conference on Machine Learning, p.466477, 2001.
DOI : 10.1007/3-540-44795-4_40

. Tieleman, . T. Hinton, G. E. Tieleman, and . Hinton, Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude, 2012.

. Vinyals, Grammar as a foreign language, Neural Information Processing Systems (NIPS), 2014.

. Wang, Building a Semantic Parser Overnight, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), p.13321342, 2015.
DOI : 10.3115/v1/P15-1129

P. Warren, H. D. David, F. C. Warren, and . Pereira, An ecient easily adaptable system for interpreting natural language queries, Comput. Linguist, vol.8, pp.3-4110122, 1982.

. Wen, Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015.
DOI : 10.18653/v1/D15-1199

URL : http://arxiv.org/pdf/1508.01745

M. Wong, R. J. Wong, and . Mooney, Learning for semantic parsing with statistical machine translation, Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics -, p.439446, 2006.
DOI : 10.3115/1220835.1220891

URL : http://acl.ldc.upenn.edu/N/N06/N06-1056.pdf

M. Wong, R. Wong, and . Mooney, Learning synchronous grammars for semantic parsing with lambda calculus, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, p.960967, 2007.

C. Gardent, Orthogonality regularizer for question answering, Proceedings of the Fifth Joint Conference on Lexical and Computational Semantics, *SEM@ACL 2016, pp.11-12, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01623819

M. Chang, S-MART: novel treebased structured learning algorithms applied to tweet entity linking, 1609.

. Yao, Probabilistic text modeling with orthogonalized topics, Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, SIGIR '14, p.907910, 2014.
DOI : 10.1145/2600428.2609471

. Yih, Semantic Parsing for Single-Relation Question Answering, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), p.643648, 2014.
DOI : 10.3115/v1/P14-2105

URL : http://aclweb.org/anthology/P/P14/P14-2105.pdf

. Yih, Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2015.
DOI : 10.3115/v1/P15-1128

M. Zelle, M. Zelle, and R. J. Mooney, Learning to parse database queries using inductive logic programming, AAAI/IAAI, pp.1050-1055, 1996.

C. Zettlemoyer, S. Luke, M. Zettlemoyer, and . Collins, Learning to map sentences to logical form: Structured classication with probabilistic categorial grammars, UAI '05, Proceedings of the 21st Conference in Uncertainty in Articial Intelligence, p.658666, 2005.

C. Zettlemoyer, S. Luke, M. Zettlemoyer, and . Collins, Online learning of relaxed ccg grammars for parsing to logical form, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL-2007, pp.678-687, 2007.