Optimizing Debt Collections Using Constrained Reinforcement Learning, Conference of the Special Interest Group on Knowledge Discovery and Data Mining, 2010. ,
Constrained Policy Optimization, International Conference on Machine Learning, p.86, 2017. ,
, Natural Language Understanding, vol.20, 1995.
Constrained Markov Decision Processes, vol.70, 1999. ,
URL : https://hal.archives-ouvertes.fr/inria-00074109
An automated measure of mdp similarity for transfer in reinforcement learning, Workshops at the Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, 2014. ,
The Machine That Won the War. The Magazine of Fantasy and Science Fiction, 1961. ,
, Continuous transfer in Deep Q-learning
Finite-time analysis of the multiarmed bandit problem, Machine learning, 2002. ,
Finite-time analysis of the multiarmed bandit problem, Machine learning, vol.57, p.55, 2002. ,
How to do things with words, p.28, 1962. ,
Sur les opérations dans les ensembles abstraits et leur application aux équations intégrales, Fundamenta Mathematicae, p.37, 1922. ,
Training Dialogue Systems With Human Advice, International Conference on Autonomous Agents and Multiagent Systems, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01945831
, Training Dialogue Systems With Human Advice, International Conference on Autonomous Agents and Multiagent Systems, 2018.
Human-Machine Dialogue as a Stochastic Game, Conference of the Special Interest Group on Discourse and Dialogue, vol.55, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01225848
, Human-machine dialogue as a stochastic game, Conference of the Special Interest Group on Discourse and Dialogue, p.97, 2015.
Transfer in deep reinforcement learning using successor features and generalised policy improvement, 2018. ,
Dynamic programming and Lagrange multipliers, National Academy of Sciences of the USA, p.37, 1956. ,
, A Markovian decision process, Journal of Mathematics and Mechanics, vol.37, p.33, 1957.
Functional Approximations and Dynamic Programming, Mathematics of Computation, vol.38, 1959. ,
Word Recognition Computer Program, 1966. ,
, , p.19
Constrained Optimization and Lagrange Multiplier Methods (Optimization and Neural Computation Series), Athena Scientific, p.39, 1996. ,
Optimal policies for controlled Markov chains with a constraint, Journal of Mathematical Analysis and Applications, vol.71, 1985. ,
The composition of messages in speechgraphics interactive systems, International Symposium on Spoken Dialogue, 1996. ,
High Fidelity Speech Synthesis with Adversarial Networks, p.19, 2019. ,
Natural Language Input for a Computer Problem Solving System, vol.20, 1964. ,
GUS, a Frame-driven Dialog System, Artificial Intelligence, 1977. ,
Budget Allocation using Weakly Coupled, Constrained Markov Decision Processes, Conference on Uncertainty in Artificial Intelligence, vol.86, 2016. ,
, , vol.22, 2011.
Hello, It's GPT-2 -How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems, p.97, 2019. ,
A Fitted-Q Algorithm for Budgeted MDPs, Workshop on Safety, Risk and Uncertainty in Reinforcement Learning, Conference on Uncertainty in Artificial Intelligence, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01928092
, A Fitted-Q Algorithm for Budgeted MDPs, European Workshop on Reinforcement Learning, 2018.
, Safe transfer learning for dialogue applications, International Conference on Statistical Language and Speech Processing, vol.50, 2018.
Online learning and transfer for user adaptation in dialogue systems, Joint special session on negotiation dialog, Workshop on the Semantics and Pragmatics of Dialogue-Conference of the Special Interest Group on Discourse and Dialogue, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01557775
, Continuous transfer in Deep Q-learning
Budgeted Reinforcement Learning in Continuous State Space, Workshop on Safety Risk and Uncertainty in Reinforcement Learning at Conference on Uncertainty in Artificial Intelligence, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02375727
Budgeted Reinforcement Learning in Continuous State Space, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02375727
Knowledge transfer between speakers for personalised dialogue management, Conference of the Special Interest Group on Discourse and Dialogue, pp.49-51, 2015. ,
, Knowledge transfer between speakers for personalised dialogue management, Conference of the Special Interest Group on Discourse and Dialogue, p.89, 2015.
Clustering behaviors of spoken dialogue systems users, IEEE International Conference on Acoustics, Speech and Signal Processing, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00685009
, Co-adaptation in spoken dialogue systems, Natural interaction with robots, knowbots and smartphones, p.97, 2014.
Optimizing Spoken Dialogue Management with Fitted Value Iteration, Conference of the International Speech Communication Association, vol.80, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-00553184
Transfer deep reinforcement learning in 3d environments: An empirical study, Deep Reinforcement Learning Workshop at Conference on Neural Information Processing Systems, p.116, 2016. ,
, A Survey on Dialogue Systems: Recent Advances and New Frontiers". In: Exploration Newsletter (cited on pages, vol.27, p.22, 2017.
Policy Adaptation for Deep Reinforcement Learning-Based Dialogue Management, IEEE International Conference on Acoustics, Speech and Signal Processing, vol.51, p.48, 2018. ,
, A.3 Conclusion 121
Towards better decoding and language model integration in sequence to sequence models, Conference of the International Speech Communication Association, p.27, 2017. ,
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria, Journal of Machine Learning Research, vol.86, p.70, 2018. ,
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach, Conference on Neural Information Processing Systems, 2015. ,
2001: A Space Odyssey, p.18, 1968. ,
Artificial Paranoia: A Computer Simulation of Paranoid Processes, p.21, 1975. ,
, The Digital Hand: Volume II: How Computers Changed the Work of American Financial, Telecommunications, Media, and Entertainment Industries, 2005.
Strategic Dialogue Management via Deep Reinforcement Learning, Workshop on Deep Reinforcement Learning, Conference on Neural Information Processing Systems, 2015. ,
, The Routledge pragmatics encyclopedia. Routledge, p.21, 2010.
Policy Certificates: Towards Accountable Reinforcement Learning, International Conference on Machine Learning, vol.70, 2019. ,
Automatic recognition of spoken digits, Journal of the Acoustical Society of America, 1952. ,
Improving generalization for temporal difference learning: The successor representation, Neural Computation, p.116, 1993. ,
PILCO: A Model-Based and Data-Efficient Approach to Policy Search, International Conference on Machine Learning, p.116, 2011. ,
Use of kernel deep convex networks and end-to-end learning for spoken language understanding, IEEE Spoken Language Technology Workshop, p.28, 2012. ,
, Continuous transfer in Deep Q-learning
How Klattalk became DECtalk: An Academic's Experiences in the Business World, The official proceedings of Speech Technology, 1987. ,
Deep belief network based semantic taggers for spoken language understanding, Conference of the International Speech Communication Association, 2013. ,
User modeling for spoken dialogue system evaluation, IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, vol.49, 1997. ,
Rationalization: A Neural Machine Translation Approach to Generating Natural Language Explanations, Conference on AI, Ethics, and Society, Association for Computing Machinery-Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, 2018. ,
Learning the Parameters of Reinforcement Learning from Data for Adaptive Spoken Dialogue Systems, vol.34, 2016. ,
URL : https://hal.archives-ouvertes.fr/tel-01809184
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems, Conference of the International Speech Communication Association, p.49, 2016. ,
Ordinal regression for interaction quality prediction, IEEE International Conference on Acoustics, Speech and Signal Processing, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01107499
Will my Spoken Dialogue System be a Slow Learner, In: Conference of the Special Interest Group on Discourse and Dialogue, vol.42, 2013. ,
Reward Function Learning for Dialogue Management, Frontiers in Artificial Intelligence and Applications, p.51, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00749430
, Task Completion Transfer Learning for Reward Inference, Workshop, Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, vol.51, p.50, 2014.
Reinforcement learning with Gaussian processes, International Conference on Machine Learning, vol.48, 2006. ,
Tree-Based Batch Mode Reinforcement Learning, Journal of Machine Learning Research, vol.55, p.40, 2005. ,
Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems, 2009. ,
Hyperbolic discounting and learning over multiple horizons, p.36, 2019. ,
Proto-transfer learning in markov decision processes using spectral methods, Computer Science Department Faculty Publication Series, vol.46, 2006. ,
Transfer of task representation in reinforcement learning using policy-based proto-value functions, International Conference on Autonomous Agents and Multiagent Systems, vol.46, 2008. ,
Social Signal and User Adaptation in Reinforcement Learning-based Dialogue Management, Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication, p.41, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01315527
Building Watson: An Overview of the DeepQA Project, Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, vol.20, 2010. ,
Model-agnostic meta-learning for fast adaptation of deep networks, International Conference on Machine Learning, p.116, 2017. ,
Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position, In: Biological cybernetics, vol.36, 1980. ,
Neural approaches to conversational AI, Foundations and Trends R in Information Retrieval, vol.30, p.21, 2019. ,
, Continuous transfer in Deep Q-learning
A Comprehensive Survey on Safe Reinforcement Learning, Journal of Machine Learning Research, 2015. ,
POMDP-based dialogue manager adaptation to extended domains, Conference of the Special Interest Group on Discourse and Dialogue, pp.48-51, 2013. ,
Gaussian processes for pomdp-based dialogue manager optimization, Speech, and Language Processing, 2013. ,
Risk-sensitive reinforcement learning applied to control under constraints, In: Journal of Artificial Intelligence Research, p.86, 2005. ,
Transfer Learning for User Adaptation in Spoken Dialogue Systems, International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems, vol.50, p.89, 2016. ,
Reinforcement Learning of Argumentation Dialogue Policies in Negotiation, Conference of the International Speech Communication Association, 2011. ,
Multilingual spokenlanguage understanding in the MIT Voyager system, Speech Communication, 1995. ,
A form-based dialogue manager for spoken language applications, International Conference on Spoken Language Processing, 1996. ,
Using Natural-Language Processing to Produce Weather Forecasts, IEEE Expert: Intelligent Systems and Their Applications, 1994. ,
Towards End-to-end Speech Recognition with Recurrent Neural Networks, International Conference on Machine Learning, vol.27, p.19, 2014. ,
Query Intent Detection using Convolutional Neural Networks, Workshop on Query Understanding, International Conference on Web Search and Data Mining, 2016. ,
Machine Learning for Dialog State Tracking: A Review, International Workshop on Machine Learning in Spoken Language Processing, 2015. ,
Deep Neural Network Approach for the Dialog State Tracking Challenge, Conference of the Special Interest Group on Discourse and Dialogue, 2013. ,
Long Short-term Memory, Neural computation, p.19, 1997. ,
A Synthetic Speaker, Journal of Franklin Institute, p.18, 1939. ,
Dynamic Programming and Markov Processes, p.37, 1960. ,
Learning Deep Structured Semantic Models for Web Search Using Clickthrough Data, ACM International Conference on Information & Knowledge Management, 2013. ,
Goal-Oriented Chatbot Dialog Management Bootstrapping with Transfer Learning, International Joint Conference on Artificial Intelligence, vol.51, p.48, 2018. ,
Ex machina, vol.20, 2015. ,
Robust Dynamic Programming, Mathematics of Operations Research, vol.70, 2005. ,
Q-learning, Machine Learning, p.89, 1992. ,
Adaptive referring expression generation in spoken dialogue systems: Evaluation with real users, Conference of the Special Interest Group on Discourse and Dialogue, vol.55, 2010. ,
, Continuous Speech Recognition by Statistical Methods". In: IEEE 64, p.19, 1976.
Her (cited on page 21), 2013. ,
Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, p.27, 2000. ,
, Continuous transfer in Deep Q-learning
Generalization through simulation: Integrating simulated and real data into deep reinforcement learning for vision-based autonomous flight, vol.116, p.97, 2019. ,
Optimal control of Markov processes with incomplete state information, Journal of Mathematical Analysis and Applications, 1965. ,
Clustering by Means of Medoids, Data Analysis based on the L1-Norm and Related Methods, 1987. ,
Parameter estimation for agenda-based user simulation, Conference of the Special Interest Group on Discourse and Dialogue, p.49, 2010. ,
The MaDrIgAL Project: Multi-Dimensional Interaction Management and Adaptive Learning, International Workshop on Domain Adaptation for Dialog Agents, 2016. ,
, Towards Learning Transferable Conversational Skills using Multi-dimensional Dialogue Modelling, Workshop on the Semantics and Pragmatics of Dialogue, vol.51, p.48, 2018.
Optimising turn-taking strategies with reinforcement learning, Conference of the Special Interest Group on Discourse and Dialogue, vol.58, 2015. ,
, Incremental human-machine dialogue simulation, Dialogues with Social Robots, p.49, 2017.
Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition, Conference of the International Speech Communication Association, p.27, 2017. ,
Towards Language-Universal End-to-End Speech Recognition, IEEE International Conference on Acoustics, Speech and Signal Processing, p.27, 2018. ,
Least-squares Policy Iteration, Journal of Machine Learning Research, vol.40, 2003. ,
Transfer of knowledge in cognitive systems, workshop on Structural Knowledge Transfer for Machine Learning at International Conference on Machine Learning, p.47, 2006. ,
, A.3 Conclusion
The complex negotiation dialogue game, Workshop on the Semantics and Pragmatics of Dialogue, vol.96, p.49, 2017. ,
Enhanced monitoring tools and online dialogue optimisation merged into a new spoken dialogue system design experience, Conference of the International Speech Communication Association, 2010. ,
The negotiation dialogue game, Dialogues with Social Robots, vol.55, p.23, 2017. ,
Optimising a handcrafted dialogue system design, Conference of the International Speech Communication Association, 2010. ,
Hybridisation of expertise and reinforcement learning in dialogue systems, Conference of the International Speech Communication Association, 2009. ,
Safe Policy Improvement with Baseline Bootstrapping, vol.115, p.86, 2019. ,
Knight Rider. National Broadcasting Company (cited on page 18), 1986. ,
Customizable Descriptions of Object-Oriented Models, Advances in Natural Language Processing, 1977. ,
Knowledge transfer in reinforcement learning, vol.46, 2008. ,
, Transfer in Reinforcement Learning: a Framework and a Survey, Reinforcement Learning -State of the art, vol.89, pp.45-47, 2012.
Transfer of samples in batch reinforcement learning, International Conference on Machine Learning, vol.57, p.55, 2008. ,
Batch Policy Learning under Constraints, International Conference on Machine Learning, p.86, 2019. ,
Object Recognition with Gradient-Based Learning, Shape, Contour and Grouping in Computer Vision, 1999. ,
Accelerating Recurrent Neural Network Language Model Based Online Speech Recognition System, IEEE International Conference on Acoustics, Speech and Signal Processing, p.27, 2018. ,
Learning what to say and how to say it: Joint optimisation of spoken dialogue management and natural language generation, Computer Speech & Language, 2011. ,
Dialogue policy learning for combinations of noise and user simulation: transfer results, Conference of the Special Interest Group on Discourse and Dialogue, p.49, 2007. ,
Data-driven methods for adaptive spoken dialogue systems: Computational learning for conversational interfaces, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00756740
Approximate Robust Control of Uncertain Dynamical Systems, Workshop on Machine Learning for Intelligent Transportation Systems, Conference on Neural Information Processing Systems, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01931744
A stochastic model of computer-human interaction for learning dialogue strategies, European Conference on Speech Communication and Technology, vol.49, 1997. ,
A stochastic model of human-machine interaction for learning dialog strategies, IEEE Transactions on speech and audio processing, 2000. ,
Deep Reinforcement Learning for Dialogue Generation, Conference on Empirical Methods in Natural Language Processing, p.29, 2016. ,
Reinforcement learning for dialog management using least-squares policy iteration and fast feature selection, International Speech Communication Association, 2009. ,
Reinforcement Learning for Dialog Management using Least-Squares Policy Iteration and Fast Feature Selection, Conference of the International Speech Communication Association, vol.80, p.55, 2009. ,
End-to-end task-completion neural dialogue systems, 2017. ,
ROUGE: A Package for Automatic Evaluation of Summaries, Workshop on Text Summarization Branches Out, Annual Meeting of the Association for Computational Linguistics, 2004. ,
Model-based Bayesian Reinforcement Learning for Dialogue Management, Conference of the International Speech Communication Association, vol.40, 2013. ,
Multiobjective Reinforcement Learning: A Comprehensive Overview, IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2014. ,
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses, Annual Meeting of the Association for Computational Linguistics, 2017. ,
The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems, p.39, 2015. ,
Investment science, 2013. ,
Some methods for classification and analysis of multivariate observations, Berkeley symposium on mathematical statistics and probability, vol.55, 1967. ,
Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes, Journal of Machine Learning Research, vol.46, 2007. ,
Clustering markov decision processes for continual transfer, vol.65, p.55, 2013. ,
Beyond VaR: from measuring risk to managing risk, IEEE Conference on Computational Intelligence for Financial Engineering, 2003. ,
Investigation of recurrent-neural-network architectures and learning methods for spoken language Chapter A. Continuous transfer in Deep Q-learning understanding, Conference of the International Speech Communication Association, 2013. ,
Human-level control through deep reinforcement learning, Nature, vol.77, p.43, 2015. ,
Personalizing a Dialogue System with Transfer Reinforcement Learning, Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, vol.51, p.50, 2018. ,
Imitation Game, vol.20, 2014. ,
Safe Policy Improvement with Soft Baseline Bootstrapping, European Conference on Machine Learning, p.86, 2019. ,
Mobile speech and advanced natural language solutions, 2013. ,
Robust Control of Markov Decision Processes with Uncertain Transition Matrices, Mathematics of Operations Research, 2005. ,
Stochastic language generation for spoken dialogue systems, Advances in Natural Language Processing -NAACL, 2000. ,
, Parallel WaveNet: Fast High-Fidelity Speech Synthesis, p.29, 2018.
OpenAI Five, p.96, 2018. ,
An Open Source and Open Hardware Deep Learning-powered Visual Navigation Engine for Autonomous Nano-UAVs, p.97, 2019. ,
Bleu: a Method for Automatic Evaluation of Machine Translation, Annual Meeting of the Association for Computational Linguistics, 2002. ,
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning, Annual Meeting of the Association for Computational Linguistics, 2018. ,
Recent Advances in Natural Language Generation: A Survey and Classification of the Empirical Literature, Computing and Informatics, 2017. ,
Microsoft silences its new A.I. bot Tay, after Twitter users teach it racism, p.97, 2016. ,
Safe policy improvement by minimizing robust baseline regret, Conference on Neural Information Processing Systems, p.86, 2016. ,
A Framework for Unsupervised Learning of Dialogue Strategies, Presses Universitaires de Louvain, vol.49, p.21, 2004. ,
Sample-efficient batch reinforcement learning for dialogue management optimization, ACM Transactions on Speech and Language Processing (TSLP) (cited on pages 40, vol.55, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00617517
Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes, Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, p.86, 2015. ,
Language Models are Unsupervised Multitask Learners, vol.97, p.22, 2019. ,
Natural Language Generation in Dialog Systems, International Conference on Human Language Technology Research, 2001. ,
Gaussian processes in machine learning, vol.48, 2003. ,
Neural Fitted Q Iteration -First Experiences with a Data Efficient Neural Reinforcement Learning Method, European Conference on Machine Learning, vol.42, 2005. ,
, Continuous transfer in Deep Q-learning
Counseling and psychotherapy, p.21, 1942. ,
A Survey of Multi-Objective Sequential Decision-Making, Journal of Artificial Intelligence Research, vol.71, p.70, 2013. ,
The Perceptron: A Probabilistic Model for Information Storage and Organization in The Brain, Psychological Review, 1958. ,
Spoken dialogue management using probabilistic reasoning, Annual Meeting of the Association for Computational Linguistics, 2000. ,
, The Pattern Playback, 2019.
, Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol.1, 1986.
On-line Q-learning using connectionist systems, 1994. ,
ARTIMIS: Natural dialogue meets rational agency, vol.28, p.21, 1997. ,
Dialogues for negotiation: agent varieties and dialogue sequences, International Workshop on Agent Theories, Architectures, and Languages, vol.55, 2001. ,
RightClick.io Uses AI-Powered Chatbot to Create a Website, 2019. ,
Deep belief nets for natural language call-routing, IEEE International Conference on Acoustics, Speech and Signal Processing, 2011. ,
A Conceptual Dependency Parser for Natural Language, Conference on Computational Linguistics, vol.20, 1969. ,
Statistical User and Error Modelling for Spoken Dialogue Systems, p.49, 2008. ,
, A.3 Conclusion
Effects of the user model on simulation-based learning of dialogue strategies, IEEE Workshop on Automatic Speech Recognition and Understanding, p.49, 2005. ,
Bladerunner, vol.20, 1982. ,
Speech acts: An essay in the philosophy of language, p.28, 1969. ,
A survey of available corpora for building data-driven dialogue systems, 2015. ,
Building end-to-end dialogue systems using generative hierarchical neural network models, Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, vol.97, p.30, 2016. ,
Learning Semantic Representations Using Convolutional Neural Networks for Web Search, International Conference on World Wide Web, 2014. ,
Improving action selection in MDP's via knowledge transfer, Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, vol.46, 2005. ,
Mastering the game of Go with deep neural networks and tree search, 2016. ,
Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system, Journal of Artificial Intelligence Research, 2002. ,
Sur la division des corps matériels en partie, Bulletin de l'academie polonaise des sciences, vol.55, 1957. ,
Model transfer for Markov decision tasks via parameter matching, Workshop of the UK Planning and Scheduling Special Interest Group, vol.46, 2006. ,
, Continuous transfer in Deep Q-learning
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning, Artificial intelligence, vol.46, 1999. ,
Policy Gradients with Variance Related Risk Criteria, International Conference on Machine Learning, vol.70, 2012. ,
Transferring instances for model-based reinforcement learning, Joint European Conference on Machine Learning and Knowledge Discovery in Databases, vol.46, 2008. ,
Transfer learning for reinforcement learning domains: A survey, Journal of Machine Learning Research, vol.89, p.45, 2009. ,
Transfer via inter-task mappings in policy search reinforcement learning, International Conference on Autonomous Agents and Multiagent Systems, vol.46, 2007. ,
Baidu's Melody -AI Powered Conversational Bot for Doctors and Patients -The Digital Insurer, 2019. ,
High confidence policy improvement, International Conference on Machine Learning, p.86, 2015. ,
Statistical Methods for Spoken Dialogue Management, 2013. ,
Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems, Computer Speech & Language, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-00621617
Regularization of incorrectly posed problems, Doklady Akademii Nauk SSSR, vol.64, 1963. ,
Using advice to transfer knowledge acquired in one reinforcement learning task to another, European Conference on Machine Learning, vol.46, 2005. ,
Towards deeper understanding: Deep convex networks for semantic utterance classification, p.135, 2012. ,
, IEEE International Conference on Acoustics, Speech and Signal Processing, p.28
On Computable Numbers, with an Application to the Entscheidungs problem, vol.20, 1936. ,
, Computing machinery and intelligence, Mind, vol.20, 1950.
Qualityadaptive spoken dialogue initiative selection and implications on reward modelling, Conference of the Special Interest Group on Discourse and Dialogue, vol.55, 2015. ,
PyDial: A Multi-domain Statistical Dialogue System Toolkit, Annual Meeting of the Association for Computational Linguistics, p.22, 2017. ,
Function Approximation for Continuous Constrained MDPs, p.86, 2010. ,
WaveNet: A generative model for raw audio, Speech Synthesis Workshop, vol.29, p.19, 2016. ,
Attention is all you need, Conference on Neural Information Processing Systems, 2017. ,
, AlphaStar: Mastering the Real-Time Strategy Game StarCraft II, p.97, 2019.
GuessWhat?! Visual Object Discovery through Multimodal Dialogue, IEEE Conference on Computer Vision and Pattern Recognition, vol.43, p.29, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01549641
, Continuous transfer in Deep Q-learning
Results Obtained from a Vowel Recognition Computer Program, The Journal of the Acoustical Society of America, 1959. ,
Informational redundancy and resource bounds in dialogue, The Institute for Research in Cognitive Science, 1993. ,
Learning optimal dialogue strategies: A case study of a spoken dialogue agent for email, Annual Meeting of the Association for Computational Linguistics, 1998. ,
Tacotron: Towards Endto-End Speech Synthesis, Conference of the International Speech Communication Association, 2017. ,
Sample efficient actor-critic with experience replay, p.43, 2016. ,
Sample Efficient Deep Reinforcement Learning for Dialogue Systems With Large Action Spaces, Speech and Language Processing, 2018. ,
Eliza-a computer program for the study of natural language com-munication between man and machine, Communications of the Association for Computing Machinery, 1966. ,
Latent Intention Dialogue Models, International Conference on Machine Learning, 2017. ,
Semantically conditioned lstm-based natural language generation for spoken dialogue systems, p.29, 2015. ,
Learning recognition and segmentation of 3-D objects from 2-D images, International Conference on Computer Vision, 1993. ,
Robust Markov Decision Processes, Mathematics of Operations Research, vol.70, 2013. ,
Partially observable Markov decision processes with continuous observations for dialogue management, Recent Trends in Discourse and Dialogue, vol.42, 2008. ,
The Dialog State Tracking Challenge, Conference of the Special Interest Group on Discourse and Dialogue, vol.29, 2013. ,
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning, Machine Learning, p.43, 1992. ,
, , vol.22, 2005.
Chitty-Chitty-Chat Bot": Deep Learning for Conversational AI, International Joint Conference on Artificial Intelligence, vol.30, 2018. ,
Building Task-Oriented Dialogue Systems for Online Shopping, Conference on Artificial Intelligence of the Association for the Advancement of Artificial Intelligence, 2017. ,
Zero-shot learning and clustering for semantic utterance classification using deep learning, International Conference on Learning Representations, 2014. ,
Spoken language understanding using long short-term memory neural networks, IEEE Spoken Language Technology Workshop, p.28, 2014. ,
Recurrent neural networks for language understanding, Conference of the International Speech Communication Association, 2013. ,
POMDP-Based Statistical Spoken Dialog Systems: A Review, p.21, 2013. ,
The Hidden Information State Model: a practical framework for POMDP-based spoken dialogue management, Computer Speech and Language, p.29, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00598186
Deep reinforcement learning with successor features for navigation across similar environments, International Conference on Intelligent Robots and Systems, p.116, 2017. ,
, Continuous transfer in Deep Q-learning
A Joint Model of Intent Determination and Slot Filling for Spoken Language Understanding, International Joint Conference on Artificial Intelligence, 2016. ,
Jupiter: A Telephone-Based Conversational Interface for Weather Information, IEEE Transactions on Speech and Audio Processing, 2000. ,