. .. Introduction, 85 6.2 Interaction-aware model for driver behavior estimation

.. .. Discussion,

, Lecture notes in Advanced Robotics (cit, Bibliography Abbeel, p.147, 2011.

P. Abbeel and A. Y. Ng, Apprenticeship learning via inverse reinforcement learning, Machine Learning, Proceedings of the Twenty-first International Conference (ICML 2004), 2004.

P. Abbeel, D. Dolgov, A. Y. Ng, and S. Thrun, Apprenticeship learning for motion planning with application to parking lot navigation, IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.1083-1090, 2008.

G. Agamennoni, J. I. Nieto, and E. M. Nebot, Estimation of Multivehicle Dynamics by Considering Contextual Information, IEEE Transactions on Robotics, vol.28, pp.855-870, 2012.

M. Althoff, Reachability Analysis and Its Application to the Safety Assessment of Autonomous Cars (cit, p.140, 2010.

M. Althoff and R. Lösch, Can automated road vehicles harmonize with traffic flow while guaranteeing a safe distance, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), p.140, 2016.

S. Ammoun and F. Nashashibi, Real time trajectory prediction for collision risk estimation between vehicles, 2009 IEEE 5th International Conference on Intelligent Computer Communication and Processing, vol.45, p.41, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00438624

G. Aoude, J. Joseph, N. Roy, and J. How, Mobile Agent Trajectory Prediction using Bayesian Nonparametric Reachability Trees, American Institute of Aeronautics and Astronautics Infotech at Aerospace Conference, p.45, 2011.

M. Ardelt, C. Coester, and N. Kaempchen, Highly Automated Driving on Freeways in Real Traffic Using a Probabilistic Framework, IEEE Trans. Intelligent Transportation Systems, vol.13, pp.1576-1585, 2012.

B. Argall, S. Chernova, M. M. Veloso, and B. Browning, A survey of robot learning from demonstration, Robotics and Autonomous Systems, vol.57, p.32, 2009.

S. Atev, G. Miller, and N. P. Papanikolopoulos, Clustering of Vehicle Trajectories, IEEE Transactions on Intelligent Transportation Systems, vol.11, issue.3, p.44, 2010.

P. Auer, N. Cesa-bianchi, and P. Fischer, Finite-time Analysis of the Multiarmed Bandit Problem, Machine Learning 47, vol.2, p.30, 2002.

M. Bahram, C. Hubmann, A. Lawitzky, M. Aeberhard, and D. Wollherr, A Combined Modeland Learning-Based Framework for Interaction-Aware Maneuver Prediction, IEEE Transactions on Intelligent Transportation Systems, vol.17, pp.1538-1550, 2016.

H. Bai, D. Hsu, W. Sun-lee, and V. A. Ngo, Monte Carlo Value Iteration for Continuous-State POMDPs, Algorithmic Foundations of Robotics IX: Selected Contributions of the Ninth International Workshop on the Algorithmic Foundations of Robotics, vol.32, p.31, 2011.

T. Bandyopadhyay, K. Sung-won, and E. Frazzoli, Intention-Aware Motion Planning, Algorithmic Foundations of Robotics X, vol.56, p.55, 2013.

Y. Bar-shalom and X. Li, Estimation with Applications to Tracking and Navigation, p.95, 2001.

D. Barber, Bayesian Reasoning and Machine Learning, 2012.

D. Barber, Expectation Correction for Smoothed Inference in Switching Linear Dynamical Systems, Journal of Machine Learning Research, vol.7, pp.2515-2540, 2006.

R. Bellman, The theory of dynamic programming, Bull. Amer. Math. Soc, vol.60, issue.6, p.25, 1954.

, A Markovian Decision Process, Journal of Mathematical Mechanics, vol.6, p.24, 1957.

D. P. Bertsekas, Dynamic Programming and Optimal Control. 2nd. Athena Scientific (cit, p.25, 2000.

D. P. Bertsekas and J. N. Tsitsiklis, Introduction to Probability, Athena Scientific books. Athena Scientific, p.17, 2002.

C. M. Bishop, Pattern Recognition and Machine Learning, vol.17, p.145, 2006.

J. P. Bliss and S. A. Acton, Alarm mistrust in automobiles: how collision alarm reliability affects driving, Applied Ergonomics, vol.34, issue.6, pp.499-509, 2003.

A. Boularias, J. Kober, and J. Peters, Relative Entropy Inverse Reinforcement Learning, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, p.35, 2011.

M. Bouton, A. Cosgun, and M. J. Kochenderfer, Belief state planning for autonomously navigating urban intersections, 2017 IEEE Intelligent Vehicles Symposium (IV), vol.55, pp.104-106, 2017.

S. Brechtel, T. Gindele, and R. Dillmann, Probabilistic MDP-behavior planning for cars, 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp.1537-1542, 2011.

, Probabilistic decision-making under uncertainty for autonomous driving using continuous POMDPs, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp.392-399, 2014.

S. Brechtel, Dynamic Decision-making in Continuous Partially Observable Domains: A Novel Method and its Application for Autonomous Driving, vol.51, p.32, 2015.

S. Brechtel, T. Gindele, and R. Dillmann, Solving Continuous POMDPs: Value Iteration with Incremental Learning of an Efficient Space Representation, Proceedings of the 30th International Conference on Machine Learning, vol.54, p.32, 2013.

A. Broadhurst, S. Baker, and T. Kanade, Monte Carlo road safety reasoning, IEEE Proceedings. Intelligent Vehicles Symposium, p.42, 2005.

A. Brooks, A. Makarenko, S. B. Williams, and H. F. Durrant-whyte, Parametric POMDPs for planning in continuous state spaces, Robotics and Autonomous Systems, vol.54, p.31, 2006.

C. B. Browne, E. Powley, and D. Whitehouse, A Survey of Monte Carlo Tree Search Methods, IEEE Transactions on Computational Intelligence and AI, vol.117, p.103, 2012.

M. Buehler, K. Iagnemma, and S. Singh, The 2005 DARPA Grand Challenge: The Great Robot Race. 1st, p.50, 2007.

, The DARPA Urban Challenge: Autonomous Vehicles in City Traffic. 1st, p.51, 2009.

A. Byravan, M. Monfort, B. D. Ziebart, B. Boots, and D. Fox, GraphBased Inverse Optimal Control for Robot Manipulation, Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, IJCAI 2015, pp.1874-1880, 2015.

, Report of traffic accident involving an autonomous vehicle, p.3, 2016.

A. Cassandra, M. L. Littman, and N. L. Zhang, Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes, Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence. UAI'97, p.27, 1997.

A. Couëtoux, J. Hoock, N. Sokolovska, O. Teytaud, and N. Bonnard, Continuous Upper Confidence Trees, Learning and Intelligent Optimization, p.117, 2011.

I. Dagli and D. Reichardt, Motivation-based approach to behavior prediction, Intelligent Vehicle Symposium, vol.1, p.139, 2002.

I. Dagli, M. Brost, G. Breuel-;-jaime, G. Carbonell, J. Siekmann et al., Action Recognition and Prediction for Driver Assistance Systems Using Dynamic Belief Networks, Agent Technologies, Infrastructures, Tools, and Applications for E-Services, p.139, 2003.

E. D. Dickmanns, R. Behringer, and D. Dickmanns, The seeing passenger car 'VaMoRs-P, Proceedings of the Intelligent Vehicles '94 Symposium, vol.50, p.49, 1994.

J. Erdmann, SUMO's Lane-Changing Model". In: Modeling Mobility with Open Data, p.120, 2015.

C. Finn, S. Levine, and P. Abbeel, Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, Proceedings of the 33nd International Conference on Machine Learning, vol.36, p.35, 2016.

L. Fletcher, S. Teller, and E. Olson, The MIT-Cornell collision and why it happened, Journal of Field Robotics 25, vol.10, pp.775-807, 2008.

T. Fraichard and H. Asama, Inevitable collision states -a step towards safer robots?, In: Advanced Robotics, vol.18, p.140, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00182082

. Fraichard, T. M. Thierry, and . Howard, Iterative Motion Planning and Safety Issue, Handbook of Intelligent Vehicles, p.36, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00768956

C. Fulgenzi, C. Tay, A. Spalanzani, and C. Laugier, Probabilistic navigation in dynamic environment using Rapidly-exploring Random Trees and Gaussian processes, IEEE/RSJ International Conference on Intelligent Robots and Systems, p.45, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00332595

. Galceran, A. G. Enric, R. M. Cunningham, E. Eustice, and . Olson, Multipolicy Decision-making for Autonomous Driving via Changepoint-based Behavior Prediction: Theory and Experiment, Auton. Robots, vol.41, p.139, 2017.

M. Garzón and A. Spalanzani, An hybrid simulation tool for autonomous cars in very high traffic scenarios, ICARCV 2018 -15th International Conference on Control, Automation, Robotics and Vision, p.118, 2018.

A. Gill, Introduction to the Theory of Finite-state Machines. McGraw-Hill Electronic Sciences series, p.51, 1962.

T. Gindele, S. Brechtel, and R. Dillmann, A probabilistic model for estimating driver behaviors and vehicle trajectories in traffic environments, 13th International IEEE Conference on Intelligent Transportation Systems, pp.1625-1631, 2010.

, Learning context sensitive behavior models from observations for predicting traffic situations, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013), pp.1764-1771, 2013.

. Gm-cruise, , 2017.

A. Goldhoorn, A. Garrell, R. Alquézar, and A. Sanfeliu, Continuous real time POMCP to find-and-follow people by a humanoid service robot, 2014 IEEE-RAS International Conference on Humanoid Robots, pp.741-747, 2014.

I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning, vol.18, p.17, 2016.

C. Grover, . Knight, and . Okoro, Automated Emergency Braking Systems: Technical requirements, costs and benefits, 2008.

C. Guestrin and D. Ormoneit, Robust Combination of Local Controllers, UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, p.64, 2001.

E. A. Hansen, An Improved Policy Iteration Algorithm for Partially Observable MDPs, Proceedings of the 1997 Conference on Advances in Neural Information Processing Systems 10. NIPS '97, p.29, 1998.

, Solving POMDPs by Searching in Policy Space, Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence. UAI'98, p.29, 1998.

D. Helbing, L. Buzna, A. Johansson, and T. Werner, Self-Organized Pedestrian Crowd Dynamics: Experiments, Simulations, and Design Solutions, Transportation Science, vol.39, p.55, 2005.

P. Henry, C. Vollmer, B. Ferris, and D. Fox, Learning to navigate through crowded environments, Proceedings -IEEE International Conference on Robotics and Automation, vol.70, p.40, 2010.

T. M. Howard, C. J. Green, A. Kelly, and D. Ferguson, State space sampling of feasible motions for high-performance mobile robot navigation in complex environments, J. Field Robotics, vol.25, p.61, 2008.

C. Hubmann, M. Aeberhard, and C. Stiller, A generic driving strategy for urban environments, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pp.1010-1016, 2016.

S. Ji, R. Parr, H. Li, X. Liao, and L. Carin, Point-based Policy Iteration, Proceedings of the 22Nd National Conference on Artificial Intelligence, vol.2, p.29, 2007.

K. Jo, M. Lee, J. Kim, and M. Sunwoo, Tracking and Behavior Reasoning of Moving Vehicles Based on Roadway Geometry Constraints, IEEE Transactions on Intelligent Transportation Systems PP.99, p.42, 2016.

J. Joseph, F. Doshi-velez, A. S. Huang, and N. Roy, A Bayesian nonparametric approach to modeling motion patterns, Autonomous Robots 31, vol.4, p.44, 2011.

J. Joseph, F. Mason, N. Doshi-velez, and . Roy, A Bayesian Nonparametric Approach to Modeling Mobility Patterns, Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2010, 2010.

L. Kaelbling, M. L. Pack, A. R. Littman, and . Cassandra, Planning and acting in partially observable stochastic domains, Artificial Intelligence, vol.101, issue.1, pp.99-134, 1998.

N. Kaempchen, K. Weiss, M. Schaefer, and K. C. Dietmayer, IMM object tracking for high dynamic driving maneuvers, IEEE Intelligent Vehicles Symposium, pp.825-830, 2004.

N. Kaempchen, B. Schiele, and K. Dietmayer, Situation Assessment of an Autonomous Emergency Brake for Arbitrary Vehicle-to-Vehicle Collision Scenarios, IEEE Transactions on Intelligent Transportation Systems, vol.10, p.42, 2009.

R. E. Kalman, A New Approach to Linear Filtering And Prediction Problems, ASME Journal of Basic Engineering, p.22, 1960.

C. Katrakazas, M. Quddus, W. Chen, and L. Deka, Real-time motion planning methods for autonomous on-road driving: State-of-the-art and future research directions, Transportation Research Part C: Emerging Technologies, vol.60, p.36, 2015.

A. Kelly and B. Nagy, Reactive Nonholonomic Trajectory Generation via Parametric Optimal Control, In: I. J. Robotics Res, vol.22, issue.7-8, p.64, 2003.

A. Kesting, M. Treiber, and D. Helbing, Enhanced intelligent driver model to access the impact of driving strategies on traffic capacity, Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, vol.368, p.54, 1928.

K. M. Kitani, D. Brian, J. A. Ziebart, M. Bagnell, and . Hebert, Activity Forecasting, Computer Vision -ECCV 2012 -12th European Conference on Computer Vision, pp.201-214, 2012.

S. Klingelschmitt, M. Platho, H. Groß, V. Willert, and J. Eggert, Combining behavior and situation information for reliably estimating multiple intentions, 2014 IEEE Intelligent Vehicles Symposium Proceedings, vol.46, pp.388-393, 2014.

D. E. Knuth, Big Omicron and Big Omega and Big Theta, p.143, 1976.

J. Kober, J. Andrew-(drew, ). Bagnell, and J. Peters, Reinforcement Learning in Robotics: A Survey". In: (cit, p.32, 2013.

L. Kocsis and C. Szepesvári, Bandit Based Monte-Carlo Planning, Machine Learning: ECML 2006, p.116, 2006.

N. Koenig and A. Howard, Design and use paradigms for Gazebo, an open-source multi-robot simulator, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), vol.3, p.119, 2004.

D. Koller and N. Friedman, Probabilistic Graphical Models: Principles and Techniques, p.17, 2009.

D. Krajzewicz, J. Erdmann, M. Behrisch, and L. Bieker, Recent Development and Applications of SUMO -Simulation of Urban MObility, International Journal On Advances in Systems and Measurements, vol.5, p.119, 2012.

S. Krauß, Microscopic Modeling of Traffic Flow: Investigation of Collision Free Vehicle Dynamics, vol.122, p.120, 1998.

H. Kretzschmar, M. Spies, C. Sprunk, and W. Burgard, Socially compliant mobile robot navigation via inverse reinforcement learning, The International Journal of Robotics Research, vol.35, p.36, 2016.

M. Kuderer, S. Gulati, and W. Burgard, Learning driving styles for autonomous vehicles from demonstration, IEEE International Conference on Robotics and Automation, ICRA 2015, vol.73, pp.2641-2646, 2015.

P. Kumar, M. Perrollaz, S. Lefèvre, and C. Laugier, Learning-based approach for online lane change intention prediction, 2013 IEEE Intelligent Vehicles Symposium (IV), vol.43, pp.797-802, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00821309

H. Kurniawati and V. Yadav, An Online POMDP Solver for Uncertainty Planning in Dynamic Environment, Robotics Research: The 16th International Symposium ISRR, p.31, 2016.

H. Kurniawati, D. Hsu, and W. Lee, SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces, Robotics: Science and Systems IV, Eidgenössische Technische Hochschule Zürich, p.55, 2008.

A. Lawitzky, D. Althoff, and C. F. Passenberg, Interactive scene prediction for automotive applications, 2013 IEEE Intelligent Vehicles Symposium (IV), pp.1028-1033, 2013.

S. H. Lee and S. W. Seo, A learning-based framework for handling dilemmas in urban automated driving, 2017 IEEE International Conference on Robotics and Automation (ICRA), pp.1436-1442, 2017.

S. Lefevre, C. Laugier, and J. I. Guzman, Evaluating risk at road intersections by detecting conflicting intentions, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012, p.48, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00678482

S. Lefèvre, D. Vasquez, and C. Laugier, A survey on motion prediction and risk assessment for intelligent vehicles, In: ROBOMECH Journal 1.1, vol.41, pp.1-14, 2014.

U. N. Lerner, Hybrid Bayesian Networks for Reasoning about Complex Systems, p.10, 2012.

S. Levine and P. Abbeel, Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics, Neural Information Processing Systems (NIPS) (cit, p.36, 2014.

S. Levine and V. Koltun, Continuous Inverse Optimal Control with Locally Optimal Examples, ICML '12: Proceedings of the 29th International Conference on Machine Learning, p.61, 2012.

S. Levine, Z. Popovic, and V. Koltun, Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems, pp.19-27, 2011.

W. Liu, S. W. Kim, S. Pendleton, and M. H. Ang, Situation-aware decision making for autonomous driving on urban road using online POMDP, 2015 IEEE Intelligent Vehicles Symposium (IV), vol.56, p.55, 2015.

W. S. Lovejoy, Computationally Feasible Bounds for Partially Observed Markov Decision Processes, Operations Research, vol.39, p.28, 1991.

M. Mcnaughton, C. Urmson, J. M. Dolan, and J. Lee, Motion planning for autonomous driving with a conformal spatiotemporal lattice, IEEE International Conference on Robotics and Automation, ICRA 2011, pp.4889-4895, 2011.

D. Meyer-delius, C. Plagemann, and W. Burgard, Probabilistic situation recognition for vehicular traffic scenarios, 2009 IEEE International Conference on Robotics and Automation, p.45, 2009.

I. Miller, M. Campbell, and D. Huttenlocher, Team Cornell's Skynet: Robust perception and planning in an urban environment, Journal of Field Robotics 25, vol.8, p.52, 2008.

M. Montemerlo, J. Becker, and S. Bhat, Junior: The Stanford Entry in the Urban Challenge, The DARPA Urban Challenge: Autonomous Vehicles in City Traffic, pp.91-123, 2009.

J. Morton and M. J. Kochenderfer, Simultaneous policy learning and latent state inference for imitating driver behavior, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), p.139, 2017.

K. Murphy and . Patrick, Dynamic Bayesian Networks: Representation, Inference and Learning". AAI3082340, vol.48, p.22, 2002.

F. Naser, D. Dorhout, and S. Proulx, A parallel autonomy research platform, 2017 IEEE Intelligent Vehicles Symposium (IV), pp.933-940, 2017.

, Analysis of Lane Change Crashes, National Highway Traffic Safety Administration, vol.4, 2013.

, Critical Reasons for Crashes Investigated in the National Motor Vehicle Crash Causation Survey, 2015.

, Accident Report: Collision Between a Car Operating With Automated Vehicle Control Systems and a Tractor-Semitrailer Truck Near, National Transportation Safety, 2016.

, Preliminary report: highway HWY18MH010. Accident in Tempe, p.3, 2018.

A. Nègre, L. Rummelhard, and C. Laugier, Hybrid Sampling Bayesian Occupancy Filter, IEEE Intelligent Vehicles Symposium (IV). Dearborn, United States, p.41, 2014.

G. Neumann, M. Pfeiffer, and W. Maass, Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs, Machine Learning: ECML 2007: 18th European Conference on Machine Learning, p.64, 2007.

A. Y. Ng, J. Stuart, and . Russell, Algorithms for Inverse Reinforcement Learning, Proceedings of the Seventeenth International Conference on Machine Learning, vol.32, p.33, 2000.

A. Y. Ng, D. Harada, and S. J. Russell, Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping, Proceedings of the Sixteenth International Conference on Machine Learning, p.32, 1999.

J. Nilsson, J. Fredriksson, and E. Coelingh, Rule-Based Highway Maneuver Intention Recognition, 2015 IEEE 18th International Conference on Intelligent Transportation Systems, pp.950-955, 2015.

B. Okal and K. Oliver-arras, Learning socially normative robot navigation behaviors with Bayesian inverse reinforcement learning, 2016 IEEE International Conference on Robotics and Automation, ICRA 2016, pp.2889-2895, 2016.

N. Oliver and A. P. Pentland, Graphical models for driver behavior recognition in a SmartCar, Proceedings of the IEEE Intelligent Vehicles Symposium, vol.46, p.44, 2000.

S. C. Ong, D. Shao-wei-png, W. Hsu, and . Lee, Planning under Uncertainty for Robotic Tasks with Mixed Observability, I. J. Robotics Res. 29, vol.8, p.55, 2010.

B. Paden, M. ?áp, S. Z. Yong, D. Yershov, and E. Frazzoli, A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles, IEEE Transactions on Intelligent Vehicles 1.1, p.36, 2016.

S. Paquet, L. Tobin, and B. Chaib-draa, An Online POMDP Algorithm for Complex Multiagent Environments, Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multiagent Systems. AAMAS '05. The Netherlands, vol.54, p.30, 2005.

S. Paquet, B. Chaib-draa, and S. Ross, Hybrid POMDP algorithms, Proceedings of The Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM-2006, p.30, 2006.

N. Perez-higueras, F. Caballero, and L. Merino, Learning Human-Aware Path Planning with Fully Convolutional Networks, ICRA 2018 -Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018.

. Pérez-higueras, F. Noé, L. Caballero, and . Merino, Teaching Robot Navigation Behaviors to Optimal RRT Planners, International Journal of Social Robotics, vol.10, p.61, 2018.

J. Pineau, G. Gordon, and S. Thrun, Point-based Value Iteration: An Anytime Algorithm for POMDPs, Proceedings of the 18th International Joint Conference on Artificial Intelligence. IJCAI'03, p.28, 2003.

J. Pineau, G. J. Gordon, and S. Thrun, Anytime Point-Based Approximations for Large POMDPs, Journal of Artificial Intelligence Research, vol.27, p.27, 2006.

M. Pivtoraiko and A. Kelly, Efficient constrained path planning via search in state lattices, The 8th International Symposium on Artificial Intelligence, Robotics and Automation in Space, p.64, 2005.

D. A. Pomerleau, Efficient Training of Artificial Neural Networks for Autonomous Navigation, Neural Computation 3.1, pp.88-97, 1991.

J. M. Porta, N. A. Vlassis, T. J. Matthijs, P. Spaan, and . Poupart, Point-Based Value Iteration for Continuous POMDPs, Journal of Machine Learning Research, vol.7, p.31, 2006.

P. Poupart and C. Boutilier, Bounded Finite State Controllers, Advances in Neural Information Processing Systems 16, p.29, 2004.

W. B. Powell, Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics), p.25, 2007.

S. J. Prince, Computer Vision: Models Learning and Inference, vol.145, p.17, 2012.

M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. 1st, p.25, 1994.

J. Randløv and P. Alstrøm, Learning to Drive a Bicycle Using Reinforcement Learning and Shaping, Proceedings of the Fifteenth International Conference on Machine Learning, p.32, 1998.

C. E. Rasmussen and C. Williams, Gaussian Processes for Machine Learning. Adaptive Computation and Machine Learning, p.44, 2006.

N. Ratliff, J. Andrew-;-bagnell, and M. Zinkevich, Maximum Margin Planning, International Conference on Machine Learning, 2006.

N. D. Ratliff, M. David, J. A. Bradley, J. E. Bagnell, and . Chestnutt, Boosting Structured Prediction for Imitation Learning, Advances in Neural Information Processing Systems, vol.19, pp.1153-1160, 2006.

M. Riedmiller and H. Braun, A direct adaptive method for faster backpropagation learning: the RPROP algorithm, IEEE International Conference on Neural Networks, vol.1, p.40, 1993.

R. Li, X. Vesselin, and P. Jilkov, Survey of maneuvering target tracking: dynamic models, vol.4048, p.41, 2000.

S. Ross and B. Chaib-draa, AEMS: An Anytime Online Search Algorithm for Approximate Policy Refinement in Large POMDPs, IJCAI 2007, Proceedings of the 20th International Joint Conference on Artificial Intelligence, p.30, 2007.

S. Ross, J. Pineau, S. Paquet, and B. Chaib-draa, Online Planning Algorithms for POMDPs, J. Artif. Intell. Res, vol.32, pp.663-704, 2008.

T. Roughgarden, Algorithms Illuminated, Part 1: The Basics. Soundlikeyourself Publishing, LLC (cit, p.143, 2017.

L. Rummelhard, A. Nègre, and C. Laugier, Conditional Monte Carlo Dense Occupancy Tracker, 2015 IEEE 18th International Conference on Intelligent Transportation Systems, pp.2485-2490, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01205298

L. Rummelhard, A. Nègre, M. Perrollaz, and C. Laugier, Probabilistic Grid-based Collision Risk Prediction for Driving Application, p.41, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01011808

S. J. Russell, Learning Agents for Uncertain Environments (Extended Abstract), Proceedings of the Eleventh Annual Conference on Computational Learning Theory, vol.32, pp.101-103, 1998.

D. Sadigh, S. Sastry, A. Sanjit, A. D. Seshia, and . Dragan, Planning for Autonomous Cars that Leverages Effects on Human Actions, Proceedings of the Robotics: Science and Systems Conference (RSS) (cit, p.61, 2016.

. Sae-international, Taxonomy and Definitions for Terms Related to Driving Automation Systems for On-Road Motor Vehicles, 2016.

J. K. Satia, E. Roy, and . Lave, Markovian Decision Processes with Uncertain Transition Probabilities, p.30, 1973.

R. Schubert, Evaluating the Utility of Driving: Toward Automated Decision Making Under Uncertainty, IEEE Transactions on Intelligent Transportation Systems 13.1, vol.56, p.53, 2012.

R. Schubert, E. Richter, and G. Wanielik, Comparison and evaluation of advanced motion models for vehicle tracking, 11th International Conference on Information Fusion, p.41, 2008.

W. Schwarting and P. Pascheka, Recursive conflict resolution for cooperative motion planning in dynamic highway traffic, Intelligent Transportation Systems (ITSC), pp.1039-1044, 2014.

W. Schwarting, J. Alonso-mora, L. Pauli, S. Karaman, and D. Rus, Parallel autonomy in automated vehicles: Safe motion generation with minimal intervention, 2017 IEEE International Conference on Robotics and Automation (ICRA), pp.1928-1935, 2017.

G. Shani, J. Pineau, and R. Kaplow, A survey of point-based POMDP solvers, Autonomous Agents and Multi-Agent Systems 27.1, p.29, 2013.

A. Shariff, J. Bonnefon, and I. Rahwan, Psychological roadblocks to the adoption of self-driving vehicles, Nature Human Behaviour, vol.1, issue.10, p.12, 2017.

K. Shiarlis, J. Messias, and S. Whiteson, Rapidly exploring learning trees, 2017 IEEE International Conference on Robotics and Automation (ICRA), p.61, 2017.

M. Shimosaka, T. Kaneko, and K. Nishi, Modeling risk anticipation and defensive driving on residential roads with inverse reinforcement learning, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp.1694-1700, 2014.

M. Shimosaka, J. Sato, K. Takenaka, and K. Hitomi, Fast Inverse Reinforcement Learning with Interval Consistent Graph for Driving Behavior Prediction, Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, vol.69, p.61, 2017.

D. Silver and G. Tesauro, Monte-Carlo simulation balancing, Proceedings of the 26th Annual International Conference on Machine Learning, pp.945-952, 2009.

D. Silver and J. Veness, Monte-Carlo Planning in Large POMDPs, Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems, pp.2164-2172, 2010.

. Silver, J. A. David, A. Bagnell, and . Stentz, High Performance Outdoor Navigation from Overhead Data using Imitation Learning, Robotics: Science and Systems IV, Eidgenössische Technische Hochschule Zürich, 2008.

R. D. Smallwood and E. J. Sondik, The Optimal Control of Partially Observable Markov Processes over a Finite Horizon, Operations Research, vol.21, issue.5, p.27, 1973.

T. Smith and R. G. Simmons, Heuristic Search Value Iteration for POMDPs, UAI '04, Proceedings of the 20th Conference in Uncertainty in Artificial Intelligence, p.29, 2004.

, Point-Based POMDP Algorithms: Improved Analysis and Implementation, UAI '05, Proceedings of the 21st Conference in Uncertainty in Artificial Intelligence, p.29, 2005.

A. Somani, N. Ye, D. Hsu, and W. Lee, DESPOT: Online POMDP Planning with Regularization, Advances in Neural Information Processing Systems 26, p.31, 2013.

E. J. Sondik, The Optimal Control of Partially Observable Markov Processes, p.27, 1971.

, The Optimal Control of Partially Observable Markov Processes Over the Infinite Horizon: Discounted Costs, Operations Research, vol.26, issue.2, p.27, 1978.

E. Sonu, Z. Sunberg, and M. J. Kochenderfer, Exploiting Hierarchy for Scalable Decision Making in Autonomous Driving, Intelligent Vehicles Symposium. Changshu (cit, vol.56, p.54, 2018.

J. Sorstedt, L. Svensson, F. Sandblom, and L. Hammarstrand, A New Vehicle Motion Model for Improved Predictions and Situation Assessment, IEEE Transactions on Intelligent Transportation Systems, vol.12, pp.1209-1219, 2011.

M. T. Spaan and N. A. Vlassis, Perseus: Randomized Point-based Value Iteration for POMDPs, J. Artif. Intell. Res, vol.24, p.28, 2005.

Z. N. Sunberg, C. J. Ho, and M. J. Kochenderfer, The value of inferring the internal state of traffic participants for autonomous freeway driving, 2017 American Control Conference (ACC), pp.3004-3010, 2017.

Z. N. Sunberg, J. Mykel, and . Kochenderfer, Online Algorithms for POMDPs with Continuous State, Action, and Observation Spaces, International Conference on Automated Planning and Scheduling (ICAPS). Delft (cit. on pp. 32, 105, vol.117, p.113, 2018.

R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction. Adaptive computation and machine learning, vol.24, p.17, 1998.

C. Tay, Analysis of Dynamic Scenes: Application to Driving Assistance, Theses. Institut National Polytechnique de Grenoble -INPG (cit. on pp. 7, 9, pp.44-46, 2009.

. Tesla, Tesla 2016 Disengagement Report, 2016.

T. Autopilot, , pp.26-32

S. Thrun, Monte Carlo POMDPs, Advances in Neural Information Processing Systems, vol.12, p.31, 1999.

S. Thrun, W. Burgard, and D. Fox, Probabilistic Robotics, vol.17, p.24, 2005.

S. Thrun, M. Montemerlo, and H. Dahlkamp, Stanley: The robot that won the DARPA Grand Challenge, J. Field Robotics, vol.23, pp.661-692, 2006.

R. Toledo-moreo and M. A. Zamora-izquierdo, IMM-Based Lane-Change Prediction in Highways With Low-Cost GPS/INS, IEEE Transactions on Intelligent Transportation Systems 10.1, pp.180-185, 2009.

M. Treiber and A. Kesting, Modeling Lane-Changing Decisions with MOBIL, Traffic and Granular Flow '07, p.54, 2009.

M. Treiber, A. Hennecke, and D. Helbing, Congested traffic states in empirical observations and microscopic simulations, Phys. Rev. E, vol.62, issue.2, pp.1805-1824, 2000.

S. Ulbrich and M. Maurer, Probabilistic online POMDP decision making for lane changes in fully automated driving, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013), vol.56, p.54, 2013.

, United Nations Economic Commission for, The 2015 bulletin on statistics of road traffic accidents in Europe and North America, p.1, 2015.

C. Urmson, J. Anhalt, and D. Bagnell, Autonomous Driving in Urban Environments: Boss and the Urban Challenge, The DARPA Urban Challenge: Autonomous Vehicles in City Traffic, pp.1-59, 2009.

, NGSIM: Next Generation Simulation, pp.2-07, 2006.

D. Vasquez, Novel planning-based algorithms for human motion prediction, 2016 IEEE International Conference on Robotics and Automation (ICRA), p.75, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01256516

D. Vasquez, T. Fraichard, and C. Laugier, Incremental Learning of Statistical Motion Patterns With Growing Hidden Markov Models, IEEE Transactions on Intelligent Transportation Systems, vol.10, p.44, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00584320

D. Vasquez, B. Okal, and K. O. Arras, Inverse Reinforcement Learning algorithms and features for robot navigation in crowds: An experimental comparison, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.1341-1346, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01105265

A. Verband-der, From Driver Assistance Systems to Automated Driving, 2015.

K. Vogel, A comparison of headway and time to collision as safety indicators, Accident Analysis & Prevention, vol.35, p.78, 2003.

R. Washington, BI-POMDP: Bounded, Incremental, Partially-Observable Markov-Model Planning, Recent Advances in AI Planning, 4th European Conference on Planning, ECP'97, p.30, 1997.

. Waymo, Disengagement Report. Tech. rep, 2017.

J. Wei and J. M. Dolan, A robust autonomous freeway driving algorithm, 2009 IEEE Intelligent Vehicles Symposium, pp.1015-1020, 2009.

K. Weiss, N. Kaempchen, and A. Kirchner, Multiple-model tracking for the detection of lane change maneuvers, IEEE Intelligent Vehicles Symposium, vol.85, p.43, 2004.

M. T. Wolf and J. W. Burdick, Artificial potential functions for highway driving with collision avoidance, 2008 IEEE International Conference on Robotics and Automation, pp.3731-3736, 2008.

, Global Status Report on Road Safety 2015. Tech. rep. (cit. on p. 1), 2015.

M. Wulfmeier, D. Z. Wang, and I. Posner, Watch this: Scalable cost-function learning for path planning in urban environments, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), p.69, 2016.

M. Wulfmeier, P. Ondruska, and I. Posner, Maximum Entropy Deep Inverse Reinforcement Learning, Neural Information Processing Systems Conference, Deep Reinforcement Learning Workshop (cit, vol.35, p.34, 2015.

M. Wulfmeier, D. Rao, D. Z. Wang, P. Ondruska, and I. Posner, Large-scale cost function learning for path planning using deep inverse reinforcement learning, The International Journal of Robotics Research, vol.36, issue.10, pp.1073-1087, 2017.

B. D. Ziebart, L. Andrew, J. A. Maas, A. K. Bagnell, and . Dey, Maximum Entropy Inverse Reinforcement Learning, Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, AAAI, vol.63, pp.33-35, 2008.

B. D. Ziebart, D. Nathan, G. Ratliff, and . Gallagher, Planning-based prediction for pedestrians, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, p.75, 2009.

J. Ziegler, P. Bender, and H. Lategahn, Kartengestütztes automatisiertes Fahren auf der BerthaBenz-Route von Mannheim nach Pforzheim, Workshop Fahrerassistenzsysteme (FAS2014), pp.79-94, 2014.

J. Ziegler, P. Bender, and M. Schreiber, Making Bertha Drive: An Autonomous Journey on a Historic Route, IEEE Intelligent Transportation Systems Magazine 6.2, vol.52, p.51, 2014.

A. Zyner, S. Worrall, and E. Nebot, A Recurrent Neural Network Solution for Predicting Driver Intention at Unsignalized Intersections, IEEE Robotics and Automation Letters, vol.3, issue.3, p.43, 2018.

K. Åström, Optimal control of Markov processes with incomplete state information, Journal of Mathematical Analysis and Applications, vol.10, pp.174-205, 1965.