.. Scalability:-the-Élysée-palace-environment, 145 8.4.1 Evaluating the exploration efficiency, p.148

.. Analysis-of-the-exploration-efficiency, 152 8.5.2 Analysis of the communication policy, p.152

.. House-environment, 156 9.1.1 Evaluating the exploration efficiency, Medium communication cost Contents 9.1 Simple configuration: the, p.160

.. Scalability:-the-Élysée-palace-environment, 167 9.4.1 Evaluating the exploration efficiency, p.167

.. Scalability:-the-Élysée-palace-environment, 180 10.4.1 Evaluating the exploration efficiency, p.182

N. Agmon, On events in multi-robot patrol in adversarial environments, Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2010.

. Agmon, Multi-robot adversarial patrolling facing a full-knowledge opponent, Journal of Artificial Intelligence, pp.887-916, 2011.

. Agmon, Multi-robot perimeter patrol in adversarial settings, 2008 IEEE International Conference on Robotics and Automation, 2008.
DOI : 10.1109/ROBOT.2008.4543563

. Agmon, The impact of adversarial knowledge on adversarial planning in perimeter patrol, Proceedings of the 7th international joint conference on Autonomous Agents and Multiagent Systems (AAMAS), 2008.

A. , D. Allen, D. Darwiche, and A. , New advances in inference by recursive conditioning, Proceedings of the 19th conference on Uncertainty in Artificial Intelligence, 2002.

. Amato, Solving POMDPs using quadratically constrained linear programs, Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems , AAMAS '06, 2006.
DOI : 10.1145/1160633.1160694

. Amigoni, How Much Worth Is Coordination of Mobile Robots for Exploration in Search and Rescue?, RoboCup 2012: Robot Soccer World Cup XVI, 2012.
DOI : 10.1109/JPROC.2006.876927

. Araya-lopez, A pomdp extension with belief-dependent rewards, Advances in Neural Information Processing Systems (NIPS), 2010.
URL : https://hal.archives-ouvertes.fr/inria-00535560

. Aulinas, The slam problem: A survey, Proceedings of the 11th International Conference of the Catalan Association for Artificial Intelligence, 2008.

F. Aurenhammer, Voronoi diagrams---a survey of a fundamental geometric data structure, ACM Computing Surveys, vol.23, issue.3, pp.345-405, 1991.
DOI : 10.1145/116873.116880

. Bachrach, Autonomous Flight in Unknown Indoor Environments, International Journal of Micro Air Vehicles, vol.3, issue.1, pp.217-228, 2009.
DOI : 10.1260/175682909790291492

. Bagnell, Policy search by dynamic programming, Advances in neural information processing systems, 2003.

. Bai, Monte Carlo Value Iteration for Continuous-State POMDPs, Algorithmic foundations of robotics IX, 2011.
DOI : 10.1007/978-3-642-17452-0_11

R. Bajcsy, Active perception, Proceedings of the IEEE, 1988.
DOI : 10.1109/5.5968

. Barry, Deth*: Approximate hierarchical solution of large markov decision processes, Proceedings of the 25th Conference on Artificial intelligence (AAAI), 2011.

. Basilico, Leader-follower strategies for robotic patrolling in environments with arbitrary topologies, Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2009.

. Basilico, Patrolling security games: Definition and algorithms for solving large instances with single patroller and single intruder, Artificial Intelligence, vol.184, issue.185, pp.78-123, 2012.
DOI : 10.1016/j.artint.2012.03.003

. Basilico, A gametheoretical model applied to an active patrolling camera, Proceedings of the International Conference on Emerging Security Technologies (EST), 2010.

. Bautin, Towards a communication free coordination for multi-robot exploration, Proceedings of the 8th National Conference on Control Architectures of Robots, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00599605

R. Becker, Solving transition independent decentralized markov decision processes. Computer Science Department Faculty Publication Series, pp.423-455, 2004.
DOI : 10.1145/860575.860583

URL : http://anytime.cs.umass.edu/shlomo/papers/aamas03a.pdf

A. Bellenger, Semantic Decision Support for Information Fusion Applications, Institut National des Sciences Appliquées, 2013.
URL : https://hal.archives-ouvertes.fr/tel-00845918

R. Bellman, A Markovian Decision Process, Indiana University Mathematics Journal, vol.6, issue.4, 1957.
DOI : 10.1512/iumj.1957.6.56038

D. Bellman, R. Bellman, and S. Dreyfus, Applied dynamic programming, 1962.
DOI : 10.1515/9781400874651

. Bernstein, The Complexity of Decentralized Control of Markov Decision Processes, Mathematics of Operations Research, vol.27, issue.4, pp.819-840, 2002.
DOI : 10.1287/moor.27.4.819.297

. Bertoli, Planning in nondeterministic domains under partial observability via symbolic model checking, Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI), 2001.

. Beynier, . Mouaddib, A. Beynier, and A. Mouaddib, A polynomial algorithm for decentralized Markov decision processes with temporal constraints, Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems , AAMAS '05, 2005.
DOI : 10.1145/1082473.1082619

URL : https://hal.archives-ouvertes.fr/hal-01344441

F. Blum, A. Blum, and M. Furst, Fast planning through planning graph analysis, Artificial Intelligence, vol.90, issue.1-2, pp.281-300, 1997.
DOI : 10.1016/S0004-3702(96)00047-1

URL : http://doi.org/10.1016/s0004-3702(96)00047-1

P. Borlund, The concept of relevance in IR, Journal of the American Society for Information Science and Technology, vol.9, issue.10, pp.913-925, 2003.
DOI : 10.1002/asi.10286

. Boutilier, Decision-theoretic planning: Structural assumptions and computational leverage, Journal of Artificial Intelligence Research, pp.1-94, 1999.

. Boutilier, Exploiting structure in policy construction, Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI), 1995.

. Boutilier, Stochastic dynamic programming with factored representations, Artificial Intelligence, vol.121, issue.1-2, pp.49-107, 2000.
DOI : 10.1016/S0004-3702(00)00033-3

P. Boutilier, C. Boutilier, and D. Poole, Computing optimal policies for partially observable decision processes using compact representations, Proceedings of the National Conference on Artificial Intelligence, 1996.

. Burgard, Sonarbased mapping with mobile robots using em, Proceedings of the International Conference on Machine Learning, 1999.

. Burgard, Collaborative Exploration of Unknown Environments with Teams of Mobile Robots, Advances in plan-based control of robotic agents, 2002.
DOI : 10.1007/3-540-37724-7_4

. Burgard, Coordinated multi-robot exploration, IEEE Transactions on Robotics, vol.21, issue.3, pp.376-386, 2005.
DOI : 10.1109/TRO.2004.839232

J. Béziau, What is many-valued logic?, Proceedings 1997 27th International Symposium on Multiple- Valued Logic, 1997.
DOI : 10.1109/ISMVL.1997.601384

O. Cappé, Online sequential Monte Carlo EM algorithm, 2009 IEEE/SP 15th Workshop on Statistical Signal Processing, 2009.
DOI : 10.1109/SSP.2009.5278646

. Carillo, On the comparison of uncertainty criteria for active SLAM, 2012 IEEE International Conference on Robotics and Automation, 2012.
DOI : 10.1109/ICRA.2012.6224890

. Carlone, An application of Kullback-Leibler divergence to active SLAM and exploration with Particle Filters, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010.
DOI : 10.1109/IROS.2010.5652164

. Carlone, Active SLAM and Exploration with Particle Filters Using Kullback-Leibler Divergence, Journal of Intelligent & Robotic Systems, vol.25, issue.3, pp.291-311, 2014.
DOI : 10.1007/s10846-013-9981-9

D. Chapman, Planning for conjunctive goals, Artificial Intelligence, vol.32, issue.3, pp.333-377, 1987.
DOI : 10.1016/0004-3702(87)90092-0

A. Darwiche, Compiling bayesian networks with local structure, Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI), 2005.

. Cheng, J. Cheng, and M. J. Druzdzel, Ais-bn: An adaptive importance sampling algorithm for evidential reasoning in large bayesian networks, Journal of Artificial Intelligence Research, pp.155-188, 2000.

. Conitzer, . Sandholm, V. Conitzer, and T. Sandholm, Computing the optimal strategy to commit to, Proceedings of the 7th ACM conference on Electronic commerce , EC '06, 2006.
DOI : 10.1145/1134707.1134717

G. F. Cooper, The computational complexity of probabilistic inference using bayesian belief networks, Artificial Intelligence, vol.42, issue.2-3, pp.393-405, 1990.
DOI : 10.1016/0004-3702(90)90060-D

. Corff, Online Expectation Maximization algorithm to solve the SLAM problem, 2011 IEEE Statistical Signal Processing Workshop (SSP), 2011.
DOI : 10.1109/SSP.2011.5967666

A. Darwiche, Recursive conditioning, Artificial Intelligence, vol.126, issue.1-2, pp.5-41, 2001.
DOI : 10.1016/S0004-3702(00)00069-2

URL : http://doi.org/10.1016/s0004-3702(00)00069-2

A. Darwiche-]-darwiche, Bayesian Networks, pp.467-509, 2008.

M. Davison, A. J. Davison, and D. W. Murray, Simultaneous localization and map-building using active vision, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.7, pp.865-880, 2002.
DOI : 10.1109/TPAMI.2002.1017615

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.135.7602

K. Dean, T. Dean, and K. Kanazawa, A model for reasoning about persistence and causation, Computational Intelligence, vol.4, issue.2, pp.142-150, 1989.
DOI : 10.1016/0004-3702(87)90012-9

R. Dechter, Bucket elimination: A unifying framework for reasoning, Artificial Intelligence, vol.113, issue.1-2, pp.41-85, 1999.
DOI : 10.1016/S0004-3702(99)00059-4

A. P. Dempster, Upper and lower probabilities induced by a multivalued mapping, The annals of Mathematical statistics, pp.325-339, 1967.

. Doshi, . Gmytrasiewicz, P. Doshi, and P. J. Gmytrasiewicz, Monte carlo sampling methods for approximating interactive pomdps, Journal of Artificial Intelligence Research, pp.297-337, 2009.

. Doshi, . Perez, P. Doshi, and D. Perez, Generalized point based value iteration for interactive pomdps, Proceedings of the 23rd Conference on Artificial intelligence (AAAI), 2008.

. Doshi, Graphical models for interactive POMDPs: representations and solutions, Autonomous Agents and Multi-Agent Systems, vol.20, issue.2, pp.376-416, 2009.
DOI : 10.1007/s10458-008-9064-7

D. Dubois, Uncertainty theories: a unified view, Proceedings of the IEEE Cybernetic Systems Conference, 2007.

P. Dubois, D. Dubois, and H. Prade, Possibility theory: qualitative and quantitative aspects Quantified representation of uncertainty and imprecision, pp.169-226, 1998.

R. Eidenberger and J. Scharinger, Active perception and scene modeling by planning with probabilistic 6D object poses, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010.
DOI : 10.1109/IROS.2010.5651927

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.302.6541

. Elmaliach, Multi-robot area patrol under frequency constraints, Annals of Mathematics and Artificial Intelligence, pp.293-320, 2009.
DOI : 10.1007/s10472-010-9193-y

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.116.4842

. Elmaliach, A realistic model of frequency-based multi-robot polyline patrolling, Proceedings of the 7th international Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2008.

M. R. Endsley, Toward a Theory of Situation Awareness in Dynamic Systems, Human Factors: The Journal of the Human Factors and Ergonomics Society, vol.37, issue.1, pp.32-64, 1995.
DOI : 10.1518/001872095779049543

. Fagin, Reasoning about knowledge, 1995.

. Feder, Adaptive Mobile Robot Navigation and Mapping, The International Journal of Robotics Research, vol.18, issue.7, pp.650-668, 1999.
DOI : 10.1177/02783649922066484

URL : http://albacore.mit.edu/~jleonard/pubs/ijrr_preprint.pdf

V. V. Fedorov, Theory of optimal experiments, 1972.

N. Fikes, R. E. Fikes, and N. J. Nilsson, Strips: A new approach to the application of theorem proving to problem solving, Artificial Intelligence, vol.2, issue.3-4, pp.189-208, 1972.
DOI : 10.1016/0004-3702(71)90010-5

M. Floreano, D. Floreano, and F. Mondada, Active perception, navigation, homing, and grasping: an autonomous perspective, Proceedings of PerAc '94. From Perception to Action, 1994.
DOI : 10.1109/FPA.1994.636089

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.115.8166

L. Floridi, Understanding epistemic relevance, Erkenntnis, pp.69-92, 2008.
DOI : 10.1007/s10670-007-9087-5

URL : http://philsci-archive.pitt.edu/4075/1/uer.pdf

H. Friedman, N. Friedman, and J. Y. Halpern, Plausibility measures: a user's guide, Proceedings of the 11th conference on Uncertainty in artificial intelligence, 1995.

H. Friedman, N. Friedman, and J. Y. Halpern, Plausibility measures and default reasoning, Journal of the ACM, vol.48, issue.4, pp.648-685, 2001.
DOI : 10.1145/502090.502092

URL : http://arxiv.org/abs/cs/9808007

T. Fudenberg, D. Fudenberg, and J. Tirole, Game Theory, 1991.

N. Gatti, Game theoretical insights in strategic patrolling: Model and algorithm in normal-form, Proceedings of the 18th European Conference on Artificial Intelligence (ECAI), 2008.

. Geffner, . Bonet, H. Geffner, and B. Bonet, Solving large pomdps using real time dynamic programming, Proceedings of the 15th AAAI Fall Symposium on POMDPs, 1998.

. Ghallab, Automated planning: theory & practice, 2004.

. Glad, Theoretical study of ant-based algorithms for multi-agent patrolling, Proceedings of the 18th European Conference on Artificial Intelligence including Prestigious Applications of Intelligent Systems (PAIS), 2008.
URL : https://hal.archives-ouvertes.fr/inria-00326963

. Gmytrasiewicz, . Doshi, P. J. Gmytrasiewicz, and P. Doshi, A framework for sequential planning in multi-agent settings, Journal of Artificial Intelligence Research, pp.49-79, 2005.

Z. Goldman, C. V. Goldman, and S. Zilberstein, Optimizing information exchange in cooperative multi-agent systems, Proceedings of the second international joint conference on Autonomous agents and multiagent systems , AAMAS '03, 2003.
DOI : 10.1145/860575.860598

Z. Goldman, C. V. Goldman, and S. Zilberstein, Decentralized control of cooperative systems: Categorization and complexity analysis, Journal of Artificial Intelligence Research, pp.143-174, 2004.

H. P. Grice, Syntax and semantics, chapter Logic and Conversation, pp.41-58, 1975.

. Guestrin, Efficient solution algorithms for factored mdps, Journal of Artificial Intelligence Research, pp.399-468, 2003.

A. Guo, Decision-theoretic active sensing for autonomous agents, Proceedings of the second international joint conference on Autonomous agents and multiagent systems , AAMAS '03, 2003.
DOI : 10.1145/860575.860766

Q. Guo and Z. Qu, Coverage control for a mobile robot patrolling a dynamic and uncertain environment, Proceedings of the 5th World Congress on Intelligent Control and Automation (WCICA), 2004.

J. Y. Halpern, Reasoning about Uncertainty, 2003.

J. Y. Halpern, Reasoning about Uncertainty, chapter Belief Revision, pp.97-104, 2003.

. Harmelen, Handbook of Knowledge Representation (Foundations of Artificial Intelligence), 2008.

J. V. Heijenoort, From Frege to Gödel: a source book in mathematical logic, 1967.

J. Hintikka, Knowledge and belief, 1962.

. Holmes, An o(n2) square root unscented kalman filter for visual simultaneous localization and mapping, IEEE Transactions on Pattern Analysis and Machine Intelligence, pp.1251-1263, 2009.

R. A. Howard, Dynamic Programming and Markov Processes, 1960.

. Hwang, Cooperative patrol planning of multi-robot systems by a competitive auction system, Proceedings of the International Joint Conference ICCAS-SICE, 2009.

. Izadi, . Precup, M. T. Izadi, and D. Precup, Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes, Machine Learning: ECML 2005, pp.593-600, 2005.
DOI : 10.1007/11564096_58

. Jensen, Bayesian updating in recursive graphical models by local computation, Computational Statistics Quarterly, pp.269-282, 1990.

. Jensfelt, A framework for vision based bearing only 3D SLAM, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006., 2006.
DOI : 10.1109/ROBOT.2006.1641990

C. Ji, S. Ji, and L. Carin, Cost-sensitive feature acquisition and classification, Pattern Recognition, vol.40, issue.5, pp.1474-1485, 2007.
DOI : 10.1016/j.patcog.2006.11.008

. Jøsang, A survey of trust and reputation systems for online service provision. Decision support system, pp.618-644, 2007.

J. Kiefer, General equivalence theory for optimum designs (approximate theory) The annals of Statistics, pp.849-879, 1974.

. Ko, A practical, decision-theoretic approach to multi-robot mapping and exploration, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453), 2003.
DOI : 10.1109/IROS.2003.1249654

R. Kollar, T. Kollar, and N. Roy, Trajectory Optimization using Reinforcement Learning for Map Exploration, The International Journal of Robotics Research, vol.19, issue.3, pp.175-196, 2008.
DOI : 10.1007/978-1-4613-8997-2_14

. Koller, . Parr, D. Koller, and R. Parr, Policy iteration for factored mdps, Proceedings of the 16th conference on Uncertainty in artificial intelligence, 2000.

. Kontitsis, Multi-robot active SLAM with relative entropy optimization, 2013 American Control Conference, 2013.
DOI : 10.1109/ACC.2013.6580252

H. W. Kuhn, The hungarian method for the assignment problem, Naval research logistics quarterly, pp.83-97, 1955.

. Kullback, . Leibler, S. Kullback, and R. A. Leibler, On information and sufficiency . The annals of mathematical statistics, pp.79-86, 1951.
DOI : 10.1214/aoms/1177729694

. Kurniawati, SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces, Robotics: Science and Systems IV, pp.65-72, 2008.
DOI : 10.15607/RSS.2008.IV.009

D. Larkin and R. Dechter, Bayesian inference in the presence of determinism, Proceedings of the 9+th International Workshop on Artificial Intelligence and Statistics, 2003.

. Lauritzen, S. L. Spiegelhalter-]-lauritzen, and D. J. Spiegelhalter, Local computations with probabilities on graphical structures and their application to expert systems, Journal of the Royal Statistical Society, pp.157-224, 1988.

L. Laverny, N. Laverny, and J. Lang, From knowledge-based programs to graded belief-based programs, part i: On-line reasoning*. Synthese, pp.277-321, 2005.
DOI : 10.1007/1-4020-4631-6_8

L. Laverny, N. Laverny, and J. Lang, From knowledge-based programs to graded belief-based programs, part ii: off-line reasoning, Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI), 2005.

. Leung, Active SLAM using Model Predictive Control and Attractor based Exploration, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006.
DOI : 10.1109/IROS.2006.282530

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.89.7906

. Lewis, Applications suitable for unmanned and autonomous missions utilizing the tactical amphibious ground support (tags) platform. Defense and Security, pp.508-519, 2004.

. Lilienthal, A rao-blackwellisation approach to gdm-slam ? integrating slam and gas distribution mapping, Proceedings of the European Conference on Mobile Robots (ECMR), 2007.

. Littman and M. L. Littman, Memoryless policies: Theoretical limitations and practical results, From Animals to Animats 3: Proceedings of the third international conference on simulation of adaptive behavior, 1994.

. Littman and M. L. Littman, The witness algorithm: Solving partially observable markov decision processes, 1994.

. Littman, Stochastic boolean satisfiability, Journal of Automated Reasoning, pp.251-296, 2001.

. Loutfi, Gas distribution mapping of multiple odour sources using a mobile robot, Robotica, vol.4, issue.02, pp.311-319, 2009.
DOI : 10.1016/S0925-4005(99)00415-3

W. S. Lovejoy, Computationally Feasible Bounds for Partially Observed Markov Decision Processes, Operations Research, vol.39, issue.1, pp.162-175, 1991.
DOI : 10.1287/opre.39.1.162

. Machado, Multiagent patrolling: An empirical analysis of alternative architectures. Multi-Agent-Based Simulation II, pp.155-170, 2003.

A. Markov, The Theory of Algorithms, 1960.

M. Martins-filho, L. S. Martins-filho, and E. E. Macau, Patrol Mobile Robots and Chaotic Trajectories, Mathematical Problems in Engineering, vol.38, issue.9, 2007.
DOI : 10.1109/21.59969

URL : http://doi.org/10.1155/2007/61543

. Matignon, Distributed value functions for multi-robot exploration, 2012 IEEE International Conference on Robotics and Automation, 2012.
DOI : 10.1109/ICRA.2012.6224937

URL : https://hal.archives-ouvertes.fr/hal-00966784

. Menezes, Negotiator Agents for the Patrolling Task, Advances in Artificial Intelligence IBERAMIA-SBIA, 2006.
DOI : 10.1007/11874850_9

. Mihaylova, A comparison of decision making criteria and optimization methods for active robotic sensing. Numerical Methods and Applications, pp.316-324, 2003.

. Mihaylova, Active sensing for robotics -a survey, Proceedings of the 5 th International Conference On Numerical Methods and Applications, 2002.

R. V. Mises, Probability, statistics, and truth, 1957.

J. Moeschler, Language and Speech Engineering, chapter ntroduction to pragmatics, pp.51-68, 2007.

S. Thrun, FastSLAM: A Scalable Method for the Simultaneous Localization and Mapping Problem in Robotics, 2007.

. Montemerlo, Fastslam: A factored solution to the simultaneous localization and mapping problem, Proceedings of the 18th National Conference on Artificial intelligence (AAAI), 2002.

. Morari, MODEL PREDICTIVE CONTROL: THEORY AND PRACTICE, Proceedings of the Workshop on Model Based Process Control, 2014.
DOI : 10.1016/B978-0-08-035735-5.50006-1

E. Moravec, H. P. Moravec, and A. Elfes, High resolution maps from wide angle sonar, Proceedings. 1985 IEEE International Conference on Robotics and Automation, 1985.
DOI : 10.1109/ROBOT.1985.1087316

R. E. Neapolitan, Probabilistic Reasoning in Expert Systems: Theory and Algorithms, 1990.

. Newell, Report on a general problem-solving program, IFIP Congress, 1959.

F. A. Oliehoek, Reinforcement Learning: State of the Art, chapter Decentralized POMDPs, pp.471-503, 2012.

. Oliehoek, Exploiting locality of interaction in factored dec-pomdps, Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems, 2008.

. Oxford and . Oxford, Patrol. The Oxford online dictionary. Accessed, pp.2015-2016

G. Palacios, H. Palacios, and H. Geffner, Compiling uncertainty away in conformant planning problems with bounded width, Journal of Artificial Intelligence Research, pp.623-675, 2009.

. Paruchuri, Playing games for security: an efficient exact algorithm for solving bayesian stackelberg games, Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems, 2008.

. Paruchuri, Coordinating Randomized Policies for Increasing the Security of Agent Systems, Information Technology and Management, pp.67-79, 2009.
DOI : 10.1017/CBO9780511973031.007

. Paruchuri, An efficient heuristic approach for security against multiple adversaries, Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems , AAMAS '07, 2007.
DOI : 10.1145/1329125.1329344

. Paruchuri, Security in multiagent systems by policy randomization, Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems , AAMAS '06, 2006.
DOI : 10.1145/1160633.1160681

J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, 1988.

W. Penberthy, J. S. Penberthy, and D. S. Weld, Ucpop: A sound, complete , partial order planner for adl, Proceedings of the 3rd International Conference on the Principles of Knowledge Representation and Reasoning (KR), 1992.

B. Petrick, R. P. Petrick, and F. Bacchus, A knowledge-based approach to planning with incomplete information and sensing, Artificial Intelligence Planning Sys- tems, 2002.

B. Petrick, R. P. Petrick, and F. Bacchus, Extending the knowledgebased approach to planning with incomplete information and sensing, Proceedings of the 9th International Conference on the Principles of Knowledge Representation and Reasoning (KR), 2004.

. Pineau, Point-based value iteration: An anytime algorithm for pomdps, Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI), 2003.

. Pistore, Automated composition of web services by planning at the knowledge level, Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI), 2005.

Z. Poole, D. Poole, and N. L. Zhang, Exploiting contextual independence in probabilistic inference, Journal of Artificial Intelligence Research, pp.263-313, 2003.

M. Poole, D. L. Poole, and A. K. Mackworth, Artificial Intelligence: foundations of computational agents, 2010.
DOI : 10.1017/CBO9780511794797

R. Portugal, D. Portugal, and R. Rocha, MSP algorithm, Proceedings of the 2010 ACM Symposium on Applied Computing, SAC '10, 2010.
DOI : 10.1145/1774088.1774360

R. Portugal, D. Portugal, and R. Rocha, A survey on multi-robot patrolling algorithms. Technological Innovation for Sustainability, pp.139-146, 2011.

P. Poupart, Exploiting structure to efficiently solve large scale partially observable Markov decision processes, 2005.

. Poupart, . Boutilier, P. Poupart, and C. Boutilier, Bounded finite state controllers, Advances in Neural Information Processing Systems (NIPS), 2003.

. Poupart, Closing the gap: Improved bounds on optimal pomdpsolutions, Proceedings of the 21st International Conference on Automated Planning and Scheduling (ICAPS), 2011.

M. L. Puterman, Markov decision processes: discrete stochastic dynamic programming, 1994.
DOI : 10.1002/9780470316887

D. V. Pynadath, The communicative multiagent team decision problem: Analyzing teamwork theories and models, Journal of Artificial Intelligence Research, pp.389-423, 2002.

. Rathnasabapathy, Exact solutions of interactive POMDPs using behavioral equivalence, Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems , AAMAS '06, 2006.
DOI : 10.1145/1160633.1160816

C. Ross, S. Ross, and B. Chaib-draa, Aems: An anytime online search algorithm for approximate policy refinement in large pomdps, Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI), 2007.

S. Roussel, Apports de la logique mathématique pour la modélisation de l'information échangée dans les systèmes multi-agents interactifs, 2010.

S. Roussel and L. Cholvy, Cooperative interpersonal communication and relevant information, Proceedings of the ESSLLI Workshop on Logical Methods for Social Concepts, 2009.

. Ruan, Patrolling in a stochastic environment, 2005.

S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, 2009.

E. D. Sacerdoti, The nonlinear nature of plans, Proceedings of the 4th International Joint Conference on Artificial Intelligence (IJCAI), 1975.

. Sak, Probabilistic Multiagent Patrolling, Advances in Artificial Intelligence (SBIA), 2008.
DOI : 10.1137/S1064827595287997

T. Saracevic, Relevance reconsidered, Proceedings of the 2nd Conference on Conceptions of Library and Information Science (CoLIS 2), 1996.

. Satsangi, Exploiting submodular value functions for faster dynamic sensor selection: Extended version, 2014.

. Satsangi, Exploiting submodular value functions for faster dynamic sensor selection, Proceedings of the 29th Conference on Artificial intelligence (AAAI), 2015.

L. J. Savage, The foundations of statistics, 1972.

A. Sayyareh, A new upper bound for kullback-leibler divergence, Applied Mathematical Sciences, pp.3303-3317, 2011.

Z. Seuken, S. Seuken, and S. Zilberstein, Formal models and algorithms for decentralized decision making under uncertainty, Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2008.
DOI : 10.1007/s10458-007-9026-5

. Seymour, . Peterson, R. Seymour, and G. L. Peterson, A Trust-Based Multiagent System, 2009 International Conference on Computational Science and Engineering, 2009.
DOI : 10.1109/CSE.2009.297

P. Shachter, R. D. Shachter, and M. A. Peot, Simulation Approaches to General Probabilistic Inference on Belief Networks, Proceedings of the 5th Annual Conference on Uncertainty in Artificial Intelligence (UAI), 1989.
DOI : 10.1016/B978-0-444-88738-2.50024-5

G. Shafer and . Shani, A mathematical theory of evidence. Princeton university press A survey of point-based pomdp solvers, Proceedings of the 12th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 1976.

. Shenoy, . Shafer, P. P. Shenoy, and G. Shafer, Propagating Belief Functions with Local Computations, IEEE Expert, vol.1, issue.3, pp.43-52, 1986.
DOI : 10.1109/MEX.1986.4306979

O. Sigaud and O. Buffet, Markov decision processes in artificial intelligence, 2010.
DOI : 10.1002/9781118557426

URL : https://hal.archives-ouvertes.fr/inria-00432735

D. Silver and J. Veness, Monte-carlo planning in large pomdps, Advances in Neural Information Processing Systems, 2010.

R. Sim, R. Sim, and N. Roy, Global A-Optimal Robot Exploration in SLAM, Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005.
DOI : 10.1109/ROBOT.2005.1570193

. Simmons, Coordination for multi-robot exploration and mapping, Proceedings of the 17th National Conference on Artificial intelligence (AAAI), 2000.

. Singh, Efficient informative sensing using multiple robots, Journal of Artificial Intelligence Research, pp.707-755, 2009.

S. Smallwood, R. D. Smallwood, and E. J. Sondik, The Optimal Control of Partially Observable Markov Processes over a Finite Horizon, Operations Research, vol.21, issue.5, pp.1071-1088, 1973.
DOI : 10.1287/opre.21.5.1071

D. E. Smith and D. S. Weld, Conformant graphplan, Proceedings of the 15th National Conference on Artificial intelligence (AAAI), 1998.

T. Smyth, Rules for tower of hanoï, pp.2014-2024, 2014.

E. J. Sondik, The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs, Operations Research, vol.26, issue.2, pp.282-304, 1978.
DOI : 10.1287/opre.26.2.282

M. T. Spaan, Cooperative active perception using pomdps, Proceedings of the Workshop on advancements in POMDP solvers (Workshop of AAAI), 2008.

. Spaan, Active cooperative perception in network robot systems using POMDPs, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010.
DOI : 10.1109/IROS.2010.5648856

V. Spaan, M. T. Spaan, and N. A. Vlassis, Perseus: Randomized pointbased value iteration for pomdps, Journal of Artificial Intelligence Research, pp.195-220, 2005.

W. Spohn, Ordinal conditional functions: A dynamic theory of epistemic states, Proceedings of the Irvine Conference on Probability and Causation, 1988.

. Stachniss, Information gainbased exploration using rao-blackwellized particle filters, Robotics: Science and Systems, 2005.

M. Stefik, Planning with constraints, Artificial Intelligence, pp.111-139, 1981.

. Sunderhauf, Using the Unscented Kalman Filter in Mono-SLAM with Inverse Depth Parametrization for Autonomous Airship Control, 2007 IEEE International Workshop on Safety, Security and Rescue Robotics, 2007.
DOI : 10.1109/SSRR.2007.4381265

G. J. Sussman, The virtuous nature of bugs, pp.111-117, 1974.

. Van-lambalgen, Random Sequences, 1987.

L. Van, . Wilson, J. H. Van-lint, and R. M. Wilson, A course in combinatorics, 2001.

. Wang, Multi-robot simultaneous localization and mapping using D-SLAM framework, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information, 2007.
DOI : 10.1109/ISSNIP.2007.4496863

. Weyns, TOWARDS ACTIVE PERCEPTION IN SITUATED MULTI-AGENT SYSTEMS, Applied Artificial Intelligence, vol.5, issue.9-10, pp.867-883, 2004.
DOI : 10.1145/958961.958963

. Wilson, D. Sperber-]-wilson, and D. Sperber, Handbook of pragmatics, chapter Relevance theory, pp.606-632, 2002.

G. H. Wright, An essay in modal logic, 1951.

. Wurm, Coordinated multirobot exploration using a segmentation of the environment, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2008.

B. Yamauchi, A frontier-based approach for autonomous exploration, Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA'97. 'Towards New Computational Principles for Robotics and Automation', 1997.
DOI : 10.1109/CIRA.1997.613851

B. Yamauchi, Frontier-based exploration using multiple robots, Proceedings of the second international conference on Autonomous agents , AGENTS '98, 1998.
DOI : 10.1145/280765.280773

. Yedidia, Constructing freeenergy approximations and generalized belief propagation algorithms, IEEE Transactions on Information Theory, pp.1182-2312, 2005.
DOI : 10.1109/tit.2005.850085

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.118.9766

L. A. Zadeh, Fuzzy logic, Computer, pp.83-93, 1988.

L. A. Zadeh, Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets and Systems, pp.9-34, 1999.

R. Zhou, X. S. Zhou, and S. I. Roumeliotis, Multi-robot SLAM with Unknown Initial Correspondence: The Robot Rendezvous Case, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006.
DOI : 10.1109/IROS.2006.282219

. Zlot, Multi-robot exploration controlled by a market economy, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292), 2002.
DOI : 10.1109/ROBOT.2002.1013690