Représentations relationnelles et apprentissage interactif pour l'apprentissage efficace du comportement coopératif

Thibaut Munzer 1
1 Flowers - Flowing Epigenetic Robots and Systems
Inria Bordeaux - Sud-Ouest, U2IS - Unité d'Informatique et d'Ingénierie des Systèmes
Abstract : This thesis presents new approaches toward efficient and intuitive high-level plan learning for cooperative robots. More specifically this work study Learning from Demonstration algorithm for relational domains. Using relational representation to model the world, simplify representing concurrentand cooperative behavior.We have first developed and studied the first algorithm for Inverse ReinforcementLearning in relational domains. We have then presented how one can use the RAP formalism to represent Cooperative Tasks involving a robot and a human operator. RAP is an extension of the Relational MDP framework that allows modeling concurrent activities. Using RAP allow us to represent both the human and the robot in the same process but also to model concurrent robot activities. Under this formalism, we have demonstrated that it is possible to learn behavior, as policy and as reward, of a cooperative team. Prior knowledge about the task can also be used to only learn preferences of the operator.We have shown that, using relational representation, it is possible to learn cooperative behaviors from a small number of demonstration. That these behaviors are robust to noise, can generalize to new states and can transfer to different domain (for example adding objects). We have also introduced an interactive training architecture that allows the system to make fewer mistakes while requiring less effort from the human operator. By estimating its confidence the robot is able to ask for instructions when the correct activity to dois unsure. Lastly, we have implemented these approaches on a real robot and showed their potential impact on an ecological scenario.
Document type :
Complete list of metadatas

Cited literature [77 references]  Display  Hide  Download
Contributor : Abes Star <>
Submitted on : Tuesday, May 23, 2017 - 4:45:08 PM
Last modification on : Wednesday, July 3, 2019 - 10:48:04 AM
Long-term archiving on : Friday, August 25, 2017 - 12:45:27 AM


Version validated by the jury (STAR)


  • HAL Id : tel-01526955, version 1


Thibaut Munzer. Représentations relationnelles et apprentissage interactif pour l'apprentissage efficace du comportement coopératif. Autre [cs.OH]. Université de Bordeaux, 2017. Français. ⟨NNT : 2017BORD0574⟩. ⟨tel-01526955⟩



Record views


Files downloads