Skip to Main content Skip to Navigation

Le transfert adaptatif en apprentissage par renforcement : application à la simulation de schéma de jeux tactiques

Aydano Pamponet Machado 1
1 SMA - Systèmes Multi-Agents
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : A possible way to accelerate reinforcement learning process is to guide the exploration process using prior domain knowledge. This called knowledge transfer, and most transfer algorithms are based on an implicit assumption. They suppose that prior knowledge has a good quality for the current task. If this condition is not true, the learning process will be worse than standard reinforcement learning algorithm (negative transfer). This thesis put forwards some transfer algorithms to avoid this problem, whose can adapts learning process to prior knowledge quality. More precisely, we introduce a parameter called transfer rate, which controls how much prior knowledge will be used. In addition, we propose to optimize the transfer rate in order to make the best use of this policy. Thus, the proposed algorithms provide some robustness, working for all prior knowledge quality level, which was not the case with previous approaches. These algorithms are evaluated in two different problems: a toy problem (the gridworld), and a real complex one (a coach assistant tool). The latter application offers a coach to seize tactical patterns with a graphical interface, and then allows agents to view players doing the same patterns. To meet within a reasonable time, the request of the coach. The reinforcement learning alone is not enough, and transfer our algorithms have been applied to this area with success
Document type :
Complete list of metadatas

Cited literature [70 references]  Display  Hide  Download
Contributor : Theses Bupmc <>
Submitted on : Tuesday, April 16, 2013 - 4:41:39 PM
Last modification on : Friday, January 8, 2021 - 5:50:03 PM
Long-term archiving on: : Wednesday, July 17, 2013 - 4:06:37 AM


  • HAL Id : tel-00814207, version 1


Aydano Pamponet Machado. Le transfert adaptatif en apprentissage par renforcement : application à la simulation de schéma de jeux tactiques. Modélisation et simulation. Université Pierre et Marie Curie - Paris VI, 2009. Français. ⟨NNT : 2009PA066209⟩. ⟨tel-00814207⟩



Record views


Files downloads