Using "social actions" and RL algorithms to build policies in Dec-POMDP - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Communication Dans Un Congrès Année : 2009

Using "social actions" and RL algorithms to build policies in Dec-POMDP

Vincent Thomas

Résumé

Building individual behaviors to solve collective problems is a major stake whose applications are found in several domains. DecPOMDP has been proposed as formalism for describing multi-agent problems. However, solving a Dec POMDP turned out to be a NEXP problem. In this study, we introduced the original concept of social action to get round the inherent complexity of DecPOMDP and we proposed three decentralized reinforcement learning algorithms which approximate the optimal policy in DecPOMDP. This article analyses the results obtained and argues that this new approach seems promising for automatic top-down collective behavior computation.
Fichier non déposé

Dates et versions

inria-00399400 , version 1 (26-06-2009)

Identifiants

  • HAL Id : inria-00399400 , version 1
  • PRODINRA : 248808

Citer

Vincent Thomas, Mahuna Akplogan. Using "social actions" and RL algorithms to build policies in Dec-POMDP. IADIS International Conference on Intelligent Systems and Agents 2009 - IADIS ISA 2009, Jun 2009, Lagoa, Portugal. ⟨inria-00399400⟩
191 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More