Using "social actions" and RL algorithms to build policies in Dec-POMDP

Vincent Thomas; Mahuna Akplogan

Communication Dans Un Congrès Année : 2009

Using "social actions" and RL algorithms to build policies in Dec-POMDP

(1) , (2)

1
2

Vincent Thomas

Fonction : Auteur
PersonId : 16368
IdHAL : vincent-thomas
ORCID : 0000-0003-3401-4649

Autonomous intelligent machine

Mahuna Akplogan

Fonction : Auteur
PersonId : 861454

Institut National de la Recherche Agronomique

Résumé

Building individual behaviors to solve collective problems is a major stake whose applications are found in several domains. DecPOMDP has been proposed as formalism for describing multi-agent problems. However, solving a Dec POMDP turned out to be a NEXP problem. In this study, we introduced the original concept of social action to get round the inherent complexity of DecPOMDP and we proposed three decentralized reinforcement learning algorithms which approximate the optimal policy in DecPOMDP. This article analyses the results obtained and argues that this new approach seems promising for automatic top-down collective behavior computation.

Mots clés

Multi-agent systems Markov decision processes reinforcement learning interaction. interaction

Domaines

Système multi-agents [cs.MA]

Vincent Thomas : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00399400

Soumis le : vendredi 26 juin 2009-12:01:09

Dernière modification le : vendredi 24 mars 2023-14:52:52

Dates et versions

inria-00399400 , version 1 (26-06-2009)

Identifiants

HAL Id : inria-00399400 , version 1
PRODINRA : 248808

Citer

Vincent Thomas, Mahuna Akplogan. Using "social actions" and RL algorithms to build policies in Dec-POMDP. IADIS International Conference on Intelligent Systems and Agents 2009 - IADIS ISA 2009, Jun 2009, Lagoa, Portugal. ⟨inria-00399400⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRA UNIV-LORRAINE INRIA2 LORIA INRAE

191 Consultations

0 Téléchargements

Using "social actions" and RL algorithms to build policies in Dec-POMDP

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager