Finding good stochastic factored policies for factored Markov decision processes

Julia J. Radoszycki; Nathalie Dubois Peyrard Peyrard; Régis Sabbadin

Communication Dans Un Congrès Année : 2014

Finding good stochastic factored policies for factored Markov decision processes

(1) , (1) , (1)

Julia J. Radoszycki

Fonction : Auteur

Unité de Mathématiques et Informatique Appliquées de Toulouse

Nathalie Dubois Peyrard Peyrard

Fonction : Auteur
PersonId : 736895
IdHAL : nathalie-peyrard
ORCID : 0000-0002-0356-1255
IdRef : 060568283

Unité de Mathématiques et Informatique Appliquées de Toulouse

Régis Sabbadin

Fonction : Auteur
PersonId : 736236
IdHAL : regis-sabbadin
ORCID : 0000-0002-6286-1821
IdRef : 133261395

Unité de Mathématiques et Informatique Appliquées de Toulouse

Résumé

We propose a framework for approximate resolution of MDPs with factored state space, factored action space and additive reward, based on (i) considering stochastic factored policies (SFPs) with a given structure, (ii) using variational approximations to estimate SFP values and (iii) using local continuous optimization algorithms to compute “good” SFPs.We have implemented and tested an algorithm (CA-LBP), involving a loopy belief propagation algorithm and a coordinate ascent procedure. Experiments show that CA-LBP performs as well as a state-of-the-art algorithm dedicated to a specific sub-class of FA-FMDPs, and that CA-LBP can be applied to general FA-FMDPs with up to 100 binary state variables and 100 binary action variables.

Domaines

Mathématiques [math] Informatique [cs]

Migration ProdInra : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-02742738

Soumis le : mercredi 3 juin 2020-04:44:08

Dernière modification le : jeudi 14 mars 2024-03:13:37

Dates et versions

hal-02742738 , version 1 (03-06-2020)

Identifiants

HAL Id : hal-02742738 , version 1
PRODINRA : 262087
WOS : 000349444700225

Citer

Julia J. Radoszycki, Nathalie Dubois Peyrard Peyrard, Régis Sabbadin. Finding good stochastic factored policies for factored Markov decision processes. 21st European Conference on Artificial Intelligence, Aug 2014, prague, Czech Republic. 2 p. ⟨hal-02742738⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRA INRAE INRAEOCCITANIETOULOUSE MATHNUM MIAT

3 Consultations

0 Téléchargements

Finding good stochastic factored policies for factored Markov decision processes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager