Finding good stochastic factored policies for factored Markov decision processes - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Finding good stochastic factored policies for factored Markov decision processes

Résumé

We propose a framework for approximate resolution of MDPs with factored state space, factored action space and additive reward, based on (i) considering stochastic factored policies (SFPs) with a given structure, (ii) using variational approximations to estimate SFP values and (iii) using local continuous optimization algorithms to compute “good” SFPs.We have implemented and tested an algorithm (CA-LBP), involving a loopy belief propagation algorithm and a coordinate ascent procedure. Experiments show that CA-LBP performs as well as a state-of-the-art algorithm dedicated to a specific sub-class of FA-FMDPs, and that CA-LBP can be applied to general FA-FMDPs with up to 100 binary state variables and 100 binary action variables.
Fichier non déposé

Dates et versions

hal-02742738 , version 1 (03-06-2020)

Identifiants

  • HAL Id : hal-02742738 , version 1
  • PRODINRA : 262087
  • WOS : 000349444700225

Citer

Julia J. Radoszycki, Nathalie Dubois Peyrard Peyrard, Régis Sabbadin. Finding good stochastic factored policies for factored Markov decision processes. 21st European Conference on Artificial Intelligence, Aug 2014, prague, Czech Republic. 2 p. ⟨hal-02742738⟩
3 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More