A Distributed Actor-Critic Algorithm and Applications to Mobile Sensor Network Coordination Problems

Pennesi P; Paschalidis IC

IEEE Transactions on Automatic Control, Vol.55, No.2, 492-497, 2010

DOI10.1109/TAC.2009.2037462 Export Citation

A Distributed Actor-Critic Algorithm and Applications to Mobile Sensor Network Coordination Problems

We introduce and establish the convergence of a distributed actor-critic method that orchestrates the coordination of multiple agents solving a general class of a Markov decision problem. The method leverages the centralized single-agent actor- critic algorithm of [1] and uses a consensus-like algorithm for updating agents' policy parameters. As an application and to validate our approach we consider a reward collection problem as an instance of a multi-agent coordination problem in a partially known environment and subject to dynamical changes and communication constraints.

Keywords:Actor-critic methods;consensus;Markov decision processes (MDP);multi-agent coordination;sensor networks