화학공학소재연구정보센터
IEEE Transactions on Automatic Control, Vol.58, No.3, 639-653, 2013
Nash, Social and Centralized Solutions to Consensus Problems via Mean Field Control Theory
The purpose of this paper is to synthesize initial mean consensus behavior of a set of agents from the fundamental optimization principles of i) stochastic dynamic games, and ii) optimal control. In the stochastic dynamic game model each agent seeks to minimize its individual quadratic discounted cost function involving the mean of the states of all other agents. In this formulation we derive the limiting infinite population mean field equation system and explicitly compute its unique solution. The resulting mean field (MF) control strategies drive each agent to track the overall population's initial state distribution mean, and by applying these decentralized strategies, any finite population system reaches mean consensus asymptotically as time goes to infinity. Furthermore, these control laws possess an epsilon(N)-Nash equilibrium property where epsilon(N) goes to zero as the population size goes to infinity. Finally, the analysis is extended to the case of random mean field couplings. In the social cooperative formulation the basic objective is to minimize a social cost as the sum of the individual cost functions containing mean field coupling. In this formulation we show that for any individual agent the decentralized mean field social (MF Social) control strategy is the same as the mean field Nash (MF Nash) equilibrium strategy. Hence MF - Nash Controls U-Nash(N) = MF - Social Controls U-Soc(N). On the other hand, the solution to the centralized LQR optimal control formulation yields the Standard Consensus (SC) algorithm whenever the graph representing the corresponding topology of the network is Completely Connected (CC). Hence Cent. LQR Controls U-Cent(N) = SC - CC Conrols U-SC(N). Moreover, a system with centralized control laws reaches consensus on the initial state distributionmean as time and population size N go to infinity. Hence, asymptotically in time MF - Nash Controls U-Nash(N) = MF - Social Controls U-Soc(N) = Cent. LQR Controls U-Cent(infinity) = SC - CC Controls U-SC(infinity). Finally, the analysis is extended to the long time average (LTA) cost functions case.