Dr. Howard M. Schwartz: Publication Abstract

Abstract: We extend the potential-based shaping method from Markov decision processes to multi-player general-sum stochastic games. We prove that the Nash equilibrium of the stochastic game remains unchanged after potential-based shaping is applied to the environment. The property of policy invariance provides a possible way of speeding convergence when learning to play a stochastic game. PDF
Keywords: game theory, machine learning, multiagent systems, reinforcement learning

Department of Systems and Computer Engineering
Ottawa, Canada

Dr. Howard Schwartz: Publication Abstract

Department of Systems and Computer Engineering Ottawa, Canada

Dr. Howard Schwartz: Publication Abstract

Department of Systems and Computer Engineering
Ottawa, Canada