Data-based Reinforcement Learning Algorithm with Experience Replay for Solving Constrained Nonzero-sum Differential Games

21st International Symposium on Mathematical Theory of Networks and Systems , 2014-07-07

Title : ( Data-based Reinforcement Learning Algorithm with Experience Replay for Solving Constrained Nonzero-sum Differential Games )

Authors: Sholeh Yasini , Ali Karimpour , Mohammad Bagher Naghibi Sistani ,

File:

دانلود فایل برای اعضای دانشگاه

Access to full-text not allowed by authors

Citation: BibTeX | EndNote

Abstract

Abstract—In this paper a partially model-free reinforcement learning (RL) algorithm based on experience replay is developed for finding online the Nash equilibrium solution of the multi-player nonzero-sum (NZS) differential games. In order to avoid the performance degradation or even system instability, the amplitude limitation on the control inputs is considered in the design procedure. The proposed algorithm is implemented on actor-critic structure for every player in the game, where both actor and critic networks are tuned at the same time. The game players learn online the solution of the constrained coupled Hamilton-Jacobi (HJ) equations, without using any knowledge on the internal system dynamics. The idea of experience replay is used to relax the requirement for checking the restrictive persistence of excitation (PE) condition which is difficult to verify or implement online. The closed-loop stability is analyzed and the convergence to the Nash equilibrium of the game is shown. A simulation study example is provided showing the effectiveness of the proposed approach.

Keywords

, Coupled Hamilton-Jacob Equations, Nonzero-sum Differential Game, Reinforcement Learning

برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

BibTeX
EndNote

@inproceedings{paperid:1044409,
author = {Yasini, Sholeh and Karimpour, Ali and Naghibi Sistani, Mohammad Bagher},
title = {Data-based Reinforcement Learning Algorithm with Experience Replay for Solving Constrained Nonzero-sum Differential Games},
booktitle = {21st International Symposium on Mathematical Theory of Networks and Systems},
year = {2014},
location = {IRAN},
keywords = {Coupled Hamilton-Jacob Equations; Nonzero-sum Differential Game; Reinforcement Learning},
}

[Download]

%0 Conference Proceedings
%T Data-based Reinforcement Learning Algorithm with Experience Replay for Solving Constrained Nonzero-sum Differential Games
%A Yasini, Sholeh
%A Karimpour, Ali
%A Naghibi Sistani, Mohammad Bagher
%J 21st International Symposium on Mathematical Theory of Networks and Systems
%D 2014

[Download]