بررسی تاثیر کاهش تفکیک پذیریفضای حالتو در نظر گرفتنعدم قطعیت در کنترل زمانبدی یک تقاطع ایزوله با استفاده از یادگیریQ

ششمین همایش بین المللی مدیریت و ایمنی ترافیک , 2024-05-21

عنوان : ( بررسی تاثیر کاهش تفکیک پذیریفضای حالتو در نظر گرفتنعدم قطعیت در کنترل زمانبدی یک تقاطع ایزوله با استفاده از یادگیریQ )

نویسندگان: ناصر پریز ,

فایل:

دانلود فایل برای اعضای دانشگاه

بر اساس تصمیم نویسنده مقاله دسترسی به متن کامل برای اعضای غیر دانشگاه ممکن نیست

استناددهی: BibTeX | EndNote

چکیده

In this research, the Q-learning reinforcement method is used as an approach that interacts with the environment and undergoes the learning process in real-time, aimed at improving the scheduling of intersections. The criterion considered is the cumulative waiting time of vehicles on the incoming routes to the intersection, which is used as a benchmark for evaluating the proposed method. This study attempts to critically analyze the performance and shortcomings of conventional discrete-state reinforcement learning methods in problems with a very high number of states. The main contribution of this research is the definition of a new state space in traffic signal control problems and the examination of the impact of reducing the resolution of the state space and considering the uncertainty in action selection on improving the quality of reinforcement learning. Furthermore, it is shown that using reduced resolution leads to improved agent performance despite the use of discrete-state methods. To evaluate the performance of the proposed method, the results obtained from the proposed method are compared with those from the Q-learning method in continuous states and fixed-time from the articles [1, 2]. For the implementation of the designed control algorithms and generating the traffics data, a traffic simulator called SUMO has been used. The results indicate a reduction in the total waiting time and also a decrease in computation time in our proposed method compared to the methods under comparison.

کلمات کلیدی

, Reinforcement Learning, Traffic Signal Control, Q Learning, Isolated Intersection, Machine Learning

برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

BibTeX
EndNote

@inproceedings{paperid:1102836,
author = {پریز, ناصر},
title = {بررسی تاثیر کاهش تفکیک پذیریفضای حالتو در نظر گرفتنعدم قطعیت در کنترل زمانبدی یک تقاطع ایزوله با استفاده از یادگیریQ},
booktitle = {ششمین همایش بین المللی مدیریت و ایمنی ترافیک},
year = {2024},
location = {تهران, ايران},
keywords = {Reinforcement Learning; Traffic Signal Control; Q Learning; Isolated Intersection; Machine Learning},
}

[Download]

%0 Conference Proceedings
%T بررسی تاثیر کاهش تفکیک پذیریفضای حالتو در نظر گرفتنعدم قطعیت در کنترل زمانبدی یک تقاطع ایزوله با استفاده از یادگیریQ
%A پریز, ناصر
%J ششمین همایش بین المللی مدیریت و ایمنی ترافیک
%D 2024

[Download]