0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Better Exploration In Single-Agent Q-Learning Using Controlled Linear Perturbation
نویسندگان :
Sadredin Hokmi
1
Mohammad Haeri
2
1- Sharif university of technology
2- Sharif university of technology
کلمات کلیدی :
Q-learning،Exploration،Controlled Linear perturbation،Convergence rate،Maze،Cart-Pole
چکیده :
Reinforcement learning algorithms, especially model-free algorithms like Q-learning, have shown reliable results in finding optimal solutions for many real-time applications. However, challenges such as exploration in real-time and the convergence rate need to be addressed, and many researches have proposed algorithms to tackle these challenges. Algorithms like speedy Q-learning, Zap Q-learning, algorithms based on adding a regularization term, noise injection, and many others have been introduced. In this paper, an algorithm based on controlled linear perturbation is presented, which, according to the numerical results, can significantly reduce unnecessary explorations that are risky in real-time. Additionally, the proposed algorithm does not depend on the learning rate \mathbit{\alpha}, \mathbit{\gamma}, or changes in coefficients. However, to be effective, the parameters of the algorithm should be chosen within the correct range. The results of applying the proposed algorithm have been compared with three reliable algorithms: standard Q-learning, speedy Q-learning, and noise injection. These comparisons were conducted in a 9x9 maze scenario and in the cart-pole environment.
لیست مقالات
لیست مقالات بایگانی شده
Depth Estimation in Monocular Images of Inside the Sewer Pipes for Navigation of Inspection Robots
Zeinab Maroufi - Alireza Hadi Hosseinabadi - Reza Askari moghadam
Controllable UWB THz Absorber Using a New Single-layer Graphene-based Grating
Mahdieh Bozorgi - Mahmood Rafaei Booket - Mohammad Amin Zolghadr
تشخیص و مکان یابی خطاها در آرایه های فتوولتائیک متصل به شبکه
سعید انصاری - حیدر صامت - تیمور قنبری
Design and Modeling of Graphene Based Electro-absorption Modulator Integrated with Hybrid Plasmonic Waveguides
Hadi Soofi - Shima Karkon Bagheri - Hamid Vahed
A New Approach to Solve MDVRP in Lower Computation Time
Reza Rahimi Baghbadorani - Mohammad Amin Zajkani - Mohammad Haeri
Kernel-Based Band Selection for Hyperspectral Image Classification
Mehdi Kamandar
Optimal Design of a Synchronous Reluctance Motor Using BioGeography-Based Optimization
Tohid Sharifi - Mojtaba Mirsalim
مقایسهگر پویا با قابلیت کار در شرایط زیر آستانه بر اساس منطق Pseudo-NMOS
سید سعید حسینی دولت آبادی - محسن جلالی
A Transformer less Quadratic Boost DC-DC Converter with Continuous Input Current and a Few Number of Components, Based on Classical Boost and Cuk Converter Suitable for Renewable Applications
Saeed Mahdizadeh - Reza Sharifi Shahrivar - Hossein Gholizadeh - Ebrahim Afjei
A reinforcement learning-based control approach for tracking problem of a class of nonlinear systems: Applied to a Single-Link Manipulator
Farshad Rahimi - Sepideh Ziaei - Reza Mahboobi Esfanjani
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2