0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
Design of a 2MW Medium Voltage Conventional Hybrid DC Circuit Breaker for Railway Application
Seyed Hamid Khalkhali - Mohsen Taghizadeh Kejani - Ali Asghar Razi Kazemi
Heart Abnormality Classification by Phonocardiogram Analysis Using Fusion in Feature and Decision Levels
Hossein Rahmati - Hassan Ghassemian - Maryam Imani
T-type L-2L De-Embedding Method for On-Wafer T-model Transmission Line Network
Milad Seyedi - Nasser Masoumi - Samad Sheikhaei
Numerical study of different pillar shapes using deterministic lateral displacement method for particle separation
Mohammad Mahdi Eskandari Sani - Mahdi Aliverdinia - Mahdi Moghimi Zand
Enhancing SCGAN’s Disentangled Representation Learning with Contrastive SSIM Similarity Constraints
Iman Yazdanpanah - Ali Eslamian
Observer-Based Control for impulsive switched systems with Uncertain inputs
Soheil Sheikh ahmadi - Farzad Hashemzadeh - Mohammad Ali Badamchizadeh
Error Correction Enhancement in SCL Decoding of Polar Codes Using LSTM Network
Fatemeh Alia - Bahareh Akhbari - Mahmoud Ahmadian Attari
HIV Virus States Estimation by Extended Kalman Particle Filter
Meysam Hooshmand - Mahtab Sharifian - Hamid Sharifian - Javad Mahmoudi
تعیین آرایش بهینه خطوط جهت کاهش فرسایش یقه پایه های بتنی ناشی از تنشهای باد
میثم پوراحمدی نخلی - حمیدرضا فیروزآبادی
مدل سازی و شبیه سازی جداکننده پرتو کوانتومی و تداخل گر ماخ زندر کوانتومی
محمد جواد شریفی
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2