0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
CatBoost Classifier For DDoS Detection In SDN Using Ryu Controller
Yazdan etdali Mohamadreza Noorifard
Fixed-time consensus of unknown nonlinear multi-agent systems
Mohammad Hadi Rezaei - Ali Abooee
Supercapacitor Active Balancing and Control Circuit for Harvesting Energy from Vehicle’s Tire
Mostafa Noohi - Ali Mirvakili
Mountain Gazelle Optimized PID Controller for a MIMO System with External Disturbance
Siavash Shirali - Hamoun Maleki - Hadi Delavari
BLSTM-Convolutional Neural Networks for Respiratory Disease Diagnosis
Mohammad Hassan Khamechian - Mohammad Reza Akbarzadeh Tootoonchi
Modeling of a low-noise amplifier with a recurrent neural network
Mostafa Noohi - Fatemeh Charoosaei - Ali Mirvakili - Sayed Alireza Sadrossadat
Wide-band Cloaking of Finite Length PEC Cylindrical Objects under Oblique Incidence using Multi-Layer Mantle Cloak
Alireza Moosaei - Mohammad Hasan Neshati
ZnO-based Acoustofluidics: Droplet-based Particle Manipulation
Sara Abbasi - Behdad Barahimi - Sara Darbari - Mohammad Kazem Moravvej-Farshi - Mohammad Zabetian
A New Low Noise 4-Gb/s Serial CMOS MPPM Modulator
Erfan Alasvand Andekah - Noushin Ghaderi - Mostafa Pour Sayahi
Reliability Evaluation of Distribution System Considering a Modified Electric Bus as a Mobile Energy Storage (Tehran E-Bus as a Case study)
Ali Kamali - Amir Soleimani - Seyed Vahid Nourbakhsh - Hassan Nehzati - Vahid Esfahanian - Mahmoud Oukati Sadegh
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0