0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
Modeling of seawater desalination by reverse osmosis method powered by wind turbine
Bahareh Iranmanesh - Gholam Hossein Riahy Dehkordi
Three-Leg AC/AC Converters :A Comprehensive Practical Overview
MohammadHadi Mokhtari - Seyed Mohsen Mortazavi - Mohammad Reza Zolghadri
A Fast Approach for Deep Neural Network Implementation on FPGA
Maedeh Nobari - Hadi Jahanirad
Ultra-Low Power Current-Mode ASK Demodulator for Contactless Smart Cards
Somayeh Yousefi - Mohsen Jalali
A 20W High Gain Power Amplifier
Hamid Taleb-Alhagh-Nia - Reza Rezaei Siahrood - Hamed Sajadinia
Instantaneous Blind Audio Source Separation Using Characteristic Function of Heavy-Tailed Distributions
Kamran Rajabi - Mohammadreza Hassannejad Bibalan - Neda Faraji
Community Energy Management Using MARL: Synergy of Price-Based and Incentive-Based Demand Response
Mohammad Hashemnezhad - Hamed Delkhosh - Ahmad Shahabi - Mohsen Parsa Moghaddam
Performance analysis under the Independent Fluctuating Two-Ray (IFTR) Fading in RIS-Assisted Millimeter Wave Communications
Maryam Olyaee - Hadi Hashemi - Juan Manuel Romero Jerez
Deep Convolutional Neural Network for ADHD Classification using resting-state fMRI
MohammadHadi Firouzi - Maliheh Ahmadi - Kamran Kazemi - Mohammad Sadegh Helfroush - Ardalan Aarabi
Investigation of Li3P as Electrolyte and Lithium-ion conductor: An Ab-Initio Study
Keyvan Khosh Abady - ََamin Niksirat - Negar Karpourazar - Mahdi Pourfath
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4