0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
The Effect of Cavity Length on Two-State Quantum Dot Laser Performance
Gholamreza Babaabasi - Mohammad Mohsen Sheikhey - Sara Alaei
Job Title Prediction from Tweets Using Word Embedding and Deep Neural Networks
Shayan Vassef - Ramin Toosi - Mohammad Ali Akhaee
مدلسازی، تحلیل و شبیه سازی مبدل رزونانسی LC-LC با قابلیت همزمان جریان ثابت و ولتاژ ثابت در خروجی مناسب برای شارژ باتری
کامران داودی
Design and Manufacturing of a Programmable Spin Coater Based on a Brushless DC Motor
MirBehrad Mousavi - Saeed Javadizadeh - Seyed Ahmadreza Firoozabadi - Majid Badieirostami
Error Correction Enhancement in SCL Decoding of Polar Codes Using LSTM Network
Fatemeh Alia - Bahareh Akhbari - Mahmoud Ahmadian Attari
Fusion of Multi-Level CNN With LBP Features For Facial Emotion Recognition
Ehsan Bahmanabady - Maryam Imani - Hassan Ghassemian
Average Secrecy Capacity Performance Analysis for SWIPT-Based SIMO Underlay Cognitive Radio
Mohammad Javad Saber1 - Seyedeh Maryam Mazloum - Seyed Mohammad Sajad Sadough
تجزیه وابستگی با استفاده از Q-Learning محافظه کار
امیر زارعی - علیرضا خیاطیان - پیمان ستوده
Incentivizing Peer-to-Peer Energy Trading in Microgrids
Amir Noori - Babak Tavassoli - Alireza Fereidunian
Simulation Analysis of Electrode Metal Influence on the Microcavity Effect in Organic Light-Emitting Diodes
Faezeh Rahimi - Mohammad Sedghi - Asghar Gholami
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0