0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
Investigating Validity and Reliability of The Features Extracted by a 5R Vertical Robot for Arm Motion and Learning Assessment
Sarvenaz Bourbour - Fariba Bahrami Boodelalou - Ghorban Taghizadeh
An Enhanced SLAM Method Using ICP Algorithm for Autonomous Mobile Robots Navigation
Hasan Enami Eraghi - Mohammad Reza Taban - Sayed Farzad Bahreinian - Mohammad Reza Jabbari
Combination of Classifiers to Detecting Grade of Gliblastoma using MRS
Roqaie Moqadam - Nazila Loghmani - Meysam Siyahmansoori - Armin Allahverdy
A Novel Estimation Law for Impedance-Controlled Bilateral Teleoperation to Enhance Human-Environment Interaction
Mobina Kameli - Mohammad Motaharifar - Negin Sayyaf
Proposing an indirect distributed approach to apply SSSEP vibrational stimulation
SAHAR SADEGHI - Ali Maleki
Design and Implementation of CAN Bus Monitoring Module for Lithium Battery Management System
Shakila Kazempourdizaji - Amir Mohammad Moazami Goudarzi - Majid Shalchian
بررسی تاثیر اعمال پوشش مش متال در مقاومت حرارتی و خوردگی سیم فولادی استحکام بالا بعنوان مغزی هادی های پرظرفیت ACSS
فائزه راد - مهرنوش طاهرخانی - ناصر میرشاه ولایتی - عبداله جواهری
طراحی و بررسی یک اینورتر چند سطحی جدید با کاهش تعداد ادوات قدرت به کار گرفته شده
حسین جعفری - داریوش نظرپور - سجاد گلشن نواز - ابراهیم بابائی
A Single-Fed Circularly-Polarized Elliptical Slot Antenna for S-Band applications
Sina Rezaee - Mahdi Janforooz - Behnam Rasoulpour
الگوریتم تشخیصی برای طبقه بندی سرطان خون لوسمی لنفوسیتی حاد با استفاده از شبکه های عصبی عمیق در یادگیری آنلاین
رضا گودرزی - علی جلالی - امید هاشمی پورتفرشی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.3