0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
خلاصه سازی ویدیوهای کپسول آندوسکوپی با رویکرد یادگیری انتقالی
محدثه امیریان چایجان - رضا آقائی زاده ظروفی - مسعود رضا سهرابی
طراحی و مدلسازی امولاتور دریچه گاز الکترونیکی برای کاربرد در خودرو
محمدرضا درزی - مجید شالچیان
{High performance detector for massive MIMO systems using an adaptive filering approach
Masoud Tahmasbi Fard - Mojtaba Amiri - Ali Olfat
Analysis and Simulation of the Formation and dimensions of Gate-Defined Double Quantum Dots
Mahya Mostafavi - Majid Shalchian
Melanoma Detection Using Multi-Color LBP-FPl and Optimized VGG16
Vida Esmaeili - Mahmood Mohassel Feghhi
Design and Simulation of a Flight Control System for a Quadcopter using Fuzzy-PID Controller
Seyedeh Mahsa Zakipour Bahambari - Mojtaba Mohsen Haghighi - Saeed Khankalantary
Techno-Economic Dispatch of Distributed Energy Resources for Optimal Grid-Connected Operation of a Microgrid
Selma Cheshmeh khavar - Arya Abdolahi
A novel clustering-based over-sampling technique for imbalanced data sets
Behzad Mirzaei - Hossein Nezamabadi-pour - Javad Mahmoodi
کنترل پیش بین مقاوم توزیع شده برای سیستم های خطی چند عامله
علی سلمانپور - حامد کبریائی
An Ensemble Model for Sleep Stages Classification
Sahar Hassanzadeh Mostafaei - Jafar Tanha - Amir Sharafkhaneh - Zohair Hassanzadeh Mostafaei - Mohammed Hussein Ali Al-jaf - Alireza Fakhim babaei
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.3.1