0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
A Siamese Neural Network for Predicting snoRNA-Disease Association
Milad Besharatifard - Fatemeh Zare-Mirakabad
Robust Laguerre based model predictive control for trajectory tracking of LTV systems
Marzieh Jamalabadi - Mahyar Naraghi - Iman Sharifi - Elnaz Firouzmand
Proposing an indirect distributed approach to apply SSSEP vibrational stimulation
SAHAR SADEGHI - Ali Maleki
Probabilistic Dynamic Economic Dispatch in Presence of Wind Farms
Homayoun Berahmandpour - Shahram Montasar Kuhsari - Hassan Rastegar
بررسی اثر فیدبک نوری بر مشخصه های دینامیکی لیزرهای قفل مد سیلیکونی
محمد شکرپور - محمد حسن یاوری
Joint User Association and UAV Location Optimization for Two-Tired Visible Light Communication Networks
Alireza Qazavi - Foroogh Sadat Tabataba - Mehdi Naderi Soorki
طراحی و شبیه سازی یک تقویت کننده کم نویز پهن باند در باند K (18 تا 27 گیگاهرتز)
نوید نصیری - حسین شمسی
Proposed Small Signal Dynamic Model for a Grid-Connected Battery Storage System
Zahra Moradi- Shahrbabak
P300 Evoked Related Potential Detection Based on Integration of Modified HOG and Convolutional Neural Networks
Pedram Havaei - Elham Mahmoudzadeh - Maryam Zekri
Anomaly Detection in Urban Water Distribution Grids Using Fog Computing Architecture
Sara Mirzaie - Mohammadreza Avazaghaei - Omid Bushehrian
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4