0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
Design of a highly efficient photoconductive terahertz modulator enhanced by photonic crystal resonant cavity
Faramarz Alihosseini - Zahra Heshmatpanah - Hesam Zandi
همزمان سازی سمبلها در مخابرات مولکولی مبتنی بر انتشار
سمانه منطقی - علی جمشیدی
Optimal Design of a Synchronous Reluctance Motor Using BioGeography-Based Optimization
Tohid Sharifi - Mojtaba Mirsalim
طراحی و شبیه سازی جاذب شفاف فراماده تک لایه پهن باند مبتنی بر الگوی فرکتالی با پایداری زاویه ای بالا و غیر حساس به قطبش
سحر سوقی - حمید حیدر - محمدرضا هراتی - وحید نیری
Improved quantum secret sharing based on entanglement swapping
Mahsa Khorrampanah - Monireh Houshmand - Ali Karimi Lenji
Deep SqueezeNet Based Technique for Detection of High Impedance Arcing Faults in Electric Power Distribution Networks
Amin Mohammadi - Mohsen Jannati - Mohammadreza Shams
A Novel Step-up Converter Based on Active Network and Coupled-Inductor Technique with Soft Switching Operation
Mohammadreza Zeynalhosseyni - Reza Beiranvand
Differential Protection for Power Transformers Using Tree-based Pipeline Optimization Tool
Reza Afsharisefat - Mohsen Jannati - Mohamad Reza Shams
کنترل فرآیند سیستم های حرارتی بر اساس مدل دو بعدیFMM و رویکرد یادگیری تکرارشونده تطبیقی
سهیلا عابدی - طاهره بینازاده
Design and Analysis of a Low-Power Two-Stage Dynamic Comparator with 40ps Delay in 65nm CMOS Technology
Razieh Ghasemi - Hossein Ghasemian - Ebrahim Abiri - Mohammad Reza Salehi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4