0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
Coverage Probability Analysis of User Association in NOMA-Based Full-Duplex Systems
Shaghayegh Asadollahi dehkordi - Mohammadali Mohammadi - Zahra Mobini - Sepideh Haghgoy
Design and fabrication tip tapered fiber optic dopamine sensor based on LSPR
Roksana Esmaeilpour - Mohammad Ismail zibaii - Masoumeh Barkand - Marzieh Pajouhandeh - Soroush Rostami - Mehdi Banihashemi - Mohammad-Mahdi Babakhani-fard
Bit Error Mitigation Using Unequal Resistivity Levels in Memristors
Amir Mohammad Hajisadeghi - Peiman Pourmomen - Hamid Reza Zarandi
Monte Carlo Analysis of Process Variations in Metal-Semiconductor-Metal Photodetectors for Nanophotonic Interconnects Application
Arash Qodratnama - Farshad Khunjush - Mohsen Raji
PCG Denoising using AR-based Kalman Filter
Mohammad Sadegh Nazemi - Hesam Hakimnejad - Zohreh Azimifar
ارائه مبدل DC-DC غیر ایزوله هیبریدی بهره ولتاژ بالا با سوئیچ فعال سلفی
حسن زارعین - مجتبی حیدری - سیدمحمد دهقان دهنوی
طراحی مدولاتور الکتروجذبی پلاسمونیک مبتنی بر Vo2 باساختار اسلت برای بهبود عملکرد در طول موج 1550 نانومتر مخابراتی
حبیبه صمدی - حمید واحد - هادی صوفی
طراحی و ساخت لیدار پالسی برای خودرو خودران با حذف موثر پدیده تداخل
سبحان دبیدیان - صدرا تفقدی جامی - زهرا کاوه وش - علی فتوت احمدی
A novel protection scheme for HVDC transmission lines based on DC-filter current and DC line current
Mohammad Amin Rezaei Gazik - Hossein Kazemi Karegar
Higher-order semi-blind source separation approaches using Canonical Polyadic (CP) decomposition
Mohammad Jalilpour Monesi - Sepideh Hajipour Sardouie
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.2