0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
Enhancing Kriging with Inductive Spatio-Temporal GraphODE
Amin Sheykhzadeh - Behzad Moshiri - Ebrahim Ghafar-Zadeh
Jacobian matrix calculation in scattering from dielectric objects using semi-explicit MoM
Fatemeh Mandegari - Leila Ahmadi - Amir Ahmad Shishegar
Vehicle stability control and trajectory tracking utilizing a type-2 fuzzy controller
Mohammad Mahdavi Mazdeh - Mehdi Pourgholi - Vahid Fakhari
Multi-objective Optimization of Peer-to-Peer Transactions in Arizona State University’s Microgrid by NSGA II
Pourya Shirinshahrakfard - Amir Abolfazl Suratgar - Mohammad Bagher Menhaj - Gevork B. Gharehpetian
Second-order Sliding Mode Control for DC-DC buck converter with input Voltage Ripple Elimination
Maede Azimi - Mehdi Asadi - Adel Zakipour
Perfect Tracking of a Non-minimum Phase MIMO System
Saeedreza Tofighi - Farshad Merrikh-Bayat
Holographic Technique Inspired Multi-Beam Cylindrical Leaky-Wave Antenna
Mohammad Amin Chaychi Zadeh - Nader Komjani - Sajjad Zohrevand
A Bidirectional Transformerless Resonant Converter for Capacitive Power Transmission for Electric Vehicle and PowerWall Applications
Jasem Shahsevani - Reza Beiranvand
Using the Artificial Bee Colony (ABC) Algorithm in Collaboration with the Fog Nodes in the Internet of Things Three-layer Architecture
Shakoor Vakilian - Seyed Vahid Moravvej - Ali Fanian
بکارگیری تکنیک کنترل مقاوم جهت طراحی مسیر حرکت خودرو در مانورهای اضطراری ممانعت از برخورد
محمد امین قماشی - رضا کاظمی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0