0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
Coverage Probability Analysis of User Association in NOMA-Based Full-Duplex Systems
Shaghayegh Asadollahi dehkordi - Mohammadali Mohammadi - Zahra Mobini - Sepideh Haghgoy
Robust Neuro-Adaptive Fuzzy Sliding Mode Control for a Remotely Operated Underwater Vehicle Manipulator
Mahdi Armoon - Marzie Lafouti - Babak Tavassoli - Hamid D. Taghirad
Defense Against Spectrum Sensing Data Falsification Attack in Cognitive Radio Networks Using Machine Learning
Nazanin Parhizgar - Ali Jamshidi - Peyman Setoodeh
طراحی و پیادهسازی یک ماشین حالت محدود جهت محاسبة تابع مثلثاتی تانژانت معکوس مبتنی بر سری تیلور عقبرونده و با استفاده از دو واحد ضربکنندة DSP48-E بر روی تراشههای FPGA شرکت AMD-XILINX به صورت زمان متغیر
میثم هارونی - پیام سنائی
An active learning approach for classification of several arrhythmias in ECG signal
Nastaran Darbani - Danial Katoozian - Hossein Hosseini-Nejad
A Novel Analytical Tuning Method for Designing of Composite Nonlinear Feedback Control Law in Continuous-time Dynamical Systems
Ali Vazani - Valiollah Ghaffari
A New Method on Failure Detection of Fixed and Moving Contacts of Circuit Breakers
Hassan Hamidi - Ali Asghar Razi Kazemi
The Comparison of MXene and Graphene-Based Antennas for 5G/6G Communications
Javad Shokri Seyyedi - Gholamreza Moradi - Reza Sarraf Shirazi - Sepehr Sahab - Abolfazl Ebrahimpour
Synergizing ISAC and OTFS in a Non-GB-OMA Downlink Framework
Ghasem Saeidi - Hamid Saeedi-sourck
Performance analysis under the Independent Fluctuating Two-Ray (IFTR) Fading in RIS-Assisted Millimeter Wave Communications
Maryam Olyaee - Hadi Hashemi - Juan Manuel Romero Jerez
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.3