0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
TELLM: Advancements in Knowledge Incorporation and Task-specific Enhancements of Large Language Models
Fatemeh Feizi - Amirhossein Hossein Nia - MohammadMahdi Hemmatyar - Fatemeh Rahimi - Farhoud Jafari Kaleibar
Optimization of Novel L-shaped Gate All Around Junctionless Field Effect Transistor
Mohammad Tabarsi Sochelmaei - Arash Yazdanpanah Goharrizi
A Robust Video Steganography using 3D-CNN and Maximum Mean Discrepancy Cost Function
Ali Ghofrani - Rahil Mahdian Toroghi - Hassan Zareian
A Fast Approach for Deep Neural Network Implementation on FPGA
Maedeh Nobari - Hadi Jahanirad
الگوریتم تشخیصی برای طبقه بندی سرطان خون لوسمی لنفوسیتی حاد با استفاده از شبکه های عصبی عمیق در یادگیری آنلاین
رضا گودرزی - علی جلالی - امید هاشمی پورتفرشی
A New High Voltage Gain Z-Source Based DC-DC Converter for High-Power DG Applications
Sakina Bakhshi - Reza Beiranvand
Application of Statistical Techniques and Machine Learning in Forecasting Distribution Network Load: A Real Case Study on the Iranian Power System
Hossein Jafari - Mohammad Sadegh Sepasian - Fatemeh Teimori
Outage Analysis of Distributed Relaying NOMA in Cognitive Radio Networks
Zahra Doorbash - Ali Jamshidi
Brain Tumor Segmentation using Multimodal MRI and Convolutional Neural Network
Nazila Loghmani - Roqaie Moqadam - Armin Allahverdy
Family of Soft-Switched Single-Switch Switched-Resonator Converters with Low Component Count
Maryam Hajilou - Siamak Khalili - Hosein Farzanehfard
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0