0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
حسگر ضریب شکست مبتنی بر فانو رزونانس در موجبرهای فلز- عایق- فلز، با رزوناتور صفحهای تزویج شده از جانب
تورج هاشمی - نسرین عبدالهی برازجان - عباس علی قنبری
A fair-optimal solution for multi-objective optimization based on Shapley value
Mohammadreza Mohammadhasani - Habib Rajabi Mashhadi
Analyzing Large-scale PV Plant Controllers by Technical Performance Indices using MCS Method
Hooman Nasrazadani - Alireza Sedighi - Hossein Seifi
Achieving a Wide Range of Voltage Gain in Three-Phase LLC Resonant Converter Using Hybrid Control of Variable Frequency and Variable Magnetizing Inductor
Saeed Ramezani darvish - Salar Sadeghian - Adib Abrishamifar
بکارگیری تکنیک کنترل مقاوم جهت طراحی مسیر حرکت خودرو در مانورهای اضطراری ممانعت از برخورد
محمد امین قماشی - رضا کاظمی
Electricity Tariff Volatility Mitigation Using Uncertainty-Diminution and Hedge Contracts along with Risk Management Policies
Majid Moazzami - Hossein Shahinzadeh - Majid Najafi - Zohreh Azani - Shohreh Azani - Gevork B. Gharehpetian
A Novel UAV-enabled V2V Mobile Network: A Reinforcement Learning Approach
Hossein Mohammadi Firouzjaei - Javad Zeraatkar - Mehrdad Ardebilipour
Robot-Assisted Rehabilitation with Optimal Impedance: Using an $\mathcal{EKF}$-Based $\mathcal{L}asso-\mathcal{MPC}$
Hossein Ahmadian - Iman Sharifi - Heidar Ali Talebi
Full Soft Switching Interleaved High Voltage Gain Converter For Renewable Energy Systems
Baharak Akhlaghi
Innovative Pathway Optimization for Autonomous Drones in Urban Landscapes Using Integrated Techniques
Seyed Ahmad Abtahi - M.A. Amiri Atashgah - Bahram Tarvirdizadeh - Mohammad Habashiniak
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0