0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
کاهش نویز و کلاتر در تصاویر رنگی داپلر اولترسوند
بهینا علیزاده - سید محمود سخایی
Carbon Doping and Defect Engineering in Hexagonal Boron Nitride: Insights from First-Principles Calculations
Matin Bakhtavari Mamagani - Maryam Keshavarz Afshar - Ebrahim Nadimi
Study of the interaction between different parameters in the fabrication of paper-based microfluidic devices using the wax printing method
MOHAMMAD DERAKHSHANI - SEYED HOSSEIN TAYEBI - MEHRDAD LOTFI CHOOBBARI - AMIR JAHANSHAHI
Multi-Attribute Decision-Making Methods to a Cloud Service Providing Selection
Amirhossein Shahbakhsh razavi - Kiumars Javan - Mehdi Zaferanieh - Somayeh Sobati-Moghadam
Multi wasserstien distance
Atefeh Ziaei Moghadam - Hamed Azarnoush - Seyyed Ali Seyyedsalehi
طبقه بندی سکته مغزی در یک سیستم دو بعدی چند فرکانسی با استفاده از امواج مایکروویو و یادگیری عمیق
محسن مهرانیان - محمدسعید ماجدی - امیررضا عطاری
Fast Subdomain Approximation of Brushless Electrical Machines with Spoke-Hub Permanent Magnets
Meisam Pourahmadinakhli - Seyed Hassan Daryanavard - Masoud Jokar-Kohanjani - Sina Soltani
Optimal Placement of Unified Power Flow Controller in Power System Considering Transient Stability and Voltage Stability Criteria
Esmail Zahmatkeshan - Mohsen Bandekhoda
Analyzing Large-scale PV Plant Controllers by Technical Performance Indices using MCS Method
Hooman Nasrazadani - Alireza Sedighi - Hossein Seifi
Enhancing the Incident Angle Band in Carpet Cloaking using Deep Neural Networks
Amirhossein Fallah - Leila Yousefi - Ahmad Kalhor
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.7.4