0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
ارزیابی کیفیت و موفقیت های پیوند کلیه
علی رفیعی پور - بهزاد خلجی امامزاده عباسی - زینب زالی - مسعودرضا هاشمی
A New High gain Transformerless DC-DC Converter with Low Voltage Stress on Power Switches
Amirreza Bahadori - Ali Nadermohammadi - Mohammad Maalandish - Seyed Hossein Hosseini - Mehran Sabahi
Stability Analysis of Distributed-Order Systems: a Lyapunov Scheme
Vahid Badri
An Integrated Technical Analysis and Machine Learning Trading Model for Noisy and Volatile Financial Markets
Arvin Esfandiari - Ali Doustmohammadi
Estimation of the Arc Model Parameters Using Heuristic Optimization Methods
Sadegh Ghavami - Ali A Razi-kazemi
Hardware Implementation of a Chaos Based Image Encryption Using High-Level Synthesis
Saeed Sharifian.m.m - Vahid Rashtchi - Ali Azarpeyvand
A novel CMRR Enhancement technique in fully-differential Class-AB OTAs
Amirhossein Sabour - Mahsa Ramezan Pour - Mohammad Yavari
Learning-Based Routing Policy For Wireless Sensor Networks
Najim Halloum - Yousef Darmani - Ali Ahmadi
Dominant Control Set Selection in Clustered Complex Brain Network
Sana Motallebi - Mohammad Javad Yazdanpanah - Abdol-Hossein Vahabie
Formation Control of Bicycle Model of Mobile Robots with Disturbance Using PID Controller
Amirhossein Rahmankhanloo - Saeed Khankalantary - Ali Akbar Vahedi
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2