0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
نویسندگان :
Alireza Nezamzadeh
1
Hamed Jalaly Bidgoly
2
Marzieh Kamali
3
1- Isfahan University of Technology
2- Isfahan University of Technology
3- Isfahan University of Technology
کلمات کلیدی :
Multi-Agent system،Reinforcement Learning،Cooperative Control،Coverage Environment
چکیده :
This paper presents the coverage environment of two agents using reinforcement learning. When we talk about reinforcement learning, two popular algorithms, Q-learning and Sarsa, come up. One of the differences between these algorithms refers to the on-policy and off-policy methods for updating the rules of the algorithms. In this paper, we consider two environments in order to investigate cooperative coverage path planning by using these algorithms. Our goal is coverage environment with the cooperation of agents based on reducing the energy of the agents. At first, we consider the environment without an obstacle, then is considered an obstacle for that. The proposed algorithm expresses sharing experience between agents when they want to cover the environment with an obstacle and compares that in convergence speed with when there was no sharing experience between agents. The performance of the proposed algorithm is evaluated by various simulations.
لیست مقالات
لیست مقالات بایگانی شده
A novel approach for recommender systems based on query likelihood and sentiment analysis
Mohammadreza Soltaninezhad - Alireza Basiri
Unsupervised Change Detection in SAR Images Using a Six-Branch CNN and Adaptive Window Approach
Abbas Kakoolvand - Maryam Imani - Hassan Ghassemian
Kernel-Based Embedded Feature Selection for Motor Imagery Based BCI
Mehdi Kamandar
Noniterative Solution of Inverse Scattering Problems Using A Priori Information
Leila Ahmadi - Amir Ahmad Shishegar
Power Transformer Vibration Study and its Application in Winding Deformation Detection
Amir Esmaeili Nezhad - Mohammad Hamed Samimi
Improving Artificial Neural Network Performance Using Hybrid Activation Function
Morteza Taheri - Sajad Haghzad Klidbary
An Improved Hybrid Recommender System: Integrating Document Context-Based and Behavior-Based Methods
Meysam Varasteh - Mehdi Soleiman Nejad - Hadi Moradi - Mohammad Amin Sadeghi - Ahmad Kalhor
بهینه سازی استفاده از منابع شبکه های نوری با گرومینگ ترافیک در لایهی MPLS
محمدعلی سالک قادری - آرش رضایی - لطف اله بیگی
Electronic properties of 2D perovskites NMA2PbBr4 and NEA2PbBr4 for PeLED applications: first principle approach
Samad Shokouhi - Seyedeh bita Saadatmand - Vahid Ahmadi
Multinomial Emoji Prediction Using Deep Bidirectional Transformers and Topic Modeling
Zahra Ebrahimian - Ramin Toosi - Mohammad Ali Akhaee
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2