0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Semi-supervised Deep Reinforcement Learning in Decentralized Multi-Agent Collision Avoidance and Path Planning in a Complex Environment
نویسندگان :
Marzie Parooei
1
Mehdi Tale Masouleh
2
Ahmad Kalhor
3
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
کلمات کلیدی :
Decentralized،Multi Agent،Collision Avoidance،Deep Reinforcement Learning
چکیده :
The problem of path planning and collision avoidance in complex and natural environments is one of the basic requirements of the robotic world, enabling robots to enter social environments. This paper aims to provide a decentralized path planning and collision avoidance method in multi-agent environments. In this method, each agent is a decision-making unit that decides independently from other agents and based on what is in its field of view. In the present paper, classical methods have been used to generate data for training purposes. Models were trained offline by imitating classical methods then semi-supervised methods were used for feature extraction. The results obtained from this method were compared with the Optimal Reciprocal Collision Avoidance (ORCA) method in three environments with different densities and three different indices. The proposed method performed relatively optimally and successfully increased the interaction index while decreasing the computation time. On the other hand, due to the scalable potential of this method, the number of agents could be increased without affecting the computation time.
لیست مقالات
لیست مقالات بایگانی شده
A Dual-Band LPDA Antenna Based on MXene for High-Band 5G Application
Javad Shokri seyyedi - Reza Sarraf Shirazi - Gholamreza Moradi
بررسی عملکرد الگوریتم یادگیری تقلیدی در آموزش شبکه عصبی کاملا متصل برای حل مسئله مسیریابی در محیطهای چندعامله
محمد روغنی - سمانه حسینی سمنانی
40Hz Auditory Entrainment Promotes Synchronization Between Frontal and Parietal Regions of the Brain
Mojtaba Lahijanian - Hamid Aghajan
An Autonomous Multi Agent Q-Learning Approach for Resource Allocation in D2D-Enabled Heterogeneous Networks
Pouya Akhoundzadeh - Ghasem Mirjalily - Mohammad taghi Saadeghi
بررسی توان و افزایش بازدهی در فرستنده سوئیچینگ لورن
عادل رضائیان - احمد عفیفی - جمشید ده پهلوانی
Entanglement Witness Derived By Using Kolmogorov-Arnold Networks
Fatemeh Lajevardi - Azam Mani - Ali Fahim
Joint Request Aggregation and Content Caching at the Edge via Named Data Networking
Parisa Bakhtou - Siavash Khorsandi
Temporal Green's function of an RLC resonator with arbitrary time-varying capacitance using differential transition matrix
Somayeh Boshgazi - Khashayar Mehrany - Mohammad Memarian
Realization of a high-resolution plasmonic refractive index sensor based on double-nanodisk shaped resonators
Leila Hajshahvaladi - Hassan Kaatuzian - Mohammad Danaie - Ghazaleh Nourbakhsh
Modeling of Photo-thermoelectric Current Effects in Phase Change Material based Optical Nano Dipole Antenna Energy Transducer
Daniyal Khosh Maram - Seyed Asad Amirhosseini
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0