0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Semi-supervised Deep Reinforcement Learning in Decentralized Multi-Agent Collision Avoidance and Path Planning in a Complex Environment
نویسندگان :
Marzie Parooei
1
Mehdi Tale Masouleh
2
Ahmad Kalhor
3
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
کلمات کلیدی :
Decentralized،Multi Agent،Collision Avoidance،Deep Reinforcement Learning
چکیده :
The problem of path planning and collision avoidance in complex and natural environments is one of the basic requirements of the robotic world, enabling robots to enter social environments. This paper aims to provide a decentralized path planning and collision avoidance method in multi-agent environments. In this method, each agent is a decision-making unit that decides independently from other agents and based on what is in its field of view. In the present paper, classical methods have been used to generate data for training purposes. Models were trained offline by imitating classical methods then semi-supervised methods were used for feature extraction. The results obtained from this method were compared with the Optimal Reciprocal Collision Avoidance (ORCA) method in three environments with different densities and three different indices. The proposed method performed relatively optimally and successfully increased the interaction index while decreasing the computation time. On the other hand, due to the scalable potential of this method, the number of agents could be increased without affecting the computation time.
لیست مقالات
لیست مقالات بایگانی شده
طراحی ایستگاه شارژ سریع با در نظر گرفتن عدم قطعیت منابع تجدیدپذیر و مدیریت ریسک
محمد بزرگپور رودباری - میثم جعفری نوکندی - محمد هاشمی مصیر
Performance Analysis of an UAV-assisted cognitive D2D communication-based Disaster Response Network
Hossein Mohammadi Firozjae - Javad Zeraatkar Moghaddam - Mehrdad Ardebilipour
Model Predictive Control for Interconnected Systems with Communication Delays
Reza Mohammadikia - Mahsan Tavakoli-Kakhki
ارائه یک روش جدید مبتنی بر ترکیب محدب برای مدلسازی اُفت دریچههای گاز بهمنظور حل مساله برنامهریزی تولید
حسین شریف زاده
برنامه ریزی توسعه شبکه های انتقال از دیدگاه شرکت های برق منطقه ای برای حداکثر سازی درآمد حاصل از ترانزیت برق
وحید مظفری - رضا نوروزیان - امیر باقری
Improving the Performance of ST-GCN on Multi-Site rs-fMRI Data Through Time Repetition Alignment
Mehrana Calagari - Hamidreza Hakimdavoodi
بررسی یک روش معکوس برای استخراج ثابت دی الکتریک محلی با استفاده از میکروسکوپ نوری روبشی میدان نزدیک
علی اقراری - محمد نشاط
Design and fabrication of wearable and stretchable EEG headband using textile-based electrode wire
Kourosh Motiepor - Arman Modoudi Yaghouti - Simin Bakhtiyari - Amir Jahanshahi - Roohollah Bagherzadeh
Transmission Dynamics and Optimal Control Strategy to Mitigate the Spread of Novel Coronavirus: The Case of Iran
Reza Shadi - Ahmad Fakharian - Hamid Khaloozadeh
Wake-Sleep Learning in R-STDP-Based Spiking Neural Networks to Avoid Catastrophic Forgetting
Mehrdad Baradaran - Katayoon Kobraei - Saeed Reza Kheradpisheh
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.2