0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Semi-supervised Deep Reinforcement Learning in Decentralized Multi-Agent Collision Avoidance and Path Planning in a Complex Environment
نویسندگان :
Marzie Parooei
1
Mehdi Tale Masouleh
2
Ahmad Kalhor
3
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
کلمات کلیدی :
Decentralized،Multi Agent،Collision Avoidance،Deep Reinforcement Learning
چکیده :
The problem of path planning and collision avoidance in complex and natural environments is one of the basic requirements of the robotic world, enabling robots to enter social environments. This paper aims to provide a decentralized path planning and collision avoidance method in multi-agent environments. In this method, each agent is a decision-making unit that decides independently from other agents and based on what is in its field of view. In the present paper, classical methods have been used to generate data for training purposes. Models were trained offline by imitating classical methods then semi-supervised methods were used for feature extraction. The results obtained from this method were compared with the Optimal Reciprocal Collision Avoidance (ORCA) method in three environments with different densities and three different indices. The proposed method performed relatively optimally and successfully increased the interaction index while decreasing the computation time. On the other hand, due to the scalable potential of this method, the number of agents could be increased without affecting the computation time.
لیست مقالات
لیست مقالات بایگانی شده
Integration of P2G and Renewables in Stochastic Day-ahead Electricity-Gas Scheduling
Mojtaba Choghaei - Mohammad Kazem Sheikh-El-Eslami
Vibration Analysis of a High-Speed Switched Reluctance Motor Considering Fast Demagnetization Voltage
Nasrin Majlesi - Amir Rashidi - Morteza Saghaian Nejad
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
Alireza Nezamzadeh - Hamed Jalaly Bidgoly - Marzieh Kamali
Design of a Retinal Prosthesis Circuit With In-pixel Edge Detection Capability
Zahra Bonesbordi - Sayed Masoud Sayedi
اثر پایلوتهای متعامد بر تخمین کانال مایمو انبوه تقسیم فرکانسی مبتنی بر رگرسیون خطی
سید طالب ساداتی لمردی - کمال محامدپور
تعیین محل خطا با استفاده از اطلاعات حاصل شده از خطا در حضور جبرانساز سری خازنی کنترل تریستوری (TCSC) به روش آفلاین.
حامد حیدری - سعید غنیمتی
Robust H∞ Control Design for Variable-Speed Wind Turbines Using Bilinear Matrix Inequalities
Hamidreza Javanmardi - Alireza Hamedi - Mahya Rahimzadeh
مدل سازی و شبیه سازی جداکننده پرتو کوانتومی و تداخل گر ماخ زندر کوانتومی
محمد جواد شریفی
Design of Dual-Band Triangular Microstrip Antenna Using Fractal Structure for Wi-Max and Wi-Fi Applications
Arian Mianji - Mohammad Bemani - Saeid Nikmehr - Ahmad Atashpaz Gargari
Optimization of 915nm laser diode asymmetric structure: experimental and theoretical studies in tandem
Seyed peyman Abbasi - Maryam Lajvardi - Arash Hodaei
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.7.4