0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Batch(offline) Reinforcement Learning for recommender system
نویسندگان :
Mohammad Amir Rezaei Gazik
1
Mehdy Roayaei
2
1- دانشگاه تربیت مدرس
2- دانشگاه تربیت مدرس تهران
کلمات کلیدی :
offline reinforcement learning،recommender systems،big data
چکیده :
The explosive spread of the Internet in recent years has increased the types and amounts of big data, making it difficult for users to search for the data they need. With the continued growth of business on the Internet, e-learning, increased communication and sharing among users, and the advent of social networking, there is an undeniable need to design and implement systems that make it easier for people to search. A recommender system provides the ability to provide the most appropriate and accurate suggestions to users by checking user-related information from relevant datasets. In other words, it extracts user preferences and interests from data and makes suggestions. In this paper, we study the problem of learning recommendation systems with large datasets. We propose an offline RL framework for recommender systems to achieve high accuracy and perform recommendations quickly. Specifically, we propose a framework called Ofrec that first transforms the problem into a Markov Decision Process (MDP) and provides highly accurate and time-saving recommendations. We conduct extensive experiments on a large dataset of CIKM 2019 EComm AI and show that the proposed approach outperforms supervised learning and reinforcement learning algorithms.
لیست مقالات
لیست مقالات بایگانی شده
Two Mixed Logical Dynamical Real-Time Receding Horizon Control Schemes for Microgrids Operation Optimization
Seyed Shahab Kheradmand - Reyhaneh Haghpanah - Malihe Maghfouri Farsangi - Mojtaba Barkhordary
تشخیص ناهنجاری گفتاری با استفاده از مدلسازی جاذبهای صوتی در فضای بازسازی شده فاز
عاطفه کردکاری خسروشاهی - یاسر شکفته
High Step up DC/DC Converter with Low Input Current Ripple and Low Voltage Stress on Semiconductors
Saed Mahmoud Alilou - Mohammad Maalandish - Soheil Nouri - Seyed Hossein Hosseini
Robust Object Detection Against Adversarial Perturbations with Gabor Filter
Mohammad Parsa Karimi - Abdollah Amirkhani - Shahriar B. Shokouhi
A Hybrid Approach for Multimodal Biometric Recognition based on Feature Level Fusion in Reproducing Kernel Hilbert Space
Mohammad Hassan Safavipour - Mohammad Ali Doostari - Hamed Sadjedi
طراحی و شبیه سازی یک تقویت کننده کم نویز پهن باند در باند K (18 تا 27 گیگاهرتز)
نوید نصیری - حسین شمسی
A Low Power Wideband 0.6-5.4 GHz CG-CS LNA with pMOS-nMOS Configuration and Resistive Feedback
Sajjad Shojaei Baghini - Seyed Ali Samareh TaheriNasab - Samad Sheikhaei
High efficiency Continuous class J/B power amplifier design with 130% Fractional Bandwidth
Sara Aghajani - Mahmoud Kamarei - Marzieh Chegini
Physiotherapy Algorithms on FUM-Physio Robot
Keyvan Tayaranian Marvian - Amir Hossein Nazari - Seyed Mohammad Tahamipour Zarandi - Mohammad Reza Akbarzadeh totonchi - Zahra Soltani - Alireza Akbarzadeh totonchi
Vision Transformer and Parallel Convolutional Neural Network for Speech Emotion Recognition
Saber Hashemi - Mohammad Asgari
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.3.1