0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Batch(offline) Reinforcement Learning for recommender system
نویسندگان :
Mohammad Amir Rezaei Gazik
1
Mehdy Roayaei
2
1- دانشگاه تربیت مدرس
2- دانشگاه تربیت مدرس تهران
کلمات کلیدی :
offline reinforcement learning،recommender systems،big data
چکیده :
The explosive spread of the Internet in recent years has increased the types and amounts of big data, making it difficult for users to search for the data they need. With the continued growth of business on the Internet, e-learning, increased communication and sharing among users, and the advent of social networking, there is an undeniable need to design and implement systems that make it easier for people to search. A recommender system provides the ability to provide the most appropriate and accurate suggestions to users by checking user-related information from relevant datasets. In other words, it extracts user preferences and interests from data and makes suggestions. In this paper, we study the problem of learning recommendation systems with large datasets. We propose an offline RL framework for recommender systems to achieve high accuracy and perform recommendations quickly. Specifically, we propose a framework called Ofrec that first transforms the problem into a Markov Decision Process (MDP) and provides highly accurate and time-saving recommendations. We conduct extensive experiments on a large dataset of CIKM 2019 EComm AI and show that the proposed approach outperforms supervised learning and reinforcement learning algorithms.
لیست مقالات
لیست مقالات بایگانی شده
طراحی تزویجگر پهن باند سه استابی فشرده میکرواستریپ برای استفاده در ترکیب کننده توان
صادق حیدری کاهکش - اکرم شیخی
A new LDO regulator with adaptive PSR improvement under wide load current range and fast load transient response
Mohammad Ahmadi - Emad Ebrahimi
Modeling and control of two PPR cooperative manipulations with a passive joint
Hassan Khosravi - Farhad Fani Saberi - Rasul Fesharakifard
طراحی و شبیه سازی جاذب شفاف فراماده تک لایه پهن باند مبتنی بر الگوی فرکتالی با پایداری زاویه ای بالا و غیر حساس به قطبش
سحر سوقی - حمید حیدر - محمدرضا هراتی - وحید نیری
Heterogeneous Coverage Path Planning For Multi- Agent systems with ACO and GA
Mohammad Hasan Jalili Bahabadi - ََAmir Mahdavi - Saeed Khankalantary
Application of Max Flow- Min Cut Theory to find the best placement Of Electronic-based DC-PFCs for enhancing static security in MT-HVDC Meshed Grids
Mir Hamed Pour Mir Asghariyan - Jafar Milimonfared - Seyed Saeid Heidari Yazdi - Ali Haji Ali Biglo - Kumars Rouzbehi
Stochastic model predictive control based on online learning for a class of nonlinear constrained systems
Mahdi Mansoury - Mohammad Ali Badamchizadeh - Hamed Kharrati
بررسی حفظ همراستایی در سامانههای مخابرات نوری فضای آزاد
مهدی زندی آتشبار - اصغر غلامی - فروغالسادات طباطبا
Net Load Forecasting of Household Prosumers Considering Deep Reinforcement Learning
Behzad Motallebi Azar - Rasool Kazemzadeh - Morteza Zare Oskouei - Behnam Mohammadi-Ivatloo
Design and Analysis of Three-Step Cyclic Vernier Time-to-Digital Converter
ُSara Mansouri - Hamidreza Rezaee-Dehsorkh - Nassim Ravanshad
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4