0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Batch(offline) Reinforcement Learning for recommender system
نویسندگان :
Mohammad Amir Rezaei Gazik
1
Mehdy Roayaei
2
1- دانشگاه تربیت مدرس
2- دانشگاه تربیت مدرس تهران
کلمات کلیدی :
offline reinforcement learning،recommender systems،big data
چکیده :
The explosive spread of the Internet in recent years has increased the types and amounts of big data, making it difficult for users to search for the data they need. With the continued growth of business on the Internet, e-learning, increased communication and sharing among users, and the advent of social networking, there is an undeniable need to design and implement systems that make it easier for people to search. A recommender system provides the ability to provide the most appropriate and accurate suggestions to users by checking user-related information from relevant datasets. In other words, it extracts user preferences and interests from data and makes suggestions. In this paper, we study the problem of learning recommendation systems with large datasets. We propose an offline RL framework for recommender systems to achieve high accuracy and perform recommendations quickly. Specifically, we propose a framework called Ofrec that first transforms the problem into a Markov Decision Process (MDP) and provides highly accurate and time-saving recommendations. We conduct extensive experiments on a large dataset of CIKM 2019 EComm AI and show that the proposed approach outperforms supervised learning and reinforcement learning algorithms.
لیست مقالات
لیست مقالات بایگانی شده
بازسازی تصاویر رادار دهانه مصنوعی با استفاده از نمایش تنک مبتنی بر گروه
محبوبه خدرزاده - صادق صمدی
طبقهبندی تصاویر سلولی پاپ اسمیر مبتنی بر الگوریتمهای ترتیبی یادگیری جمعی و شبکههای عمیق استخراج ویژگی
زهرا کمالی - محمدصادق هل فروش - کامران کاظمی - مژگان اکبرزاده
Parkinson’s Disease Classification Using Continuous Wavelet Transform and Ensemble Convolutional Neural Networks on EEG Signals
Seyed Pedram Monazami - Raheleh Davoodi
Robust Object Detection Against Adversarial Perturbations with Gabor Filter
Mohammad Parsa Karimi - Abdollah Amirkhani - Shahriar B. Shokouhi
Classification of Schizophrenia Patients by Nonlinear Analysis of EEG
Amirhossein Tajik - Hoda Jalalkamali - Hossein Nezamabadipour
Angular Misalignment Effect on the Performance of Underwater MIMO OCC Systems
Ehsan Hamidnejad - Asghar Gholami
A Novel Interpretation of Coding in Time-Modulated Arrays
Mehdi Gholami - Mohammad Neshat
Formation Control of Bicycle Model of Mobile Robots with Disturbance Using PID Controller
Amirhossein Rahmankhanloo - Saeed Khankalantary - Ali Akbar Vahedi
Integrated expansion planning of the distribution network and distributed generations considering energy storage systems, electric vehicles charging stations, and daily load modeling
Ahmad Mohammadi Pour - Mehrdad Setayesh Nazar
Uneven Illumination Correction in Whole Slide Imaging using Pix2Pix
Sama Nemati - Hasti Shabani
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2