0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Batch(offline) Reinforcement Learning for recommender system
نویسندگان :
Mohammad Amir Rezaei Gazik
1
Mehdy Roayaei
2
1- دانشگاه تربیت مدرس
2- دانشگاه تربیت مدرس تهران
کلمات کلیدی :
offline reinforcement learning،recommender systems،big data
چکیده :
The explosive spread of the Internet in recent years has increased the types and amounts of big data, making it difficult for users to search for the data they need. With the continued growth of business on the Internet, e-learning, increased communication and sharing among users, and the advent of social networking, there is an undeniable need to design and implement systems that make it easier for people to search. A recommender system provides the ability to provide the most appropriate and accurate suggestions to users by checking user-related information from relevant datasets. In other words, it extracts user preferences and interests from data and makes suggestions. In this paper, we study the problem of learning recommendation systems with large datasets. We propose an offline RL framework for recommender systems to achieve high accuracy and perform recommendations quickly. Specifically, we propose a framework called Ofrec that first transforms the problem into a Markov Decision Process (MDP) and provides highly accurate and time-saving recommendations. We conduct extensive experiments on a large dataset of CIKM 2019 EComm AI and show that the proposed approach outperforms supervised learning and reinforcement learning algorithms.
لیست مقالات
لیست مقالات بایگانی شده
A Novel Interpretation of Coding in Time-Modulated Arrays
Mehdi Gholami - Mohammad Neshat
User Identification Based on Hand Geometrical Biometrics Using Media-Pipe
Sara Ghanbari - Zahra Parvin Ashtyani - Mehdi Tale Masouleh
Cascaded Multilevel Inverter with Reduced Switch Count
Mohammadamin Aalami - Ebrahim Babaei - Saeid Ghassem Zadeh
Investigating Validity and Reliability of The Features Extracted by a 5R Vertical Robot for Arm Motion and Learning Assessment
Sarvenaz Bourbour - Fariba Bahrami Boodelalou - Ghorban Taghizadeh
A 30dB and 250μW High Linear Variable Gain Amplifier with Employing Gm-boosting and Common Mode Feedforward Techniques
Mehdi Shahabi
On the Interaction Between Meteorological Conditions and Performance Optimization in MISO Free-Space Optical Communication
Meysam Ghanbari - Mahdis Saghaee Jahed - Seyed Mohammad Sajad Sadough
Back-Stepping Integral Sliding Mode Control with Iterative Learning Control Algorithm for Quadrotor UAV Transporting Cable-Suspended Payload
Davood Allahverdy - Ahmad Fakharian - Mohammad Bagher Menhaj
Message Overhead Control Using P-Epidemic Routing Method in Resource-Constrained Heterogeneous DTN
Mohammad Yousef Darmani - Shiva Karimi
Hardware Implementation of a Chaos Based Image Encryption Using High-Level Synthesis
Saeed Sharifian.m.m - Vahid Rashtchi - Ali Azarpeyvand
بررسی روابط توان دوم برای اعداد باینری با تمرکز بر طراحی و پیاده سازی مدار برای ورودی 4 بیت مثبت و منفی در پروسه استاندارد 0.18 µm CMOS
احمد احمدزاده - امیر فتحی - بهبود مشعوفی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0