RecSys 2020 Tutorial: Introduction to Bandits in Recommender Systems
Introduction to Bandits in Recommender Systems by Andrea Barraza-Urbina (NUI Galway) and Dorota Glowacka (University of Helsinki) The multi-armed bandit problem models an agent that simultaneously attempts to acquire new knowledge (exploration) and optimize his decisions based on existing knowledge (exploitation). The agent attempts to balance these competing tasks in order to maximize his total value over the period of time considered. There are many practical applications of the bandit model, such as clinical trials, adaptive routing or portfolio design. Over the last decade there has been an increased interest in developing bandit algorithms for specific problems in recommender systems, such as news and ad recommendation, the cold start problem in recommendation, personalization, collaborative filtering with bandits, or combining social networks with bandits to improve product recommendation. The aim of this tutorial is to provide participants with the basic knowledge of the following concepts: (a) the exploration-exploitation dilemma and its connection to learning through interaction; (b) framing of the recommender systems problem as an interactive sequential decision-making task that needs to balance exploration and exploitation; (c) basic fundamentals behind bandit approaches that address the exploration-exploitation dilemma; and (d) a general picture of the state-of-the-art of bandit-based recommender systems. With this tutorial we hope to enable participants to start working on bandit-based recommender systems and to provide a framework that would empower them to develop more advanced approaches. The tutorial is divided into three sections focused on: (1) general motivation and introduction to classic bandit approaches; (2) hands-on session where a simple synthetic recommendation task representing a bandit problem with linear rewards will be used; and (3) overview of a variety of applications of bandit algorithms in recommendation systems summarizing the current state and an outline of challenges applying bandit algorithms in recommendation systems. This introductory tutorial is aimed at an audience with background in computer science, information retrieval or recommender systems who have a general interest in the application of machine learning techniques in recommender systems. The prerequisite knowledge is basic familiarity with machine learning and basic knowledge of statistics and probability theory. The tutorial will provide practical examples based on Python code and Jupyter Notebooks.

Workshop on Context-Aware Recommender Systems

Workshop on Bandit and Reinforcement Learning from User Interactions

Tutorial 2B Hands On Reinforcement Learning for recommender systems

Multi-Armed Bandits: A Cartoon Introduction - DCBA #1

The Contextual Bandits Problem: A New, Fast, and Simple Algorithm

Netflix's Recommendation Evolution by Mark Hsiao | VideoRecSys 2023 Workshop | ACM RecSys

A/B Testing vs. Multi-Armed Bandits: What You Need To Know (Outperform Podcast)

Data Analytics for Beginners | Data Analytics Training | Data Analytics Course | Intellipaat

Recent Advances in Generative Conversational Recommender Systems

Contextual Bandits : Data Science Concepts

RecSys 2020 Tutorial: Conversational Recommender Systems

Recommender Systems: Basics, Types, and Design Consideration | Machine Learning | Community Webinar

The Contextual Bandits Problem

Counterfactual Learning and Evaluation for Recommender Systems:

Deep Learning for Personalized Search and Recommender Systems part 1

Bandit Algorithms - 1

CS885 Lecture 8a: Multi-armed bandits

47th #ebaytechtalk: Deep Learning for Recommender Systems

Building an AI Dark Factory: A Codebase That Writes Its Own Code, Live

