AAAI Publications, Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence

Font Size: 
Thompson Sampling for Combinatorial Bandits and Its Application to Online Feature Selection
Audrey Durand, Christian Gagné

Last modified: 2014-06-18

Abstract


In this work, we address the combinatorial optimization problem in the stochastic bandit setting with bandit feedback. We propose to use the seminal Thompson Sampling algorithm under an assumption on rewards expectations. More specifically, we tackle the online feature selection problem where results show that Thompson Sampling performs well. Additionnally, we discuss the challenges associated with online feature selection and highlight relevant future work directions.

Keywords


Online Feature Selection; Bandit; Thompson Sampling; Combinatorial

Full Text: PDF