Publications

Contextual bandit algorithms with supervised learning guarantees