Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators
- Zaiwei Chen
- Siva Theja Maguluri
- et al.
- 2021
- NeurIPS 2021
This is our catalog of publications authored by IBM researchers, in collaboration with the global research community. It’s an ever-growing body of work that shows why IBM is one of the most important contributors to modern computing.