Skip to yearly menu bar Skip to main content


Poster

The Statistical Complexity of Offline Decision-Making

Thanh Nguyen-Tang · Raman Arora


Abstract:

We study the statistical complexity of offline decision-making with function approximation, establishing (near) minimax-optimal rates for stochastic contextual bandits and Markov decision processes. The performance limits are captured by the pseudo-dimension of the (value) function class and a new characterization of the behavior policy that strictly subsumes all the previous notions of data coverage in the offline decision-making literature. In addition, we consider the value of offline data for online decision-making and show nearly minimax-optimal rates in a wide range of regimes.

Live content is unavailable. Log in and register to view live content