← 返回大厅
arXiv (CS.LG) 2026-06-16 12:00 DOI: arXiv:2602.08026

Sharp analysis of linear ensemble sampling

摘要 / Abstract

arXiv:2602.08026v2 Announce Type: replace Abstract: We analyse linear ensemble sampling (ES) with standard Gaussian perturbations in stochastic linear bandits. We show that for ensemble size $m=\Theta(d\log n)$, ES attains $\tilde O(d^{3/2}\sqrt n)$ high-probability regret, closing the gap to the Thompson sampling benchmark while keeping computation comparable. The proof brings a new perspective on randomized exploration in linear bandits by reducing the analysis to a time-uniform exceedance problem for $m$ independent Brownian motions. This continuous-time lens appears particularly natural here: it yields an exact representation of the relevant discrete-time processes, and we do not know another route to a sharp ES bound.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。