机器学习与数据科学博士生系列论坛(第八十七期)—— A Regularized Online Newton Method for Stochastic Convex Bandits with Linear Vanishing Noise
Holder: Jingxin Zhan(Peking University)
Time:2025-04-17 16:00-17:00
Location:Tencent Conference 531-8098-3912
Abstract:
Bandit convex optimization is an online version of the zeroth-order optimization problem, a fundamental issue in optimization with many applications in operations research and other fields. Recently, Lumbreras and Tomamichel [2024] proposed a vanishing noise model in which the subgaussian noise parameter is assumed to decrease linearly as the learner selects actions closer to the minimizer of the convex loss function.
In this talk, we introduce a Regularized Online Newton Method (RONM), which is based on the Online Newton Method (ONM) of Fokkema et al. [2024]. The method reaches a polylogarithmic regret in the time horizon n when the loss function grows quadratically in the constraint set, which recovers the results of Lumbreras and Tomamichel [2024] in linear bandits.
About the Speaker:
论坛每次邀请一位博士生就某个前沿课题做较为系统深入的介绍,主题包括但不限于机器学习、高维统计学、运筹优化和理论计算机科学。
Your participation is warmly welcomed!

欢迎扫码关注北大统计科学中心公众号,了解更多讲座信息!