Contextual Linear Optimization under Full and Partial Feedback

毛小介清华大学
Time: TBD Location: TBD

Abstract

This talk is about Contextual Linear Optimization (CLO) across two feedback regimes, where we study the traditional two-stage Estimate-Then-Optimize (ETO) approach and the new integrated Induced Empirical Risk Minimization (IERM) framework. In the full-feedback setting, we theoretically demonstrate that under model correct specification, ETO can surprisingly achieve faster regret convergence rates than IERM by leveraging problem-specific geometric properties. In partial-feedback settings (bandit and semi-bandit), we propose a unified offline IERM framework and establish novel fast-rate guarantees. Numerical experiments on shortest path problems validate our theoretical findings across different regimes.

Biography

毛小介,清华大学经济管理学院管理科学与工程系副教授,2016年获得武汉大学经济学学士学位,2021年获得美国康奈尔大学统计与数据科学专业博士学位。研究兴趣主要包括因果推断、数据驱动的决策方法、统计机器学习等。