Publications

(2025). Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions. arXiv preprint.

PDF

(2024). Task Diversity Shortens the ICL Plateau. Arxiv preprint.

PDF Cite

(2024). Optimal Acceleration for Minimax and Fixed-Point Problems is Not Unique. ICML, 2024 (Spotlight).

PDF Cite

(2023). Mirror Duality in Convex Optimization. arXiv preprint.

PDF Cite