Publications

100 results at NeurIPS 2025

BenchmarkCards: Standardized Documentation for Large Language Model Benchmarks
- - Anna Sokol
  - Elizabeth Daly
  - et al.
- 2025
- NeurIPS 2025
Optimal Estimation of the Best Mean in Multi-Armed Bandits
- - Takayuki Osogami
  - Junya Honda
  - et al.
- 2025
- NeurIPS 2025
Causal LLM Routing: End-to-End Regret Minimization from Observational Data
- - Asterios Tsiourvas
  - Wei Sun
  - et al.
- 2025
- NeurIPS 2025
Final-Model-Only Data Attribution with a Unifying View of Gradient-Based Methods
- - Dennis Wei
  - Inkit Padhi
  - et al.
- 2025
- NeurIPS 2025
Adaptive Distraction: Probing LLM Contextual Robustness with Automated Tree Search
- - Yanbo Wang
  - Zixiang Xu
  - et al.
- 2025
- NeurIPS 2025
Shape it Up! Restoring LLM Safety during Finetuning
- - Shengyun Peng
  - Pin-Yu Chen
  - et al.
- 2025
- NeurIPS 2025
Transformers Learn Faster with Semantic Focus
- - Parikshit Ram
  - Kenneth Clarkson
  - et al.
- 2025
- NeurIPS 2025
Optimality and NP-Hardness of Transformers in Learning Markovian Dynamical Functions
- - Yanna Ding
  - Songtao Lu
  - et al.
- 2025
- NeurIPS 2025
Structured Sparse Transition Matrices to Enable State Tracking in State-Space Models
- - Aleksandar Terzic
  - Nicolas Menet
  - et al.
- 2025
- NeurIPS 2025
Objective Soups: Multilingual Multi-Task Modeling for Speech Processing
- - A Saif
  - Lisha Chen
  - et al.
- 2025
- NeurIPS 2025