Latent Principle Discovery for Language Model Self-ImprovementKeshav RamjiTahira Naseemet al.2025NeurIPS 2025
Modeling Human Behavior Without Humans: Prospect Theoretic Multi-Agent Reinforcement LearningSheyan LalmohammedKhush Guptaet al.2025ICML 2025
Conformal Language Model Reasoning with Coherent FactualityMaxon Rubin-tolesMaya Gambhiret al.2025ICLR 2025