Final-Model-Only Data Attribution with a Unifying View of Gradient-Based MethodsDennis WeiInkit Padhiet al.2025NeurIPS 2025
Specifying exact circuit algorithms in universal transformersTaku ItoRuchir Puriet al.2025NeurIPS 2025
MEAL: A Multi-dimensional Evaluation of Alignment Techniques for LLMsMuneeza AzmatMomin Abbaset al.2025NeurIPS 2025
Representation Similarity Reveals Implicit Layer Grouping in Neural NetworksTian GaoAmit Dhurandharet al.2025NeurIPS 2025
Scaling LLM Planning: NL2FLOW for Parametric Problem Generation and Rigorous EvaluationJung koo Kang2025NeurIPS 2025
Toward a Coherent Virtual Cell Model: Probing Biological World-Model Coherence in Transcriptomic Foundation ModelsNoa MorielYishai Shimoniet al.2025NeurIPS 2025
SafeCOMM: Investigating Safety Degradation in Fine-Tuned Telecom Large Language ModelsAladin DjuheraSwanand Ravindra Kadheet al.2025NeurIPS 2025
Quantifying policy uncertainty in generative flow networks with uncertain rewardsRamon Nartallo-kaluarachchiRobert Manson Sawkoet al.2025NeurIPS 2025
The Shepherd Test: How Will Superintelligent Agents Balance Care and Control in Asymmetric Relationships?Djallel BouneffoufMatthew Riemeret al.2025NeurIPS 2025