MermaidSeqBench: An Evaluation Benchmark for LLM-to-Mermaid Sequence Diagram GenerationBasel ShbitaFarhan Ahmedet al.2025NeurIPS 2025
Specifying exact circuit algorithms in universal transformersTaku ItoRuchir Puriet al.2025NeurIPS 2025
SafeCOMM: Investigating Safety Degradation in Fine-Tuned Telecom Large Language ModelsAladin DjuheraSwanand Ravindra Kadheet al.2025NeurIPS 2025
ConstrainedSQL: Training LLMs for Text2SQL via Constrained Reinforcement LearningWeiqin ChenNhan Phamet al.2025NeurIPS 2025
SemCLIP: A Semantic Memory-Aligned Vision Language ModelTanveer Syeda-MahmoodNiharika DSouzaet al.2025NeurIPS 2025
STRIDE: A Systematic Framework for Selecting AI Modalities—Agentic AI, AI Assistants, or LLM CallsShubhi AsthanaRuchi Mahindruet al.2025NeurIPS 2025
Dynamic Features Adaptation in Networking: Toward Flexible training and Explainable inferenceYannis BelkhiterSeshu Tirupathiet al.2025NeurIPS 2025
Harnessing biomedical foundation models for genomic feature engineering to investigate patient drug responseLaura GardinerJennifer Kellyet al.2025NeurIPS 2025