Publications

910 results for Trustworthy AI

Advances in Emulating Earth System Models
- - Björn Lütjens
  - Kalyn Dorheim
  - et al.
- 2025
- AGU 2025
APILOT: Improving the Security and Usability of LLM Code Suggestions via Outdated API Mitigation
- - Weiheng Bai
  - Keyang Xuan
  - et al.
- 2025
- ACSAC 2025
Evaluation of partitioning algorithms for trustworthy out-of-distribution evaluation of machine learning models in biochemistry
- - Raúl Fernández Díaz
  - Lam Thanh Hoang
  - et al.
- 2025
- VIBE 2025
Secure and Safe AI Agents for Big Data Infrastructures
- - Bhavya Bhavya
  - Sai Sree Laya Chukkapalli
- 2025
- Big Data 2025
Cross-Process Defect Attribution using Potential Loss Analysis
- - Ide-San Ide
  - Kohei Miyaguchi
- 2025
- WSC 2025
BenchmarkCards: Standardized Documentation for Large Language Model Benchmarks
- - Anna Sokol
  - Elizabeth Daly
  - et al.
- 2025
- NeurIPS 2025
Deferring Concept Bottleneck Models: Learning to Defer Interventions to Inaccurate Experts
- - Andrea Pugnana
  - Riccardo Massidda
  - et al.
- 2025
- NeurIPS 2025
Shape it Up! Restoring LLM Safety during Finetuning
- - Shengyun Peng
  - Pin-Yu Chen
  - et al.
- 2025
- NeurIPS 2025
Adaptive Distraction: Probing LLM Contextual Robustness with Automated Tree Search
- - Yanbo Wang
  - Zixiang Xu
  - et al.
- 2025
- NeurIPS 2025
Causally Reliable Concept Bottleneck Models
- - Giovanni De Felice
  - Arianna Casanova Flores
  - et al.
- 2025
- NeurIPS 2025