Towards Trustworthy and Socially Responsible Generative Foundation ModelsYue HuangZhenhong Zhouet al.2026AAAI 2026
BenchmarkCards: Standardized Documentation for Large Language Model BenchmarksAnna SokolElizabeth Dalyet al.2025NeurIPS 2025
Adaptive Distraction: Probing LLM Contextual Robustness with Automated Tree SearchYanbo WangZixiang Xuet al.2025NeurIPS 2025