Multi-Sense Embeddings for Language Models and Knowledge DistillationQitong WangMohammed Zakiet al.2025ACL 2025
DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM EvaluationEliya HabbaOfir Arvivet al.2025ACL 2025
Query-driven Document-level Scientific Evidence Extraction from Biomedical StudiesMassimiliano PronestiJoao Bettencourt-Silvaet al.2025ACL 2025
MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation SystemsYannis KatsisSara Rosenthalet al.2025ACL 2025
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific LanguagesMehant KammakomatiSameer Pimparkhedeet al.2025ACL 2025
EpMAN: Episodic Memory AttentioN for Generalizing to Longer ContextsSUBHAJIT CHAUDHURYPayel Daset al.2025ACL 2025
Defensive Prompt Patch: A Robust and Generalizable Defense of Large Language Models against Jailbreak AttacksChen XiongXiangyu Qiet al.2025ACL 2025
ZeroNER: Fueling Zero-Shot Named Entity Recognition via Entity Type DescriptionsAlessio CocchieriMarcos Martínez Galindoet al.2025ACL 2025