Exploring Straightforward Methods for Automatic Conversational Red-TeamingGeorge KourNaama Zwerdlinget al.2025NAACL 2025
Systematic Knowledge Injection into Large Language Models via Diverse Augmentation for Domain-Specific RAGKushagra BhushanYatin Nandwaniet al.2025NAACL 2025
InspectorRAGet: An Introspection Platform for RAG EvaluationBenjamin SznajderKshitij Fadniset al.2025NAACL 2025
The Literary Canons of Large-Language Models: An Exploration of the Frequency of Novel and Author Generations Across Gender, Race and Ethnicity, and NationalityPaulina Toro IsazaNalani Kopp2025NAACL 2025
DAMAGeR: Deploying Automatic and Manual Approaches to GenAI Red-teamingManish NagireddyMichael Fefferet al.2025NAACL 2025
ASTER: Natural and Multi-language Unit Test Generation with LLMsRangeet PanMyeongsoo Kimet al.2025ICSE 2025
Can LLMs Replace Manual Annotation of Software Engineering Artifacts?Toufique AhmedPremkumar Devanbuet al.2025MSR 2025
Which Contributions Deserve Credit? Perceptions of Attribution in Human-AI Co-CreationJessica HeStephanie Houdeet al.2025CHI 2025