Sequence-Aware Inline Measurement Attribution for Good-Bad Wafer DiagnosisKohei MiyaguchiMasao Jokoet al.2025ASMC 2025
Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You InItay NakashGeorge Kouret al.2025NAACL 2025
Exploring Straightforward Methods for Automatic Conversational Red-TeamingGeorge KourNaama Zwerdlinget al.2025NAACL 2025
Comprehensive Layer-Wise Analysis of SSL Models for Audio Deepfake DetectionYassine ElkheirYounes Samihet al.2025NAACL 2025
The Literary Canons of Large-Language Models: An Exploration of the Frequency of Novel and Author Generations Across Gender, Race and Ethnicity, and NationalityPaulina Toro IsazaNalani Kopp2025NAACL 2025
DAMAGeR: Deploying Automatic and Manual Approaches to GenAI Red-teamingManish NagireddyMichael Fefferet al.2025NAACL 2025