Exploring Straightforward Methods for Automatic Conversational Red-TeamingGeorge KourNaama Zwerdlinget al.2025NAACL 2025
Unveiling Safety Vulnerabilities of Large Language ModelsGeorge KourMarcel Zalmanoviciet al.2023EMNLP 2023
Balancing via Generation for Multi-Class Text Classification ImprovementNaama TepperEsther Goldbraichet al.2020EMNLP 2020
Claims on demand – an initial demonstration of a system for automatic detection and polarity identification of context dependent claims in massive corporaEhud AharoniCarlos Alzateet al.2014COLING 2014