Parameterized Abstract Interpretation for Transformer VerificationPei HuangDennis Weiet al.2026AAAI 2026
Phrase-grounded Fact-checking for Automatically Generated Chest X-ray ReportsRazi MahmoodDiego Machado Reyeset al.2025MICCAI 2025
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2025ACL 2025
In-Context Bias Propagation in LLM-Based Tabular Data GenerationPol Garcia RecasensAlberto Gutierrez-torreet al.2025ICML 2025
MAD-MAX: Modular And Diverse Malicious AttackMiXtures for Automated LLM Red TeamingStefan SchoepfMuhammad Zaid Hameedet al.2025ICML 2025
Retention Score: Quantifying Jailbreak Risks for Vision Language ModelsZaitang LiPin-Yu Chenet al.2025AAAI 2025
PEEL the Layers and Find Yourself: Revisiting Inference-time Data Leakage for Residual Neural NetworksHuzaifa ArifKeerthiram Murugesanet al.2025IEEE SaTML 2025