1st Workshop on Data Integrity and Secure Cloud Computing (DISCC-2022)Pradip BoseJennifer Dworaket al.2022MICRO 2022
Your Fairness May Vary: Pretrained Language Model Fairness in Toxic Text ClassificationIoana Baldini SoaresDennis Weiet al.2022ACL 2022
The Good, the Bad, and the Outliers: A Testing Framework for Decision Optimization Model LearningOrit DavidovichGheorghe-Teodor Berceaet al.2022KDD 2022
LaSO: Label-Set Operations networks for multi-label few-shot learningAmit AlfassyLeonid Karlinskyet al.2019CVPR 2019
Exploring Vulnerabilities in LLMs: A Red Teaming Approach to Evaluate Social BiasYuya Jeremy OngJay Pankaj Galaet al.2024IEEE CISOSE 2024
DetAIL: A Tool to Automatically Detect and Analyze Drift In LanguageNishtha MadaanAdithya Manjunathaet al.2023IAAI 2023