SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language ModelsManish NagireddyLamogha Chiazoret al.2024AAAI 2024
On the Safety of Interpretable Machine Learning: A Maximum Deviation ApproachDennis WeiRahul Nairet al.2022NeurIPS 2022
Ground-Truth, Whose Truth? - Examining the Challenges with Annotating Toxic Text DatasetsKofi ArhinIoana Baldini Soareset al.2021NeurIPS 2021
Bias in Clinical Risk Prediction Models: Challenges in Application to Observational Health DataYoonyoung ParkMoninder Singhet al.2021AAAI 2021
Algorithmic Selection of Patients for Case Management: Alternative Proxies to Healthcare CostsMoninder Singh2021AAAI 2021
Your Fairness May Vary: Pretrained Language Model Fairness in Toxic Text ClassificationIoana Baldini SoaresDennis Weiet al.2022ACL 2022