Weakly Supervised Detection of Hallucinations in LLM ActivationsMiriam RateikeCelia Cintaset al.2023NeurIPS 2023
Influence Based Approaches to Algorithmic Fairness: A Closer LookSoumya GhoshPrasanna Sattigeriet al.2023NeurIPS 2023
Adversarial Auditing of Machine Learning Models under Compound ShiftKaran BhanotDennis Weiet al.2023ESANN 2023
Balancing Social Impact, Opportunities, and Ethical Constraints of Using AI in the Documentation and Vitalization of Indigenous LanguagesClaudio S. PinhanezPaulo Cavalinet al.2023IJCAI 2023
Skin Tone Analysis for Representation in Educational Materials (STAR-ED) Using Machine LearningGirmaw Abebe TadesseCelia Cintaset al.2023npj Digital Medicine
An AI-assisted Workbench for Material DiscoveryEmilio Ashton Vital BrazilRenato Fontoura de Gusmao Cerqueiraet al.2023ACS Fall 2023
Stress-Testing Bias Mitigation Algorithms to Understand Fairness VulnerabilitiesKaran BhanotIoana Baldiniet al.2023AIES 2023
Beyond Black Box AI-Generated Plagiarism Detection: From Sentence to Document LevelMujahid Ali QuidwaiChunhui Liet al.2023ACL 2023
Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Children's Fairy TalesPaulina Toro IsazaGuangxuan Xuet al.2023ACL 2023