Conference paper
Localizing Persona Representations in LLMs
Celia Cintas, Miriam Rateike, et al.
AIES 2025
Trustworthy artificial intelligence researchers should seek to better detect and characterize systematic deviations in data and models (that is, bias). This article provides data scientists with motivation, theory, code, and examples on how to perform disciplined discovery of systematic deviations in data and models at the subset level.
Celia Cintas, Miriam Rateike, et al.
AIES 2025
Paul Grefen, Irene Vanderfeesten, et al.
Machines
Sijia Liu, Pin-Yu Chen, et al.
IEEE SPM
Luke Dicks, David E. Graff, et al.
MSDE