The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Language ModelsYan LiuYu Liuet al.2024ICLR 2024
Uncovering and Quantifying Social Biases in Code GenerationYan LiuXiaokang Chenet al.2023NeurIPS 2023