Uncovering and Quantifying Social Biases in Code GenerationYan LiuXiaokang Chenet al.2023NeurIPS 2023
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion ModelsSheng-yen ChoPin-Yu Chenet al.2023NeurIPS 2023
Weakly Supervised Detection of Hallucinations in LLM ActivationsMiriam RateikeCelia Cintaset al.2023NeurIPS 2023
Using Foundation Models to Promote Digitization and Reproducibility in Scientific ExperimentationAmol ThakkarAndrea Giovanniniet al.2023NeurIPS 2023
Hierarchical Reinforcement Learning with AI Planning ModelsJunkyu LeeMichael Katzet al.2023NeurIPS 2023
Influence Based Approaches to Algorithmic Fairness: A Closer LookSoumya GhoshPrasanna Sattigeriet al.2023NeurIPS 2023
Characterizing pre-trained and task-adapted molecular representationsCelia CintasPayel Daset al.2023NeurIPS 2023