Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language ModelsChia-yi HsuYu-Lin Tsaiet al.2024NeurIPS 2024
Ring-A-Bell! How Reliable are Concept Removal Methods For Diffusion Models?Yu-Lin TsaiChia-yi Hsuet al.2024ICLR 2024
Formalizing Generalization and Adversarial Robustness of Neural Networks to Weight PerturbationsYu-Lin TsaiChia-Yi Hsuet al.2021NeurIPS 2021
Formalizing Generalization and Robustness of Neural Networks to Weight PerturbationsYu-Lin TsaiChia-Yi Hsuet al.2021ICLR 2021