Publications

228 results for Adversarial Robustness and Privacy

On Robustness-Accuracy Characterization of Language Models using Synthetic Datasets
- - Ching-yun Ko
  - Pin-Yu Chen
  - et al.
- 2024
- COLM 2024
Be Your Own Neighborhood: Detecting Adversarial Examples by the Neighborhood Relations Built on Self-Supervised Learning
- - Zhiyuan He
  - Yijun Yang
  - et al.
- 2024
- ICML 2024
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
- - Zhi-yi Chin
  - Chieh-ming Jiang
  - et al.
- 2024
- ICML 2024
What Would Gauss Say About Representations? Probing Pretrained Image Models using Synthetic Gaussian Benchmarks
- - Irene Ko
  - Pin-Yu Chen
  - et al.
- 2024
- ICML 2024
Towards Assurance of LLM Adversarial Robustness using Ontology-Driven Argumentation
- - Tomas Bueno Momcilovic
  - Beat Buesser
  - et al.
- 2024
- xAI 2024
Improving Membership Inference Attacks against Classification Models
- - Shlomit Shachor
  - Natalia Razinkov
  - et al.
- 2024
- KES-IDT 2024
Overload: Latency Attacks on Object Detection for Edge Devices
- - Erh-Chung Chen
  - Pin-Yu Chen
  - et al.
- 2024
- CVPR 2024
Advancing the Robustness of Large Language Models through Self-Denoised Smoothing
- - Jiabao Ji
  - Bairu Hou
  - et al.
- 2024
- NAACL 2024
Evaluating the Impact of Skin Tone Representation on Out-of-Distribution Detection Performance in Dermatology
- - Assala Benmalek
  - Celia Cintas
  - et al.
- 2024
- ISBI 2024
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
- - Xiangyu Qi
  - Yi Zeng
  - et al.
- 2024
- ICLR 2024