Publications

910 results for Trustworthy AI

Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI
- - Ambrish Rawat
  - Stefan Schoepf
  - et al.
- 2024
- NeurIPS 2024
Global Area Sampling for Geospatial Foundation Model
- - Daiki Kimura
  - Naomi Simumba
  - et al.
- 2024
- AGU 2024
Advanced Physics-AI Models for Rain Enhancement in Arid Regions
- - Lloyd Treinish
  - Mukul Tewari
  - et al.
- 2024
- AGU 2024
Advancing Applications of Remote Sensing for Detection of and Long-Term Monitoring of Harmful Algal Blooms (HABs)
- - Lloyd Treinish
  - Vincent Moriarty
- 2024
- AGU 2024
Modelling the Extreme July 2023 Hudson Valley Precipitation Event Using WRF
- - Anthony Praino
  - Lloyd Treinish
  - et al.
- 2024
- AGU 2024
Membership Inference Attacks Against Time-Series Models
- - Noam Koren
  - Abigail Goldsteen
  - et al.
- 2024
- ACML 2024
Value Alignment from Unstructured Text
- - Inkit Padhi
  - Karthikeyan Natesan Ramamurthy
  - et al.
- 2024
- EMNLP 2024
Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion
- - Kerem Zaman
  - Leshem Choshen
  - et al.
- 2024
- EMNLP 2024
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
- - Erik Miehling
  - Manish Nagireddy
  - et al.
- 2024
- EMNLP 2024
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios
- - Samuel Ackerman
  - Ella Rabinovich
  - et al.
- 2024
- EMNLP 2024