Publications

104 results for Trustworthy Generation

GP-MOLFORMER-SIM: Test Time Molecular Optimization through Contextual Similarity Guidance
- - Jiri Navratil
  - Jerret Ross
  - et al.
- 2026
- AAAI 2026
Small Models Exhibit Limited Answer Consistency in Repetition Trials of the Multiple-Choice MMLU-Redux and MedQA Benchmarks
- - Claudio Santos Pinhanez
  - Paulo Rodrigo Cavalin
  - et al.
- 2026
- AAAI 2026
Learn more about our Trustworthy Generation work
Black-Box Uncertainty Quantification for Large Language Models via Ensemble-of-Ensembles
- - Wang Ma
  - Debarun Bhattacharjya
  - et al.
- 2026
- AAAI 2026
SOFAI-LM: A Cognitive Architecture for Building Efficient and Reliable Reasoning Systems with LLMs
- - Vedant Khandelwal
  - Francesca Rossi
  - et al.
- 2026
- AAAI 2026
Efficient Decoding Methods for Language Models on Encrypted Data
- - Matan Avitan
  - Moran Baruch
  - et al.
- 2025
- IJCNLP-AACL 2025
Advances in Emulating Earth System Models
- - Björn Lütjens
  - Kalyn Dorheim
  - et al.
- 2025
- AGU 2025
Phrase-grounded Fact-checking for Automatically Generated Chest X-ray Reports
- - Razi Mahmood
  - Diego Machado Reyes
  - et al.
- 2025
- MICCAI 2025
3rd TrustAI Workshop: Building Public Awareness and Engagement
- - Miriam Rateike
  - Brian Mboya
  - et al.
- 2025
- DLI 2025
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models
- - George Kour
  - Itay Nakash
  - et al.
- 2025
- ACL 2025
Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMs
- - Megh Thakkar
  - Quentin Fournier
  - et al.
- 2025
- ACL 2025