Publications

810 results for Foundation Models

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
- - Mohammed Nowaz Rabbani Chowdhury
  - Meng Wang
  - et al.
- 2024
- ICML 2024
What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
- - Hongkang Li
  - Meng Wang
  - et al.
- 2024
- ICML 2024
Risk Aware Benchmarking of Large Language Models
- - Apoorva Nitsure
  - Youssef Mroueh
  - et al.
- 2024
- ICML 2024
tinyBenchmarks: evaluating LLMs with fewer examples
- - Felipe Maia Polo
  - Lucas Weber
  - et al.
- 2024
- ICML 2024
Asymmetry in Low-Rank Adapters of Foundation Models
- - Jiacheng Zhu
  - Kristjan Greenewald
  - et al.
- 2024
- ICML 2024
Humans Linguistically Align to their Conversational Partners, and Language Models Should Too
- - Rachel Ostrand
  - Sara Berger
- 2024
- ICML 2024
Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
- - Swanand Ravindra Kadhe
  - Farhan Ahmed
  - et al.
- 2024
- ICML 2024
Generation constraint scaling can mitigate hallucination
- - Georgios Kollias
  - Payel Das
  - et al.
- 2024
- ICML 2024
Needle in the Haystack for Memory Based Large Language Models
- - Payel Das
  - Soham Dan
  - et al.
- 2024
- ICML 2024
A Multi-View Mixture-of-Experts based on Language and Graphs for Molecular Properties Prediction
- - Victor Shirasuna
  - Eduardo Almeida Soares
  - et al.
- 2024
- ICML 2024