Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss LandscapesXiaomeng XuPin-Yu Chenet al.2024NeurIPS 2024
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time SeriesVijay EArindam Jatiet al.2024NeurIPS 2024
Multi-Scale Representation Learning for Protein Fitness PredictionZuobai ZhangPascal Notinet al.2024NeurIPS 2024
Reducing Transformer Key-Value Cache Size with Cross-Layer AttentionWilliam BrandonMayank Mishraet al.2024NeurIPS 2024
Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMsMegh ThakkarYash Moreet al.2024NeurIPS 2024
Unified Lookup Tables: Privacy-Preserving Foundation ModelsNikita JanakarajanIrina Espejo Moraleset al.2024NeurIPS 2024
Multi-View Mixture-of-Experts for Predicting Molecular Properties Using SMILES, SELFIES, and Graph-Based RepresentationsEduardo Almeida SoaresIndra Priyadarsini Set al.2024NeurIPS 2024
Thought of Search: Planning with Language Models Through The Lens of EfficiencyMichael KatzHarsha Kokelet al.2024NeurIPS 2024
Fine-Tuned MLP-Mixers as data-driven Numerical Surrogates?Imran NasimJoao Lucas de Sousa Almeida2024NeurIPS 2024
Automating Thought of Search: A Journey Towards Soundness and CompletenessDaniel CaoMichael Katzet al.2024NeurIPS 2024