Efficient Scaling of Large Language Models with Mixture of Experts and 3D Analog In-Memory ComputingJulian BüchelA. Vasilopouloset al.2025Nat. Comput. Sci.
Beyond neuropsychological tests: AI speech analysis in PKUSusan WaisbrenKely Norelet al.2024J. Inherit. Metab. Dis.
UNIFIEDGT: Towards a Universal Framework of Transformers in Large-Scale Graph LearningLin JunhongXiaojie Guoet al.2024Big Data 2024
Privacy without Noisy Gradients: Slicing Mechanism for Generative Model TrainingKristjan GreenewaldYuancheng Yuet al.2024NeurIPS 2024
Distributional Preference Alignment of LLMs via Optimal TransportIgor MelnykYoussef Mrouehet al.2024NeurIPS 2024
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation ModelsYuchen HuChen Chenet al.2024NeurIPS 2024
GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative ModelsZhaitang LiPin-Yu Chenet al.2024NeurIPS 2024
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss LandscapesXiaomeng XuPin-Yu Chenet al.2024NeurIPS 2024