From Multiple-Choice to Extractive QA: A Case Study for English and ArabicTeresa LynnMalik Altakroriet al.2025COLING 2025
Graph-based Uncertainty Metrics for Long-form Language Model GenerationsMingjian JiangYangjun Yangjunet al.2024NeurIPS 2024
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular TasksIbrahim AbdelazizKinjal Basuet al.2024EMNLP 2024
CHRONOS: A Schema-Based Event Understanding and Prediction SystemMaria ChangAchille Fokoueet al.2024IAAI 2024
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMsYoung-Suk LeeArafat Sultanet al.2023EMNLP 2023
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of RerankersJon Saad-FalconOmar Khattabet al.2023EMNLP 2023
MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error TypesKeerthiram MurugesanSarathkrishna Swaminathanet al.2023ACL 2023
Moving Beyond Downstream Task Accuracy for Information Retrieval BenchmarkingKeshav SanthanamJon Saad-Falconet al.2023ACL 2023
PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and DevelopmentAvi SilJaydeep Senet al.2023ACL 2023
PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and DevelopmentAvi SilJaydeep Senet al.2023arXiv