Few shot chain-of-thought driven reasoning to prompt LLMs for open-ended medical question answeringSaeel Sandeep NachaneOjas Gramopadhyeet al.2024EMNLP 2024
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial ScenariosSamuel AckermanElla Rabinovichet al.2024EMNLP 2024