SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language ModelsDebarun BhattacharjyaBalaji Ganesanet al.2025EMNLP 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsRadu MarinescuDebarun Bhattacharjyaet al.2025EMNLP 2025
Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssistElizabeth DalyErik Miehlinget al.2025EMNLP 2025
Optimistic Exploration for Risk-Averse Constrained Reinforcement LearningRadu MarinescuElizabeth Dalyet al.2025ECAI 2025
XABPs: Towards eXplainable Autonomous Business ProcessesPeter FettkeFabiana Fournieret al.2025ECAI 2025
Agentic Process Observability: Discovering Behavioral VariabilityFabiana FournierLior Limonadet al.2025ECAI 2025
Exposing AI Bias by Crowdsourcing: Democratizing Critique of Large Language ModelsHangzhi GuoPranav Venkitet al.2025AIES 2025
Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality IndicatorsHyo Jin DoRachel Ostrandet al.2025AIES 2025