QueryGym: Step-by-Step Interaction with Relational DatabasesHaritha AnanthakrishnanHarsha Kokelet al.2026AAAI 2026
AssetOpsBench-Live: Privacy-Aware Online Evaluation of Multi-Agent Performance in Industrial OperationsDhaval PatelNianjun Zhouet al.2026AAAI 2026
AutoTuneX: Interactive Automated Fine-Tuning for Large Language ModelsDaniel Karl I. WeidelePriyanshu Raiet al.2026AAAI 2026
DFAgent: From Natural Language Data Interactions to Reusable Agent-Ready ToolsNeelamadhav GantayatRenuka Sindhgattaet al.2026AAAI 2026
Auto-BenchmarkCard: Automated Synthesis of Benchmark DocumentationAris HofmannInge Vejsbjerget al.2026AAAI 2026
ToolSmith: A Multi-Agent Framework for Enterprise Tool CreationPurna Chandra Sekhar VakudavathuKushal Mukherjeeet al.2026AAAI 2026
Agentic Code Generation for Heuristic Rules in Equipment MonitoringFabio LorenziAbigail Langbridgeet al.2026AAAI 2026