A 360 review of AI agent benchmarksResearchKim Martineau04 Jun 2025AIGenerative AINatural Language ProcessingTrustworthy Generation
A 360 review of AI agent benchmarksResearchKim Martineau04 Jun 2025AIGenerative AINatural Language ProcessingTrustworthy Generation
An invisible watermark to keep tabs on tabular dataResearchKim Martineau19 May 2025Adversarial Robustness and PrivacyAIGenerative AITrustworthy Generation
AI is changing how we work — is it time to change how we credit AI’s involvement?ResearchKim Martineau13 May 2025AIAI TransparencyGenerative AINatural Language Processing
ASTER: Natural and multi-language unit test generation with LLMsTechnical noteRangeet Pan, Rahul Krishna, Raju Pavuluri, and Saurabh Sinha30 Apr 2025AIAI Testing
IBM’s safety checkers top a new AI benchmarkNewsKim Martineau09 Apr 2025AIAI TransparencyFairness, Accountability, TransparencyGenerative AINatural Language Processing
IBM’s Mikhail Yurochkin wants to make AI’s “cool” factor tangibleResearchKim Martineau05 Mar 2025AIFairness, Accountability, TransparencyGenerative AINatural Language ProcessingTrustworthy AI
Why we’re teaching LLMs to forget things ExplainerKim Martineau07 Oct 2024AIGenerative AINatural Language ProcessingTrustworthy AI
A toxic language filter built for speedNewsKim Martineau09 Sep 2024AIFoundation ModelsOpen SourceTrustworthy AI
Teaching AI models to improve themselves ResearchPeter Hess14 Aug 2024AIComputer ScienceExplainable AIGenerative AINatural Language ProcessingTrustworthy AITrustworthy Generation
IBM and RPI researchers demystify in-context learning in large language modelsNewsPeter Hess25 Jul 2024AIAI TransparencyExplainable AITrustworthy AI
IBM reaffirms its commitment to the Rome Call for AI ethicsNewsMike Murphy15 Jul 2024AIFairness, Accountability, Transparency
Tiny benchmarks for large language modelsNewsKim Martineau03 Jun 2024AIAI TestingFoundation ModelsGranite
IBM’s Granite model is one of the most transparent LLMs in the worldNewsMike Murphy22 May 2024AIAI TransparencyGraniteOpen Source
What is red teaming for generative AI?ExplainerKim Martineau11 Apr 2024Adversarial Robustness and PrivacyAIAI TestingFairness, Accountability, TransparencyFoundation ModelsNatural Language ProcessingSecurityTrustworthy AI