Computer Vision
Modern computer vision systems have superhuman accuracy when it comes to image recognition and analysis, but they don’t really understand what they see. At IBM Research, we’re designing AI systems with the ability to see the world like we do.
Our work
IBM Granite now has eyes
ResearchKim MartineauIBM’s new benchmark changes monthly to avoid teaching to the test
ResearchKim MartineauEnvironmental analysis made easier with IBM’s Geospatial Studio
NewsKim MartineauWhat is prompt-tuning?
NewsKim MartineauIBM and NASA team up to spur new discoveries about our planet
NewsKim Martineau3 minute readDebugging foundation models for bias
ResearchKim Martineau- See more of our work on Computer Vision
Publications
Explain Yourself, Briefly! Self-Explaining Neural Networks with Concise Sufficient Reasons
- Shahaf Bassan
- Ron Eliav
- et al.
- 2025
- ICLR 2025
Text-Guided Few-Shot Semantic Segmentation with Training-Free Multimodal Feature Matching
- Guillaume Buthmann
- Tomoya Sakai
- et al.
- 2025
- ICASSP 2025
Self-supervised learning of monocular depth estimators in autonomous vehicles with federated learning
- Elton Figueiredo de Souza Soares
- Carlos Alberto Viera Campos
- 2025
- Eng Appl Artif Intell
Foundation models for materials discovery – current state and future directions
- Edward Pyzer-knapp
- Matteo Manica
- et al.
- 2025
- npj Computational Materials
LionHeart: A Layer-based Mapping Framework for Heterogeneous Systems with Analog In-Memory Computing Tiles
- Corey Liam Lammie
- Yuxuan Wang
- et al.
- 2025
- IEEE TETC
Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
- 2025
- AAAI 2025