Computer Vision
Modern computer vision systems have superhuman accuracy when it comes to image recognition and analysis, but they don’t really understand what they see. At IBM Research, we’re designing AI systems with the ability to see the world like we do.
Our work
IBM’s new benchmark changes monthly to avoid teaching to the test
ResearchKim MartineauEnvironmental analysis made easier with IBM’s Geospatial Studio
NewsKim MartineauWhat is prompt-tuning?
NewsKim MartineauIBM and NASA team up to spur new discoveries about our planet
NewsKim Martineau3 minute readDebugging foundation models for bias
ResearchKim MartineauNew smartphone app to navigate blind people to stand in lines with distances
ResearchHironobu Takagi, Chieko Asakawa, Masaki Kuribayashi, and Seita Kayukawa3 minute read- See more of our work on Computer Vision
Publications
Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
- 2025
- AAAI 2025
The Inherent Adversarial Robustness of Analog In-Memory Computing
- 2025
- Nature Communications
Enhancing Federated Averaging of Self-Supervised Monocular Depth Estimators for Autonomous Vehicles with Bayesian Optimization
- 2025
- Future Generation Computer Systems
ConvNLP: Image-based AI Text Detection
- Suriya Prakash Jambunathan
- Ashwath Shankarnarayan
- et al.
- 2024
- Big Data 2024
Kernel approximation using analogue in-memory computing
- Julian Büchel
- Giacomo Camposampiero
- et al.
- 2024
- Nature Machine Intelligence
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs
- Irene Huang
- Wei Lin
- et al.
- 2024
- NeurIPS 2024