Even the best AI models fail at visual tasks toddlers handle easily

AI for Software Engineering (Copilots, SDLC, Testing) EN-US 18.01.2026

1 min read AI for Software Engineering (Copilots, SDLC, Testing) -/5

In short

A new study exposes a fundamental weakness in today's AI systems: even the most capable multimodal language models struggle with basic visual tasks that toddlers master before they learn to
This finding raises questions about the capabilities and limitations of current AI technologies.
In this context, it is important to note that the ability to perceive and interpret visual information develops in humans during early childhood.

Read previous title Read next article in this category

Previous: Sequoia's Bold Move: Investing in Anthropic Amidst a $25 Billion Raise · Next: GPT-5.2 Pro Solves Another Erdős Problem While New Database Reveals Most Attempts Still Fail

Editor: Martin Haak

A new study exposes a fundamental weakness in today's AI systems: even the most capable multimodal language models struggle with basic visual tasks that toddlers master before they learn to speak. This finding raises questions about the capabilities and limitations of current AI technologies. In this context, it is important to note that the ability to perceive and interpret visual information develops in humans during early childhood. The study's results could have far-reaching implications for the development of future AI systems, particularly in areas where visual skills are critical. A final assessment of these technologies would be premature at this point, as further research is needed to understand the underlying causes of these weaknesses.

Source:

Even the best AI models fail at visual tasks toddlers handle easily — The Decoder (EN-US)

HAI

In short

More in this category