• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

AI Struggles With A Task So Basic Most 8-Year-Old Humans Can Do It

March 18, 2025 by Deborah Bloomfield

Artificial intelligence (AI) has come a long way over the last decade, moving from this horror show to pretty impressive image generation, and text generation which gets its facts right a lot of the time and confidently tells you the wrong answer when it can’t.

ADVERTISEMENT

But there are quite a few tasks where humans cannot be beaten. For instance, image generators struggle with hands, teeth, or a glass of wine that is full to the brim.



One task, where AI fails to beat young children, is reading the time. 

“The ability to interpret and reason about time from visual inputs is critical for many real-world applications— ranging from event scheduling to autonomous systems,” authors of a new study write, adding that despite this AI research has focused on object detection, image capturing, and understanding scenes.

While researchers attempt to make AI that can understand complex geometry and math, models struggle with the basics of understanding clocks and calendars. It may seem simple for humans, but not for machines.

“In particular, analogue clock reading and calendar comprehension involve intricate cognitive steps: they demand fine-grained visual recognition (e.g., clock-hand position, day-cell layout) and non-trivial numerical reasoning (e.g., calculating day offsets),” the study authors explain.

ADVERTISEMENT

In the new paper, which has not yet been peer-reviewed, researchers from the University of Edinburgh in the UK tested seven AI models with some simple questions related to time. These included identifying the time from an image of an analog clock and on clocks with different hands and numerals, as well as a number of reasoning tasks involving calendars.

The AIs did not perform well on the most basic of tasks – reading the time – getting the correct answer less than a quarter of the time, and struggling especially with clocks with Roman numerals or stylized hands. For instance, shown a clock reading the time 4:00, the OpenAI’s Chat GPT-o1 guessed “12:15”, while Claude-3.5-S took a punt with “11:35”.

On calendar-based tasks, the models did perform a little better, getting answers wrong around 20 percent of the time. Here they were asked questions like “Which day of the week is Christmas?” and “Which weekday is the 100th of the year?”.

“Closed-source models like GPT-o1 and Claude-3.5 outshine open-source ones on popular holidays, potentially reflecting memorized patterns in the training data,” the team explains.

ADVERTISEMENT

“However, accuracy diminishes substantially for lesser-known or arithmetically demanding queries (e.g., 153rd day), indicating that performance does not transfer well to offset-based reasoning. The drop is especially evident among smaller or open-source models (MiniCPM, Qwen2-VL-7B, and Llama3.2-Vision), which exhibit near-random performance on less popular or offset-based queries.”

According to the team, the results indicate that these models are still struggling with understanding and reasoning around time, which needs a combination of visual perception, numerical computation, and structured logical inference. Without improvements in these areas, real-world applications such as scheduling may be error-prone.

“AI research today often emphasises complex reasoning tasks, but ironically, many systems still struggle when it comes to simpler, everyday tasks,” Aryo Gema from Edinburgh’s School of Informatics, and co-author on the paper, said in a statement. “Our findings suggest it’s high time we addressed these fundamental gaps. Otherwise, integrating AI into real-world, time-sensitive applications might remain stuck at the eleventh hour.”

The study is available on the pre-print server arXiv.

Deborah Bloomfield
Deborah Bloomfield

Related posts:

  1. Soccer-Chelsea fined for failing to control players in Liverpool game
  2. The iPhone 13 Pro goes to Disneyland
  3. Testosterone Patch To Alleviate Low Sex Drive During Menopause To Be Trialed
  4. How Long Do Chickens Live?

Source Link: AI Struggles With A Task So Basic Most 8-Year-Old Humans Can Do It

Filed Under: News

Primary Sidebar

  • Martian Mudstone Has Features That Might Be Biosignatures, New Brain Implant Can Decode Your Internal Monologue, And Much More This Week
  • Crocodiles Weren’t All Blood-Thirsty Killers, Some Evolved To Be Plant-Eating Vegetarians
  • Stratospheric Warming Event May Be Unfolding In The Southern Polar Vortex, Shaking Up Global Weather Systems
  • 15 Years Ago, Bees In Brooklyn Appeared Red After Snacking Where They Shouldn’t
  • Carnian Pluvial Event: It Rained For 2 Million Years — And It Changed Planet Earth Forever
  • There’s Volcanic Unrest At The Campi Flegrei Caldera – Here’s What We Know
  • The “Rumpelstiltskin Effect”: When Just Getting A Diagnosis Is Enough To Start The Healing
  • In 1962, A Boy Found A Radioactive Capsule And Brought It Inside His House — With Tragic Results
  • This Cute Creature Has One Of The Largest Genomes Of Any Mammal, With 114 Chromosomes
  • Little Air And Dramatic Evolutionary Changes Await Future Humans On Mars
  • “Black Hole Stars” Might Solve Unexplained JWST Discovery
  • Pretty In Purple: Why Do Some Otters Have Purple Teeth And Bones? It’s All Down To Their Spiky Diets
  • The World’s Largest Carnivoran Is A 3,600-Kilogram Giant That Weighs More Than Your Car
  • Devastating “Rogue Waves” Finally Have An Explanation
  • Meet The “Masked Seducer”, A Unique Bat With A Never-Before-Seen Courtship Display
  • Alaska’s Salmon River Is Turning Orange – And It’s A Stark Warning
  • Meet The Heaviest Jelly In The Seas, Weighing Over Twice As Much As A Grand Piano
  • For The First Time, We’ve Found Evidence Climate Change Is Attracting Invasive Species To Canadian Arctic
  • What Are Microfiber Cloths, And How Do They Clean So Well?
  • Stowaway Rat That Hopped On A Flight From Miami Was A “Wake-Up Call” For Global Health
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2025 · Medical Market Report. All Rights Reserved.

Go to mobile version