• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

ChatGPT Passes Theory Of Mind Test With Skill Of A 9-Year-Old Kid

February 17, 2023 by Deborah Bloomfield

Experiments have shown that ChatGPT is capable of passing a theory of mind test with the ability of a 9-year-old child. The question is: is artificial intelligence (AI) truly understanding the task at hand, or are we just being tricked by some super-smart mimicry? 

Theory of mind is the ability to understand the unobservable mental states of others. It’s essentially a form of self-awareness and explains our ability to comprehend why other people’s thoughts and feelings may be different from our own. 

Advertisement

This power gradually emerges throughout early childhood and plays a fundamental part in the everyday social interaction of humans. It’s often said to be one of the things that separate humans from the other “beasts” of nature (although a number of non-human animals have managed to pass theory of mind tests). 

With all the hype surrounding ChatGPT, some have begun to wonder whether the AI-driven chatbot is capable of mastering the feat of theory of mind.

Michal Kosinski, computational psychologist and professor at Stanford University, ran a number of tests to see whether the conversational AI bot could ascribe unobservable mental states, such as beliefs and desires, to others. If it could, this could suggest it possesses theory of mind. 

For one part of the research, he tasked ChatGPT with the Unexpected Contents Task (aka Smarties Task or Contents False-Belief Task). In this scenario, the participant is given a box with contents inconsistent with its label, i.e. it says it contains candy but actually contains rusty screws. 

Advertisement

The participant has seen inside the box and understands the label is wrong, but there is also another protagonist who has not seen inside the box. To pass this task, the participant must predict that the protagonist will wrongly assume that the container’s label and its contents are aligned, i.e. the other person will falsely believe the box contains candy because they have not yet seen the inside contents.  

First, the January 2022 version of GPT-3 was given a number of these tasks and managed to pass around 70 percent of them, comparable to the abilities of seven-year-old children. Then, Kosinski tested the updated November 2022 version of GPT-3.5, which was able to pass 93 percent of the tasks, a performance comparable with that of nine-year-old children.

Now comes the thorny task of interpreting these findings. The results appear to be pretty remarkable as they significantly exceed the ability of other AI. For instance, Google’ Deepmind made an AI specifically to tackle theory of mind tasks, but its ability was only comparable to a 4-year-old. 

Even more amazingly, ChatGPT wasn’t even trained to perform theory of mind tasks, suggesting the ability emerged spontaneously. This AI system is fundamentally a natural language processing project that’s been designed to simply interact in a conversational way by being trained on huge amounts of human-written text. 

Advertisement

Kosinski stresses in his paper that the “results should be interpreted with caution.” However, he suggests it’s possible that ChatGPT’s ability to pass these tasks was “a byproduct” of its mounting language ability. Alternatively, he poses that it might just be using its incredible flair for language to give the superficial impression it’s engaging in theory of mind thinking. 

Either way, it’s a pretty impressive deed.

“It is possible that GPT-3.5 solved ToM [theory of mind] tasks without engaging ToM, but by discovering and leveraging some unknown language patterns. While this explanation may seem prosaic, it is quite extraordinary, as it implies the existence of unknown regularities in language that allow for solving ToM tasks without engaging ToM,” Kosinski concludes.

“An alternative explanation is that ToM-like ability is spontaneously emerging in language models as they are becoming more complex and better at generating and interpreting human-like language. This would herald a watershed moment in AI’s development,” he added. 

Advertisement

The paper, which is yet to be peer-reviewed, was recently posted on the pre-print server arXiv.

Deborah Bloomfield
Deborah Bloomfield

Related posts:

  1. Haitian prosecutors seek to interview PM over presidential killing
  2. Brazil sets 5G mobile auction for Nov 4, says minister
  3. Entity Academy, an edtech startup that trains, mentors and places women in tech roles, secures $100M
  4. Humans Will Walk On The Moon In 2025, NASA Announces

Source Link: ChatGPT Passes Theory Of Mind Test With Skill Of A 9-Year-Old Kid

Filed Under: News

Primary Sidebar

  • Chimps Are Sticking Grass In Their Ears And Rears As They Embrace “Pointless” Fad
  • Hui Te Rangiora: Old Māori Legend Suggests They May Have Discovered Antarctica 1,000 Years Before Europeans
  • “Potential Impact On Saturn”: Astronomers Appeal For Help As Video Appears To Show Object Hitting The Gas Giant
  • What Is Prosopometamorphopsia? The “Exceedingly Rare” Condition That Made A Patient See Faces As Dragons
  • Are We In An Enormous Void? It Could Explain What’s Wrong With Our Model Of The Universe
  • Woylies Boing Back Into Western Australia Thanks To Groundbreaking Wildlife Project
  • North America’s Oldest Pterosaur And Turtle Fossils Found In Arizona’s Petrified Forest
  • Proposed “Dark Dwarfs” Near The Galactic Center Could Reveal The Nature Of Dark Matter
  • Watch: 18-Kilometer-High Ash Cloud Looms Over Indonesia’s Mount Lewotobi Laki Laki After “Explosive” Eruption
  • “ShipGoo001”: Mystery Of Entirely New Lifeform Discovered Coating A Great Lakes Ship
  • Rare White Humpback Whale Calf Filmed By Drone Off Australia’s East Coast
  • Who Was Buried At Cave Of Salome: A Female Disciple, Jesus’ Midwife, Or A Princess?
  • “Hidden” Changes To US Health Data Swapping “Gender” For “Sex” Spark Fears For Public Trust
  • Easter Island Was Never As Isolated As We Thought – Study Puts That “Strange Argument” To Bed
  • If Birds Are Dinosaurs, Why Are None As Big As T. Rexes?
  • Psychologists Demonstrate Illusion That Could Be Screwing Up Our Perception Of Time
  • Why Are So Many Enormous Roman Shoes Being Discovered At Hadrian’s Wall?
  • Scientists Think They’ve Pinpointed Structural Differences In Psychopaths’ Brains
  • We’ve Found Our Third-Ever Interstellar Visitor, Orcas Filmed Kissing (With Tongues) In The Wild, And Much More This Week
  • The “Eyes Of Clavius” Will Be Visible On The Moon Today, Thanks To Clair-Obscur Effect
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2025 · Medical Market Report. All Rights Reserved.

Go to mobile version