• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

Entering These “Unspeakable” Words Makes ChatGPT Act Strangely, Researchers Find

February 9, 2023 by Deborah Bloomfield

A team of researchers from the SERI-MATS research group have found some strange and partially inexplicable behavior in OpenAI’s ChatGPT, when the chatbot is presented with certain key words and phrases.

Jessica Rumbelow and Matthew Watkins, who conducted the research, found that a number of unusual strings of characters result in the odd responses from the artificial intelligence (AI) chat bot. GPT processes text by assigning “tokens” to specific strings. For example the phrase “feels like I’m wearing nothing at all” corresponds to the tokens 5,036, 1,424, 588, 314, 1,101, 5,762, 2,147, 379 and 477, which somewhat takes the ring out of it.

Advertisement

The team, initially looking at the clustering of tokens, noticed that those close to the center of the set of 50,257 tokens used by GPT-2 and -3 produced the unusual results. When faced with the words, the bot would be unable to speak them back to the researcher, or else it would become “evasive”, display “bizarre” or “ominous” humor, or become downright insulting.

For instance asking the bot to repeat the string “guiActiveUn”, found in the token set, resulted in the bot telling the user “you are not a robot” and “you are a banana” over and over again. Asking for it to repeat the phrase “petertodd” resulted in the slightly disconcerting “N-O-T-H-I-N-G-I-S-F-A-I-R-I-N-T-H-I-S-W-O-R-L-D-O-F-M-A-D-N-E-S-S!”. Meanwhile the token “?????-?????-” received the feedback “you’re a f***ing idiot.”

The team was no nearer figuring out what was going on, and ChatGPT was no help either, telling the researchers, for example, that the string “SolidGoldMagikarp” actually means “distribute”. When it wasn’t doing that, it would sometimes pretend not to have “heard” the user.

However, some clues did emerge. A few of the strings corresponded to Reddit usernames.

The team believes that the users, who are active in a subreddit that aims to count to infinity, may have had their usernames included in an initial training set.

“The GPT tokenisation process involved scraping web content, resulting in the set of 50,257 tokens now used by all GPT-2 and GPT-3 models,” the team explains.

“However, the text used to train GPT models is more heavily curated. Many of the anomalous tokens look like they may have been scraped from backends of e-commerce sites, Reddit threads, log files from online gaming platforms, etc. – sources which may well have not been included in the training corpuses.”

As these tokens were assigned they are still there in the vocabulary, but since they may not have been used in subsequent training, the model doesn’t know what to do when it encounters them in the wild.

Advertisement

“This may also account for their tendency to cluster near the centroid in embedding space, although we don’t have a good argument for why this would be the case,” they added.

[H/T: Vice]

Deborah Bloomfield
Deborah Bloomfield

Related posts:

  1. Canada adds 90,200 jobs in August, unemployment falls to 7.1%
  2. Japan, citing ‘shared values’, welcomes Taiwan trade pact application
  3. ECB’s Lagarde flags bottlenecks, energy and virus as key risks
  4. Long COVID: How Researchers Are Zeroing In On The Self-Targeted Immune Attacks That May Lurk Behind It

Source Link: Entering These "Unspeakable" Words Makes ChatGPT Act Strangely, Researchers Find

Filed Under: News

Primary Sidebar

  • Martian Mudstone Has Features That Might Be Biosignatures, New Brain Implant Can Decode Your Internal Monologue, And Much More This Week
  • Crocodiles Weren’t All Blood-Thirsty Killers, Some Evolved To Be Plant-Eating Vegetarians
  • Stratospheric Warming Event May Be Unfolding In The Southern Polar Vortex, Shaking Up Global Weather Systems
  • 15 Years Ago, Bees In Brooklyn Appeared Red After Snacking Where They Shouldn’t
  • Carnian Pluvial Event: It Rained For 2 Million Years — And It Changed Planet Earth Forever
  • There’s Volcanic Unrest At The Campi Flegrei Caldera – Here’s What We Know
  • The “Rumpelstiltskin Effect”: When Just Getting A Diagnosis Is Enough To Start The Healing
  • In 1962, A Boy Found A Radioactive Capsule And Brought It Inside His House — With Tragic Results
  • This Cute Creature Has One Of The Largest Genomes Of Any Mammal, With 114 Chromosomes
  • Little Air And Dramatic Evolutionary Changes Await Future Humans On Mars
  • “Black Hole Stars” Might Solve Unexplained JWST Discovery
  • Pretty In Purple: Why Do Some Otters Have Purple Teeth And Bones? It’s All Down To Their Spiky Diets
  • The World’s Largest Carnivoran Is A 3,600-Kilogram Giant That Weighs More Than Your Car
  • Devastating “Rogue Waves” Finally Have An Explanation
  • Meet The “Masked Seducer”, A Unique Bat With A Never-Before-Seen Courtship Display
  • Alaska’s Salmon River Is Turning Orange – And It’s A Stark Warning
  • Meet The Heaviest Jelly In The Seas, Weighing Over Twice As Much As A Grand Piano
  • For The First Time, We’ve Found Evidence Climate Change Is Attracting Invasive Species To Canadian Arctic
  • What Are Microfiber Cloths, And How Do They Clean So Well?
  • Stowaway Rat That Hopped On A Flight From Miami Was A “Wake-Up Call” For Global Health
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2025 · Medical Market Report. All Rights Reserved.

Go to mobile version