• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

Meet DarkBERT, The Only AI Trained On The Dark Web

May 18, 2023 by Deborah Bloomfield

In case you were worried that the current iteration of generative AIs are too nice and empathetic, scientists have got you covered – a new language model has been trained on the worst part of the internet, the Dark Web. 

Given perhaps the funniest name yet, DarkBERT (yes, that’s actually its name) is a generative AI trained exclusively on the Dark Web in order to compare it to a vanilla counterpart. The team behind it – reporting their findings in a preprint paper that is yet to undergo peer-review – wanted to understand whether using the Dark Web as a dataset would give an AI better context on the language used there, making it more valuable to people wishing to trawl the Dark Web for research and for law enforcement fighting cyber crime. 

Advertisement

It also did an extensive trawl of a place that most humans don’t really want to go and indexed its various domains, so thanks for taking one for the team DarkBERT. 

The Dark Web is an area of the internet that Google and other search engines ignore, preventing the vast majority of people from going there. It is only accessible by using specialized software called Tor (or similar), and as such has gained quite the reputation for what goes on there. Urban legends have talked of torture rooms, contract killers, and all sorts of horrific crimes, but the truth is that most of it is just scams and other ways to steal your data without the safety of browser security, which we all take very much for granted. Still, the Dark Web is supposedly used by cyber crime networks to anonymously talk, making it an extremely important target for law enforcement. 

A team from South Korea hooked up a language model to trawl through the Dark Web using Tor and to return the raw data it found, creating a model that could make better sense of the language used there. Once done, they compared how it performed to existing models the researchers had created prior, including RoBERTa and BERT.  

The findings presented in the preprint showed that DarkBERT outperformed the others in all datasets, but it was close. As all the AIs were from a similar framework, it is expected that they would have similar performance, but DarkBERT excelled on the Dark Web specifically. 

Advertisement

So, what will DarkBERT be used for? Hopefully it won’t be given the nuclear launch codes, but the team expect it to be a powerful tool in scanning the Dark Web for cybersecurity threats, as well as keeping tabs on forums to identify illicit activity.  

Let’s just hope this doesn’t give OpenAI any ideas. 

The preprint, which is a preliminary version of a study that has not yet been peer-reviewed, can be found on the arXiv. 

Deborah Bloomfield
Deborah Bloomfield

Related posts:

  1. Italian film brings circus freaks to Venice festival
  2. Soccer – Too many meaningless matches not good for the international game, says FIFA president Infantino
  3. Former F1 driver Rosberg, Agnelli’s Exor invest in adopt-a-tree site Treedom
  4. Peru community says it won’t end Glencore mine blockade until demands met

Source Link: Meet DarkBERT, The Only AI Trained On The Dark Web

Filed Under: News

Primary Sidebar

  • US Just Killed NASA’s Mars Sample Return Mission – So What Happens Now?
  • Art Sleuths May Have Recovered Traces Of Da Vinci’s DNA From One Of His Drawings
  • Countries With The Most Narcissists Identified By 45,000-Person Study, And The Results Might Surprise You
  • World’s Oldest Poison Arrows Were Used By Hunters 60,000 Years Ago
  • The Real Reason You Shouldn’t Eat (Most) Raw Cookie Dough
  • Antarctic Scientists Have Just Moved The South Pole – Literally
  • “What We Have Is A Very Good Candidate”: Has The Ancestor Of Homo Sapiens Finally Been Found In Africa?
  • Europe’s Missing Ceratopsian Dinosaurs Have Been Found And They’re Quite Diverse
  • Why Don’t Snorers Wake Themselves Up?
  • Endangered “Northern Native Cat” Captured On Camera For The First Time In 80 Years At Australian Sanctuary
  • Watch 25 Years Of A Supernova Expanding Into Space Squeezed Into This 40-Second NASA Video
  • “Diet Stacking” Trend Could Be Seriously Bad For Your Health
  • Meet The Psychedelic Earth Tiger, A Funky Addition To “10 Species To Watch” In 2026
  • The Weird Mystery Of The “Einstein Desert” In The Hunt For Rogue Planets
  • NASA Astronaut Charles Duke Left A Touching Photograph And Message On The Moon In 1972
  • How Multilingual Are You? This New Language Calculator Lets You Find Out In A Minute
  • Europa’s Seabed Might Be Too Quiet For Life: “The Energy Just Doesn’t Seem To Be There”
  • Amoebae: The Microscopic Health Threat Lurking In Our Water Supplies. Are We Taking Them Seriously?
  • The Last Dogs In Antarctica Were Kicked Out In April 1994 By An International Treaty
  • Interstellar Comet 3I/ATLAS Snapped By NASA’s Europa Mission: “We’re Still Scratching Our Heads About Some Of The Things We’re Seeing”
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2026 · Medical Market Report. All Rights Reserved.

Go to mobile version