• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

Meet DarkBERT, The Only AI Trained On The Dark Web

May 18, 2023 by Deborah Bloomfield

In case you were worried that the current iteration of generative AIs are too nice and empathetic, scientists have got you covered – a new language model has been trained on the worst part of the internet, the Dark Web. 

Given perhaps the funniest name yet, DarkBERT (yes, that’s actually its name) is a generative AI trained exclusively on the Dark Web in order to compare it to a vanilla counterpart. The team behind it – reporting their findings in a preprint paper that is yet to undergo peer-review – wanted to understand whether using the Dark Web as a dataset would give an AI better context on the language used there, making it more valuable to people wishing to trawl the Dark Web for research and for law enforcement fighting cyber crime. 

Advertisement

It also did an extensive trawl of a place that most humans don’t really want to go and indexed its various domains, so thanks for taking one for the team DarkBERT. 

The Dark Web is an area of the internet that Google and other search engines ignore, preventing the vast majority of people from going there. It is only accessible by using specialized software called Tor (or similar), and as such has gained quite the reputation for what goes on there. Urban legends have talked of torture rooms, contract killers, and all sorts of horrific crimes, but the truth is that most of it is just scams and other ways to steal your data without the safety of browser security, which we all take very much for granted. Still, the Dark Web is supposedly used by cyber crime networks to anonymously talk, making it an extremely important target for law enforcement. 

A team from South Korea hooked up a language model to trawl through the Dark Web using Tor and to return the raw data it found, creating a model that could make better sense of the language used there. Once done, they compared how it performed to existing models the researchers had created prior, including RoBERTa and BERT.  

The findings presented in the preprint showed that DarkBERT outperformed the others in all datasets, but it was close. As all the AIs were from a similar framework, it is expected that they would have similar performance, but DarkBERT excelled on the Dark Web specifically. 

Advertisement

So, what will DarkBERT be used for? Hopefully it won’t be given the nuclear launch codes, but the team expect it to be a powerful tool in scanning the Dark Web for cybersecurity threats, as well as keeping tabs on forums to identify illicit activity.  

Let’s just hope this doesn’t give OpenAI any ideas. 

The preprint, which is a preliminary version of a study that has not yet been peer-reviewed, can be found on the arXiv. 

Deborah Bloomfield
Deborah Bloomfield

Related posts:

  1. Italian film brings circus freaks to Venice festival
  2. Soccer – Too many meaningless matches not good for the international game, says FIFA president Infantino
  3. Former F1 driver Rosberg, Agnelli’s Exor invest in adopt-a-tree site Treedom
  4. Peru community says it won’t end Glencore mine blockade until demands met

Source Link: Meet DarkBERT, The Only AI Trained On The Dark Web

Filed Under: News

Primary Sidebar

  • If Birds Are Dinosaurs, Why Are None As Big As T. Rexes?
  • Psychologists Demonstrate Illusion That Could Be Screwing Up Our Perception Of Time
  • Why Are So Many Enormous Roman Shoes Being Discovered At Hadrian’s Wall?
  • Scientists Think They’ve Pinpointed Structural Differences In Psychopaths’ Brains
  • We’ve Found Our Third-Ever Interstellar Visitor, Orcas Filmed Kissing (With Tongues) In The Wild, And Much More This Week
  • The “Eyes Of Clavius” Will Be Visible On The Moon Today, Thanks To Clair-Obscur Effect
  • Shockingly High Microplastic Levels Found On Remote Mediterranean Coral Reef Island
  • Interstellar Object, Cheesy Nightmares, And Smooching Orcas
  • World’s Largest Martian Meteorite Up For Auction Could Reach Whopping $2-4 Million
  • Kimalu The Beluga Whale Undergoes Pioneering Surgery And Becomes First Beluga To Survive General Aesthetic
  • The 1986 Soviet Space Mission That’s Never Been Repeated: Mir To Salyut And Back Again
  • Grisly Incident In Yellowstone National Park Shows Just How Dangerous This Vibrant Wilderness Can Be
  • Out Of All Greenhouse Gas Emitters On Earth, One US Organization Takes The Biscuit
  • Overly Ambitious Adder Attempts To Eat Hare 10 Times Its Mass In Gnarly Video
  • How Fast Does A Spacecraft Need To Go To Escape The Solar System?
  • President Trump’s Cuts To USAID Could Result In A “Staggering” 14 Million Avoidable Deaths By 2030
  • Dzo: Hybrids Beasts That Are Perfectly Crafted For Life On Earth’s Highest Mountains
  • “Rarest Event Ever” Had A Half-Life 1 Trillion Times Longer Than The Age Of The Universe – How Did We See It?
  • Meet The Bille, A Self-Righting Tetrahedron That Nobody Was Sure Could Exist
  • Neurogenesis Confirmed: Adult Brains Really Do Make New Hippocampal Neurons
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2025 · Medical Market Report. All Rights Reserved.

Go to mobile version