• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

Meet DarkBERT, The Only AI Trained On The Dark Web

May 18, 2023 by Deborah Bloomfield

In case you were worried that the current iteration of generative AIs are too nice and empathetic, scientists have got you covered – a new language model has been trained on the worst part of the internet, the Dark Web. 

Given perhaps the funniest name yet, DarkBERT (yes, that’s actually its name) is a generative AI trained exclusively on the Dark Web in order to compare it to a vanilla counterpart. The team behind it – reporting their findings in a preprint paper that is yet to undergo peer-review – wanted to understand whether using the Dark Web as a dataset would give an AI better context on the language used there, making it more valuable to people wishing to trawl the Dark Web for research and for law enforcement fighting cyber crime. 

Advertisement

It also did an extensive trawl of a place that most humans don’t really want to go and indexed its various domains, so thanks for taking one for the team DarkBERT. 

The Dark Web is an area of the internet that Google and other search engines ignore, preventing the vast majority of people from going there. It is only accessible by using specialized software called Tor (or similar), and as such has gained quite the reputation for what goes on there. Urban legends have talked of torture rooms, contract killers, and all sorts of horrific crimes, but the truth is that most of it is just scams and other ways to steal your data without the safety of browser security, which we all take very much for granted. Still, the Dark Web is supposedly used by cyber crime networks to anonymously talk, making it an extremely important target for law enforcement. 

A team from South Korea hooked up a language model to trawl through the Dark Web using Tor and to return the raw data it found, creating a model that could make better sense of the language used there. Once done, they compared how it performed to existing models the researchers had created prior, including RoBERTa and BERT.  

The findings presented in the preprint showed that DarkBERT outperformed the others in all datasets, but it was close. As all the AIs were from a similar framework, it is expected that they would have similar performance, but DarkBERT excelled on the Dark Web specifically. 

Advertisement

So, what will DarkBERT be used for? Hopefully it won’t be given the nuclear launch codes, but the team expect it to be a powerful tool in scanning the Dark Web for cybersecurity threats, as well as keeping tabs on forums to identify illicit activity.  

Let’s just hope this doesn’t give OpenAI any ideas. 

The preprint, which is a preliminary version of a study that has not yet been peer-reviewed, can be found on the arXiv. 

Deborah Bloomfield
Deborah Bloomfield

Related posts:

  1. Italian film brings circus freaks to Venice festival
  2. Soccer – Too many meaningless matches not good for the international game, says FIFA president Infantino
  3. Former F1 driver Rosberg, Agnelli’s Exor invest in adopt-a-tree site Treedom
  4. Peru community says it won’t end Glencore mine blockade until demands met

Source Link: Meet DarkBERT, The Only AI Trained On The Dark Web

Filed Under: News

Primary Sidebar

  • Watch Out For Aurorae Tonight – The Strongest Solar Flare Of 2025 So Far Just Erupted From The Sun
  • First Radio Detection Received From Interstellar Object 3I/ATLAS. What Does That Mean?
  • “Drop Crocs”: Australia Once Had Ancient Crocs That Climbed Trees To Jump On Their Prey
  • How We Know Interstellar Object 3I/ATLAS Is Not An Alien Mothership
  • First-Of-Its-Kind Evidence Shows Bees Can Learn “Morse Code” – Well, Kinda
  • Humans Have A “Seventh Sense” That Lets You Touch Things From A Distance
  • The Longest Place Name Has 111 Letters – And It’s Visited By Millions Of People Each Year
  • We Now Know Why Neanderthal Faces Looked So Different To Our Own
  • Why Does Africa Have So Many Of The World’s Largest Land Animals?
  • This “Ant-Mimicking” Spider Produces Its Own Kind Of Milk And Nurses Its Babies
  • 1972 Was The Longest Year In Modern History – Here’s Why
  • Why Did “Magic Mushrooms” Evolve To Be Hallucinogenic – What’s In It For The Mushrooms?
  • Why Can’t You Domesticate All Wild Animals? The Process Relies On 6 Characteristics Few Mammals Possess
  • Meet Some Of Earth’s Mightiest Predators
  • Canada Officially Loses Its Measles Elimination Status After Nearly 30 Years. The US Is Not Far Behind
  • Two “Anomalies” Detected In Egypt’s Menkaure Pyramid Using Electrical Resistance Tomography
  • Invasive “Tree Of Heaven” Unleashes Hell As “Double Invasion” Sweeps Across Virginia
  • Hamman’s Crunch: A Man Covered His Nose And Mouth Whilst Sneezing And Ended Up In Hospital
  • “One Of The Most Beautiful Experiments In Evolutionary Biology”: What The Peppered Moth Taught Us About Evolution
  • Why Do Microwaved Eggs Explode When You Bite Into Them?
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2025 · Medical Market Report. All Rights Reserved.

Go to mobile version