• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

Meet DarkBERT, The Only AI Trained On The Dark Web

May 18, 2023 by Deborah Bloomfield

In case you were worried that the current iteration of generative AIs are too nice and empathetic, scientists have got you covered – a new language model has been trained on the worst part of the internet, the Dark Web. 

Given perhaps the funniest name yet, DarkBERT (yes, that’s actually its name) is a generative AI trained exclusively on the Dark Web in order to compare it to a vanilla counterpart. The team behind it – reporting their findings in a preprint paper that is yet to undergo peer-review – wanted to understand whether using the Dark Web as a dataset would give an AI better context on the language used there, making it more valuable to people wishing to trawl the Dark Web for research and for law enforcement fighting cyber crime. 

Advertisement

It also did an extensive trawl of a place that most humans don’t really want to go and indexed its various domains, so thanks for taking one for the team DarkBERT. 

The Dark Web is an area of the internet that Google and other search engines ignore, preventing the vast majority of people from going there. It is only accessible by using specialized software called Tor (or similar), and as such has gained quite the reputation for what goes on there. Urban legends have talked of torture rooms, contract killers, and all sorts of horrific crimes, but the truth is that most of it is just scams and other ways to steal your data without the safety of browser security, which we all take very much for granted. Still, the Dark Web is supposedly used by cyber crime networks to anonymously talk, making it an extremely important target for law enforcement. 

A team from South Korea hooked up a language model to trawl through the Dark Web using Tor and to return the raw data it found, creating a model that could make better sense of the language used there. Once done, they compared how it performed to existing models the researchers had created prior, including RoBERTa and BERT.  

The findings presented in the preprint showed that DarkBERT outperformed the others in all datasets, but it was close. As all the AIs were from a similar framework, it is expected that they would have similar performance, but DarkBERT excelled on the Dark Web specifically. 

Advertisement

So, what will DarkBERT be used for? Hopefully it won’t be given the nuclear launch codes, but the team expect it to be a powerful tool in scanning the Dark Web for cybersecurity threats, as well as keeping tabs on forums to identify illicit activity.  

Let’s just hope this doesn’t give OpenAI any ideas. 

The preprint, which is a preliminary version of a study that has not yet been peer-reviewed, can be found on the arXiv. 

Deborah Bloomfield
Deborah Bloomfield

Related posts:

  1. Italian film brings circus freaks to Venice festival
  2. Soccer – Too many meaningless matches not good for the international game, says FIFA president Infantino
  3. Former F1 driver Rosberg, Agnelli’s Exor invest in adopt-a-tree site Treedom
  4. Peru community says it won’t end Glencore mine blockade until demands met

Source Link: Meet DarkBERT, The Only AI Trained On The Dark Web

Filed Under: News

Primary Sidebar

  • Thought “Superflu” Was Bad? Strap In: It’s Norovirus Season In The US
  • Why Does Evolution Turn Everything Into Crabs?
  • Neil deGrasse Tyson And Professor Brian Cox Talk Interstellar Comet 3I/ATLAS And Alien Spacecraft: “It’s Older Than Us”
  • New Species Of Tiny Pumpkin Toadlet Is The Size Of A Pencil Tip, And We Cannot Cope
  • Watch The World’s Most Metal Frog Take Down A Giant “Murder Hornet”
  • Scheduling Cancer Immunotherapy In The Morning May Lower Your Risk Of Death By As Much As 63 Percent
  • Spacetime Vortices Spotted For The First Time As Black Hole Kills A Star
  • The Never-Before-Seen First Stars In The Universe May Have Finally Been Spotted
  • There’s Finally An Explanation For The Longest Known Gamma Ray Burst’s Appearance – But A Key Mystery Remains
  • The Earliest Evidence Of Making Fire Has Been Discovered, Dating To 400,000 Years Ago
  • First X-Ray Image Of Comet 3I/ATLAS Reveals Signature Unseen In Other Interstellar Objects
  • The Surprisingly Scientific Events That Occurred On Christmas Day
  • Humans Are The Smartest And Dumbest Animal Of All Time, Argues Biologist
  • The Final Secret Of Self-Healing Roman Concrete May Have Been Cracked
  • People Are Confused By The Natural Markings On Watermelons That Look Like “Crop Circles”
  • Pica: The Disorder That Makes People Crave And Eat The Inedible
  • Project Alpha: In 1979, Magicians Infiltrated A Washington Laboratory To Test Scientific Rigor In Parapsychology
  • We May Finally Know What Caused The “Hobbit” Humans To Go Extinct
  • Radical New Treatment Clears Disease In 64 Percent Of Patients With Incurable Cancer
  • People Are Just Now Realizing That The Earth Has A Tail, Stretching At Least 2 Million Kilometers
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2025 · Medical Market Report. All Rights Reserved.

Go to mobile version