• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

Synthetic dataset of human trafficking victims could allow big data work without privacy compromises

September 23, 2021 by David Barret Leave a Comment

In order to combat human trafficking effectively, those combating it must understand it — and these days, that means data. Unfortunately, for obvious reasons there is no convenient index of trafficking victims, though this confidential information is in some ways abundant. Microsoft and the International Organization for Migration may have found a way forward with a new synthetic database that has all the important characteristics of the real trafficking data, but is completely artificial.

While each victim is unquestionably individual, basic high-level questions like what countries are increasingly the source or means of trafficking, what routes and methods are used, and where the victims end up are a matter of statistics. The evidence to identify trends and patterns, crucial to prevention, is locked up in thousands of these individual stories that most would prefer not to publicize.

“Administrative data on identified cases of human trafficking represent one of the main sources of data available but such information is highly sensitive,” said IOM program coordinator Harry Cook in a news release describing the dataset. “IOM has been delighted to work with Microsoft Research over the past two years to make progress on the critical challenge of sharing such data for analysis while protecting the safety and privacy of victims.”

Historically, for things like crime databases and medical info, the strategy is to redact liberally, but this method of “de-anonymizing” has been shown to be ineffective against any serious attempt to reconstruct the data. With numerous databases public and leaked and computing power on tap, the redacted information can be supplied quite reliably.

The option taken by Microsoft Research is to use the original data as the basis for a synthetic dataset that retains all the important statistical relationships of the source but none of the identifiable information. And it’s not just turning “Jane Doe” into “Janet Doeman” and her hometown from Cleveland to Queens. Instead, groups of no less than ten people with similar or overlapping data are merged to create a set of attributes that accurately represent them statistically but can’t be used to identify them individually.

Caption: Statistics relating to human trafficking around the world.

Image Credits: Microsoft Research / IOM

Naturally this doesn’t have the granularity of the original data, but unlike the sensitive source, this data can actually be used. It’s not necessarily for some task force to analyze and say “okay the next smuggling operation will be based out of…” but rather this data, based in firsthand evidence, can be pointed at as a factual record for addressing this at a policy and diplomacy level. Where before one may have had to say in a more general way that Country X or Government Z was neglectful or complicit in these matters, having hard data to back that up allows one to say “36 percent of sex trafficking victims pass through your jurisdiction.”

Not that the data has to be used in strongarm tactics — simply understanding the global trade in human misery as a system and not just a series of disconnected events is valuable in and of itself. You can peruse the data and request to use it here., and learn more about the process for creating it at the program’s GitHub.

Source Link Synthetic dataset of human trafficking victims could allow big data work without privacy compromises

David Barret
David Barret

Related posts:

  1. Tennis-Sabalenka defeats Mertens in straight sets in U.S. Open fourth round
  2. China’s export, import growth likely eased in Aug on COVID-19 cases, supply bottlenecks: Reuters poll
  3. Apple and Google bow to pressure in Russia to remove Kremlin critic’s tactical voting app
  4. Iran joins expanding Asian security body led by Moscow, Beijing

Filed Under: News

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Primary Sidebar

  • Vowel Sounds “Thought To Be Unique To Humans” Discovered In Sperm Whales For The First Time
  • Bizarre Creature With “All-Body Brain” Challenges What We Know About Evolution of Nervous Systems
  • For First Time, Astronomers Record A Coronal Mass Ejection From A Star That’s Not Our Sun
  • In 2032, Earth May Be Treated To A Meteor Shower Like No Other, Courtesy Of “City-Killer” Asteroid 2024 YR4
  • “A Wave Of Poo”: People Reversed The Direction Of The Chicago River’s Flow In 1900
  • Watch Out For Aurorae Tonight – The Strongest Solar Flare Of 2025 So Far Just Erupted From The Sun
  • First Radio Detection Received From Interstellar Object 3I/ATLAS. What Does That Mean?
  • “Drop Crocs”: Australia Once Had Ancient Crocs That Climbed Trees To Jump On Their Prey
  • How We Know Interstellar Object 3I/ATLAS Is Not An Alien Mothership
  • First-Of-Its-Kind Evidence Shows Bees Can Learn “Morse Code” – Well, Kinda
  • Humans Have A “Seventh Sense” That Lets You Touch Things From A Distance
  • The Longest Place Name Has 111 Letters – And It’s Visited By Millions Of People Each Year
  • We Now Know Why Neanderthal Faces Looked So Different To Our Own
  • Why Does Africa Have So Many Of The World’s Largest Land Animals?
  • This “Ant-Mimicking” Spider Produces Its Own Kind Of Milk And Nurses Its Babies
  • 1972 Was The Longest Year In Modern History – Here’s Why
  • Why Did “Magic Mushrooms” Evolve To Be Hallucinogenic – What’s In It For The Mushrooms?
  • Why Can’t You Domesticate All Wild Animals? The Process Relies On 6 Characteristics Few Mammals Possess
  • Meet Some Of Earth’s Mightiest Predators
  • Canada Officially Loses Its Measles Elimination Status After Nearly 30 Years. The US Is Not Far Behind
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2025 · Medical Market Report. All Rights Reserved.

Go to mobile version