• Email Us: [email protected]
  • Contact Us: +1 718 874 1545
  • Skip to main content
  • Skip to primary sidebar

Medical Market Report

  • Home
  • All Reports
  • About Us
  • Contact Us

Almost All Languages Appear To Follow Zipf’s Law, And We Have No Idea Why

November 28, 2024 by Deborah Bloomfield

Humans like to think we’re unpredictable beings, to a certain extent, governed by free will emerging somehow from physical processes. Well, here’s one weird thing to send you into a linguistics-based existential crisis; most languages appear to follow an equation known as Zipf’s law, and we have no idea why.

Words are used with varying frequency, as you might expect. You have more use for the word “the” than you do for the word “ecumenical” or “phubbing“, for example. But analyzing the frequency of word use in large texts reveals that it closely follows a specific statistical law.

Advertisement

“About 80 years ago, George Kingsley Zipf reported an observation that the frequency of a word seems to be a power law function of its frequency rank, formulated as f(r) ∝ 𝑟𝛼, where f is word frequency, r is the rank of frequency, and 𝛼 is the exponent,” a paper on the topic explains.

To put it simply, the most frequently used word in a language – in English, “the” – is used twice as often as the next most common word, and three times as often as the next, and four times as often as the next, and so on following this power law for a surprisingly long time. 

You may think this is some weird quirk of English, but it isn’t. Zipf’s law appears to apply to almost all languages that have been looked into. No matter whether you are speaking English, Hindi, French, Mandarin, or Spanish, the frequency of a word appears to drop off scaling to its popularity rank.

A graph showing Zipf's law on different languages.

Zipf’s law applies to the first 10 million words in 30 different languages on Wikipedia.

Weirder still, it even applies to languages we haven’t even deciphered yet. Even the words appearing in the mysterious Voynich Manuscript appear to follow this law. And individual texts, if they are large enough, will roughly follow these laws too, with the top-ranked word appearing twice as much as the next etc, etc. Even Charles Darwin can’t evolve his way out of this one, with one analysis finding it applies fairly neatly to his text On the Origin of Species. In fact, it crops up all over the place.

Advertisement



So, that’s pretty weird, no? 

“It is worth reflecting on the peculiarity of this law,” a review of the topic explains. “It is certainly a nontrivial property of human language that words vary in frequency at all; it might have been reasonable to expect that all words should be about equally frequent. But given that words do vary in frequency, it is unclear why words should follow such a precise mathematical rule – in particular, one that does not reference any aspect of each word’s meaning.”

There are many potential explanations for the idea, from statistical problems to constraints imposed by human memory and vocabulary. George Zipf himself proposed that the law comes from a balance of effort minimization, with speakers (or writers) attempting to minimize their own effort by using more frequently occurring words, and listeners (or readers) seeking clarity in language from less-frequently used words. An extension of this is that humans attempt to convey meaning as efficiently as possible, tending towards using words that maximize the amount of information they can convey.

Advertisement

Another idea is that more common words tend to become more popular over time as language spreads and develops, leading to a sort of snowball effect. But none are truly accepted as the explanation, and the cause behind it remains a bit of a mystery.

If you would really like to send yourself into a linguistics-based existential crisis, you can even paste your own (long) text/novel/paper into a distribution calculator and see if it obeys Zipf’s law. You might not like how predictable your use of language may appear, but fear not, even Shakespeare’s Hamlet appears to follow it too.

Deborah Bloomfield
Deborah Bloomfield

Related posts:

  1. U.S. banking lobby groups oppose proposed tax reporting law
  2. A Sunspot Is So Large Right Now You Can See It Without A Telescope
  3. Major Vaccine Breakthrough Could Spell Hope For Treating Aggressive Breast Cancer
  4. What’s The Strongest Animal In The World?

Source Link: Almost All Languages Appear To Follow Zipf's Law, And We Have No Idea Why

Filed Under: News

Primary Sidebar

  • What Is The Purpose Of Those Lines On Your Towels?
  • The Invisible World Around Us: How Can We Capture And Clean The Air We Breathe?
  • 85-Million-Year-Old Dinosaur Eggs Dated Using “Atomic Clock For Fossils” For The First Time
  • Why Shouldn’t You Kiss Babies? New Study Shows Even Healthy Newborns Can Become Severely Ill With RSV
  • Earth Has A New Quasi-Moon – And It Has Probably Been Around For Decades
  • Want To Kill Your Prey? Do It Feather-Legged Lace Weaver Spider Style And Vomit All Over Them
  • IFLScience The Big Questions: Are We In The Anthropocene?
  • The Wildfire Paradox Affecting 440 Million People Has As Worrying A Solution As You’d Expect
  • AI May Infringe On Your Rights And Insult Your Dignity (Unless We Do Something Soon)
  • How Do You Study Cryptic Species? We’re Finally Lifting The Lid On The World’s Least Understood Mammals
  • Once-In-A-Decade Close Encounter With Hazardous Asteroid 2025 FA22 Approaches
  • With 229 Pairs, This Beautiful Animal Has The Highest Number Of Chromosomes Of Any Animal
  • “An Unimaginable Breakthrough”: Loudest-Ever Gravitational Wave Collision Proves Stephen Hawking Correct
  • Exciting Martian Mudstone Has Features That Might Be Considered Biosignatures
  • How Long Did Dinosaurs Live? “It’s A Big Surprise To People That Work On Them”
  • NASA’s Mysterious Announcement: “Clearest Sign Of Life That We’ve Ever Found On Mars”
  • New Brain Implant Can Decode Your Internal Monologue, Raising Fears Of Mind Reading
  • “Immediate, Sustained, And Devastating” Pain: The Most Venomous Mammal Packs An Extremely Nasty Sting
  • Domestic Cats Keeping Making Hybrids. That’s A Problem, And Yes – That Includes Some Pets
  • These Strange Little Lizards Have Toxic Green Blood, And No One Knows Exactly Why
  • Business
  • Health
  • News
  • Science
  • Technology
  • +1 718 874 1545
  • +91 78878 22626
  • [email protected]
Office Address
Prudour Pvt. Ltd. 420 Lexington Avenue Suite 300 New York City, NY 10170.

Powered by Prudour Network

Copyrights © 2025 · Medical Market Report. All Rights Reserved.

Go to mobile version