Content ID:
Field:


  • About Smithsonian
  • Email Updates
  • Member Services
  • Shop
  • Archive

Smithsonian.com

  • Smithsonian Institution
  • Smithsonian Channel
  • goSmithsonian
  • Air & Space magazine
  • Home
  • History & Archaeology
  • People & Places
  • Science & Nature
  • Arts & Culture
  • Travel
  • Photos & Videos
  • Subscribe
  • Art & Artists
  • Music & Literature
  • Photo of the Day
  • Smithsonian Institution
  • Trends & Traditions
Dictionary

iStockPhoto.com

  • Trends & Traditions

The Million Word March

What defines a word? Lexicographers and other experts don’t always agree

  • By Anika Gupta
  • Smithsonian.com, September 24, 2008

Article Tools

 
  • Font
  • Share/Save/Bookmark Share
     
  • Email
  •  
  • Print
  • Digg Digg
     
  • Comments
  • StumbleUpon StumbleUpon
     
  • RSS
  • Reddit Reddit
     

    History's "Global Languages"

    Anika Gupta

    Related Links

    Global Language Monitor

    Most Popular

    • Viewed
    • Emailed
    1. Gobekli Tepe: The World’s First Temple?
    2. Tattoos
    3. The 'Secret Jews' of San Luis Valley
    4. The Coldest Place in the Universe
    5. A Brief History of the Salem Witch Trials
    6. John Hodgman Gives “More Information Than You Require”
    7. America's First True "Pilgrims"
    8. One Man's Korean War
    9. New Light on Stonehenge
    10. Bugs, Brains and Trivia
    1. Gobekli Tepe: The World’s First Temple?
    2. The 'Secret Jews' of San Luis Valley
    3. Sarah Vowell on the Puritans' Legacy
    4. Bugs, Brains and Trivia
    5. Jukebox: A Choir of Turkeys
    6. The Coldest Place in the Universe
    7. America's First True "Pilgrims"
    8. The Financial Panic of 1907: Running from History
    9. Munich at 850
    10. John Hodgman Gives “More Information Than You Require”

    It used to be that the expert source on what was or wasn't a word was that school-day staple: the dictionary. American Heritage, Webster's Third, the Oxford English: there were a few trusted players in the game.

    But what if those players are losing their edge?

    Take the word "staycation." Staycation, which means to spend a vacation at home, recently appeared in the New York Times, USA Today and MSNBC. But it isn't likely to appear anytime soon in a dictionary. The same goes for "bracketology," (the science of NCAA March Madness betting) Facebook and Wikipedia.

    "We try to cover the most salient" words," says Joe Pickett, executive editor of the American Heritage Dictionary. "What does the educated layperson need to know?"

    The people who make dictionaries are known as lexicographers ("authors or editors of a dictionary." Thanks, Merriam-Webster). And they have a time-tested method for choosing which new words to certify and which ones to toss before the next edition or update of a dictionary's Web site.

    Groups of editors at a dictionary watch specific subject areas, logging the hits a new word gets. A "hit" is a mention in a book, newspaper or Web site. Then they put the hits in a database and compare the new terms to words they already have. So although Facebook, being a brand name, doesn't qualify, every word in Shakespeare's plays does – including cap-a-pie ("from head to foot") and fardel ("burden"). Being the granddaddy of creative linguistics, Shakespeare invented more than 1,700 words. All of them appear in an unabridged dictionary.

    Dictionaries reject words for being too technical (even the most die-hard "Grey's Anatomy" fan will never need to know what a mammosomatotroph is) or for being too young (staycation).

    They don't count brand names (Coke, Facebook, Wikipedia) or most foreign words and phrases.

    "We aren't trying to be Wikipedia," Pickett said.

    So who is? Who's keeping track of, counting and sorting the words English speakers use on an everyday basis?

    The Global Language Monitor, based in Austin, Tex., has been tracking words for the past five years. Using its own teams of experts and its own algorithm, they say English adds a new word every 98 minutes. This means there are more than 900,000 English words in the world, and the one-millionth will appear sometime in April 2009.

    1 2 3

    It used to be that the expert source on what was or wasn't a word was that school-day staple: the dictionary. American Heritage, Webster's Third, the Oxford English: there were a few trusted players in the game.

    But what if those players are losing their edge?

    Take the word "staycation." Staycation, which means to spend a vacation at home, recently appeared in the New York Times, USA Today and MSNBC. But it isn't likely to appear anytime soon in a dictionary. The same goes for "bracketology," (the science of NCAA March Madness betting) Facebook and Wikipedia.

    "We try to cover the most salient" words," says Joe Pickett, executive editor of the American Heritage Dictionary. "What does the educated layperson need to know?"

    The people who make dictionaries are known as lexicographers ("authors or editors of a dictionary." Thanks, Merriam-Webster). And they have a time-tested method for choosing which new words to certify and which ones to toss before the next edition or update of a dictionary's Web site.

    Groups of editors at a dictionary watch specific subject areas, logging the hits a new word gets. A "hit" is a mention in a book, newspaper or Web site. Then they put the hits in a database and compare the new terms to words they already have. So although Facebook, being a brand name, doesn't qualify, every word in Shakespeare's plays does – including cap-a-pie ("from head to foot") and fardel ("burden"). Being the granddaddy of creative linguistics, Shakespeare invented more than 1,700 words. All of them appear in an unabridged dictionary.

    Dictionaries reject words for being too technical (even the most die-hard "Grey's Anatomy" fan will never need to know what a mammosomatotroph is) or for being too young (staycation).

    They don't count brand names (Coke, Facebook, Wikipedia) or most foreign words and phrases.

    "We aren't trying to be Wikipedia," Pickett said.

    So who is? Who's keeping track of, counting and sorting the words English speakers use on an everyday basis?

    The Global Language Monitor, based in Austin, Tex., has been tracking words for the past five years. Using its own teams of experts and its own algorithm, they say English adds a new word every 98 minutes. This means there are more than 900,000 English words in the world, and the one-millionth will appear sometime in April 2009.

    In contrast, most standard dictionaries have about 200,000 words, unabridged dictionaries about 600,000.

    But the Monitor is so sure of its numbers it has started a Million Word March, a countdown to the one-millionth word.

    "We went back to the Middle English and saw that the definition of a word was 'a thought spoken,'" said Paul JJ Payack, president and chief word analyst at the Monitor, "which means if I say a word, and you understand me, it's a real word."

    Payack counts staycation, Facebook and Wikipedia as words. But he also follows some of the old rules. For example, words that are both noun and verb, such as "water" are counted only once. He doesn't count all the names there are for chemicals, because there are hundreds of thousands.

    Once the Monitor identifies a word, it tracks it over time, watching to see where the word appears. Based on that measurement, they decide if the word has "momentum," basically, whether it's becoming more popular or if it's a one-hit wonder of the linguistic world.

    At first glance, this seems a lot like a dictionary's system.

    "It's the same as the old [method], just recognizing the new reality," Payack said. The Monitor's method gives a lot more weight to online citations.

    But is Payack's "new reality" well, real? He claims that the fast flow of information and the advent of global English have changed the way people use words. And that the gap between the words people use and the words that appear in dictionaries might be on the rise.

    "It turns out that once something enters the Internet, it's like an echo chamber," said Payack. Since the first web browser appeared in 1991, the Internet has added a lot of words to the English language—dot-com, blog—and it's added these words fast. The Web has also taken existing words to new ears.

    "Back in the mid-'90s, getting a couple of thousand browser hits for a word made us inclined to enter it; now the threshold has changed," Pickett said. "You can find so much evidence for obscure words and expressions."

    But dictionaries are used to playing catch-up. After all, it's hard to define a word before it's coined.

    Payack says the Internet isn't the most pressing challenge to traditional word-counting methodology. That, in his opinion, is "global English."

    English has nearly 400 million native speakers, putting it second in the world, but it has 1.3 billion speakers overall, making it the world's most widely understood language, explains Payack . It's spoken by over 300 million people in India as a second language, and by at least that many second speakers in China.

    "Anyone who speaks English right now feels like they own it," Payack says. For example, look at the adjective "brokeback." After director Ang Lee's called his movie about two cowboys who fall in love "Brokeback Mountain," the word "brokeback" wormed its way into the English vernacular as a synonym for 'gay.' Although "brokeback" may be past its glory days in the United States, the word, with this new meaning, is still popular in China, Payack said. It appears on blogs and Web sites, which means it has momentum, which means it's a word.

    "Nowadays we have much more human traffic going in all directions around the world," said Salikoko Mufwene, a linguistics professor at the University of Chicago, who has studied the development of regional dialects. Whether or not Chinese-inspired words will become part of American English, for example, "depends on how regularly Americans are going to interact with Asians in English," he said.

    And if they did, would Americans become, on average, more verbose? Average Americans use about 7,500 words a day and know about 20,000 total. Even Shakespeare only knew about 60,000.

    So the number of words in the English language will always be many, many more than any one person knows or uses.

    Both Mufwene and American Heritage's Pickett said English could very well have a million words already. Counting words, after all, is an imprecise science.

    It's also not the dictionary's science. The job of dictionaries has always been, Mufwene said, "to reflect how people speak, not to teach them how to speak." If the dictionary reflection grows narrower, it can still be valuable.

    "You need people to edit the dictionary and take responsibility for it, so that it's reliable," Pickett said. "And I don't think that's going to change."


     
    Comments

    The way one looks at this topic really depends on one's point of view. It's similar to the idea of "traditional writing versus text messaging"--there's really no right or wrong answer; there are only different points of view.

    Posted by Tori Myers on September 30,2008 | 11:08AM

    Perhaps Smithsonian and GLM should arrange a joint celebration of this wonderful event. Unless I'm woefully ignorant, early next spring, English will become the first language in human history to have over one-million words. --Mike Perry, Untangling Tolkien

    Posted by Mike Perry on October 6,2008 | 12:56PM

    Yiddish words, long used in conversations, are now appearing in various English language writings. i.e: nosh, putz, goniff,shikseh, etc. Will such as these be added?

    Posted by hpuziss on October 6,2008 | 02:52PM

    Two words in use in my family are: shrammed - when you are so cold your bones feel as if they are aching, and twitten - a paved alley between buildings. I believe they both originate from Sussex, England, where my mother (now 85) was brought up by her grandmother, born in 1855.

    Posted by Lyn Hewitt-Jones on October 8,2008 | 08:50AM

    Very Interesting. Just as words get added as tey gain "momentum", I guess, we also need someone to watch out for words that are losing it, and hence to sort of "de-notify" them, else imagine a world where this Language has , say 4 million "words" and the average person continues to know only about 20,000 - since the capacity of the human brain is unlikely to change anytime soon, it would make for a world where people feel dumber by the day!!

    Posted by Amitabh Jaipuria on October 12,2008 | 10:20PM

    Last year I married my husband who happens to be Jewish. He said I was the one in the family that uses Yiddish the most. He also said I use it correctly the least. I have since then made it a point to understand the definitions of Yiddish words and find their roots and usage. I think words nowadays are becoming more technologically related. For example; friends text message me with sayings such as, "LOL", and "TTFN" which mean laughing out loud and ta ta for now. The funny thing will be to see how advanced our language will become in the future as far as this new age kind of techno-talk we use without really realizing we do. In addition, I found out that UCLA offers a course in Ebonics, which is very interesting due to the cultural understanding and background!

    Posted by Amy Schlossberg on October 22,2008 | 07:26PM

    Post a Comment


    Name: (required)

    Email: (required)

    Comment:



    Advertisement

    Smithsonian Videos

    Star-Spangled Salute

    Re-enactors relive the Battle of Baltimore


    One Life: The Mask of Lincoln

    National Portrait Gallery historian David C. Ward discusses images of Abraham Lincoln


    Fallow Groan

    Watch a fallow buck groan


    Fishermen's Fate

    In the town of Fort Bragg, California, fishermen scramble to make a living


    Coral Reefs and Creatures

    The Phoenix Islands provide an unspoiled center for marine science


    Advertisement

    Culturespotter

    Experience Mexico

    Choose from seven videos to learn more about Mexico and its rich history.

    Cultured Collector

    Cultured Furnishings

    Bernhardt Furniture, in collaboration with the Smithsonian Institution, announces new additions to its line of home furnishings.

    Window Shopping

    Gifts, Gadgets and Great Finds!

    From Our Advertisers: Products, Offers and Free Info

    Travel & Adventure

    Subscribe Today & Win a FREE Trip to Paris!


    Sojourners

    Love to travel? We've collected some of the best offerings from our most valued travel partners, across the country and around the world

    In The Magazine

    November 2008

    • Looking Up
    • The World's First Temple?
    • One Man's Korean War
    • Banner Days
    • Munich at 850

    View Table of Contents



    Enter Now!

    Smithsonian's 6th Annual Photo Contest

    Enter the Smithsonian magazine 6th annual photo contest now >>

    Ecocenter

    The Oceans

    Global health from an underwater perspective and why what you eat matters

    Smithsonian Journeys

    Villas-and-Vistas
    Villas and Vistas of the Italian Lake District
    A stay amid romantic Lake Como and Lake Maggiore






    View full archiveRecent Issues


    • Nov 2008


    • Oct 2008


    • Sep 2008

    Newsletter

    Sign up for regular email updates from Smithsonian magazine, including free newsletters, special offers and current news updates.

    Subscribe Now

    About Us

    Smithsonian.com expands on Smithsonian magazine's in-depth coverage of history, science, nature, the arts, travel, world culture and technology. Join us regularly as we take a dynamic and interactive approach to exploring modern and historic perspectives on the arts, sciences, nature, world culture and travel, including videos, blogs and a reader forum.

    Explore our Brands

    • goSmithsonian.com
    • Smithsonian Air & Space Museum
    • Smithsonian Institution
    • Smithsonian Catalogue
    • Smithsonian Journeys
    • Smithsonian Channel
    • Site Map
    • Privacy Policy
    • Copyright
    • About Smithsonian
    • Contact Us
    • Advertising
    • Reader Panel
    • Subscribe
    • RSS

    Smithsonian Institution

    Produced by Clickability