Skip to main content

Test announcement

Announcement here about some event or update. Or maybe link to promoted article. 

Main navigation

  • Home
  • Culture
    • Humor
    • Mathematics
    • Random Thoughts
    • Science & Society
    • Sports Science
    • Technology
  • Earth Sciences
    • Atmospheric
    • Energy
    • Environment
    • Geology
    • Oceanography
    • Paleontology
  • Life Sciences
    • Ecology & Zoology
    • Evolution
    • Immunology
    • Microbiology
    • Neuroscience
  • Medicine
    • Aging
    • Cancer Research
    • Clinical Research
    • Pharmacology
    • Public Health
    • Vision
  • Physical Sciences
    • Aerospace
    • Applied Physics
    • Chemistry
    • Optics
    • Physics
    • Space
  • Social Sciences
    • Anthropology
    • Archaeology
    • Philosophy & Ethics
    • Psychology
    • Science History
  • Contributors
X XD

User menu

  • Log in

Language Translation: A Problem of Vector Space Mathematics

By Hank Campbell in Science 2.0
September 27, 2013
Profile picture for user Hank
Submitted by Hank on Fri, 09/27/2013 - 01:30
Old NID
121176

To translate one language into another, find the linear transformation that maps one to the other. Simple, if you are part of an elite team of Google engineers.

A new translation technique being created by Google does not rely on versions of the same document in different languages, the old dictionary approach. Instead, it uses data mining techniques to model the structure of a single language and then compares this to the structure of another language. The new approach relies on the notion that every language must describe a similar set of ideas, so the words that do this must also be similar. For example, most languages will have words for common animals such as cat, dog, cow and so on. And these words are probably used in the same way in sentences such as “a cat is an animal that is smaller than a dog.”

The same is true of numbers. The image above shows the vector representations of the numbers one to five in English and Spanish and demonstrates how similar they are. The set of all the relationships, the so-called “language space”, can be thought of as a set of vectors that each point from one word to another. And in recent years, linguists have discovered that it is possible to handle these vectors mathematically. For example, the operation ‘king’ – ‘man’ + ‘woman’ results in a vector that is similar to ‘queen’.

Citation: Tomas Mikolov, Quoc V. Le, Ilya Sutskever, 'Exploiting Similarities among Languages for Machine Translation', arXiv:1309.4168

Link: How Google Converted Language Translation Into a Problem of Vector Space Mathematics - Technology Review

Donate

Please donate so science experts can write for the public.

At Science 2.0, scientists are the journalists, with no political bias or editorial control. We can't do it alone so please make a difference.

Donate with PayPal button 
We are a nonprofit science journalism group operating under Section 501(c)(3) of the Internal Revenue Code that's educated over 300 million people.

You can help with a tax-deductible donation today and 100 percent of your gift will go toward our programs, no salaries or offices.

Latest reads

Article teaser image
No, Trump’s Executive Orders Can’t Cancel Your Rights.
Donald Trump does not have the power to rescind either constitutional amendments or federal laws by mere executive order, no matter how strongly he might wish otherwise. No president of the United…
Article teaser image
The US Discourages Pregnant Women From Drinking Alcohol - Vegetarian Diets Are Worse
The Biden administration recently issued a new report showing causal links between alcohol and cancer, and it's about time. The link has been long-known, but alcohol carcinogenic properties have been…
Article teaser image
In British Iron Age Culture, Margaret Thatcher Was The Norm
In British Iron Age society, land was inherited through the female line and husbands moved to live with the wife’s community. Strong women like Margaret Thatcher resulted.That was inferred due to DNA…

More reads

Featured Image

Working Memory: A Psychological Reason Some Wouldn't Social Distance Earlier During COVID-19?

Some people would not or said they could not socially distance effectively during the early days of the COVID-19 pandemic, the 2019 coronavirus mutation that originated in Wuhan, China and spread…
Featured Image

If You Must Smoke, Red Wine Before May Prevent Short Term Vascular Damage

You should not smoke cigarettes and if you already do, you should stop. Though the U.S.
Featured Image

Flu Vaccination Rates Haven't Gone Up Due To The COVID-19 Pandemic

COVID-19 has done little to boost vaccination rates against other viruses, according to a new analysis.
Featured Image

Ray Tracing Is Back Again. Is It Real This Time? Nvidia Thinks So

Ray tracing has been a hot topic since...well, at least 350 B.C. in the western world, when Aristotle described his camera obscura and wrote that the eye is 'a darkened chamber awaiting light.' Da…

Footer

  • About Us
  • Copyright and Removal
  • Privacy Policy
  • Terms