Google presents Wikipedia in Arabic, Gujarati, Hindi, Kannada, Swahili, Tamil and Telugu

After the in-app purchases, it is the turn of the in-app subscriptions to arrive in Google Android Play and applications.Last year, Google...
May 25 2012
Google announced that users of Google Apps to take advantage of web...
Last November, we launched the ability to edit mobile docs in...
The Mountain View company goes shopping in the area of computer...
Earlier this month, we announced the first version of the Blogger...
The Google Apps collaboration tools have steadily and swiftly...
As a second-generation Indian who has grown up in the United States...

We believe that translation is key to our mission of making information useful to everyone. For example, Wikipedia  is a phenomenal source of knowledge, especially for speakers of common languages such as English, German and French where there are hundreds of thousands—or millions—of articles available. For many smaller languages, however, Wikipedia doesn’t yet have anywhere near the same amount of content available.

To help Wikipedia become more helpful to speakers of smaller languages, Google are working with volunteers, translators and Wikipedians across India, the Middle East and Africa to translate more than 16 million words for Wikipedia into Arabic, Gujarati, Hindi, Kannada, Swahili, Tamil and Telugu. They began these efforts in 2008, starting with translating Wikipedia articles into Hindi, a language spoken by tens of millions of Internet users. At that time the Hindi Wikipedia had only 3.4 million words across 21,000 articles—while in contrast, the English Wikipedia had 1.3 billion words across 2.5 million articles.

They selected the Wikipedia articles using a couple of different sets of criteria. First, they used Google search data to determine the most popular English Wikipedia articles read in India. Using Google Trends, they found the articles that were consistently read over time—and not just temporarily popular. Finally they used Translator Toolkit to translate articles that either did not exist or were placeholder articles or “stubs” in Hindi Wikipedia. In three months, they used a combination of human and machine translation tools to translate 600,000 words from more than 100 articles in English Wikipedia, growing Hindi Wikipedia by almost 20 percent. They have since repeated this process for other languages, to bring our total number of words translated to 16 million.

They are off to a good start but, as you can see in the graph below, they have a lot more work to do to bring the information in Wikipedia to people worldwide:

They have also found that there are many Internet users who have used our tools to translate more than 100 million words of Wikipedia content into various languages worldwide. If you do speak another language they hope you’ll join us in bringing Wikipedia content to other languages and cultures with Translator Toolkit.

They presented these results last Saturday, July 10, at Wikimania 2010 in Gdańsk, Poland. They look forward to continuing to support the creation of the world’s largest encyclopedia and they can’t wait to work with Wikipedians and volunteers to create more content worldwide.

Add new comment