Jump to content

Machine translation

From Niidae Wiki
Revision as of 20:09, 9 March 2009 by imported>Savig (Interlingua approach)
Translate this page!


Help us develop tools for translating Wikipedia.

The purpose of the Wiki(pedia) Machine Translation Project is to develop ideas, methods and tools that can help translate Wikipedia articles (and Wikimedia pages) from one language to another (particularly out of English and into languages with small numbers of fluent speakers).

Motivation

Small languages can't produce articles as fast as English wikimedia projects because the number of wikipedians is too low. The solution for this problem is the translation of English wikimedia projects. But, some languages will not have enough translators. Machine Translation can improve the productivity of the community.

But manual translation can be added later, for a more accurate text.

Approaches

Interlingua approach

A different, but related approach would be translating articles by hand into a machine translation interlingua, like Rick Morneau's interlingua or Marcos Franco's UTL, then writing software modules to translate automatically from that interlingua into each target language. This only saves work with respect to direct translation if there are several target languages whose modules are well enough developed, but those modules are much easier to write, and expectably more accurate, than full real-language to real-language automatic translation systems.

TradWiki/WikiTran

Main article: Translation Wiki

TradWiki/WikiTran (WikipediaTranslator/WikiTranslator/BabelWiki) is a wiki that will be coded to help wikipedians translate articles from English to other languages.

I would imagine it would be an easier task to translate between similar languages than non-similar ones. For example, we have Wikipedias in Catalan and Spanish and Macedonian and Bulgarian, perhaps even Dutch and Afrikaans (some more studies would have to be done to evaluate which would be most appropriate). There is some free software being produced in Spain called en:Apertium that might be useful here. - FrancisTyers 21:44, 2 March 2006 (UTC)

Existing free software

  • Apertium
    • Apertium is an open-source platform (engine, tools) to build machine translation systems, mainly between related languages. Code, documentation, language pairs available from the Apertium website.
  • Moses (Moses is licensed under the LGPL.)
    • " Moses is a statistical machine translation system that allows you to automatically train translation models for any language pair. All you need is a collection of translated texts (parallel corpus)." (from their website Moses website.
  • Wikipedia translation
    • Tool designed to help populate smaller Wikipedias, for example translating country templates quickly

Resources

General

Dictionaries

Code - inactive

The idea is to semi-automatically create a UNL textfrom, say, English,then have it fully-automatically translated in up to 150 languages on-the-fly. The project is now an independent organization.

Discussion

See the talk:Wikipedia Machine Translation Project page.

See also