Content Translation

<- Back to OER Interoperability Scenarios

Content Translation
One way to extend the reach of OERs is to translate them into multiple languages to facilitate access of those materials to others. (Translation can be viewed as a first step in localizing content.) A number of organizations have been translating OpenCourseWare materials, especially those of MIT OCW and other English-based OCWs, to extend the reach of the materials. Other approaches include embedding multi-language thesauri into an application, and facilitating volunteer or paid human translators in web-based collaborative workflows.

Examples:
 * Translations of MIT OCW materials are in process by Universia, CORE and OOPS (into Spanish and Portugese, Simplified Chinese and Traditional Chinese respectively).
 * Translations of Connexions Digital Signal Processing materials into Spanish -- see http://cnx.org/content/col10373/latest/ -- resulted in the Spanish versions of the content becoming very popular despite the translation being uneven in some portions.
 * European Schoolnet LRE thesauri across EU languages for curriculum-related terms
 * Web-based cooperative writing platforms with integrated translation tools; cooperatively created dictionaries (such as Worldwide Lexicon, see: http://opentranslation.aspirationtech.org/index.php/Open_Source_Translation_Tools#Translation_Workflow_Tool for other projects)

Challenges:
 * Transcribing text from presentation only or non-editable views (such as PDF documents or images) or to extract content from complex HTML markup from web pages. Additional challenges include the role of machine translation and the integration of projects such as those developed by Meadan - human and machine language interaction, geared to/for Arabic and English speakers)
 * Determining the accuracy and quality of translations
 * Especially for STEM, the accuracy of the translation is labor intensive to ensure but necessary for broad acceptance and use
 * Keeping translated material synchronized with the original material (if the original is constantly changing)
 * Managing multiple versions and derivative works

Enablers:
 * Editable, non-stylized versions of text
 * XML formats
 * Review mechanisms
 * Trackable attribution of (translated) versions
 * Tools to convert content from multiple formats
 * Government agencies in other countries could be great enablers/partners because they may have the funds to help and the desire to ensure high quality translations
 * The approach of LinguaPlone for the Plone content management system may be of interest (and of other CMSs).

Use Case Collections etc.
 * Open Source Translation Tools
 * Open Content Translation Use Cases