Evaluation of the suitability of machine learning techniques to build a post-editing recommendation system for Basque

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Published 25-11-2018
Nora Aranberri Jose A. Pascual

Abstract

The overall machine translation quality available for professional translators working with the Spanish{Basque pair is rather poor, which is a deterrent for its adoption. This work investigates the plausibility of building a comprehensive recommendation system to speed up decision time between postediting or translation from scratch using the very limited training data available. First, we build a set of regression models that predict the postediting eort in terms of overall quality, time and edits. Secondly, we build classication models that recommend the most ecient editing approach using postediting eort features on top of linguistic features. Results show high correlations between the predictions of the regression models and the expected HTER, time and edit number values. Similarly, the results for the classiers show that they are able to predict with high accuracy whether it is more ecient to translate or to postedit a new segment.
Abstract 372 | PDF (Euskara) Downloads 431

##plugins.themes.bootstrap3.article.details##

Keywords

Machine translation, post-editing, recommendation system, Basque.

Section
Ale Arrunta