Publication
ACL 2007
Conference paper
Guiding statistical word alignment models with prior knowledge
Abstract
We present a general framework to incorporate prior knowledge such as heuristics or linguistic features in statistical generative word alignment models. Prior knowledge plays a role of probabilistic soft constraints between bilingual word pairs that shall be used to guide word alignment model training. We investigate knowledge that can be derived automatically from entropy principle and bilingual latent semantic analysis and show how they can be applied to improve translation performance. © 2007 Association for Computational Linguistics.