Abstract
This article proposes a new distortion model for phrase-based statistical machine translation. In decoding, a distortion model estimates the source word position to be translated next (subsequent position; SP) given the last translated source word position (current position; CP). We propose a distortion model that can simultaneously consider the word at the CP, the word at an SP candidate, the context of the CP and an SP candidate, relative word order among the SP candidates, and the words between the CP and an SP candidate. These considered elements are called rich context. Our model considers rich context by discriminating label sequences that specify spans from the CP to each SP candidate. It enables our model to learn the effect of relative word order among SP candidates as well as to learn the effect of distances from the training data. In contrast to the learning strategy of existing methods, our learning strategy is that the model learns preference relations among SP candidates in each sentence of the training data. This leaning strategy enables consideration of all of the rich context simultaneously. In our experiments, our model had higher BLUE and RIBES scores for Japanese-English, Chinese-English, and German-English translation compared to the lexical reordering models.
- Yaser Al-Onaizan and Kishore Papineni. 2006. Distortion models for statistical machine translation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 529--536. DOI:http://dx.doi.org/10.3115/1220175.1220242. Google ScholarDigital Library
- Adam L. Berger, Vincent J. Della Pietra, and Stephen A. Della Pietra. 1996. A maximum entropy approach to natural language processing. Comput. Linguist. 22, 1, 39--71. http://dl.acm.org/citation.cfm?id=234285.234289. Google ScholarDigital Library
- Chris Callison-Burch, Cameron Fordyce, Philipp Koehn, Christof Monz, and Josh Schroeder. 2008. Further meta-evaluation of machine translation. In Proceedings of the 3rd Workshop on Statistical Machine Translation. Association for Computational Linguistics, 70--106. http://www.aclweb.org/anthology/W/W08/W08-0309. Google ScholarDigital Library
- Stanley F. Chen and Joshua T. Goodman. 1998. An empirical study of smoothing techniques for language modeling. Tech. rep. TR-10-98. Computer Science Group, Harvard University.Google Scholar
- Stanley F. Chen and Ronald Rosenfeld. 1999. A Gaussian prior for smoothing maximum entropy models. Tech. rep., School of Computer Science, Carnegie Mellon University.Google Scholar
- Colin Cherry. 2013. Improved reordering for phrase-based translation using sparse features. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 22--31. http://www.aclweb.org/anthology/N13-1003.Google Scholar
- Colin Cherry, Robert C. Moore, and Chris Quirk. 2012. On hierarchical re-ordering and permutation parsing for phrase-based decoding. In Proceedings of the 7th Workshop on Statistical Machine Translation. Association for Computational Linguistics, 200--209. http://www.aclweb.org/anthology/W12-3125. Google ScholarDigital Library
- David Chiang. 2007. Hierarchical phrase-based translation. Comput. Linguistics 33, 2, 201--228. Google ScholarDigital Library
- David Chiang. 2010. Learning to translate with source and target syntax. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 1443--1452. http://www.aclweb.org/anthology/P10-1146. Google ScholarDigital Library
- John DeNero and Jakob Uszkoreit. 2011. Inducing sentence structure from parallel corpora for reordering. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 193--203. http://www.aclweb.org/anthology/D11-1018. Google ScholarDigital Library
- Chris Dyer and Philip Resnik. 2010. Context-free reordering, finite-state translation. Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 858--866. http://www.aclweb.org/anthology/N10-1128. Google ScholarDigital Library
- Theodoros Evgeniou and Massimiliano Pontil. 2002. Learning preference relations from data. In Proceedings of the 13th Italian Workshop on Neural Nets. Lecture Notes in Computer Science, vol. 2486, 23--32. Google ScholarDigital Library
- Minwei Feng, Jan-Thorsten Peter, and Hermann Ney. 2013. Advancements in reordering models for statistical machine translation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Vol. 1: Long Papers). Association for Computational Linguistics, 322--332. http://www.aclweb.org/anthology/P13-1032.Google Scholar
- Yang Feng, Haitao Mi, Yang Liu, and Qun Liu. 2010. An efficient shift-reduce decoding algorithm for phrased-based machine translation. In Proceedings of the International Conference on Computational Linguistics. Coling 2010 Organizing Committee, 285--293. http://www.aclweb.org/anthology/C10-2033. Google ScholarDigital Library
- Michel Galley and Christopher D. Manning. 2008. A simple and effective hierarchical phrase reordering model. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 848--856. http://www.aclweb.org/anthology/D08-1089. Google ScholarDigital Library
- Michel Galley, Mark Hopkins, Kevin Knight, and Daniel Marcu. 2004. What’s in a translation rule? In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL’04). D. Marcu, S. Dumais, and S. Roukos Eds., Association for Computational Linguistics, 273--280.Google ScholarCross Ref
- Niyu Ge. 2010. A direct syntax-driven reordering model for phrase-based machine translation. Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 849--857. http://www.aclweb.org/anthology/N10-1127. Google ScholarDigital Library
- Dmitriy Genzel. 2010. Automatically learning source-side reordering rules for large scale machine translation. In Proceedings of the 23rd International Conference on Computational Linguistics (Coling’10). Coling 2010 Organizing Committee, 376--384. http://www.aclweb.org/anthology/C10-1043. Google ScholarDigital Library
- Isao Goto, Bin Lu, Ka Po Chow, Eiichiro Sumita, and Benjamin K. Tsou. 2011. Overview of the patent machine translation task at the NTCIR-9 workshop. In Proceedings of the 9th NTCIR Workshop (NTCIR-9). 559--578.Google Scholar
- Isao Goto, Masao Utiyama, Eiichiro Sumita, Akihiro Tamura, and Sadao Kurohashi. 2013. Distortion model considering rich context for statistical machine translation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Vol. 1: Long Papers). Association for Computational Linguistics, 155--165. http://www.aclweb.org/anthology/P13-1016.Google Scholar
- Spence Green, Michel Galley, and Christopher D. Manning. 2010. Improved models of distortion cost for statistical machine translation. Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 867--875. http://www.aclweb.org/anthology/N10-1129. Google ScholarDigital Library
- Hieu Hoang, Philipp Koehn, and Adam Lopez. 2009. A unified framework for phrase-based, hierarchical, and syntax-based statistical machine translation. In Proceedings of the International Workshop on Spoken Language Translation. 152--159.Google Scholar
- Liang Huang, Kevin Knight, and Aravind Joshi. 2006. Statistical syntax-directed translation with extended domain of locality. In Proceedings of the 7th Conference of the Association for Machine Translation of the Americas. 66--73.Google Scholar
- Hideki Isozaki, Tsutomu Hirao, Kevin Duh, Katsuhito Sudoh, and Hajime Tsukada. 2010a. Automatic evaluation of translation quality for distant language pairs. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 944--952. http://www.aclweb.org/anthology/D10-1092. Google ScholarDigital Library
- Hideki Isozaki, Katsuhito Sudoh, Hajime Tsukada, and Kevin Duh. 2010b. Head finalization: A simple reordering rule for SOV languages. In Proceedings of the Joint 5th Workshop on Statistical Machine Translation and MetricsMATR. Association for Computational Linguistics, 244--251. http://www.aclweb.org/anthology/W10-1736. Google ScholarDigital Library
- Philipp Koehn. 2004. Statistical significance tests for machine translation evaluation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’04). D. Lin and D. Wu Eds., Association for Computational Linguistics, 388--395.Google Scholar
- Philipp Koehn, Amittai Axelrod, Alexandra Birch Mayne, Chris Callison-Burch, Miles Osborne, and David Talbot. 2005. Edinburgh system description for the 2005 IWSLT speech translation evaluation. In Proceedings of the International Workshop on Spoken Language Translation.Google Scholar
- Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open source toolkit for statistical machine translation. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions. Association for Computational Linguistics, 177--180. http://www.aclweb.org/anthology/P07-2045. Google ScholarDigital Library
- John D. Lafferty, Andrew McCallum, and Fernando C. N. Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th International Conference on Machine Learning (ICML’01). Morgan Kaufmann Publishers Inc., San Francisco, CA, 282--289. http://dl.acm.org/citation.cfm?id=645530.655813. Google ScholarDigital Library
- D. C. Liu and J. Nocedal. 1989. On the limited memory method for large scale optimization. Math. Program. B 45, 3, 503--528. Google ScholarDigital Library
- Yang Liu, Qun Liu, and Shouxun Lin. 2006. Tree-to-string alignment template for statistical machine translation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 609--616. DOI:http://dx.doi.org/10.3115/1220175.1220252. Google ScholarDigital Library
- Yang Liu, Yajuan Lü, and Qun Liu. 2009. Improving tree-to-tree translation with packed forests. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. Association for Computational Linguistics, 558--566. http://www.aclweb.org/anthology/P/P09/P09-1063. Google ScholarDigital Library
- Dennis Nolan Mehay and Christopher Hardie Brew. 2012. CCG syntactic reordering models for phrase-based machine translation. In Proceedings of the 7th Workshop on Statistical Machine Translation. Association for Computational Linguistics, 210--221. http://www.aclweb.org/anthology/W12-3126. Google ScholarDigital Library
- Graham Neubig, Taro Watanabe, and Shinsuke Mori. 2012. Inducing a discriminative parser to optimize machine translation reordering. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, 843--853. http://www.aclweb.org/anthology/D12-1077. Google ScholarDigital Library
- Yizhao Ni, Craig Saunders, Sandor Szedmak, and Mahesan Niranjan. 2009. Handling phrase reorderings for machine translation. In Proceedings of the ACL-IJCNLP Conference Short Papers. Association for Computational Linguistics, 241--244. http://www.aclweb.org/anthology/P/P09/P09-2061. Google ScholarDigital Library
- Franz Josef Och. 2003. Minimum error rate training in statistical machine translation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 160--167. DOI:http://dx.doi.org/10.3115/1075096.1075117. Google ScholarDigital Library
- Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: A method for automatic evaluation of machine translation. In Proceedings of 40th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 311--318. DOI:http://dx.doi.org/10.3115/1073083.1073135. Google ScholarDigital Library
- Libin Shen, Jinxi Xu, and Ralph Weischedel. 2008. A new string-to-dependency machine translation algorithm with a target dependency language model. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 577--585. http://www.aclweb.org/anthology/P/P08/P08-1066.Google Scholar
- Andreas Stolcke, Jing Zheng, Wen Wang, and Victor Abrash. 2011. SRILM at sixteen: Update and outlook. In Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop.Google Scholar
- Christoph Tillman. 2004. A unigram orientation model for statistical machine translation. In Proceedings of the Human Language Technologies Conference of the North American Chapter of the Association for Computational Linguistics (Short Papers). D. Marcu, S. Dumais, and S. Roukos Eds., Association for Computational Linguistics, 101--104. Google ScholarDigital Library
- Karthik Visweswariah, Rajakrishnan Rajkumar, Ankur Gandhe, Ananthakrishnan Ramanathan, and Jiri Navratil. 2011. A word reordering model for improved machine translation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 486--496. http://www.aclweb.org/anthology/D11-1045. Google ScholarDigital Library
- Chao Wang, Michael Collins, and Philipp Koehn. 2007. Chinese syntactic reordering for statistical machine translation. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL). Association for Computational Linguistics, 737--745. http://www.aclweb.org/anthology/D/D07/D07-1077.Google Scholar
- Fei Xia and Michael McCord. 2004. Improving a statistical MT system with automatically learned rewrite patterns. In Proceedings of the 20th International Conference on Computational Linguistics. 508--514. Google ScholarDigital Library
- Deyi Xiong, Qun Liu, and Shouxun Lin. 2006. Maximum entropy based phrase reordering model for statistical machine translation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 521--528. DOI:http://dx.doi.org/10.3115/1220175.1220241. Google ScholarDigital Library
- Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li. 2008. Linguistically annotated BTG for statistical machine translation. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling’08). Coling 2008 Organizing Committee, 1009--1016. http://www.aclweb.org/anthology/C08-1127. Google ScholarDigital Library
- Deyi Xiong, Min Zhang, and Haizhou Li. 2012. Modeling the translation of predicate-argument structure for SMT. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 902--911. http://www.aclweb.org/anthology/P12-1095. Google ScholarDigital Library
- Kenji Yamada and Kevin Knight. 2001. A syntax-based statistical translation model. In Proceedings of 39th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 523--530. DOI:http://dx.doi.org/10.3115/1073012.1073079. Google ScholarDigital Library
- Richard Zens and Hermann Ney. 2006. Discriminative reordering models for statistical machine translation. In Proceedings of the Workshop on Statistical Machine Translation. Association for Computational Linguistics, 55--63. DOI:http://www.aclweb.org/anthology/W/W06/W06-3108. Google ScholarDigital Library
- Richard Zens, Hermann Ney, Taro Watanabe, and Eiichiro Sumita. 2004. Reordering constraints for phrase-based statistical machine translation. In Proceedings of the 20th International Conference on Computational Linguistics. 205--211. Google ScholarDigital Library
Index Terms
- Distortion Model Based on Word Sequence Labeling for Statistical Machine Translation
Recommendations
Improving phrase-based statistical machine translation with morphosyntactic transformation
We present a phrase-based statistical machine translation approach which uses linguistic analysis in the preprocessing phase. The linguistic analysis includes morphological transformation and syntactic transformation. Since the word-order problem is ...
Post-Ordering by Parsing with ITG for Japanese-English Statistical Machine Translation
Word reordering is a difficult task for translation between languages with widely different word orders, such as Japanese and English. A previously proposed post-ordering method for Japanese-to-English translation first translates a Japanese sentence ...
Word Sense Based Hindi-Tamil Statistical Machine Translation
Corpus based natural language processing has emerged with great success in recent years. It is not only used for languages like English, French, Spanish, and Hindi but also is widely used for languages like Tamil, Telugu etc. This paper focuses to ...
Comments