skip to main content
10.5555/1557690.1557731dlproceedingsArticle/Chapter ViewAbstractPublication PageshltConference Proceedingsconference-collections
research-article
Free Access

A linguistically annotated reordering model for BTG-based statistical machine translation

Authors Info & Claims
Published:16 June 2008Publication History

ABSTRACT

In this paper, we propose a linguistically annotated reordering model for BTG-based statistical machine translation. The model incorporates linguistic knowledge to predict orders for both syntactic and non-syntactic phrases. The linguistic knowledge is automatically learned from source-side parse trees through an annotation algorithm. We empirically demonstrate that the proposed model leads to a significant improvement of 1.55% in the BLEU score over the baseline reordering model on the NIST MT-05 Chinese-to-English translation task.

References

  1. Franz Josef Och. 2003. Minimum Error Rate Training in Statistical Machine Translation. In Proceedings of ACL 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Andreas Stolcke. 2002. SRILM - an extensible language modeling toolkit. In Proceedings of International Conference on Spoken Language Processing, volume 2, pages 901--904.Google ScholarGoogle Scholar
  3. Chao Wang, Michael Collins and Philipp Koehn. 2007. Chinese Syntactic Reordering for Statistical Machine Translation. In Proceedings of EMNLP-CoNLL 2007.Google ScholarGoogle Scholar
  4. Dekai Wu. 1997. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora. Computational Linguistics, 23(3):377--403. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Deyi Xiong, Shuanglong Li, Qun Liu, Shouxun Lin, Yueliang Qian. 2005. Parsing the Penn Chinese Treebank with Semantic Knowledge. In Proceedings of IJCNLP, Jeju Island, Korea. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Deyi Xiong, Qun Liu and Shouxun Lin. 2006. Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation. In Proceedings of ACL-COLING 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Dongdong Zhang, Mu Li, Chi-Ho Li and Ming Zhou. 2007. Phrase Reordering Model Integrating Syntactic Knowledge for SMT. In Proceedings of EMNLP-CoNLL 2007.Google ScholarGoogle Scholar
  8. Le Zhang. 2004. Maximum Entropy Modeling Tooklkit for Python and C++. Available at http://homepages.inf.ed.ac.uk/s0450736/maxent_toolkit.html.Google ScholarGoogle Scholar

Index Terms

  1. A linguistically annotated reordering model for BTG-based statistical machine translation

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in
                • Published in

                  cover image DL Hosted proceedings
                  HLT-Short '08: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
                  June 2008
                  307 pages

                  Publisher

                  Association for Computational Linguistics

                  United States

                  Publication History

                  • Published: 16 June 2008

                  Qualifiers

                  • research-article

                  Acceptance Rates

                  Overall Acceptance Rate240of768submissions,31%

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader