skip to main content
10.3115/1220175.1220296dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free Access

Scalable inference and training of context-rich syntactic translation models

Authors Info & Claims
Published:17 July 2006Publication History

ABSTRACT

Statistical MT has made great progress in the last few years, but current translation models are weak on re-ordering and target language fluency. Syntactic approaches seek to remedy these problems. In this paper, we take the framework for acquiring multi-level syntactic translation rules of (Galley et al., 2004) from aligned tree-string pairs, and present two main extensions of their approach: first, instead of merely computing a single derivation that minimally explains a sentence pair, we construct a large number of derivations that include contextually richer rules, and account for multiple interpretations of unaligned words. Second, we propose probability estimates and a training procedure for weighting these rules. We contrast different approaches on real examples, show that our estimates based on multiple derivations favor phrasal re-orderings that are linguistically better motivated, and establish that our larger rules provide a 3.63 BLEU point increase over minimal rules.

References

  1. D. Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. In Proc. of ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. H. Fox. 2002. Phrasal cohesion and statistical machine translation. In Proc. of EMNLP, pages 304--311. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Galley, M. Hopkins, K. Knight, and D. Marcu. 2004. What's in a translation rule? In Proc. of HLT/NAACL-04.Google ScholarGoogle Scholar
  4. J. Graehl and K. Knight. 2004. Training tree transducers. In Proc. of HLT/NAACL-04, pages 105--112.Google ScholarGoogle Scholar
  5. F. Och and H. Ney. 2004. The alignment template approach to statistical machine translation. Computational Linguistics, 30(4):417--449. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Poutsma. 2000. Data-oriented translation. In Proc. of COLING, pages 635--641. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Wu. 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3):377--404. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. K. Yamada and K. Knight. 2001. A syntax-based statistical translation model. In Proc. of ACL, pages 523--530. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H. Zhang, L. Huang, D. Gildea, and K. Knight. 2006. Synchronous binarization for machine translation. In Proc. of HLT/NAACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Scalable inference and training of context-rich syntactic translation models

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
        July 2006
        1214 pages

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 17 July 2006

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate85of443submissions,19%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader