Article

Free Access

Scalable inference and training of context-rich syntactic translation models

Authors:
Michel Galley

Columbia University, New York, NY

Columbia University, New York, NY
View Profile

,
Jonathan Graehl

University of Southern California, Marina del Rey, CA

University of Southern California, Marina del Rey, CA
View Profile

,
Kevin Knight

University of Southern California, Marina del Rey, CA and Language Weaver, Inc., Marina del Rey, CA

University of Southern California, Marina del Rey, CA and Language Weaver, Inc., Marina del Rey, CA
View Profile

,
Daniel Marcu

University of Southern California, Marina del Rey, CA and Language Weaver, Inc., Marina del Rey, CA

University of Southern California, Marina del Rey, CA and Language Weaver, Inc., Marina del Rey, CA
View Profile

,
Steve DeNeefe

University of Southern California, Marina del Rey, CA

University of Southern California, Marina del Rey, CA
View Profile

,
Wei Wang

Language Weaver, Inc., Marina del Rey, CA

Language Weaver, Inc., Marina del Rey, CA
View Profile

,
Ignacio Thayer

University of Southern California, Marina del Rey, CA

University of Southern California, Marina del Rey, CA
View Profile

ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational LinguisticsJuly 2006Pages 961–968https://doi.org/10.3115/1220175.1220296

Published:17 July 2006Publication History

ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics

Pages 961–968

ABSTRACT

Statistical MT has made great progress in the last few years, but current translation models are weak on re-ordering and target language fluency. Syntactic approaches seek to remedy these problems. In this paper, we take the framework for acquiring multi-level syntactic translation rules of (Galley et al., 2004) from aligned tree-string pairs, and present two main extensions of their approach: first, instead of merely computing a single derivation that minimally explains a sentence pair, we construct a large number of derivations that include contextually richer rules, and account for multiple interpretations of unaligned words. Second, we propose probability estimates and a training procedure for weighting these rules. We contrast different approaches on real examples, show that our estimates based on multiple derivations favor phrasal re-orderings that are linguistically better motivated, and establish that our larger rules provide a 3.63 BLEU point increase over minimal rules.

References

D. Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. In Proc. of ACL. Google ScholarDigital Library
H. Fox. 2002. Phrasal cohesion and statistical machine translation. In Proc. of EMNLP, pages 304--311. Google ScholarDigital Library
M. Galley, M. Hopkins, K. Knight, and D. Marcu. 2004. What's in a translation rule? In Proc. of HLT/NAACL-04.Google Scholar
J. Graehl and K. Knight. 2004. Training tree transducers. In Proc. of HLT/NAACL-04, pages 105--112.Google Scholar
F. Och and H. Ney. 2004. The alignment template approach to statistical machine translation. Computational Linguistics, 30(4):417--449. Google ScholarDigital Library
A. Poutsma. 2000. Data-oriented translation. In Proc. of COLING, pages 635--641. Google ScholarDigital Library
D. Wu. 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3):377--404. Google ScholarDigital Library
K. Yamada and K. Knight. 2001. A syntax-based statistical translation model. In Proc. of ACL, pages 523--530. Google ScholarDigital Library
H. Zhang, L. Huang, D. Gildea, and K. Knight. 2006. Synchronous binarization for machine translation. In Proc. of HLT/NAACL. Google ScholarDigital Library

Scalable inference and training of context-rich syntactic translation models
1. Computing methodologies
  1. Artificial intelligence
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Incremental syntactic language models for phrase-based translation
HLT '11: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1

This paper describes a novel technique for incorporating syntactic knowledge into phrase-based machine translation through incremental syntactic parsing. Bottom-up and top-down parsers typically require a completed string as input. This requirement ...
Read More
CCG syntactic reordering models for phrase-based machine translation
WMT '12: Proceedings of the Seventh Workshop on Statistical Machine Translation

Statistical phrase-based machine translation requires no linguistic information beyond word-aligned parallel corpora (Zens et al., 2002; Koehn et al., 2003). Unfortunately, this linguistic agnosticism often produces ungrammatical translations. Syntax, ...
Read More
Learning parse and translation decisions from examples with rich context
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
July 2006
1214 pages
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 17 July 2006
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate85of443submissions,19%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 125
  Total Citations
  View Citations
- 643
  Total Downloads
- Downloads (Last 12 months)28
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Scalable inference and training of context-rich syntactic translation models

ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

Incremental syntactic language models for phrase-based translation

CCG syntactic reordering models for phrase-based machine translation

Learning parse and translation decisions from examples with rich context

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Scalable inference and training of context-rich syntactic translation models

ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

Incremental syntactic language models for phrase-based translation

CCG syntactic reordering models for phrase-based machine translation

Learning parse and translation decisions from examples with rich context

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media