article

Free Access

The mathematics of statistical machine translation: parameter estimation

Authors:
Peter F. Brown

IBM T.J. Watson Research Center

IBM T.J. Watson Research Center
View Profile

,
Vincent J. Della Pietra

IBM T.J. Watson Research Center

IBM T.J. Watson Research Center
View Profile

,
Stephen A. Della Pietra

IBM T.J. Watson Research Center

IBM T.J. Watson Research Center
View Profile

,
Robert L. Mercer

IBM T.J. Watson Research Center

IBM T.J. Watson Research Center
View Profile

Authors Info & Claims

Computational Linguistics Volume 19 Issue 2pp 263–311

Published:01 June 1993Publication History

Computational Linguistics

Abstract

We describe a series of five statistical models of the translation process and give algorithms for estimating the parameters of these models given a set of pairs of sentences that are translations of one another. We define a concept of word-by-word alignment between such pairs of sentences. For any given pair of such sentences each of our models assigns a probability to each of the possible word-by-word alignments. We give an algorithm for seeking the most probable of these alignments. Although the algorithm is suboptimal, the alignment thus obtained accounts well for the word-by-word relationships in the pair of sentences. We have a great deal of data in French and English from the proceedings of the Canadian Parliament. Accordingly, we have restricted our work to these two languages; but we feel that because our algorithms have minimal linguistic content they would work well on other pairs of languages. We also feel, again because of the minimal linguistic content of our algorithms, that it is reasonable to argue that word-by-word alignments are inherent in any sufficiently large bilingual corpus.

References

Baum, L. E. (1972). "An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process." Inequalities, 3, 1--8.Google Scholar
Brown, Peter F.; Cocke, John; Della Pietra, Stephen A.; Della Pietra, Vincent J.; Jelinek, Frederick; Lafferty, John D.; Mercer, Robert L.; and Roossin, Paul S. (1990). "A statistical approach to machine translation." Computational Linguistics, 16(2), 79--85. Google ScholarDigital Library
Brown, Peter F.; Cocke, John; Della Pietra, Stephen A.; Della Pietra, Vincent J.; Jelinek, Frederick; Mercer, Robert L.; and Roossin, Paul S. (1988). "A statistical approach to language translation." In Proceedings, 12th International Conference on Computational Linguistics (COLING-88). Budapest, Hungary, August 1988, 71--76. Google ScholarDigital Library
Brown, Peter F.; Della Pietra, Stephen A.; Della Pietra, Vincent J.; and Mercer, Robert L. (1991a). "A statistical approach to sense disambiguation in machine translation." In Fourth DARPA Workshop on Speech and Natural Language. Morgan Kaufmann Publishers, Inc., 146--151. Google ScholarDigital Library
Brown, Peter F.; Della Pietra, Stephen A.; Della Pietra, Vincent J.; and Mercer, Robert L. (1991b). "Word sense disambiguation using statistical methods." In Proceedings, 29th Annual Meeting of the Association for Computational Linguistics. Berkeley CA, June 1991, 265--270. Google ScholarDigital Library
Brown, Peter F.; Della Pietra, Vincent J.; deSouza, Peter V.; and Mercer, Robert L. (1990). "Class-based n-gram models of natural language." In Proceedings of the IBM Natural Language ITI. Paris, France, March 1990, 283--298. Also in Computational Linguistics 18(4), 1992, 467--479. Google ScholarDigital Library
Brown, Peter F.; Lai, Jennifer C.; and Mercer, Robert L. (1991). "Aligning sentences in parallel corpora." In Proceedings, 29th Annual Meeting of the Association for Computational Linguistics. Berkeley CA, June 1991, 169--176. Google ScholarDigital Library
Dempster, A. P.; Laird, N. M.; and Rubin, D. B. (1977). "Maximum likelihood from incomplete data via the EM algorithm." Journal of the Royal Statistical Society, 39(B), 1--38.Google Scholar
Gale, William A.; and Church, Kenneth W. (1991a). "Identifying word correspondences in parallel texts." In Fourth DARPA Workshop on Speech and Natural Language. Morgan Kaufmann Publishers, Inc., 152--157. Google ScholarDigital Library
Gale, William A.; and Church, Kenneth W. (1991b). "A program for aligning sentences in bilingual corpora." In Proceedings, 29th Annual Meeting of the Association for Computational Linguistics. Berkeley CA, June 1991, 177--184. Google ScholarDigital Library
Itô, Kiyoshi, editor. (1987). Encyclopedic Dictionary of Mathematics, Second Edition. MIT Press.Google Scholar
Kay, Martin (1991). "Text-translation alignment." In ACH/ALLC '91: "Making Connections" Conference Handbook. Tempe AZ, March 1991.Google Scholar
Maltese, G., and Mancini, F. (1992). "An automatic technique to include grammatical and morphological information in a trigram-based statistical language model." In Proceedings, IEEE International Conference on Acoustics, Speech and Signal Processing. San Francisco CA, March 1992, I-157-I-160.Google ScholarCross Ref
Warwick, Susan, and Russell, Graham (1990). "Bilingual concordancing and bilingual lexicography." In EURALEX 4th International Congress. Málaga, Spain.Google Scholar
Weaver, W. (1955). Translation (1949). In Machine Translation of Languages. MIT Press.Google Scholar

Index Terms

The mathematics of statistical machine translation: parameter estimation
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
  2. Machine learning
2. Hardware
  1. Power and energy
    1. Power estimation and optimization
      1. Platform power issues

Index terms have been assigned to the content through auto-classification.

Recommendations

Statistical machine translation system for English to Urdu

English and Urdu, both languages, belong to different language families and follow different grammatical structure. If the source and target languages differ in linguistic features, mainly structure of the sentences as is the case with English and Urdu ...
Read More
N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination
EACL '09: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics

In this paper we compare and contrast two approaches to Machine Translation (MT): the CMU-UKA Syntax Augmented Machine Translation system (SAMT) and UPC-TALP N-gram-based Statistical Machine Translation (SMT). SAMT is a hierarchical syntax-driven ...
Read More
Linguistically annotated BTG for statistical machine translation
COLING '08: Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1

Bracketing Transduction Grammar (BTG) is a natural choice for effective integration of desired linguistic knowledge into statistical machine translation (SMT). In this paper, we propose a Linguistically Annotated BTG (LABTG) for SMT. It conveys ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

Computational Linguistics Volume 19, Issue 2
Special issue on using large corpora: II
June 1993
185 pages
ISSN:0891-2017
EISSN:1530-9312
Issue’s Table of Contents
Sponsors
In-Cooperation
Publisher
MIT Press
Cambridge, MA, United States
Publication History
- Published: 1 June 1993
Published in coli Volume 19, Issue 2
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1,179
  Total Citations
  View Citations
- 9,670
  Total Downloads
- Downloads (Last 12 months)150
- Downloads (Last 6 weeks)21
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

The mathematics of statistical machine translation: parameter estimation

Computational Linguistics

Abstract

References

Cited By

Index Terms

Recommendations

Statistical machine translation system for English to Urdu

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

Linguistically annotated BTG for statistical machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

The mathematics of statistical machine translation: parameter estimation

Computational Linguistics

Abstract

References

Cited By

Index Terms

Recommendations

Statistical machine translation system for English to Urdu

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

Linguistically annotated BTG for statistical machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media