Abstract
We describe a series of five statistical models of the translation process and give algorithms for estimating the parameters of these models given a set of pairs of sentences that are translations of one another. We define a concept of word-by-word alignment between such pairs of sentences. For any given pair of such sentences each of our models assigns a probability to each of the possible word-by-word alignments. We give an algorithm for seeking the most probable of these alignments. Although the algorithm is suboptimal, the alignment thus obtained accounts well for the word-by-word relationships in the pair of sentences. We have a great deal of data in French and English from the proceedings of the Canadian Parliament. Accordingly, we have restricted our work to these two languages; but we feel that because our algorithms have minimal linguistic content they would work well on other pairs of languages. We also feel, again because of the minimal linguistic content of our algorithms, that it is reasonable to argue that word-by-word alignments are inherent in any sufficiently large bilingual corpus.
- Baum, L. E. (1972). "An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process." Inequalities, 3, 1--8.Google Scholar
- Brown, Peter F.; Cocke, John; Della Pietra, Stephen A.; Della Pietra, Vincent J.; Jelinek, Frederick; Lafferty, John D.; Mercer, Robert L.; and Roossin, Paul S. (1990). "A statistical approach to machine translation." Computational Linguistics, 16(2), 79--85. Google ScholarDigital Library
- Brown, Peter F.; Cocke, John; Della Pietra, Stephen A.; Della Pietra, Vincent J.; Jelinek, Frederick; Mercer, Robert L.; and Roossin, Paul S. (1988). "A statistical approach to language translation." In Proceedings, 12th International Conference on Computational Linguistics (COLING-88). Budapest, Hungary, August 1988, 71--76. Google ScholarDigital Library
- Brown, Peter F.; Della Pietra, Stephen A.; Della Pietra, Vincent J.; and Mercer, Robert L. (1991a). "A statistical approach to sense disambiguation in machine translation." In Fourth DARPA Workshop on Speech and Natural Language. Morgan Kaufmann Publishers, Inc., 146--151. Google ScholarDigital Library
- Brown, Peter F.; Della Pietra, Stephen A.; Della Pietra, Vincent J.; and Mercer, Robert L. (1991b). "Word sense disambiguation using statistical methods." In Proceedings, 29th Annual Meeting of the Association for Computational Linguistics. Berkeley CA, June 1991, 265--270. Google ScholarDigital Library
- Brown, Peter F.; Della Pietra, Vincent J.; deSouza, Peter V.; and Mercer, Robert L. (1990). "Class-based n-gram models of natural language." In Proceedings of the IBM Natural Language ITI. Paris, France, March 1990, 283--298. Also in Computational Linguistics 18(4), 1992, 467--479. Google ScholarDigital Library
- Brown, Peter F.; Lai, Jennifer C.; and Mercer, Robert L. (1991). "Aligning sentences in parallel corpora." In Proceedings, 29th Annual Meeting of the Association for Computational Linguistics. Berkeley CA, June 1991, 169--176. Google ScholarDigital Library
- Dempster, A. P.; Laird, N. M.; and Rubin, D. B. (1977). "Maximum likelihood from incomplete data via the EM algorithm." Journal of the Royal Statistical Society, 39(B), 1--38.Google Scholar
- Gale, William A.; and Church, Kenneth W. (1991a). "Identifying word correspondences in parallel texts." In Fourth DARPA Workshop on Speech and Natural Language. Morgan Kaufmann Publishers, Inc., 152--157. Google ScholarDigital Library
- Gale, William A.; and Church, Kenneth W. (1991b). "A program for aligning sentences in bilingual corpora." In Proceedings, 29th Annual Meeting of the Association for Computational Linguistics. Berkeley CA, June 1991, 177--184. Google ScholarDigital Library
- Itô, Kiyoshi, editor. (1987). Encyclopedic Dictionary of Mathematics, Second Edition. MIT Press.Google Scholar
- Kay, Martin (1991). "Text-translation alignment." In ACH/ALLC '91: "Making Connections" Conference Handbook. Tempe AZ, March 1991.Google Scholar
- Maltese, G., and Mancini, F. (1992). "An automatic technique to include grammatical and morphological information in a trigram-based statistical language model." In Proceedings, IEEE International Conference on Acoustics, Speech and Signal Processing. San Francisco CA, March 1992, I-157-I-160.Google ScholarCross Ref
- Warwick, Susan, and Russell, Graham (1990). "Bilingual concordancing and bilingual lexicography." In EURALEX 4th International Congress. Málaga, Spain.Google Scholar
- Weaver, W. (1955). Translation (1949). In Machine Translation of Languages. MIT Press.Google Scholar
Index Terms
- The mathematics of statistical machine translation: parameter estimation
Recommendations
Statistical machine translation system for English to Urdu
English and Urdu, both languages, belong to different language families and follow different grammatical structure. If the source and target languages differ in linguistic features, mainly structure of the sentences as is the case with English and Urdu ...
N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination
EACL '09: Proceedings of the 12th Conference of the European Chapter of the Association for Computational LinguisticsIn this paper we compare and contrast two approaches to Machine Translation (MT): the CMU-UKA Syntax Augmented Machine Translation system (SAMT) and UPC-TALP N-gram-based Statistical Machine Translation (SMT). SAMT is a hierarchical syntax-driven ...
Linguistically annotated BTG for statistical machine translation
COLING '08: Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1Bracketing Transduction Grammar (BTG) is a natural choice for effective integration of desired linguistic knowledge into statistical machine translation (SMT). In this paper, we propose a Linguistically Annotated BTG (LABTG) for SMT. It conveys ...
Comments