Machine translation (MT) is the translation of natural language (NL) text by computer, mapping a source language (SL) text onto a target language (TL) while preserving the meaning. Interlingua-based MT systems first map the SL into an intermediate language or interlingua (IL) and then map the IL out to the TL. There are, however, no established specifications that are generally accepted among MT researchers for constructing an interlingua.
This thesis addresses the basic question of what belongs in an interlingua. The work demonstrates that the following “division of labor” is viable in an MT system: an interlingua can be constructed as its own level of representation, distinct from NL syntactic and conceptual levels of representation and linguistically relevant to the task of preserving meaning in MT. This IL captures a level of abstraction between NL syntactic representations and conceptual or knowledge representations (KR).
The thesis focuses on the translation of spatial expressions, i.e., natural language sentences that convey the location, orientation, or motion of physical objects in the real, 3-dimensional world. The focus is both computationally and linguistically motivated: localist research holds that spatial expressions are more basic structurally and semantically than expressions in other fields and, thus, are critical for both language learners and computational systems in establishing the basic linguistic relations in other types of expressions.
The research presented in the thesis spells out the consequences of the division-of-labor approach for an MT system. The contributions of the thesis are: (i) the design of a modular, multi-level MT system, (ii) the formalization of a theory of tiered IL forms and associated processing algorithms, and (iii) the encoding of single-language and cross-language generalizations in an IL for spatial expressions.
Cited By
- Naskar S and Bandyopadhyay S Handling of prepositions in English to Bengali machine translation Proceedings of the Third ACL-SIGSEM Workshop on Prepositions, (89-94)
- Alam Y Decision trees for sense disambiguation of prepositions Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics, (52-59)
- Dang H, Kipper K and Palmer M Integrating compositional semantics into a verb lexicon Proceedings of the 18th conference on Computational linguistics - Volume 2, (1011-1015)
Index Terms
- Interlingua-based machine translation of spatial expressions
Recommendations
Interlingua-based English–Hindi Machine Translation and Language Divergence
Interlingua and transfer-based approaches to machine translation have long been in use in competing and complementary ways. The former proves economical in situations where translation among multiple languages is involved, and can be used as a knowledge-...
Interlingua in machine translation
CSC '89: Proceedings of the 17th conference on ACM Annual Computer Science ConferenceThe concept of interlingua is of seventeenth century provenance, and it gained the attention of computer scientists in the context of MT. The idea was that if there are <i>n</i> source languages (SLs) then <i>n</i>(<i>n</i>-1) algorithms are required to ...