research-article

Input Method for Human Translators: A Novel Approach to Integrate Machine Translation Effectively and Imperceptibly

Authors:
Guoping Huang

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Tencent AI Lab, Shenzhen

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Tencent AI Lab, Shenzhen

0000-0001-6481-6250
View Profile

,
Jiajun Zhang

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Tencent AI Lab, Shenzhen

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Tencent AI Lab, Shenzhen
View Profile

,
Yu Zhou

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Tencent AI Lab, Shenzhen

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Tencent AI Lab, Shenzhen
View Profile

,
Chengqing Zong

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Nanshan District, Shenzhen

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Nanshan District, Shenzhen
View Profile

ACM Transactions on Asian and Low-Resource Language Information Processing Volume 18 Issue 1Article No.: 4pp 1–22https://doi.org/10.1145/3230638

Published:12 November 2018Publication History

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

Computer-aided translation (CAT) systems are the most popular tool for helping human translators efficiently perform language translation. To further improve the translation efficiency, there is an increasing interest in applying machine translation (MT) technology to upgrade CAT. To thoroughly integrate MT into CAT systems, in this article, we propose a novel approach: a new input method that makes full use of the knowledge adopted by MT systems, such as translation rules, decoding hypotheses, and n-best translation lists. The proposed input method contains two parts: a phrase generation model, allowing human translators to type target sentences quickly, and an n-gram prediction model, helping users choose perfect MT fragments smoothly. In addition, to tune the underlying MT system to generate the input method preferable results, we design a new evaluation metric for the MT system. The proposed input method integrates MT effectively and imperceptibly, and it is particularly suitable for many target languages with complex characters, such as Chinese and Japanese. The extensive experiments demonstrate that our method saves more than 23% in time and over 42% in keystrokes, and it also improves the translation quality by more than 5 absolute BLEU scores compared with the strong baseline, i.e., post-editing using Google Pinyin.

References

Sergio Barrachina, Oliver Bender, Francisco Casacuberta, Jorge Civera, Elsa Cubel, Shahram Khadivi, Antonio Lagarda, Hermann Ney, Jesús Tomás, Enrique Vidal, and others. 2009. Statistical approaches to computer-assisted translation. Comput. Ling. 35, 1 (2009), 3--28. Google ScholarDigital Library
Michael Carl, Barbara Dragsted, Jakob Elming, Daniel Hardt, and Arnt Lykke Jakobsen. 2011. The process of post-editing: A pilot study. In Proceedings of the 8th International NLPSC Workshop. Special Theme: Human-machine Interaction in Translation, Vol. 41. 131--142.Google Scholar
Shanbo Cheng, Shujian Huang, Huadong Chen, Xinyu Dai, and Jiajun Chen. 2016. PRIMT: A pick-revise framework for interactive machine translation. In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT’16). 1240--1249.Google ScholarCross Ref
Wei Cui. 1985. Evaluation of chinese character keyboards. IEEE Compu. 18, 1 (1985), 54--63. Google ScholarDigital Library
Ruiyu Fang and Xiaodong Shi. 2013. Research and Implementation on Aided Translation Tools Based on Input Method. Master’s thesis. Xiamen University.Google Scholar
George Foster and Guy Lapalme. 2002. Text Prediction for Translators. Ph.D. Dissertation. Université de Montréal. Google ScholarDigital Library
Nestor Garay-Vitoria and Julio Abascal. 2006. Text prediction systems: A survey. Univers. Access Inf. Soc. 4, 3 (2006), 188--203. Google ScholarDigital Library
Spence Green, Sida I. Wang, Jason Chuang, Jeffrey Heer, Sebastian Schuster, and Christopher D. Manning. 2014. Human effort and machine learnability in computer aided translation. In Proceeding of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP’14). 1225--1236.Google Scholar
Guoping Huang, Jiajun Zhang, Yu Zhou, and Chengqing Zong. 2015. A new input method for human translators: Integrating machine translation effectively and imperceptibly. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). 1163--1169. Google ScholarDigital Library
Guoping Huang, Jiajun Zhang, Yu Zhou, and Chengqing Zong. 2016a. Learning from user feedback for machine translation in real-time. In Proceedings of the 5th Conference on Natural Language Processing and Chinese Computing and the 24h International Conference on Computer Processing of Oriental Languages (NLPCC’16). 595--607.Google ScholarCross Ref
Guoping Huang, Jiajun Zhang, Yu Zhou, and Chengqing Zong. 2016b. A simple, straightforward and effective model for joint bilingual terms detection and word alignment in SMT. In Proceedings of the 5th Conference on Natural Language Processing and Chinese Computing and the 24th International Conference on Computer Processing of Oriental Languages (NLPCC’16). 103--115.Google ScholarCross Ref
Guoping Huang, Chunlu Zhao, Hongyuan Ma, Yu Zhou, and Jiajun Zhang. 2016c. MinKSR: A novel MT evaluation metric for coordinating human translators with the CAT-oriented input method. In Proceedings of the 12th China Workshop on Machine Translation (CWMT’16). 1--13.Google ScholarCross Ref
Tadao Kasami. 1965. An Efficient Recognition and Syntax Analysis Algorithm for Context-free Languages. Technical Report Technical Report AFCRL-65-758. Air Force Cambridge Research Laboratory.Google Scholar
Philipp Koehn. 2004. Statistical significance tests for machine translation evaluation. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP’04). 388--395.Google Scholar
Philipp Koehn. 2009a. A process study of computer-aided translation. Mach. Transl. 23, 4 (2009), 241--263.Google ScholarCross Ref
Philipp Koehn. 2009b. A web-based interactive computer aided translation tool. In Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP’09). 17--20. Google ScholarDigital Library
Philipp Koehn. 2012. Computer-added Trasnlation. (2012). Machine Translation Marathon. http://www.mt-archive.info/MTMarathon-2012-Koehn-ppt.pdf.Google Scholar
Philipp Koehn, Chara Tsoukala, and Herve Saint-Amand. 2014. Refinements to interactive translation prediction based on search graphs. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL’14). 574--578.Google ScholarCross Ref
Dong Li. 2012. A pinyin input method editor with english-chinese aided translation function. In Proceedings of the 2012 International Conference on Computer Science and Service System. 446--449. Google ScholarDigital Library
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL’02). 311--318. Google ScholarDigital Library
Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla, and John Makhoul. 2006. A study of translation edit rate with targeted human annotation. In Proceedings of Association for Machine Translation in the Americas 2006, Vol. 200. 223--231.Google Scholar
Deyi Xiong, Qun Liu, and Shouxun Lin. 2006. Maximum entropy based phrase reordering model for statistical machine translation. In Proceedings of the International Conference on Computational Linguistics and the Annual Meeting of the Association for Computational Linguistics (COLING-ACL 2’06). 521--528. Google ScholarDigital Library
Daniel H. Younger. 1967. Recognition and parsing of context-free languages in time n<sup>3</sup>. Inf. Contr. 10, 2 (1967), 189--208.Google ScholarCross Ref
Omar Zaidan. 2009. Z-MERT: A fully configurable open source tool for minimum error rate training of machine translation systems. Prague Bull. Math. Ling. 91 (2009), 79--88.Google ScholarCross Ref
Ventsislav Zhechev. 2012. Machine translation infrastructure and post-editing performance at Autodesk. In Proceedings of the AMTA 2012 Workshop on Post-Editing Technology and Practice (WPTP’12). 87--96.Google Scholar

Index Terms

Input Method for Human Translators: A Novel Approach to Integrate Machine Translation Effectively and Imperceptibly
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Machine translation
2. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction techniques
      1. Text input

Recommendations

A new input method for human translators: integrating machine translation effectively and imperceptibly
IJCAI'15: Proceedings of the 24th International Conference on Artificial Intelligence

Computer-aided translation (CAT) system is the most popular tool which helps human translators perform language translation efficiently. To further improve the efficiency, there is an increasing interest in applying the machine translation (MT) ...
Read More
Collaborative translation by monolinguals with machine translators
IUI '09: Proceedings of the 14th international conference on Intelligent user interfaces

In this paper, we present the concept for collaborative translation, where two non-bilingual people who use different languages collaborate to perform the task of translation using machine translation (MT) services, whose quality is imperfect in many ...
Read More
Language Modeling for Syntax-Based Machine Translation Using Tree Substitution Grammars: A Case Study on Chinese-English Translation

The poor grammatical output of Machine Translation (MT) systems appeals syntax-based approaches within language modeling. However, previous studies showed that syntax-based language modeling using (Context-Free) Treebank Grammars was not very helpful in ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Asian and Low-Resource Language Information Processing Volume 18, Issue 1
March 2019
196 pages
ISSN:2375-4699
EISSN:2375-4702
DOI:10.1145/3292011
Editor:
Nianwen Xue
Brandeis University, Waltham, USA
Issue’s Table of Contents
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 November 2018
- Accepted: 1 May 2018
- Revised: 1 March 2018
- Received: 1 November 2016
Published in tallip Volume 18, Issue 1

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Machine translation
computer-aided translation
evaluation metric
input method
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 304
  Total Downloads
- Downloads (Last 12 months)16
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Input Method for Human Translators: A Novel Approach to Integrate Machine Translation Effectively and Imperceptibly

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

References

Cited By

Index Terms

Recommendations

A new input method for human translators: integrating machine translation effectively and imperceptibly

Collaborative translation by monolinguals with machine translators

Language Modeling for Syntax-Based Machine Translation Using Tree Substitution Grammars: A Case Study on Chinese-English Translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Input Method for Human Translators: A Novel Approach to Integrate Machine Translation Effectively and Imperceptibly

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

References

Cited By

Index Terms

Recommendations

A new input method for human translators: integrating machine translation effectively and imperceptibly

Collaborative translation by monolinguals with machine translators

Language Modeling for Syntax-Based Machine Translation Using Tree Substitution Grammars: A Case Study on Chinese-English Translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media