article

An analysis of a high-performance japanese question answering system

Author:
Hideki Isozaki

NTT Communication Science Laboratories, NTT Corporation, Kyoto, Japan

NTT Communication Science Laboratories, NTT Corporation, Kyoto, Japan
View Profile

ACM Transactions on Asian Language Information Processing Volume 4 Issue 3pp 263–279https://doi.org/10.1145/1111667.1111670

Published:01 September 2005Publication History

ACM Transactions on Asian Language Information Processing

Abstract

Twenty-five Japanese Question Answering systems participated in NTCIR QAC2 subtask 1. Of these, our system SAIQA-QAC2 performed the best: MRR = 0.607. SAIQA-QAC2 is an improvement on our previous system SAIQA-Ii that achieved MRR = 0.46 for QAC1. We mainly improved the answer-type determination module and the retrieval module. In general, a fine-grained answer taxonomy improves QA performance but it is difficult to build an accurate answer extraction module for the fine-grained taxonomy because Machine Learning methods require a huge training corpus and hand-crafted rules are hard to maintain. Therefore, we built a fine-grained system by using a coarse-grained named entity recognizer and a Japanese lexicon “Nihongo Goi-taikei.” Our experiments show that named entity/numerical expression recognition and word sense-based answer extraction mainly contributed to the performance. In addition, we developed a new proximity-based document retrieval module that performs better than BM25. We also compared its performance with MultiText, a conventional proximity-based retrieval method developed for QA.

References

Akiba, T., Itou, K., and Fujii, A. 2004. Question answering using “common sense” and utility maximization principle. In Working Notes of NTCIR-4. 297--303.Google Scholar
Clarke, C. L. A. and Terra, E. L. 2003. Passage retrieval vs. document retrieval for factoid question answering. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 427--428. Google Scholar
Clarke, C. L. A., Cormack, G. V., and Lynam, T. R. 2001. Exploiting redundancy in question answering. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 358--365. Google Scholar
Harabagiu, S., Moldovan, D., Pasca, M., Mihalcea, R., Surdeanu, M., Bunescu, R., and Girju, R. 2000. FALCON: Boosting knowledge for answer engines. In Proceedings of Ninth Text REtrieval Conference. 479--488.Google Scholar
Hayashi, Y., Kikui, G., and Tomita, J. 2000. Searching text-rich XML documents with relevance ranking. In Proceedings of SIGIR 2000 Workshop on XML and Information Retrieval.Google Scholar
Hirao, T., Sasaki, Y., and Isozaki, H. 2001. An extrinsic evaluation for question-biased text summarization on QA tasks. In Proceedings of the Workshop on Automatic Summarization, The Second Meeting of the North American Chapter of the Association for Computational Linguistics. 61--68.Google Scholar
Ichimura, Y., Saito, Y., Sakai, T., Kokubu, T., and Koyama, M. 2004. A study of the relations among question answering, Japanese named entity extraction, and named entity taxonomy (in Japanese). In IPSJ SIG Technical Report NL-161. 17--24.Google Scholar
Ikehara, S., Miyazaki, M., Shirai, S., Yokoo, A., Nakaiwa, H., Ogura, K., Ooyama, Y., and Hayashi, Y. 1997. Goi-Taikei---A Japanese Lexicon (in Japanese). Iwanami Shoten.Google Scholar
Isozaki, H. and Kazawa, H. 2002. Efficient support vector classifiers for named entity recognition. In Proceedings of the 19th International Conference on Computational Linguistics. 390--396. Google Scholar
Jones, K. S., Walker, S., and Robertson, S. E. 2000. A probabilistic model of information retrieval: development and comparative experiments. Information Processing and Management 36, 779--840. Google Scholar
Mori, T. 2004. Japanese Q/A systems using A&ast; search and its improvement. In Working Notes of NTCIR-4. 345--352.Google Scholar
Murata, M., Utiyama, M., and Isahara, H. 2004. Japanese question-answering system using decreased adding with multiple answers. In Working Notes of NTCIR-4. 353--360.Google Scholar
Nomoto, M., Fukushige, Y., Sato, M., and Suzuki, H. 2004. NTCIR-4 QAC experiments at Matsushita. In Working Notes of NTCIR-4. 373--380.Google Scholar
Ravichandran, D. and Hovy, E. 2002. Learning surface text patterns for a question answering system. In Proceedings of the 40th Annual Meeting of the Assocication for Computational Linguistics. 41--47. Google Scholar
Sakai, T., Saito, Y., and Ichimura, Y. 2004. Toshiba ASKMi at NTCIR-4 QAC2. In Working Notes of NTCIR-4. 387--394.Google Scholar
Sasaki, Y. 2003. Question answering as abduction: A feasibility study at NTCIR QAC1. IEICE Transaction on Information and Systems E86-D, 9, 1669--1676.Google Scholar
Sasaki, Y., Isozaki, H., Hirao, T., Kokuryou, K., and Maeda, E. 2002. NTT's QA systems for NTCIR QAC-1. In Working Notes of the Third NTCIR Workshop Meeting, Part IV: Question Answering Challenge (QAC1). 63--70.Google Scholar
Sekine, S. and Eriguchi, Y. 2000. Japanese named entity extraction evaluation---analysis of results. In Proceedings of the 18th International Conference on Computational Linguistics. 1106--1110. Google Scholar
Soricut, R. and Brill, E. 2003. Automatic question answering: Beyond the factoid. In Proceedings of the Human Language Technology and North American Association for Computational Linguistics Conference. 149--156. Google Scholar
Suzuki, J., Sasaki, Y., and Maeda, E. 2002. SVM answer selection for open-domain question answering. In Proceedings of the 19th International Conference on Computational Linguistics. 974--980. Google Scholar

Index Terms

An analysis of a high-performance japanese question answering system
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information retrieval

Recommendations

Architecture and evaluation of BRUJA, a multilingual question answering system
Abstract
Given a user question, the goal of a Question Answering (QA) system is to retrieve answers rather than full documents or even best-matching passages, as most Information Retrieval systems currently do. In this paper, we present BRUJA, a QA system ...
Read More
A Factoid Question Answering System for Vietnamese
WWW '18: Companion Proceedings of the The Web Conference 2018

In this paper, we describe the development of an end-to-end factoid question answering system for the Vietnamese language. This system combines both statistical models and ontology-based methods in a chain of processing modules to provide high-quality ...
Read More
Human question answering performance using an interactive document retrieval system
IIIX '12: Proceedings of the 4th Information Interaction in Context Symposium

Every day, people answer their questions by using document retrieval systems. Compared to document retrieval systems, question answering (QA) systems aim to speed the rate at which users find answers by retrieving answers rather than documents. To ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Asian Language Information Processing Volume 4, Issue 3
September 2005
138 pages
ISSN:1530-0226
EISSN:1558-3430
DOI:10.1145/1111667
Issue’s Table of Contents

Copyright © 2005 ACM
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 September 2005
Published in talip Volume 4, Issue 3

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Question answering
document retrieval
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 548
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An analysis of a high-performance japanese question answering system

ACM Transactions on Asian Language Information Processing

Abstract

References

Cited By

Index Terms

Recommendations

Architecture and evaluation of BRUJA, a multilingual question answering system

A Factoid Question Answering System for Vietnamese

Human question answering performance using an interactive document retrieval system

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

An analysis of a high-performance japanese question answering system

ACM Transactions on Asian Language Information Processing

Abstract

References

Cited By

Index Terms

Recommendations

Architecture and evaluation of BRUJA, a multilingual question answering system

A Factoid Question Answering System for Vietnamese

Human question answering performance using an interactive document retrieval system

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media