research-article

Free Access

Unsupervised relation extraction by mining Wikipedia texts using information from the web

Authors:
Yulan Yan

The University of Tokyo, Bunkyo-ku, Tokyo, Japan

The University of Tokyo, Bunkyo-ku, Tokyo, Japan
View Profile

,
Naoaki Okazaki

The University of Tokyo, Bunkyo-ku, Tokyo, Japan

The University of Tokyo, Bunkyo-ku, Tokyo, Japan
View Profile

,
Yutaka Matsuo

The University of Tokyo, Bunkyo-ku, Tokyo, Japan

The University of Tokyo, Bunkyo-ku, Tokyo, Japan
View Profile

,
Zhenglu Yang

The University of Tokyo, Bunkyo-ku, Tokyo, Japan

The University of Tokyo, Bunkyo-ku, Tokyo, Japan
View Profile

,
Mitsuru Ishizuka

The University of Tokyo, Bunkyo-ku, Tokyo, Japan

The University of Tokyo, Bunkyo-ku, Tokyo, Japan
View Profile

ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2August 2009Pages 1021–1029

Published:02 August 2009Publication History

ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2

Pages 1021–1029

ABSTRACT

This paper presents an unsupervised relation extraction method for discovering and enhancing relations in which a specified concept in Wikipedia participates. Using respective characteristics of Wikipedia articles and Web corpus, we develop a clustering approach based on combinations of patterns: dependency patterns from dependency analysis of texts in Wikipedia, and surface patterns generated from highly redundant information related to the Web. Evaluations of the proposed approach on two different domains demonstrate the superiority of the pattern combination over existing approaches. Fundamentally, our method demonstrates how deep linguistic patterns contribute complementarily with Web surface patterns to the generation of various relations.

References

Michele Banko, Michael J. Cafarella, Stephen Soderland, Matt Broadhead and Oren Etzioni. 2007. Open information extraction from the Web. In Proceedings of IJCAI-2007. Google ScholarDigital Library
Danushka Bollegala, Yutaka Matsuo and Mitsuru Ishizuka. 2007. Measuring Semantic Similarity between Words Using Web Search Engines. In Proceedings of WWW-2007. Google ScholarDigital Library
Razvan C. Bunescu and Raymond J. Mooney. 2005. A shortest path dependency kernel for relation extraction. In Proceedings of HLT/EMLNP-2005. Google ScholarDigital Library
Jinxiu Chen, Donghong Ji, Chew Lim Tan and Zhengyu Niu. 2005. Unsupervised Feature Selection for Relation Extraction. In Proceedings of IJCNLP-2005.Google Scholar
Aron Culotta and Jeffrey Sorensen. 2004. Dependency tree kernels for relation extraction. In Proceedings of the ACL-2004. Google ScholarDigital Library
Dmitry Davidov, Ari Rappoport and Moshe Koppel. 2007. Fully unsupervised discovery of concept-specific relationships by Web mining. In Proceedings of ACL-2007.Google Scholar
Dmitry Davidov and Ari Rappoport. 2008. Classification of Semantic Relationships between Nominals Using Pattern Clusters. In Proceedings of ACL-2008.Google Scholar
Wei Fan, Kun Zhang, Hong Cheng, Jing Gao, Xifeng Yan, Jiawei Han, Philip S. Yu and Olivier Verscheure. 2008. Direct Mining of Discriminative and Essential Frequent Patterns via Model-based Search Tree. In Proceedings of KDD-2008. Google ScholarDigital Library
Evgeniy Gabrilovich and Shaul Markovitch. 2006. Overcoming the brittleness bottleneck using wikipedia: Enhancing text categorization with encyclopedic knowledge. In Proceedings of AAAI-2006. Google ScholarDigital Library
Jim Giles. 2005. Internet encyclopaedias go head to head. Nature 438:900C901.Google ScholarCross Ref
Sanda Harabagiu, Cosmin Adrian Bejan and Paul Morarescu. 2005. Shallow semantics for relation extraction. In Proceedings of IJCAI-2005. Google ScholarDigital Library
Takaaki Hasegawa, Satoshi Sekine and Ralph Grishman. 2004. Discovering Relations among Named Entities from Large Corpora. In Proceedings of ACL-2004. Google ScholarDigital Library
Nanda Kambhatla. 2004. Combining lexical, syntactic and semantic features with maximum entropy models. In Proceedings of ACL-2004. Google ScholarDigital Library
Dat P. T. Nguyen, Yutaka Matsuo and Mitsuru Ishizuka. 2007. Relation extraction from Wikipedia using subtree mining. In Proceedings of AAAI-2007. Google ScholarDigital Library
Patrick Pantel and Marco Pennacchiotti. 2006. Espresso: Leveraging generic patterns for automatically harvesting semantic relations. In Proceedings of ACL-2006. Google ScholarDigital Library
Benjamin Rosenfeld and Ronen Feldman. 2006. URES: an Unsupervised Web Relation Extraction System. In Proceedings of COLING/ACL-2006. Google ScholarDigital Library
Benjamin Rosenfeld and Ronen Feldman. 2007. Clustering for Unsupervised Relation Identification. In Proceedings of CIKM-2007. Google ScholarDigital Library
Peter D. Turney. 2006. Expressing implicit semantic relations without supervision. In Proceedings of ACL-2006. Google ScholarDigital Library
Max Volkel, Markus Krotzsch, Denny Vrandecic, Heiko Haller and Rudi Studer. 2006. Semantic wikipedia. In Proceedings of WWW-2006. Google ScholarDigital Library
Mohammed J. Zaki. 2002. Efficiently mining frequent trees in a forest. In Proceedings of SIGKDD-2002. Google ScholarDigital Library
Min Zhang, Jie Zhang, Jian Su and Guodong Zhou. 2006. A Composite Kernel to Extract Relations between Entities with both Flat and Structured Features. In Proceedings of ACL-2006. Google ScholarDigital Library

Index Terms

Unsupervised relation extraction by mining Wikipedia texts using information from the web

Recommendations

Relation extraction from wikipedia using subtree mining
AAAI'07: Proceedings of the 22nd national conference on Artificial intelligence - Volume 2

The exponential growth and reliability of Wikipedia have made it a promising data source for intelligent systems. The first challenge of Wikipedia is to make the encyclopedia machine-processable. In this study, we address the problem of extracting ...
Read More
Subtree mining for relation extraction from Wikipedia
NAACL-Short '07: Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers

In this study, we address the problem of extracting relations between entities from Wikipedia's English articles. Our proposed method first anchors the appearance of entities in Wikipedia's articles using neither Named Entity Recognizer (NER) nor ...
Read More
Social relation extraction based on chinese wikipedia articles
CLSW'12: Proceedings of the 13th Chinese conference on Chinese Lexical Semantics

Our work in this paper pays more attention to information extraction about social relations from Chinese Wikipedia articles and construction of social relation network. After obtaining the Chinese Wikipedia articles according to the provided person name,...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
August 2009
595 pages
ISBN:9781932432466
General Chair:
Keh-Yih Su
Behavior Design Corp., Taiwan
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 2 August 2009
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate85of443submissions,19%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 28
  Total Citations
  View Citations
- 1,186
  Total Downloads
- Downloads (Last 12 months)30
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Unsupervised relation extraction by mining Wikipedia texts using information from the web

ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2

ABSTRACT

References

Cited By

Index Terms

Recommendations

Relation extraction from wikipedia using subtree mining

Subtree mining for relation extraction from Wikipedia

Social relation extraction based on chinese wikipedia articles

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Unsupervised relation extraction by mining Wikipedia texts using information from the web

ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2

ABSTRACT

References

Cited By

Index Terms

Recommendations

Relation extraction from wikipedia using subtree mining

Subtree mining for relation extraction from Wikipedia

Social relation extraction based on chinese wikipedia articles

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media