Article

DIRT @SBT@discovery of inference rules from text

Authors:
Dekang Lin

University of Alberta, Edmonton, Alberta T6H 2E1 Canada

University of Alberta, Edmonton, Alberta T6H 2E1 Canada
View Profile

,
Patrick Pantel

University of Alberta, Edmonton, Alberta T6H 2E1 Canada

University of Alberta, Edmonton, Alberta T6H 2E1 Canada
View Profile

KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data miningAugust 2001Pages 323–328https://doi.org/10.1145/502512.502559

Published:26 August 2001Publication History

KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 323–328

ABSTRACT

In this paper, we propose an unsupervised method for discovering inference rules from text, such as "X is author of Y ≈ X wrote Y", "X solved Y ≈ X found a solution to Y", and "X caused Y ≈ Y is triggered by X". Inference rules are extremely important in many fields such as natural language processing, information retrieval, and artificial intelligence in general. Our algorithm is based on an extended version of Harris' Distributional Hypothesis, which states that words that occurred in the same contexts tend to be similar. Instead of using this hypothesis on words, we apply it to paths in the dependency trees of a parsed corpus.

References

1.Aniek, P.G. and Tipimeni, S. 1999. The Paraphrase Search Assistant: Terminological Feedback for Iterative Information Seeking. In Proceedings of SIGIR-99. pp. 153-159. Berkeley, CA. Google ScholarDigital Library
2.Arampatzis, A. T., Tsoris, T., Koster, C. H. A., and van der Weide, T. P. 1998. Phrase-based infase-bas retrieval. Information Processing & Management, 34(6):693-707. Google ScholarDigital Library
3.Barzitay, R., McKeewn, K., and Elhadad, M. 1999. Information Fusion in the Context of Multi-Document Summarization. In Proceedings of ACL-99. College Park, Maryland. Google ScholarDigital Library
4.Dras, M. 1999. A meta-level Grammar: Redefining Synchronous TAGs for Translation and Paraphrase. In Proceedings of ACL-99. pp. 80-97. College Park, Maryland. Google ScholarDigital Library
5.Feldman, R., Fresko, M., Kinar, Y., Lindell, Y., Liphstat, O., Rajman, M., Schler, Y., and Zamir, O. 1998. Text Mining at the Term Level. In Proceedings of the 2nd European Symposiura on Principles of Data Mining and Knowledge Discovery. pp. 65-73. Nantes, France. Google ScholarDigital Library
6.Hahn, U. and Schnattinger, K. 1998. Towards Text Knowledge Engineering. In Proceedings ofAAA1-98/IAAI-98. Menlo Park, CA. pp. 524-531. Google ScholarDigital Library
7.Harris, Z. 1985. Distributional Structure. In: Katz, J. J. (ed.) The Philosophy of Linguistics. New York: Oxford University Press. pp. 26-47.Google Scholar
8.Hays, D. 1964. Dependency Theory: A Formalism and Some Observations. Language, 40:511-525.Google Scholar
9.Hearst, M. 1992. Automatic Acquisition of Hyponyms from Large Text Corpora. In Proceedings of A CL-92. Nantes, France. Google ScholarDigital Library
10.Hindle, D. 1990. Noun Classification from Predicate-Argument Structures. In Proceedings of ACL-90. pp. 268-275. Pittsburgh. Google ScholarDigital Library
11.Iordanskaja, L, Kittredge, R., and Polguere, A. 1991. Natural Language Generation in Artificial Intelligence and Computational Linguistics. Kluwer. Boston, MA.Google Scholar
12.Jacquemin, C., Klavans, J. L., and Tzoukermann, E. 1999. NLP for Term Variant Extraction: A Synergy of Morphology, Lexicon, and Syntax. Natural Language Information Retrieval, T. Strzalkowski, editor, pp. 25-74. Kluwer. Boston, MA.Google Scholar
13.Larsen, B. and Aone, C. 1999. Fast and effective text mining using lineartime document clustering. In Proceedings of KDD-99. pp. 16-22. San Diego, CA. Google ScholarDigital Library
14.Lin, S. H., Shih, C. S., Chen, M. C., et al. 1998. Extracting Classification Knowledge of Internet Doeumeuts with Mining Term Associations: A Semantic Approach. In Proceedings of SIGIR-98. Melbourne, Australia. Google ScholarDigital Library
15.Lin, D. 1998. Extracting Collocations from Text Corpora. Workshop on Computational Terminology. pp. 57-63. Montreal, Canada.Google Scholar
16.Lin, D. 1993. Principle-Based Parsing Without OverGeneration. In Proceedings of A CL-93. pp. 112-120. Columbus, OH. Google ScholarDigital Library
17.MeCallum, A., Nigam, K., and Ungar, L. H. 2000. Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching. In Proceedings of KDD-2000. Boston, MA. Google ScholarDigital Library
18.Mercer, M. M. and Shaked, V. 1988. Strategies for Effective Paraphrasing. In Proceedings of COLING-88. pp. 431-436 Budapest. Google ScholarDigital Library
19.Pereira, F., Tishby, N., and Lee, L. 1993. Distributional Clustering of English Words. In Proceedings of ACL-93. pp. 183-190. Columbus, Ohio. Google ScholarDigital Library
20.Rajman, M. and Besancon, R. 1997. Text Mining: Natural Language Techniques and Text Mining Applications. In Proceedings of the seventh IFIP 2.6 Working Conference on Database Semantics (DS-7).Google Scholar
21.Richardson, S. D. 1997. Determining Similarity and the Inferring Relations in a LexicalKnowledge-Base. Ph.D. Thesis. The City University of New York. Google ScholarDigital Library
22.Robin, J. 1994. Revision-based Generation of Natural Language Summaries Providing Historical Background. Ph.D. Dissertation. Columbia University. Google ScholarDigital Library
23.Sampson, G. 1995. English for the Computer - The SUSANNE Corpus and Analytic Scheme. Clarendon Press. Oxford, England.Google Scholar
24.Sparek Jones, K. and Tait, J. 1. 1984. Automatic Search Term Variant Generation. Journal of Documentation, 40(1):50-66.Google ScholarCross Ref

Index Terms

DIRT @SBT@discovery of inference rules from text
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Language resources
2. Mathematics of computing
  1. Discrete mathematics
    1. Graph theory
      1. Graph algorithms

Recommendations

From Raw Text to Morphological Rules for Iban Morphological Analyser
IALP '12: Proceedings of the 2012 International Conference on Asian Language Processing

To extend a complete workflow of automatic acquisition of morphological rules for morphological analyser, we propose a semi-automatic workflow for under-resourced language, which is Iban language. The workflow focuses in determining the rules to be used ...
Read More
A Chinese Text Synthesis Method Based on Word Order Rules
Abstract
Chinese language processing is a basic subject in Natural Language Processing (NLP). The analysis of Chinese sentences is a difficult point and a research focus in this field. We propose a method for Chinese text analysis and synthesis based on ...
Read More
Learning verb inference rules from linguistically-motivated evidence
EMNLP-CoNLL '12: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning

Learning inference relations between verbs is at the heart of many semantic applications. However, most prior work on learning such rules focused on a rather narrow set of information sources: mainly distributional similarity, and to a lesser extent ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
August 2001
493 pages
ISBN:158113391X
DOI:10.1145/502512
Conference Chair:
Doheon Lee
Chonnam National University, Korea
,
General Chair:
Mario Schkolnick
SGI
,
Program Chairs:
Foster Provost
New York University
,
Ramakrishnan Srikant
IBM Almaden Research Center
Copyright © 2001 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 August 2001
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
KDD '01 Paper Acceptance Rate31of237submissions,13%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 239
  Total Citations
  View Citations
- 1,753
  Total Downloads
- Downloads (Last 12 months)17
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

DIRT @SBT@discovery of inference rules from text

KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

From Raw Text to Morphological Rules for Iban Morphological Analyser

A Chinese Text Synthesis Method Based on Word Order Rules

Learning verb inference rules from linguistically-motivated evidence