skip to main content
10.1145/502512.502559acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

DIRT @SBT@discovery of inference rules from text

Authors Info & Claims
Published:26 August 2001Publication History

ABSTRACT

In this paper, we propose an unsupervised method for discovering inference rules from text, such as "X is author of Y ≈ X wrote Y", "X solved Y ≈ X found a solution to Y", and "X caused Y ≈ Y is triggered by X". Inference rules are extremely important in many fields such as natural language processing, information retrieval, and artificial intelligence in general. Our algorithm is based on an extended version of Harris' Distributional Hypothesis, which states that words that occurred in the same contexts tend to be similar. Instead of using this hypothesis on words, we apply it to paths in the dependency trees of a parsed corpus.

References

  1. 1.Aniek, P.G. and Tipimeni, S. 1999. The Paraphrase Search Assistant: Terminological Feedback for Iterative Information Seeking. In Proceedings of SIGIR-99. pp. 153-159. Berkeley, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.Arampatzis, A. T., Tsoris, T., Koster, C. H. A., and van der Weide, T. P. 1998. Phrase-based infase-bas retrieval. Information Processing & Management, 34(6):693-707. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.Barzitay, R., McKeewn, K., and Elhadad, M. 1999. Information Fusion in the Context of Multi-Document Summarization. In Proceedings of ACL-99. College Park, Maryland. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.Dras, M. 1999. A meta-level Grammar: Redefining Synchronous TAGs for Translation and Paraphrase. In Proceedings of ACL-99. pp. 80-97. College Park, Maryland. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.Feldman, R., Fresko, M., Kinar, Y., Lindell, Y., Liphstat, O., Rajman, M., Schler, Y., and Zamir, O. 1998. Text Mining at the Term Level. In Proceedings of the 2nd European Symposiura on Principles of Data Mining and Knowledge Discovery. pp. 65-73. Nantes, France. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.Hahn, U. and Schnattinger, K. 1998. Towards Text Knowledge Engineering. In Proceedings ofAAA1-98/IAAI-98. Menlo Park, CA. pp. 524-531. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.Harris, Z. 1985. Distributional Structure. In: Katz, J. J. (ed.) The Philosophy of Linguistics. New York: Oxford University Press. pp. 26-47.Google ScholarGoogle Scholar
  8. 8.Hays, D. 1964. Dependency Theory: A Formalism and Some Observations. Language, 40:511-525.Google ScholarGoogle Scholar
  9. 9.Hearst, M. 1992. Automatic Acquisition of Hyponyms from Large Text Corpora. In Proceedings of A CL-92. Nantes, France. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.Hindle, D. 1990. Noun Classification from Predicate-Argument Structures. In Proceedings of ACL-90. pp. 268-275. Pittsburgh. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.Iordanskaja, L, Kittredge, R., and Polguere, A. 1991. Natural Language Generation in Artificial Intelligence and Computational Linguistics. Kluwer. Boston, MA.Google ScholarGoogle Scholar
  12. 12.Jacquemin, C., Klavans, J. L., and Tzoukermann, E. 1999. NLP for Term Variant Extraction: A Synergy of Morphology, Lexicon, and Syntax. Natural Language Information Retrieval, T. Strzalkowski, editor, pp. 25-74. Kluwer. Boston, MA.Google ScholarGoogle Scholar
  13. 13.Larsen, B. and Aone, C. 1999. Fast and effective text mining using lineartime document clustering. In Proceedings of KDD-99. pp. 16-22. San Diego, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.Lin, S. H., Shih, C. S., Chen, M. C., et al. 1998. Extracting Classification Knowledge of Internet Doeumeuts with Mining Term Associations: A Semantic Approach. In Proceedings of SIGIR-98. Melbourne, Australia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.Lin, D. 1998. Extracting Collocations from Text Corpora. Workshop on Computational Terminology. pp. 57-63. Montreal, Canada.Google ScholarGoogle Scholar
  16. 16.Lin, D. 1993. Principle-Based Parsing Without OverGeneration. In Proceedings of A CL-93. pp. 112-120. Columbus, OH. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.MeCallum, A., Nigam, K., and Ungar, L. H. 2000. Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching. In Proceedings of KDD-2000. Boston, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.Mercer, M. M. and Shaked, V. 1988. Strategies for Effective Paraphrasing. In Proceedings of COLING-88. pp. 431-436 Budapest. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.Pereira, F., Tishby, N., and Lee, L. 1993. Distributional Clustering of English Words. In Proceedings of ACL-93. pp. 183-190. Columbus, Ohio. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20.Rajman, M. and Besancon, R. 1997. Text Mining: Natural Language Techniques and Text Mining Applications. In Proceedings of the seventh IFIP 2.6 Working Conference on Database Semantics (DS-7).Google ScholarGoogle Scholar
  21. 21.Richardson, S. D. 1997. Determining Similarity and the Inferring Relations in a LexicalKnowledge-Base. Ph.D. Thesis. The City University of New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. 22.Robin, J. 1994. Revision-based Generation of Natural Language Summaries Providing Historical Background. Ph.D. Dissertation. Columbia University. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. 23.Sampson, G. 1995. English for the Computer - The SUSANNE Corpus and Analytic Scheme. Clarendon Press. Oxford, England.Google ScholarGoogle Scholar
  24. 24.Sparek Jones, K. and Tait, J. 1. 1984. Automatic Search Term Variant Generation. Journal of Documentation, 40(1):50-66.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. DIRT @SBT@discovery of inference rules from text

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
        August 2001
        493 pages
        ISBN:158113391X
        DOI:10.1145/502512

        Copyright © 2001 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 26 August 2001

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        KDD '01 Paper Acceptance Rate31of237submissions,13%Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader