skip to main content
10.1145/3313991.3314006acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaeConference Proceedingsconference-collections
research-article

TEST: A Terminology Extraction System for Technology Related Terms

Authors Info & Claims
Published:23 February 2019Publication History

ABSTRACT

Tracking developments in the highly dynamic data-technology landscape are vital to keeping up with novel technologies and tools, in the various areas of Artificial Intelligence (AI). However, It is difficult to keep track of all the relevant technology keywords. In this paper, we propose a novel system that addresses this problem. This tool is used to automatically detect the existence of new technologies and tools in text, and extract terms used to describe these new technologies. The extracted new terms can be logged as new AI technologies as they are found on-the-fly in the web. It can be subsequently classified into the relevant semantic labels and AI domains. Our proposed tool is based on a two-stage cascading model--the first stage classifies if the sentence contains a technology term or not; and the second stage identifies the technology keyword in the sentence. We obtain a competitive accuracy for both tasks of sentence classification and text identification.

References

  1. C. C. Aggarwal and C. Zhai. 2012. A survey of text classification algorithms. In Mining text data. Springer, 163--222.Google ScholarGoogle Scholar
  2. S. Chakrabarti, B. Dom, R. Agrawal, and P. Raghavan. 1997. Using taxonomy, discriminants, and signatures for navigating in text databases. In VLDB, Vol. 97. 446--455. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. R. Finkel, T. Grenager, and C. Manning. 2005. Incorporating non-local information into information extraction systems by Gibbs sampling. In Proceedings of the 43rd annual meeting on association for computational linguistics. Association for Computational Linguistics, 363--370. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Hossari, S. Dev, M. Nicholson, K. McCabe, A. Nautiyal, C. Conran, J. Tang, X. Wei, and F. Pitie. 2018. ADNet: A Deep Network for Detecting Adverts. In Proc. Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2018).Google ScholarGoogle Scholar
  5. A. Joulin, E. Grave, P. Bojanowski, and T. Mikolov. 2016. Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759 (2016).Google ScholarGoogle Scholar
  6. J. D. Kelleher and B. Tierney. 2018. Data Science. The MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Nautiyal, K. McCabe, M. Hossari, S. Dev, M. Nicholson, C. Conran, D. McKibben, J. Tang, X. Wei, and F. Pitié. 2018. An Advert Creation System for Next-Gen Publicity. In Proc. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD).Google ScholarGoogle Scholar
  8. M. Sahami, S. Dumais, D. Heckerman, and E. Horvitz. 1998. A Bayesian approach to filtering junk e-mail. In Learning for Text Categorization: Papers from the 1998 workshop, Vol. 62. Madison, Wisconsin, 98--105.Google ScholarGoogle Scholar

Index Terms

  1. TEST: A Terminology Extraction System for Technology Related Terms

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ICCAE 2019: Proceedings of the 2019 11th International Conference on Computer and Automation Engineering
          February 2019
          160 pages
          ISBN:9781450362870
          DOI:10.1145/3313991

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 23 February 2019

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader