skip to main content
10.1145/1816123.1816156acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

Evaluating topic models for digital libraries

Published:21 June 2010Publication History

ABSTRACT

Topic models could have a huge impact on improving the ways users find and discover content in digital libraries and search interfaces through their ability to automatically learn and apply subject tags to each and every item in a collection, and their ability to dynamically create virtual collections on the fly. However, much remains to be done to tap this potential, and empirically evaluate the true value of a given topic model to humans. In this work, we sketch out some sub-tasks that we suggest pave the way towards this goal, and present methods for assessing the coherence and interpretability of topics learned by topic models. Our large-scale user study includes over 70 human subjects evaluating and scoring almost 500 topics learned from collections from a wide range of genres and domains. We show how scoring model -- based on pointwise mutual information of word-pair using Wikipedia, Google and MEDLINE as external data sources - performs well at predicting human scores. This automated scoring of topics is an important first step to integrating topic modeling into digital libraries

References

  1. L. AlSumait, D. Barbará, J. Gentle, and C. Domeniconi. Topic significance ranking of LDA generative models. In ECML/PKDD (1), pages 67--82, 2009.Google ScholarGoogle Scholar
  2. D. Andrzejewski, X. Zhu, and M. Craven. Incorporating domain knowledge into topic modeling via Dirichlet forest priors. In ICML, page 4, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Armstrong, A. Moffat, W. Webber, and J. Zobel. Improvements that don't add up: ad-hoc retrieval results since 1998. In CIKM, pages 601--610, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Blei and J. Lafferty. Dynamic topic models. In ICML, pages 113--120, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. Blei, A. Ng, and M. Jordan. Latent Dirichlet allocation. JMLR, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. W. Buntine and A. Jakulin. Applying discrete PCA in data analysis. In UAI, pages 59--66, Banff, Canada, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Chang, J. Boyd-Graber, S. Gerrish, C. Wang, and D. Blei. Reading tea leaves: How humans interpret topic models. In NIPS, pages 288--296, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. T. Griffiths and M. Steyvers. Finding scientific topics. In PNAS, volume 101, pages 5228--5235, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  9. Q. Mei, X. Shen, and C. Zhai. Automatic labeling of multinomial topic models. In SIGKDD, pages 490--499, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Mimno and A. McCallum. Organizing the OCA: learning faceted subjects from a library of digital books. In JCDL, pages 376--385, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Newman, T. Baldwin, L. Cavedon, S. Karimi, D. Martinez, and J. Zobel. Visualizing document collections and search results using topic mapping. Journal of Web Semantics, to appear.Google ScholarGoogle Scholar
  12. D. Newman, K. Hagedorn, C. Chemudugunta, and P. Smyth. Subject metadata enrichment using statistical topic models. In JCDL, pages 366--375, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. Newman, S. Karimi, and L. Cavedon. External evaluation of topic models. In ADCS, pages 11--18, 2009.Google ScholarGoogle Scholar
  14. D. Newman, J. Lau, K. Grieser, and T. Baldwin. Automatic evaluation of topic coherence. In NAACL HLT 2010, Los Angeles, USA, to appear. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Y. Teh, M. Jordan, M. Beal, and D. Blei. Hierarchical Dirichlet processes. JASA, 101(476):1566--1581, 2006.Google ScholarGoogle Scholar
  16. H. Wallach, D. Mimno, and A. McCallum. Rethinking LDA: Why priors matter. In NIPS, pages 1973--1981, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Evaluating topic models for digital libraries

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              JCDL '10: Proceedings of the 10th annual joint conference on Digital libraries
              June 2010
              424 pages
              ISBN:9781450300858
              DOI:10.1145/1816123

              Copyright © 2010 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 21 June 2010

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article

              Acceptance Rates

              Overall Acceptance Rate415of1,482submissions,28%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader