ABSTRACT
Citation function is defined as the author's reason for citing a given paper (e.g. acknowledgement of the use of the cited method). The automatic recognition of the rhetorical function of citations in scientific text has many applications, from improvement of impact factor calculations to text summarisation and more informative citation indexers. We show that our annotation scheme for citation function is reliable, and present a supervised machine learning framework to automatically classify citation function, using both shallow and linguistically-inspired features. We find, amongst other things, a strong relationship between citation function and sentiment classification.
- Rashid M. Abdalla and Simone Teufel. 2006. A bootstrapping approach to unsupervised detection of cue phrase variants. In Proc. of ACL/COLING-06. Google ScholarDigital Library
- Susan Bonzi. 1982. Characteristics of a literature as predictors of relatedness between cited and citing works. JASIS, 33(4):208--216.Google ScholarCross Ref
- Christine L. Borgman, editor. 1990. Scholarly Communication and Bibliometrics. Sage Publications, CA.Google Scholar
- Jean Carletta. 1996. Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics, 22(2):249--254. Google ScholarDigital Library
- Daryl E. Chubin and S. D. Moitra. 1975. Content analysis of references: Adjunct or alternative to citation counting? Social Studies of Science, 5(4):423--441.Google ScholarCross Ref
- Eugene Garfield. 1979. Citation Indexing: Its Theory and Application in Science, Technology and Humanities. J. Wiley, New York, NY.Google Scholar
- C. Lee Giles, Kurt D. Bollacker, and Steve Lawrence. 1998. Citeseer: An automatic citation indexing system. In Proc. of the Third ACM Conference on Digital Libraries, pages 89--98. Google ScholarDigital Library
- T. L. Hodges. 1972. Citation Indexing: Its Potential for Bibliographical Control. Ph.D. thesis, University of California at Berkeley.Google Scholar
- David D. Lewis. 1991. Evaluating text categorisation. In Speech and Natural Language: Proceedings of the ARPA Workshop of Human Language Technology. Google ScholarDigital Library
- Terttu Luukkonen. 1992. Is scientists' publishing behaviour reward-seeking? Scientometrics, 24:297--319.Google ScholarCross Ref
- Michael H. MacRoberts and Barbara R. MacRoberts. 1984. The negational reference: Or the art of dissembling. Social Studies of Science, 14:91--94.Google ScholarCross Ref
- Michael J. Moravcsik and Poovanalingan Murugesan. 1975. Some results on the function and quality of citations. Social Studies of Science, 5:88--91.Google ScholarCross Ref
- Greg Myers. 1992. In this paper we report…---speech acts and scientific facts. Journal of Pragmatics, 17(4).Google ScholarCross Ref
- John O'Connor. 1982. Citing statements: Computer recognition and use to improve retrieval. Information Processing and Management, 18(3):125--131.Google ScholarCross Ref
- Chris D. Paice. 1981. The automatic generation of literary abstracts: an approach based on the identification of self-indicating phrases. In R. Oddy, S. Robertson, C. van Rijsbergen, and P. W. Williams, editors, Information Retrieval Research. Butterworth, London, UK. Google ScholarDigital Library
- Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proc. of EMNLP-02. Google ScholarDigital Library
- Anna Ritchie, Simone Teufel, and Stephen Robertson. 2006a. Creating a test collection for citation-based IR experiments. In Proc. of HLT/NAACL 2006, New York, US. Google ScholarDigital Library
- Anna Ritchie, Simone Teufel, and Stephen Robertson. 2006b. How to find better index terms through citations. In Proc. of ACL/COLING workshop "Can Computational Linguistics improve IR". Google ScholarDigital Library
- Simon Buckingham Shum. 1998. Evolving the web for scientific knowledge: First steps towards an "HCI knowledge web". Interfaces, British HCI Group Magazine, 39.Google Scholar
- Henry Small. 1982. Citation context analysis. In P. Dervin and M. J. Voigt, editors, Progress in Communication Sciences 3, pages 287--310. Ablex, Norwood, N.J.Google Scholar
- Ina Spiegel-Rüsing. 1977. Bibliometric and content analysis. Social Studies of Science, 7:97--113.Google ScholarCross Ref
- John Swales. 1986. Citation analysis and discourse analysis. Applied Linguistics, 7(1):39--56.Google ScholarCross Ref
- John Swales, 1990. Genre Analysis: English in Academic and Research Settings. Chapter 7: Research articles in English, pages 110--176. Cambridge University Press, Cambridge, UK.Google Scholar
- Simone Teufel and Marc Moens. 2002. Summarising scientific articles --- experiments with relevance and rhetorical status. Computational Linguistics, 28(4):409--446. Google ScholarDigital Library
- Simone Teufel, Advaith Siddharthan, and Dan Tidhar. 2006. An annotation scheme for citation function. In Proc. of SIGDial-06. Google ScholarDigital Library
- Simone Teufel. 1999. Argumentative Zoning: Information Extraction from Scientific Text. Ph.D. thesis, School of Cognitive Science, University of Edinburgh, UK.Google Scholar
- Peter D. Turney. 2002. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In Proc. of ACL-02. Google ScholarDigital Library
- Melvin Weinstock. 1971. Citation indexes. In Encyclopedia of Library and Information Science, volume 5. Dekker, New York, NY.Google Scholar
- Howard D. White. 2004. Citation analysis and discourse analysis revisited. Applied Linguistics, 25(1):89--116.Google ScholarCross Ref
- Ian H. Witten and Eibe Frank. 2005. Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, San Francisco. Google ScholarDigital Library
- Yiming Yang and Xin Liu. 1999. A re-examination of text categorization methods. In Proc. of SIGIR-99. Google ScholarDigital Library
- John M. Ziman. 1968. Public Knowledge: An Essay Concerning the Social Dimensions of Science. Cambridge University Press, Cambridge, UK.Google Scholar
- Automatic classification of citation function
Recommendations
Citation contagion: a citation analysis of selected predatory marketing journals
AbstractTo date, limited studies have examined the citations of articles published in predatory journals, and none appears to have been done in marketing. Using Google Scholar (GS) as a citation source, this study aims to examine the extent of citations ...
Journal self-citation study for semiconductor literature: synchronous and diachronous approach
Special issue: InformetricsThe present study investigates the self-citations of the most productive semiconductor journals by synchronous (self-citing rate) and diachronous (self-cited rate) approaches. Journal's productivity of 100 most productive semiconductor journals was ...
Preprint citation practice in PLOS
AbstractThe role of preprints in the scientific production and their part in citations have been growing over the past 10 years. In this paper we study preprint citations in several different aspects: the progression of preprint citations over time, their ...
Comments