skip to main content
10.1145/2700171.2791023acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
research-article

Machine Classification and Analysis of Suicide-Related Communication on Twitter

Published:24 August 2015Publication History

ABSTRACT

The World Wide Web, and online social networks in particular, have increased connectivity between people such that information can spread to millions of people in a matter of minutes. This form of online collective contagion has provided many benefits to society, such as providing reassurance and emergency management in the immediate aftermath of natural disasters. However, it also poses a potential risk to vulnerable Web users who receive this information and could subsequently come to harm. One example of this would be the spread of suicidal ideation in online social networks, about which concerns have been raised. In this paper we report the results of a number of machine classifiers built with the aim of classifying text relating to suicide on Twitter. The classifier distinguishes between the more worrying content, such as suicidal ideation, and other suicide-related topics such as reporting of a suicide, memorial, campaigning and support. It also aims to identify flippant references to suicide. We built a set of baseline classifiers using lexical, structural, emotive and psychological features extracted from Twitter posts. We then improved on the baseline classifiers by building an ensemble classifier using the Rotation Forest algorithm and a Maximum Probability voting classification decision method, based on the outcome of base classifiers. This achieved an F-measure of 0.728 overall (for 7 classes, including suicidal ideation) and 0.69 for the suicidal ideation class. We summarise the results by reflecting on the most significant predictive principle components of the suicidal ideation class to provide insight into the language used on Twitter to express suicidal ideation.

References

  1. A. Abboute, Y. Boudjeriou, G. Entringer, J. Aze, S. Bringay, and P. Poncelet. Mining twitter for suicide prevention. In Natural Language Processing and Information Systems, volume 8455 of Lecture Notes in Computer Science, pages 250--253. Springer, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  2. D. Baker and S. Fortune. Understanding self-harm and suicide websites. Crisis: The Journal of Crisis Intervention and Suicide Prevention, 29(3):118--122, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  3. L. Barbosa and J. Feng. Robust sentiment detection on twitter from biased and noisy data. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pages 36--44. Association for Computational Linguistics, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. K. Becker and M. H. Schmidt. When kids seek help on-line: Internet chat rooms and suicide. reclaiming children and youth, 13(4):229--230, 2005.Google ScholarGoogle Scholar
  5. L. Biddle, J. Donovan, K. Hawton, N. Kapur, and D. Gunnell. Suicide and the internet. Bmj, 336(7648):800--802, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  6. L. Breiman. Bagging predictors. Machine learning, 24(2):123--140, 1996. Google ScholarGoogle ScholarCross RefCross Ref
  7. P. Burnap, O. F. Rana, N. Avis, M. Williams, W. Housley, A. Edwards, J. Morgan, and L. Sloan. Detecting tension in online communities with computational twitter analysis. Technological Forecasting and Social Change, 2013.Google ScholarGoogle Scholar
  8. M. D. C. S. Counts and M. Gamon. Not all moods re created equal! a exploring human emotional states in social media. 2012.Google ScholarGoogle Scholar
  9. K. Daine, K. Hawton, V. Singaravelu, A. Stewart, S. Simkin, and P. Montgomery. The power of the web: a systematic review of studies of the influence of the internet on self-harm and suicide in young people. PloS one, 8(10):e77555, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  10. M. De Choudhury, S. Counts, E. J. Horvitz, and A. Hoff. Characterizing and predicting postpartum depression from shared facebook data. In Proceedings of the 17th ACM Conference on Computer Supported Cooperative Work & Social Computing, CSCW '14, pages 626--638, New York, NY, USA, 2014. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. De Choudhury, M. Gamon, S. Counts, and E. Horvitz. Predicting depression via social media. In ICWSM, 2013.Google ScholarGoogle Scholar
  12. B. Desmet and V. Hoste. Emotion detection in suicide notes. Expert Systems with Applications, 40(16):6351--6358, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y. Freund and R. E. Schapire. A desicion-theoretic generalization of on-line learning and an application to boosting. In Computational learning theory, pages 23--37. Springer, 1995. Google ScholarGoogle ScholarCross RefCross Ref
  14. M. Gould, P. Jamieson, and D. Romer. Media contagion and suicide among the young. American Behavioral Scientist, 46(9):1269--1284, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  15. J. F. Gunn and D. Lester. Twitter postings and suicide: An analysis of the postings of a fatal suicide in the 24 hours prior to death. Present tense, 27(16):42, 2012.Google ScholarGoogle Scholar
  16. C. Homan, R. Johar, T. Liu, M. Lytle, V. Silenzio, and C. Ovesdotter Alm. Toward macro-insights for suicide prevention: Analyzing fine-grained distress at scale. In Proceedings of the Workshop on Computational Linguistics and Clinical Psychology, pages 107--117, Baltimore, Maryland, USA, June 2014. Association for Computational Linguistics.Google ScholarGoogle ScholarCross RefCross Ref
  17. Y.-P. Huang, T. Goh, and C. L. Liew. Hunting suicide notes in web 2.0-preliminary findings. In Multimedia Workshops, 2007. ISMW'07. Ninth IEEE International Symposium on, pages 517--521. IEEE, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Ikunaga, S. R. Nath, and K. A. Skinner. Internet suicide in japan: A qualitative content analysis of a suicide bulletin board. Transcultural psychiatry, page 1363461513487308, 2013.Google ScholarGoogle Scholar
  19. N. Jacob, J. Scourfield, and R. Evans.Suicide prevention via the internet: A descriptive review. Crisis: The Journal of Crisis Intervention and Suicide Prevention, 35(4):261, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  20. J. Jashinsky, S. H. Burton, C. L. Hanson, J. West, C. Giraud-Carrier, M. D. Barnes, and T. Argyle. Tracking suicide risk factors through twitter in the us. 2013.Google ScholarGoogle Scholar
  21. V. Kolhatkar, H. Zinsmeister, and G. Hirst. Interpreting anaphoric shell nouns using antecedents of cataphoric shell nouns as training data. In EMNLP, pages 300--310, 2013.Google ScholarGoogle Scholar
  22. M. T. Lehrman, C. O. Alm, and R. A. Proaño. Detecting distressed and non-distressed affect states in short forum texts. In Proceedings of the Second Workshop on Language in Social Media, pages 9--18. Association for Computational Linguistics, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Liakata, J.-H. Kim, S. Saha, J. Hastings, and D. Rebholz-Schuhmann. Three hybrid classifiers for the detection of emotions in suicide notes. Biomedical informatics insights, 5(Suppl 1):175, 2012.Google ScholarGoogle Scholar
  24. P. Matykiewicz, W. Duch, and J. Pestian. Clustering semantic spaces of suicide notes and newsgroups articles. In Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, pages 179--184. Association for Computational Linguistics, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. A. Pak and P. Paroubek. Twitter as a corpus for sentiment analysis and opinion mining. In LREC, 2010.Google ScholarGoogle Scholar
  26. J. Pennebaker, M. Francis, and R. Booth. Linguistic Inquiry and Word Count: A computerized text analysis program. 2001.Google ScholarGoogle Scholar
  27. J. Pestian, H. Nasrallah, P. Matykiewicz, A. Bennett, and A. Leenaars. Suicide note classification using natural language processing: A content analysis. Biomedical informatics insights, 2010(3):19, 2010.Google ScholarGoogle Scholar
  28. J. P. Pestian, P. Matykiewicz, M. Linn-Gust, B. South, O. Uzuner, J. Wiebe, K. B. Cohen, J. Hurdle, and C. Brew. Sentiment analysis of suicide notes: A shared task. Biomedical informatics insights, 5(Suppl 1):3, 2012.Google ScholarGoogle Scholar
  29. J. Pirkis and R. W. Blood. Suicide and the media. Crisis: The Journal of Crisis Intervention and Suicide Prevention, 22(4):155--162, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  30. C. Poulin, B. Shiner, P. Thompson, L. Vepstas, Y. Young-Xu, B. Goertzel, B. Watts, L. Flashman, and T. McAllister. Predicting the risk of suicide by analyzing the text of clinical notes. PloS one, 9(1):e85733, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  31. P. R. Recupero, S. E. Harms, and J. M. Noble. Googling suicide: surfing for suicide information on the internet. Journal of Clinical Psychiatry, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  32. J. J. Rodriguez, L. I. Kuncheva, and C. J. Alonso. Rotation forest: A new classifier ensemble method. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 28(10):1619--1630, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. T. D. Ruder, G. M. Hatch, G. Ampanozi, M. J. Thali, and N. Fischer. Suicide announcement on facebook. Crisis: The Journal of Crisis Intervention and Suicide Prevention, 32(5):280--282, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  34. I. Spasić, P. Burnap, M. Greenwood, and M. Arribas-Ayllon. A naïve bayes approach to classifying topics in suicide notes. Biomedical informatics insights, 5(Suppl 1):87, 2012.Google ScholarGoogle Scholar
  35. H. Sueki. The association of suicide-related twitter use with suicidal behaviour: A cross-sectional study of young internet users in japan. Journal of affective disorders, 2014.Google ScholarGoogle Scholar
  36. M. Thelwall, K. Buckley, G. Paltoglou, D. Cai, and A. Kappas. Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology, 61(12):2544--2558, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. H.-H. Won, W. Myung, G.-Y. Song, W.-H. Lee, J.-W. Kim, B. J. Carroll, and D. K. Kim. Predicting national suicide numbers with social media data. PloS one, 8(4):e61809, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  38. C. Yang, K. H. Lin, and H.-H. Chen. Emotion classification using web blog corpora. In Web Intelligence, IEEE/WIC/ACM International Conference on, pages 275--278. IEEE, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. H. Yang, A. Willis, A. De Roeck, and B. Nuseibeh. A hybrid model for automatic emotion recognition in suicide notes. Biomedical informatics insights, 5(Suppl 1):17, 2012.Google ScholarGoogle Scholar

Index Terms

  1. Machine Classification and Analysis of Suicide-Related Communication on Twitter

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      HT '15: Proceedings of the 26th ACM Conference on Hypertext & Social Media
      August 2015
      360 pages
      ISBN:9781450333955
      DOI:10.1145/2700171

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 August 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      HT '15 Paper Acceptance Rate24of60submissions,40%Overall Acceptance Rate378of1,158submissions,33%

      Upcoming Conference

      HT '24
      35th ACM Conference on Hypertext and Social Media
      September 10 - 13, 2024
      Poznan , Poland

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader