research-article

Stance classification of multi-perspective consumer health information

Authors:
Anirban Sen

IIT Delhi

IIT Delhi
View Profile

,
Manjira Sinha

Conduent Labs India

Conduent Labs India
View Profile

,
Sandya Mannarswamy

Conduent Labs, India

Conduent Labs, India
View Profile

,
Shourya Roy

Big Data Labs, American Express

Big Data Labs, American Express
View Profile

CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of DataJanuary 2018Pages 273–281https://doi.org/10.1145/3152494.3152518

Published:11 January 2018Publication History

CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data

Pages 273–281

ABSTRACT

While search engines are effective in answering direct factual questions such as, 'What are the symptoms of a disease X?', they are not so effective in addressing complex consumer health queries, which do not have a single definitive answer, such as, 'Is treatment X effective for disease Y?'. Instead, the users are presented with a vast number of search results with often contradictory perspectives and no definitive answer. We denote such queries as Multi-Perspective Consumer Health Information (MPCHI) queries for which there is no single 'Yes or No' answer. While ascertaining the credibility of the claims requires domain expertise, an efficient categorization of the search results according to their stance (support or oppose) to the queries will help the searcher in decision making. Hence, this paper focuses on the problem of stance classification for MPCHI data at sentence level, presenting a new data set for MPCHI queries. Unlike typical debate or argumentative text, the linguistic characteristics of MPCHI is quite different, with extensive use of scientific formal language and absence of opinion bearing words. Hence, such inherently different characteristic of MPCHI text requires going beyond traditional Bag of Words (BoW) features for stance classification. Hence, we propose using a rich non-traditional set of features such as medical semantic relations, stance vectors, sentiment polarity, textual entailment, and study their impact on MPCHI stance classification using an SVM and a neural network classifier. We find that using novel non-traditional features improves MPCHI stance classification performance over traditional BoW model by 24% for the SVM classifier, and 44% for the neural network classifier respectively, for the best feature combination.

References

Ameeta Agrawal and Aijun An. 2012. Unsupervised emotion detection from text using semantic and syntactic relations. In Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on, Vol. 1. IEEE, 346--353. Google ScholarDigital Library
Rakesh Agrawal, Sridhar Rajagopalan, Ramakrishnan Srikant, and Yirong Xu. 2003. Mining Newsgroups Using Networks Arising from Social Behavior. In Proceedings of the 12th International Conference on World Wide Web (WWW '03). ACM, New York, NY, USA, 529--535. Google ScholarDigital Library
Pranav Anand, Marilyn Walker, Rob Abbott, Jean E. Fox Tree, Robeson Bowmani, and Michael Minor. 2011. Cats Rule and Dogs Drool!: Classifying Stance in Online Debate. In Proceedings of the 2Nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA '11). Association for Computational Linguistics, Stroudsburg, PA, USA, 1--9. http://dl.acm.org/citation.cfm?id=2107653.2107654 Google ScholarDigital Library
Alexandra Balahur, Zornitsa Kozareva, and Andrés Montoyo. 2009. Determining the Polarity and Source of Opinions Expressed in Political Debates. In Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing '09). Springer-Verlag, Berlin, Heidelberg, 468--480. Google ScholarDigital Library
O. Biran and O Rambow. 2011. Identifying Justifications in Written Dialogs,. In Proceedings of the 2011 IEEE Fifth International Conference on Semantic Computing. Google ScholarDigital Library
Official Google Blog. 2015. Google Health Information Knowledge Graph. (2015). https://googleblog.blogspot.co.uk/2015/02/health-info-knowledge-graph.htmlGoogle Scholar
Clinton Burfoot, Steven Bird, and Timothy Baldwin. 2011. Collective Classification of Congressional Floor-debate Transcripts. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1 (HLT '11). Association for Computational Linguistics, Stroudsburg, PA, USA, 1506--1515. http://dl.acm.org/citation.cfm?id=2002472.2002655 Google ScholarDigital Library
Jacob Cohen. 1968. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychological bulletin 70, 4 (1968), 213.Google Scholar
Janez Demšar. 2006. Statistical comparisons of classifiers over multiple data sets. Journal of Machine learning research 7, Jan (2006), 1--30. Google ScholarDigital Library
Shiri Dori-Hacohen. 2015. Controversy Detection and Stance Analysis. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '15). ACM, New York, NY, USA, 1057--1057. Google ScholarDigital Library
Shiri Dori-Hacohen and James Allan. 2013. Detecting controversy on the web. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13). ACM, New York, NY, USA, 1845--1848. Google ScholarDigital Library
Shiri Dori-Hacohen and James Allan. 2015. Automated Controversy Detection on the Web. In Proceedings of the 37th European Conference on Information Retrieval. 423--434. ECIR submission.Google ScholarCross Ref
Shiri Dori-Hacohen, Elad Yom-Tov, and James Allan. 2015. Navigating Controversy as a Complex Search Task. In ECIR 15. Electronic proceedings only.Google Scholar
Adam Faulkner. 2012. Automated Classification of Stance in Student Essays: An Approach Using Stance Target Information and the Wikipedia Link-Based Measure. In Proceedings of the Twenty-Seventh International Artificial Intelligence Research Conference.Google Scholar
W. Ferreira and A Vlachos. 2016. Emergent: a novel data-set for stance classification.. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics.Google ScholarCross Ref
Kazi Saidul Hasan and Vincent Ng. 2013. Stance Classification of Ideological Debates: Data, Models, Features, and Constraints. In Proceedings of the Sixth International Joint Conference on Natural Language Processing. 1348--1356.Google Scholar
Myungha Jang and James Allan. 2016. Improving Automated Controversy Detection on the Web. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '16). ACM, New York, NY, USA, 865--868. Google ScholarDigital Library
Tony Mullen and Robert Malouf. 2008. Taking sides: User classification for informal online political discourse. Internet Research 18 (2008), 177--190.Google ScholarCross Ref
Akiko Murakami and Rudy Raymond. 2010. Support or Oppose?: Classifying Positions in Online Debates from Reply Activities and Opinion Expressions. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters (COLING '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 869--875. http://dl.acm.org/citation.cfm?id=1944566.1944666 Google ScholarDigital Library
SEBASTIAN PAD?!, TAE-GIL NOH, ASHER STERN, RUI WANG, and ROBERTO ZANOLI. 2015. Design and realization of a modular architecture for textual entailment. Natural Language Engineering 21 (3 2015), 167--200. Issue 02.Google Scholar
Thomas C. Rindflesh and Marcelo Fiszman. 2003. The Interaction of Domain Knowledge and Linguistic Structure in Natural Language Processing: Interpreting Hypernymic Propositions in Biomedical Text. J. of Biomedical Informatics 36, 6 (Dec. 2003), 462--477. Google ScholarDigital Library
Parinaz Sobhani, Diana Inkpen, and Stan Matwin. 2015. From argumentation mining to stance classification. NAACL HLT 2015 (2015).Google ScholarCross Ref
Swapna Somasundaran and Janyce Wiebe. 2009. Recognizing Stances in Online Debates. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1 (ACL '09). Association for Computational Linguistics, Stroudsburg, PA, USA, 226--234. http://dl.acm.org/citation.cfm?id=1687878.1687912 Google ScholarDigital Library
Swapna Somasundaran and Janyce Wiebe. 2010. Recognizing Stances in Ideological On-line Debates. In Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text (CAAGET '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 116--124. http://dl.acm.org/citation.cfm?id=1860631.1860645 Google ScholarDigital Library
Dhanya Sridhar, Lise Getoor, and Marilyn Walker. 2014. Collective Stance Classification of Posts in Online Debate Forums. In ACL Joint Workshop on Social Dynamics and Personal Attributes in Social Media.Google Scholar
Matt Thomas, Bo Pang, and Lillian Lee. 2006. Get out the Vote: Determining Support or Opposition from Congressional Floor-debate Transcripts. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP '06). Association for Computational Linguistics, Stroudsburg, PA, USA, 327--335. http://dl.acm.org/citation.cfm?id=1610075.1610122 Google ScholarDigital Library
Marilyn A. Walker, Pranav Anand, Robert Abbott, and Ricky Grant. 2012. Stance Classification Using Dialogic Properties of Persuasion (NAACL HLT '12). Association for Computational Linguistics, Stroudsburg, PA, USA, 592--596. http://dl.acm.org/citation.cfm?id=2382029.2382124 Google ScholarDigital Library
Yi-Chia Wang and Carolyn P. Rosé. 2010. Making Conversational Structure Explicit: Identification of Initiation-response Pairs Within Online Discussions. In 2010 North American Chapter of the Association for Computational Linguistics (HLT '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 673--676. http://dl.acm.org/citation.cfm?id=1857999.1858096 Google ScholarDigital Library
WebMD. 2016. WebMD information on link between vaccine and autism. (2016). http://www.webmd.com/children/vaccines/features/vaccine-linked-to-autism#1Google Scholar
Ainur Yessenalina, Yisong Yue, and Claire Cardie. 2010. Multi-level Structured Models for Document-level Sentiment Classification. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 1046--1056. http://dl.acm.org/citation.cfm?id=1870658.1870760 Google ScholarDigital Library
Elad Yom-Tov and Luis Fernández-Luque. 2014. Information is in the eye of the beholder: Seeking information on the MMR vaccine through an Internet search engine. In AMIA 2014, American Medical Informatics Association Annual Symposium, Washington, DC, USA, November 15-19, 2014. http://knowledge.amia.org/56638-amia-1.1540970/t-004-1.1544972/f-004-1.1544973/a-226-1.1544995/a-227-1.1544992Google Scholar
Charles Zaiontz. 2016. Real Statistics Using Excel. (2016). http://www.real-statistics.com/statistics-tables/wilcoxon-signed-ranks-table/Google Scholar

Recommendations

Stance Classification of Twitter Debates: The Encryption Debate as A Use Case
#SMSociety17: Proceedings of the 8th International Conference on Social Media & Society

Social media have enabled a revolution in user-generated content. They allow users to connect, build community, produce and share content, and publish opinions. To better understand online users' attitudes and opinions, we use stance classification. ...
Read More
Debate Stance Classification Using Word Embeddings
Big Data Analytics and Knowledge Discovery
Abstract
Online debate sites act as a popular platform for users to express and form opinions. In this paper, we propose a novel unsupervised approach to perform stance classification of two-sided online debate posts. We propose the use of word embeddings ...
Read More
Cross-target Stance Classification as Domain Adaptation
Advances in Computational Intelligence
Abstract
As in many NLP tasks, stance classification - the computational task of inferring attitudes towards a given target topic from text - often relies on supervised methods and annotated corpora. However, given the costs associated with corpus ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data
January 2018
379 pages
ISBN:9781450363419
DOI:10.1145/3152494
Conference Chair:
Sayan Ranu
IIT Delhi
,
General Chairs:
Niloy Ganguly
IIT Kharagpur
,
Raghu Ramakrishnan
Microsoft
,
Program Chairs:
Sunita Sarawagi
IIT Bombay
,
Shourya Roy
American Express Big Data Labs
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 January 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
health information mining
stance classification
text processing
Qualifiers
- research-article
Conference

Acceptance Rates
CODS-COMAD '18 Paper Acceptance Rate50of150submissions,33%Overall Acceptance Rate197of680submissions,29%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 13
  Total Citations
  View Citations
- 300
  Total Downloads
- Downloads (Last 12 months)20
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Stance classification of multi-perspective consumer health information

CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data

ABSTRACT

References

Cited By

Recommendations

Stance Classification of Twitter Debates: The Encryption Debate as A Use Case

Debate Stance Classification Using Word Embeddings

Cross-target Stance Classification as Domain Adaptation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Stance classification of multi-perspective consumer health information

CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data

ABSTRACT

References

Cited By

Recommendations

Stance Classification of Twitter Debates: The Encryption Debate as A Use Case

Debate Stance Classification Using Word Embeddings

Cross-target Stance Classification as Domain Adaptation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media