ABSTRACT
This paper explores the problem of detecting sentence-level forum authority claims in online discussions. Using a maximum entropy model, we explore a variety of strategies for extracting lexical features in a sparse training scenario, comparing knowledge- and data-driven methods (and combinations). The augmentation of lexical features with parse context is also investigated. We find that certain markup features perform remarkably well alone, but are outperformed by data-driven selection of lexical features augmented with parse context.
- R. Barzilay, M. Collins, J. Hirschberg, and S. Whittaker. 2000. The rules behind roles: Identifying speaker role in radio broadcasts. In Proceedings of AAAI, pages 679--684. Google ScholarDigital Library
- E. M. Bender, J. Morgan, M. Oxley, M. Zachry, B. Hutchinson, A. Marin, B. Zhang, and M. Ostendorf. 2011. Annotating social acts: Authority claims and alignment moves in wikipedia talk pages. In Proceedings of ACL -- Workshop on Language in Social Media. Google ScholarDigital Library
- J. S. Bunderson. 2003. Recognizing and utilizing expertise in work groups: A status characteristics perspective. Administrative Science Quarterly, 48(4):557--591.Google ScholarCross Ref
- E. Gilbert, T. Bergstrom, and K. Karahalios. 2009. Blogs Are Echo Chambers: Blogs Are Echo Chambers. In Proceedings of HICSS, pages 1--10. Google ScholarDigital Library
- T. Hastie, R. Tibshirani, and J. Friedman. 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition. Springer Series in Statistics. Springer, September.Google Scholar
- B. Hutchinson, B. Zhang, and M. Ostendorf. 2010. Unsupervised broadcast conversation speaker role labeling. In Proceedings of ICASSP, pages 5322--5325.Google Scholar
- S. M. Kim and E. Hovy. 2006. Automatic identification of pro and con reasons in online reviews. In Proceedings of COLING-ACL, pages 483--490. Google ScholarDigital Library
- K. Laskowski, M. Ostendorf, and T. Schultz. 2008. Modeling vocal interaction for text-independent participant characterization in multi-party conversation. In ISCA/ACL SIGdial Workshop on Discourse and Dialogue, pages 194--201. Google ScholarDigital Library
- F. Liu and Y. Liu. 2007. Soundbite identification using reference and automatic transcripts of broadcast news speech. In Proceedings of ASRU, pages 653--658.Google Scholar
- Y. Liu. 2006. Initial study on automatic identification of speaker role in broadcast news speech. In Proceedings of HLT, pages 81--84. Google ScholarDigital Library
- A. Marin, M. Ostendorf, B. Zhang, J. T. Morgan, M. Oxley, M. Zachry, and E. M. Bender. 2010. Detecting authority bids in online discussions. In Proceedings of SLT, pages 49--54.Google Scholar
- S. Maskey and J. Hirschberg. 2006. Soundbite detection in broadcast news domain. In Proceedings of Interspeech, pages 1543--1546.Google Scholar
- A. K. McCallum. 2002. MALLET: A machine learning for language toolkit. http://mallet.cs.umass.edu.Google Scholar
- B. Pang and L. Lee. 2004. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts. In Proceedings of ACL, pages 271--278. Google ScholarDigital Library
- B. Pang and L. Lee. 2005. Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of ACL, pages 115--124. Google ScholarDigital Library
- S. Petrov, L. Barrett, R. Thibaux, and D. Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In Proceedings of COLING-ACL, pages 433--440. Google ScholarDigital Library
- A. Vinciarelli. 2007. Speakers role recognition in multiparty audio recordings using social network analysis and duration distribution modeling. IEEE Transactions on Multimedia, 9(6):1215--1226. Google ScholarDigital Library
- Y. Yang and J. O. Pedersen. 1997. A Comparative Study on Feature Selection in Text Categorization. In Proceedings of ICML, pages 412--420. Google ScholarDigital Library
Index Terms
- Detecting forum authority claims in online discussions
Recommendations
Detecting Opinionated Claims in Online Discussions
ICSC '12: Proceedings of the 2012 IEEE Sixth International Conference on Semantic ComputingThis paper explores the automatic detection of sentences that are opinionated claims, in which the author expresses a belief. We use a machine learning based approach, investigating the impact of features such as sentiment and the output of a system ...
Detecting frauds in online advertising systems
EC-Web'06: Proceedings of the 7th international conference on E-Commerce and Web TechnologiesOnline advertising is aimed to promote and sell products and services of various companies in the global market through internet. In 2005, it was estimated that companies spent $10B in web advertisements, and it is expected to grow by 25-30% in the next ...
Patterns for online discussions
PLOP '10: Proceedings of the 17th Conference on Pattern Languages of ProgramsNowadays, online education is very popular. Almost every school has developed at least a few online courses in order to be competitive in the education market. The main interaction for online courses is based on online discussions.
Also, numerous ...
Comments