ABSTRACT
Wikipedia, "the free encyclopedia", now contains over two million English articles, and is widely regarded as a high-quality, authoritative encyclopedia. Some Wikipedia articles, however, are of questionable quality, and it is not always apparent to the visitor which articles are good and which are bad. We propose a simple metric -- word count -- for measuring article quality. In spite of its striking simplicity, we show that this metric significantly outperforms the more complex methods described in related work.
- B. T. Adler and L. de Alfaro. A content-driven reputation system for the wikipedia. Proc. 16th Intl. Conf. on the World Wide Web, pages 261--270, 2007. Google ScholarDigital Library
- T. Cross. Puppy smoothies: Improving the reliability of open, collaborative wikis. First Monday, 11, 2006.Google Scholar
- A. Lih. Wikipedia as participatory journalism: Reliable sources? metrics for evaluating collaborative media as a news resource. 13th Asian Media Information and Communications Centre Annual Conference, 2004.Google Scholar
- B. Stvilia, M. Twidale, L. Gasser, and L. Smith. Information quality discussions in wikipedia. Proc. 2005 ICKM, pages 101--113, 2005.Google Scholar
- B. Stvilia, M. B. Twidale, L. C. Smith, and L. Gasser. Assessing information quality of a community-based encyclopedia. Proc. ICIQ, pages 442--454, 2005.Google Scholar
- H. Zeng, M. Alhossaini, L. Ding, R. Fikes, and D. L. McGuinness. Computing trust from revision history. Intl. Conf. on Privacy, Security and Trust, 2006. Google ScholarDigital Library
Index Terms
- Size matters: word count as a measure of quality on wikipedia
Recommendations
Assessing the quality of health-related Wikipedia articles with generic and specific metrics
WWW '21: Companion Proceedings of the Web Conference 2021Wikipedia is an online, free, multi-language, and collaborative encyclopedia, currently one of the most significant information sources on the web. The open nature of Wikipedia contributions raises concerns about the quality of its information. Previous ...
A breakdown of quality flaws in Wikipedia
WebQuality '12: Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web QualityThe online encyclopedia Wikipedia is a successful example of the increasing popularity of user generated content on the Web. Despite its success, Wikipedia is often criticized for containing low-quality information, which is mainly attributed to its ...
Detection of text quality flaws as a one-class classification problem
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementFor Web applications that are based on user generated content the detection of text quality flaws is a key concern. Our research contributes to automatic quality flaw detection. In particular, we propose to cast the detection of text quality flaws as a ...
Comments