ABSTRACT
Chinese weblogs have been expanded in an incredible speed in recent years. There is plentiful personal information in weblogs. In this paper, we propose a text classification based approach to automatically identify the interests of a weblogger. To solve the problems arising out of classifying weblog documents, the technique of heterogeneous classifiers combination is used here. We also use hierarchical classification technique to identify much specific interests. Experiments show that our interest identification approach has a high accuracy and, for most webloggers in our experiments, their interests implied in the contents of blogs could be well identified by using this approach.
Index Terms
- Automatic Identification of Chinese Weblogger's Interests Based on Text Classification
Recommendations
Mining the interests of Chinese microbloggers via keyword extraction
Microblogging provides a new platform for communicating and sharing information among Web users. Users can express opinions and record daily life using microblogs. Microblogs that are posted by users indicate their interests to some extent. We aim to ...
Categorizing blogger's interests based on short snippets of blog posts
CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge managementBlogs have become an important medium for people to express opinions and share information on the web. Predicting the interests of bloggers can be beneficial for information retrieval and knowledge discovery in the blogosphere. In this paper, we propose ...
Content-based emotion classification in online social networks for Chinese Microblogs
ACSW '17: Proceedings of the Australasian Computer Science Week MulticonferenceRecent years, social networks are popular throughout the whole world. In China in particular, more people spend their time on social networks. Sina Weibo, as the most popular microblogs in China, records millions of microblogs from different population. ...
Comments