skip to main content
10.1109/WI.2006.47guideproceedingsArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
Article
Free Access

Automatic Identification of Chinese Weblogger's Interests Based on Text Classification

Published:18 December 2006Publication History

ABSTRACT

Chinese weblogs have been expanded in an incredible speed in recent years. There is plentiful personal information in weblogs. In this paper, we propose a text classification based approach to automatically identify the interests of a weblogger. To solve the problems arising out of classifying weblog documents, the technique of heterogeneous classifiers combination is used here. We also use hierarchical classification technique to identify much specific interests. Experiments show that our interest identification approach has a high accuracy and, for most webloggers in our experiments, their interests implied in the contents of blogs could be well identified by using this approach.

Index Terms

  1. Automatic Identification of Chinese Weblogger's Interests Based on Text Classification

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image Guide Proceedings
            WI '06: Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
            December 2006
            1023 pages
            ISBN:0769527477

            Publisher

            IEEE Computer Society

            United States

            Publication History

            • Published: 18 December 2006

            Qualifiers

            • Article

            Acceptance Rates

            Overall Acceptance Rate118of178submissions,66%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader