skip to main content
10.1145/288627.288651acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article
Free Access

Inductive learning algorithms and representations for text categorization

Authors Info & Claims
Published:01 November 1998Publication History
First page image

References

  1. 1.Apte, C., Damerau, F. and Weiss, S. Automated learning of decision rules for text categorization. A CM Transactions on Information Systems, 12(3), 233-251, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.Apte, C., Damerau, F. and Weiss, S. Text Mining with decision rules and decision trees. Proceedings of the Conference on Automated Learning and Discovery, CMU, June, 1998.Google ScholarGoogle Scholar
  3. 3.Boser, B. E., Guyon, I. M., and Vapnik, V., A Training Algorithm for Optimal Margin Classifiers. Fifth Annual Workshop on Computational Learning Theory, ACM, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.Chickering D., Heckerman D., and Meek, C. A Bayesian approach for learning Bayesian networks with local structure. In Proceedings of Thirteenth Conference on Uncertainty in Artificial Intelligence, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.Cohen, W.W. and Singer, Y. Context-sensitive learning methods for text categorization In SIGIR 96: Proceedings of the 19th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, 307-315, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.Cortes, C., and Vapnik, V., Support vector networks. Machine Learning, 20, 273-297, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.Fuhr, N., Hartmanna, S., Lustig, G., Schwantner, M., and Tzeras, K. Air/X- A rule-based multi-stage indexing system for lage subject fields. In Proceedings of RIAO'91, 606-623, 1991.Google ScholarGoogle Scholar
  8. 8.Good, I.J. The Estimation of Probabilities: An Essay on Modern Bayesian Methods. MIT Press, 1965.Google ScholarGoogle Scholar
  9. 9.Hayes, P.J. and Weinstein. S.P. CONSTRUE/TIS: A system for content-based indexing of a database of news stories. In Second Annual Conference on Innovative Applications of Artificial Intelligence, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.Heckerman, D. Geiger, D. and Chickering, D.M. Learning Bayesian networks: the combination of knowledge and statistical data. Machine Learning, 20, 131-163, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.Joachims, T. Text categorization with support vector machines: Learning with many relevant features. In Proceedings I0tn European Conference on Machine Learning (ECML), Springer Verlag, 1998. http://wwwai.cs.unidortmund.de/DOKIMENTE/Joachims 97a.ps.gz Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.LeCun, Y., Jackel, L. D., Bottou, L., Cortes, C., Denker, J. S., Drucker, H., Guyon, i., Muller, U. A., Sackinger, E., Simard, P. and Vapnik, V. Learning algorithms for classification: A comparison on handwritten digit recognition. Neural Networks: The Statistical Mechanics Perspective, 261-276, 1995.Google ScholarGoogle Scholar
  13. 13.Lewis, D.D. An evaluation of phrasal and clustered representations on a text categorization task. In SIGIR'92: Proceedings of the 15th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, 37-50, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.Lewis, D.D. and Hayes, P.J. (Eds.)ACM Transactions on Information Systems- Special Issue on Text Categorization, 12(3), 1994.Google ScholarGoogle Scholar
  15. 15.Lewis, D.D. and Ringuette, M. A comparison of two learning algorithms for text categorization. In Third Annual Symposium on Document Analysis and Information Retrieval, 81-93, 1994.Google ScholarGoogle Scholar
  16. 16.Lewis. D.D. and Sparck Jones. K. Natural language processing for information retrieval. Communications of the ACM, 39(1), 92-101, January 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.Lewis, D.D., Schapire, R., Callan, J.P., and Papka, R. Training algorithms for linear text classifiers. In SIGIR '96: Proceedings of the 19th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, 298-306, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.Osuna, E., Freund, R., and Girosi, F. Training support vector machines: An application to face detection. In Proceedings of Computer Vision and Pattern Recognition '97, 130-136, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.Platt, J. Fast training of SVMs using sequential minimal optimization. To appear in: B. Scholkopf, C. Burges, and A. Smola (Eds.) Advances in Kernel Methods- Support Vector Learning, MIT Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20.Rocchio, J.J. Jr. Relevance feedback in information retrieval, in G.Salton (Ed.), The SMART Retrieval System: Experiments in Automatic Document Processing, 313-323. Prentice Hall, 1971.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21.Sahami, M. Learning Limited Dependence Bayesian Classifiers. In KDD-96: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, 335-338, AAAI Press, 1996. httr>://robotics.stanford.edu/users/sahami/papersdir/kdd96-1earn-bn.psGoogle ScholarGoogle Scholar
  22. 22.Sahami, M., Dumais, S., Heckerman, D., Horvitz, E. A Bayesian approach to filtering junk e-mail. AAAI 98 Workshop on Text Categorization, July 1998. http://robotics.stanford.edu/users/sahami/papersdir/spam.psGoogle ScholarGoogle Scholar
  23. 23.Salton, G. and McGill, M. Introduction to Modern Information Retrieval. McGraw Hill, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. 24.Schapire, R., Freund, Y., Bartlett, P. and Lee, W. S. Boosting the margin: A new explanation for the effectiveness of voting methods. Annals of Statistics, to appear, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. 25.Schiitze, H., Hull, D. and Pedersen, J.O. A comparison of classifiers and document representations for the routing problem, in SIGIR 95: Proceedings of the 18th Annual international ACM SIGIR Conference on Research and Development in Information Retrieval, 229-237, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. 26.Vapnik, V., The Nature of Statistical Learning Theory, Springer-Verlag, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. 27.Wiener E., Pedersen, J.O. and Weigend, A.S. A neural network approach to topic spotting. In Proceedings of the Fourth Annual Symposium on Document Analysis and Information Retrieval (SDAIR'95), 1995.Google ScholarGoogle Scholar
  28. 28.Yang, Y. Expert network: Effective and efficient learning from human decisions in text categorization and retrieval. SIGIR '94: Proceedings of the 17th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, 13-22, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. 29.Yang. Y. and Chute, C.G. An example-based mapping method for text categorization and retrieval. A CM Transactions on Information Systems, 12(3), 252-277, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. 30.Yang, Y. and Pedersen, J.O. A comparative study on feature selection in text categorization, in Machine Learning: Proceedings of the Fourteenth International Conference (ICML'97), 412-420, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. 31.Yang, Y. An evaluation of statistical approaches to text categorization. CMU Technical Report, CMU-CS- 97-127, April 1997.Google ScholarGoogle Scholar
  32. 32.The Reuters-21578 collection is available at: http://www.research.att.conff-lewis/reuters2157 8.htmlGoogle ScholarGoogle Scholar

Index Terms

  1. Inductive learning algorithms and representations for text categorization

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                CIKM '98: Proceedings of the seventh international conference on Information and knowledge management
                November 1998
                450 pages
                ISBN:1581130619
                DOI:10.1145/288627

                Copyright © 1998 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 1 November 1998

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                Overall Acceptance Rate1,861of8,427submissions,22%

                Upcoming Conference

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader