skip to main content
Skip header Section
Data mining methods for knowledge discoveryAugust 1998
Publisher:
  • Kluwer Academic Publishers
  • 101 Philip Drive Assinippi Park Norwell, MA
  • United States
ISBN:978-0-7923-8252-2
Published:01 August 1998
Pages:
495
Skip Bibliometrics Section
Bibliometrics
Abstract

No abstract available.

Cited By

  1. Sandhu J, Verma A and Rana P (2020). An Expert Approach for Data Flow Prediction: Case Study of Wireless Sensor Networks, Wireless Personal Communications: An International Journal, 112:1, (325-352), Online publication date: 1-May-2020.
  2. Liu C, Guo S, Shi Y and Yang Y (2017). Deterministic binary matrix based compressive data aggregation in big data WSNs, Telecommunications Systems, 66:3, (345-356), Online publication date: 1-Nov-2017.
  3. Toprceanu A and Grosseck G (2017). Decision tree learning used for the classification of student archetypes in online courses, Procedia Computer Science, 112:C, (51-60), Online publication date: 1-Sep-2017.
  4. Afify A (2016). A fuzzy rule induction algorithm for discovering classification rules, Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology, 30:6, (3067-3085), Online publication date: 1-Jan-2016.
  5. Augasta M and Kathirvalavakumar T An Empirical Comparison of Discretization Methods for Neural Classifier Proceedings of the First International Conference on Mining Intelligence and Knowledge Exploration - Volume 8284, (38-49)
  6. Huang S and Fang N (2013). Predicting student academic performance in an engineering dynamics course, Computers & Education, 61, (133-145), Online publication date: 1-Feb-2013.
  7. Minaei-Bidgoli B, Barmaki R and Nasiri M (2013). Mining numerical association rules via multi-objective genetic algorithms, Information Sciences: an International Journal, 233, (15-24), Online publication date: 1-Jun-2013.
  8. Liu Y, Wu C and Liu M (2011). Research of fast SOM clustering for text information, Expert Systems with Applications: An International Journal, 38:8, (9325-9333), Online publication date: 1-Aug-2011.
  9. Tian D, Zeng X and Keane J (2011). Core-generating approximate minimum entropy discretization for rough set feature selection in pattern classification, International Journal of Approximate Reasoning, 52:6, (863-880), Online publication date: 1-Sep-2011.
  10. Ciecholewski M Support vector machine approach to cardiac SPECT diagnosis Proceedings of the 14th international conference on Combinatorial image analysis, (432-443)
  11. Suraj Z and Pancerz K Synthesis of synchronized concurrent systems specified by information systems Proceedings of the 6th international conference on Rough sets and knowledge technology, (626-635)
  12. Hassanien A, Abdelhafez M and Own H (2008). Rough sets data analysis in knowledge discovery, Advances in Fuzzy Systems, 8, (1-13), Online publication date: 1-Jan-2008.
  13. Baeshen N Knowledge management and environmental decision support systems Proceedings of the 12th WSEAS international conference on Computers, (793-798)
  14. Yu J, Xi L and Zhou X (2008). Intelligent monitoring and diagnosis of manufacturing processes using an integrated approach of KBANN and GA, Computers in Industry, 59:5, (489-501), Online publication date: 1-May-2008.
  15. Yen C and Cios K (2008). Image recognition system based on novel measures of image similarity and cluster validity, Neurocomputing, 72:1-3, (401-412), Online publication date: 1-Dec-2008.
  16. Peters J and Skowron A Zdzisław pawlak Transactions on Rough Sets V, (1-24)
  17. Shaban K, Basir O and Kamel M Document mining based on semantic understanding of text Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications, (834-843)
  18. Zeng H, Lan H and Zeng X Redundant data processing based on rough-fuzzy approach Proceedings of the First international conference on Rough Sets and Knowledge Technology, (156-161)
  19. Guo H and Nandi A (2006). Breast cancer diagnosis using genetic programming generated feature, Pattern Recognition, 39:5, (980-987), Online publication date: 1-May-2006.
  20. Cui X, Gao J and Potok T (2006). A flocking based algorithm for document clustering analysis, Journal of Systems Architecture: the EUROMICRO Journal, 52:8, (505-515), Online publication date: 1-Aug-2006.
  21. Rawat S, Gulati V and Pujari A A fast host-based intrusion detection system using rough set theory Transactions on Rough Sets IV, (144-161)
  22. Suraj Z and Grochowalski P The rough set database system Transactions on Rough Sets III, (190-201)
  23. Kurgan L and Cios K (2004). CAIM Discretization Algorithm, IEEE Transactions on Knowledge and Data Engineering, 16:2, (145-153), Online publication date: 1-Feb-2004.
  24. Hammouda K and Kamel M (2004). Efficient Phrase-Based Document Indexing for Web Document Clustering, IEEE Transactions on Knowledge and Data Engineering, 16:10, (1279-1296), Online publication date: 1-Oct-2004.
  25. Cios K and Kurgan L (2004). CLIP4, Information Sciences: an International Journal, 163:1-3, (37-83), Online publication date: 14-Jun-2004.
  26. Ghosh A and Nath B (2004). Multi-objective rule mining using genetic algorithms, Information Sciences: an International Journal, 163:1-3, (123-133), Online publication date: 14-Jun-2004.
  27. Duncan M, Fung K, Wang H, Yen C and Cios K Identification of Contaminants in Proteomics Mass Spectrometry Data Proceedings of the IEEE Computer Society Conference on Bioinformatics
  28. Abonyi J and Szeifert F (2003). Supervised fuzzy clustering for the identification of fuzzy classifiers, Pattern Recognition Letters, 24:14, (2195-2207), Online publication date: 1-Oct-2003.
  29. Kuo T and Yajima Y Approximate reducts of an information system Proceedings of the 9th international conference on Rough sets, fuzzy sets, data mining, and granular computing, (291-294)
  30. He M and Leung H (2002). Agents in E-commerce, Knowledge and Information Systems, 4:3, (257-282), Online publication date: 1-Jul-2002.
  31. Skowron A, Komorowski J, Pawlak Z and Polkowski L Rough sets perspective on data and knowledge Handbook of data mining and knowledge discovery, (134-149)
  32. Cios K and Kurgan Ł Hybrid inductive machine learning New learning paradigms in soft computing, (276-321)
  33. Zytkow J and Klösgen W Multidisciplinary contributions to knowledge discovery Handbook of data mining and knowledge discovery, (22-32)
  34. ACM
    Garcke J and Griebel M Data mining with sparse grids using simplicial basis functions Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, (87-96)
  35. ACM
    Han J and Cercone N RuleViz Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, (244-253)
  36. Holena M Observational Logic Integrates Data Mining Based on Statistics and Neural Networks Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery, (440-445)
Contributors
  • Virginia Commonwealth University
  • Systems Research Institute of the Polish Academy of Sciences
  • San Diego State University

Recommendations

Friedrich Gebhardt

Books on data mining usually concentrate on results and on particular methods employed to achieve them. This one is different; it collects some methods that are essential for the field. This is a good idea. Unfortunately, the realization of the plan leaves something to be desired. After a short introduction to data mining in general, four chapters introduce basic techniques. The first one concerns rough sets. Here the reduction problem—finding minimal subsets of the variables that suffice for classification—takes much of the space. The chapter on fuzzy sets abounds with definitions. In the chapter on Bayesian methods, the authors emphasize classification and introduce probabilistic neural networks. The chapter on “Evolutionary Computing” gives a first introduction to genetic algorithms. These are the building blocks for the advanced methods in the next three chapters. Machine learning is used for discrimination between two or more groups by rules or decision trees; two pages discuss the application of this technique to knowledge discovery. The neural networks presented are the radial basis function network and Kohonen's self-organizing map network. In this book, clustering means finding representatives for the clusters that minimize some criterion; the authors use neural nets and fuzzy sets to achieve this goal. Data are mostly not as clean as is assumed by data mining methods; therefore, a chapter on preprocessing concludes the book. It centers on the use of principal component analysis and similar techniques to reduce the dimensions and create orthogonal variables. In all chapters, the theory is illustrated with many small examples and sometimes with larger examples. The exercises are adequate. Of course, a book on so many different subjects cannot replace a series of textbooks on each of them, and the selection of the material is a matter of taste. As the authors stress, the techniques described here presuppose that the variables are fairly independent; if they are not, orthogonal transformations should be applied in the preprocessing stage. Thus, methods whose purpose is to find such perhaps unknown dependencies (such as regression techniques) are not covered. While one could argue about the choice of subjects, the technical details are well below average. Misspellings and grammatical errors could be considered a nuisance, but numerous mistakes in the formulas (wrong indices, max instead of min, union instead of Cartesian product, inconsistent variable names, and so on) hinder comprehension. Some sections abound with definitions, while others dwell on undefined and poorly explained notions. The index could be expanded; even some terms that are used repeatedly are missing. Since the references are scattered over the chapters, an author index would be helpful.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.