skip to main content
Skip header Section
Data mining solutions: methods and tools for solving real-world problemsJuly 1998
Publisher:
  • John Wiley & Sons, Inc.
  • 605 Third Ave. New York, NY
  • United States
ISBN:978-0-471-25384-6
Published:16 July 1998
Pages:
617
Skip Bibliometrics Section
Bibliometrics
Abstract

No abstract available.

Cited By

  1. Kyun D, Sohn S and Kim J (2009). Multi-level association rules for MP3P marketing strategies based on extensive marketing survey data, Expert Systems with Applications: An International Journal, 36:9, (11582-11588), Online publication date: 1-Nov-2009.
  2. ACM
    Smith G, Tan D and Lee B iSee Proceedings of the International Conference on Advances in Computer Entertainment Technology, (190-197)
  3. Goh Y, Giess M and McMahon C (2009). Facilitating design learning through faceted classification of in-service information, Advanced Engineering Informatics, 23:4, (497-511), Online publication date: 1-Oct-2009.
  4. Chang C, Chu C and Yeh Y (2009). Integrating in-process software defect prediction with association mining to discover defect pattern, Information and Software Technology, 51:2, (375-384), Online publication date: 1-Feb-2009.
  5. Tong C, Sharma D and Shadabi F A Multi-agents Approach to Knowledge Discovery Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03, (571-574)
  6. Hung C and Tsai C (2008). Market segmentation based on hierarchical self-organizing map for markets of multimedia on demand, Expert Systems with Applications: An International Journal, 34:1, (780-787), Online publication date: 1-Jan-2008.
  7. Chang C and Chu F Applying data mining and XML technology to build a web-based house trading and matching system Proceedings of the 7th WSEAS International Conference on Simulation, Modelling and Optimization, (110-115)
  8. Tang M, Zhang Y and Zhang G Type-2 Fuzzy Web Shopping Agents Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence, (499-503)
  9. Nocke T, Schumann H, Böhm U and Flechsig M Next generation modeling I Proceedings of the 35th conference on Winter simulation: driving innovation, (763-771)
  10. Kroeze J, Matthee M and Bothma T Differentiating data- and text-mining terminology Proceedings of the 2003 annual research conference of the South African institute of computer scientists and information technologists on Enablement through technology, (93-101)
  11. Kim W, Choi B, Hong E, Kim S and Lee D (2019). A Taxonomy of Dirty Data, Data Mining and Knowledge Discovery, 7:1, (81-99), Online publication date: 1-Jan-2003.
  12. Sang K Case studies: Commercial, multiple mining tasks systems Handbook of data mining and knowledge discovery, (589-600)
  13. Kreuseler M and Schumann H (2002). A Flexible Approach for Visual Data Mining, IEEE Transactions on Visualization and Computer Graphics, 8:1, (39-51), Online publication date: 1-Jan-2002.
  14. Wong P, Cowley W, Foote H, Jurrus E and Thomas J Visualizing Sequential Patterns for Text Mining Proceedings of the IEEE Symposium on Information Vizualization 2000
  15. Wong P, Whitney P and Thomas J Visualizing Association Rules for Text Mining Proceedings of the 1999 IEEE Symposium on Information Visualization
Contributors
  • University of Maryland School of Medicine

Recommendations

Reviews

Wilfried Lemahieu

The approach to data mining presented in this work entails a very broad perception of the concept. The book's focus is original in that it is not restricted to prediction problems based on highly structured data sets. On the contrary, less structured problem domains also feature prominently, and purely descriptive tasks and the need for comprehension frequently prevail over prediction. Moreover, the authors draw from their own problem-solving practice in such diverse fields as narco-terrorism, money laundering, insider trading, claims fraud, retail sales, biomedical research, and telecommunications to provide a colorful palette of examples from various domains. The book looks at data mining from a practice-oriented viewpoint along two dimensions: theoretical approaches to the data mining process are coupled with an overview of existing tools and technologies. The first section defines data mining and delimits the problems that can be solved using it. The second part surveys techniques for representing and analyzing information. The complete cycle of accessing and integrating data, analyzing patterns, and presenting results is covered. A distinction is made between visual and nonvisual analytical methods. The authors clearly favor the former, dedicating an entire chapter to data visualization. All other techniques, such as statistical methods, neural networks, and genetic algorithms, are crowded into a single chapter, which is still only half the length of the visual techniques chapter. The third part assesses existing tools, again with a focus on visualization. Tools are categorized as link analysis tools, landscape visualization tools, or quantitative data mining tools. Numerous examples and screen captures are provided. While such a tool overview may be useful at present, it obviously offers only a snapshot of a rapidly changing domain, and will inevitably be outdated before long. Therefore, an extensive set of references to tool vendors is included as an appendix, to which the reader is referred for the most up-to-date information. A separate chapter is dedicated to future trends in visual data mining. Support for navigation, particularly in a World Wide Web environment, and tools integrating both visual and nonvisual techniques are acknowledged as important developments. The last section presents a realm of case studies containing “war stories,” once more from varied domains such as pharmaceutical research, telecommunications, finance, retail, and criminal investigation. Descriptions of the actual situation and background associated with the problem domain predominate over mathematical proofs and theoretical discussions. Thus, these case studies contribute nice illustrations of what is possible, but do not provide in-depth insight into explicit formal methodologies based on theory. The book is accompanied by a CD-ROM that contains color representations of all the black-and-white images in the book. It also includes documentation, trial versions, and sample data sets for several of the tools discussed. All this information is accessible from an attractive user interface. Unfortunately, not all of the demo applications install and operate flawlessly. Written in down-to-earth language, avoiding overspecial ized terminology where possible, the book reads effortlessly. Sometimes, however, concepts that could be considered obvious are elucidated in too much detail. The discussion never becomes tedious, thanks again to the numerous real-life examples, which add a lively flavor. The book is aimed at readers in the business community, but the general principles are said to apply in any domain. The target audience is described as system developers, technical managers, and analysts. While the work undoubtedly gives a comprehensive and easy-to-understand introduction to data mining, it does not provide the definitive overview of the domain, because it tips the scale too much in favor of data visualization techniques. Nonvisual methods such as statistical testing, neural networks, and genetic algorithms are barely touched upon. The treatment may be useful as a first taste for the lay reader, but it lacks depth. Anyone hoping to acquire adequate insight to put the latter methodologies into practice will be left out in the cold. If anything, the book can be seen as a valuable introduction for newcomers to the field. For more experienced readers, its refreshing approach to data mining and its merit of being the only book that offers a thorough discussion of visual techniques make it an estimable complement to, although certainly not a substitute for, works that take a more statistical approach.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.