skip to main content
Skip header Section
Machine Learning: The Art and Science of Algorithms that Make Sense of DataNovember 2012
Publisher:
  • Cambridge University Press
  • 40 W. 20 St. New York, NY
  • United States
ISBN:978-1-107-42222-3
Published:12 November 2012
Pages:
409
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

As one of the most comprehensive machine learning texts around, this book does justice to the field's incredible richness, but without losing sight of the unifying principles. Peter Flach's clear, example-based approach begins by discussing how a spam filter works, which gives an immediate introduction to machine learning in action, with a minimum of technical fuss. Flach provides case studies of increasing complexity and variety with well-chosen examples and illustrations throughout. He covers a wide range of logical, geometric and statistical models and state-of-the-art topics such as matrix factorisation and ROC analysis. Particular attention is paid to the central role played by features. The use of established terminology is balanced with the introduction of new and useful concepts, and summaries of relevant background material are provided with pointers for revision if necessary. These features ensure Machine Learning will set a new standard as an introductory textbook.

Cited By

  1. Chen L, Ji P, Ma Y, Rong Y and Ren J (2024). Custom machine learning algorithm for large-scale disease screening - taking heart disease data as an example, Artificial Intelligence in Medicine, 146:C, Online publication date: 1-Dec-2023.
  2. Ribeiro C, Paes A and Oliveira D (2023). AIS-based maritime anomaly traffic detection, Expert Systems with Applications: An International Journal, 231:C, Online publication date: 30-Nov-2023.
  3. Alfaro J, Aledo J and Gámez J (2023). Multi-dimensional Bayesian network classifiers for partial label ranking, International Journal of Approximate Reasoning, 160:C, Online publication date: 1-Sep-2023.
  4. Gomes L, da Silva Torres R and Côrtes M (2023). BERT- and TF-IDF-based feature extraction for long-lived bug prediction in FLOSS, Information and Software Technology, 160:C, Online publication date: 1-Aug-2023.
  5. Sokol P, Antoni Ľ, Krídlo O, Marková E, Kováčová K and Krajči S (2023). Formal concept analysis approach to understand digital evidence relationships, International Journal of Approximate Reasoning, 159:C, Online publication date: 1-Aug-2023.
  6. Panda C and Singh T (2023). ML-based vehicle downtime reduction, Engineering Applications of Artificial Intelligence, 122:C, Online publication date: 1-Jun-2023.
  7. Stawarz K, Katz D, Ayobi A, Marshall P, Yamagata T, Santos-Rodriguez R, Flach P and O’Kane A (2023). Co-designing opportunities for Human-Centred Machine Learning in supporting Type 1 diabetes decision-making, International Journal of Human-Computer Studies, 173:C, Online publication date: 1-May-2023.
  8. Alkhalifa R, Kochkina E and Zubiaga A (2023). Building for tomorrow, Information Processing and Management: an International Journal, 60:2, Online publication date: 1-Mar-2023.
  9. ACM
    Freitas L and Lelli V Using Machine Learning on Testing IoT Applications: a systematic mapping Proceedings of the Brazilian Symposium on Multimedia and the Web, (348-358)
  10. Soares D, Henriques R, Gromicho M, de Carvalho M and Madeira S (2022). Learning prognostic models using a mixture of biclustering and triclustering, Journal of Biomedical Informatics, 134:C, Online publication date: 1-Oct-2022.
  11. Nasr Azadani M and Boukerche A (2022). DriverRep, Journal of Parallel and Distributed Computing, 162:C, (105-117), Online publication date: 1-Apr-2022.
  12. ACM
    Li W, Cao Y, Hu J and Li L Application of K-Nearest Neighbor Algorithm Based on Orthogonal Wavelet Feature Extraction In Fault Diagnosis 2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture, (1678-1685)
  13. Erdelyi O and Erdelyi G (2021). The AI Liability Puzzle and A Fund-Based Work-Around, Journal of Artificial Intelligence Research, 70, (1309-1334), Online publication date: 1-May-2021.
  14. Ngueilbaye A, Wang H, Mahamat D and Junaidu S (2021). Modulo 9 model-based learning for missing data imputation, Applied Soft Computing, 103:C, Online publication date: 1-May-2021.
  15. Zhang J and Wang Y (2021). An ensemble method to improve prediction of earthquake-induced soil liquefaction: a multi-dataset study, Neural Computing and Applications, 33:5, (1533-1546), Online publication date: 1-Mar-2021.
  16. Alfaro J, Aledo J and Gámez J (2020). Learning decision trees for the partial label ranking problem, International Journal of Intelligent Systems, 36:2, (890-918), Online publication date: 11-Jan-2021.
  17. ACM
    Du X, Hargreaves C, Sheppard J, Anda F, Sayakkara A, Le-Khac N and Scanlon M SoK Proceedings of the 15th International Conference on Availability, Reliability and Security, (1-10)
  18. Bertossi L An ASP-Based Approach to Counterfactual Explanations for Classification Rules and Reasoning, (70-81)
  19. ACM
    Karmelita M and Pawlak T CMA-ES for one-class constraint synthesis Proceedings of the 2020 Genetic and Evolutionary Computation Conference, (859-867)
  20. ACM
    Cestnik B On finding the optimal parameters for probability estimation with m-estimate Proceedings of the 21st International Conference on Computer Systems and Technologies, (162-168)
  21. Lemaire V, Ismaili O, Cornuéjols A and Gay D Predictive K-means with Local Models Trends and Applications in Knowledge Discovery and Data Mining, (91-103)
  22. ACM
    Dove G and Fayard A Monsters, Metaphors, and Machine Learning Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems, (1-17)
  23. Strese M, Brudermueller L, Kirsch J and Steinbach E (2020). Haptic Material Analysis and Classification Inspired by Human Exploratory Procedures, IEEE Transactions on Haptics, 13:2, (404-424), Online publication date: 1-Apr-2020.
  24. Basiri M and Kabiri A (2020). HOMPer, Journal of Information Science, 46:1, (101-117), Online publication date: 1-Feb-2020.
  25. Boldt M, Borg A, Ickin S and Gustafsson J (2019). Anomaly detection of event sequences using multiple temporal resolutions and Markov chains, Knowledge and Information Systems, 62:2, (669-686), Online publication date: 1-Feb-2020.
  26. Castellanos-Garzón J, Costa E, Jaimes S. J and Corchado J (2019). An evolutionary framework for machine learning applied to medical data, Knowledge-Based Systems, 185:C, Online publication date: 1-Dec-2019.
  27. Gomes L, Torres R and Côrtes M (2019). Bug report severity level prediction in open source software, Information and Software Technology, 115:C, (58-78), Online publication date: 1-Nov-2019.
  28. Liu L, Cao D, Wu Y and Wei T (2019). Defective samples simulation through adversarial training for automatic surface inspection, Neurocomputing, 360:C, (230-245), Online publication date: 30-Sep-2019.
  29. Floyd S and Viktor H Soft Voting Windowing Ensembles for Learning from Partially Labelled Streams New Frontiers in Mining Complex Patterns, (85-99)
  30. Vychegzhanin S and Kotelnikov E (2019). Stance Detection Based on Ensembles of Classifiers, Programming and Computing Software, 45:5, (228-240), Online publication date: 1-Sep-2019.
  31. Popkov Y (2022). Randomized Machine Learning Procedures, Automation and Remote Control, 80:9, (1653-1670), Online publication date: 1-Sep-2019.
  32. Prati R and Said-Hung E (2019). Predicting the ideological orientation during the Spanish 24M elections in Twitter using machine learning, AI & Society, 34:3, (589-598), Online publication date: 1-Sep-2019.
  33. ACM
    Du X and Scanlon M Methodology for the Automated Metadata-Based Classification of Incriminating Digital Forensic Artefacts Proceedings of the 14th International Conference on Availability, Reliability and Security, (1-8)
  34. ACM
    Cestnik B and Kern A Supporting decisions with fast and frugal trees for improving trust in public housing services Proceedings of the 20th International Conference on Computer Systems and Technologies, (118-123)
  35. Garrett K, Ferreira G, Jia L, Sunshine J and Kästner C Detecting suspicious package updates Proceedings of the 41st International Conference on Software Engineering: New Ideas and Emerging Results, (13-16)
  36. Quin F, Weyns D, Bamelis T, Buttar S and Michiels S Efficient analysis of large adaptation spaces in self-adaptive systems using machine learning Proceedings of the 14th International Symposium on Software Engineering for Adaptive and Self-Managing Systems, (1-12)
  37. Oliva J and Rosa J (2019). Classification for EEG report generation and epilepsy detection, Neurocomputing, 335:C, (81-95), Online publication date: 28-Mar-2019.
  38. Flach P Performance evaluation in machine learning Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, (9808-9814)
  39. Verwer S and Zhang Y Learning optimal classification trees using a binary linear program formulation Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, (1625-1632)
  40. Zhukova N and Andriyanova N (2019). Cognitive Monitoring of Distributed Objects, Automatic Documentation and Mathematical Linguistics, 53:1, (32-43), Online publication date: 1-Jan-2019.
  41. ACM
    Lagus J, Longi K, Klami A and Hellas A (2018). Transfer-Learning Methods in Programming Course Outcome Prediction, ACM Transactions on Computing Education, 18:4, (1-18), Online publication date: 13-Nov-2018.
  42. Boyer J Natural language question answering in the financial domain Proceedings of the 28th Annual International Conference on Computer Science and Software Engineering, (189-200)
  43. ACM
    Kronjee J, Hommersom A and Vranken H Discovering software vulnerabilities using data-flow analysis and machine learning Proceedings of the 13th International Conference on Availability, Reliability and Security, (1-10)
  44. ACM
    Pawlak T Performance improvements for evolutionary strategy-based one-class constraint synthesis Proceedings of the Genetic and Evolutionary Computation Conference, (873-880)
  45. ACM
    Sroka D and Pawlak T One-class constraint acquisition with local search Proceedings of the Genetic and Evolutionary Computation Conference, (363-370)
  46. Kudła P and Pawlak T (2018). One-class synthesis of constraints for Mixed-Integer Linear Programming with C4.5 decision trees, Applied Soft Computing, 68:C, (1-12), Online publication date: 1-Jul-2018.
  47. ACM
    Tripathi S and Hemachandra N Scalable linear classifiers based on exponential loss function Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, (190-200)
  48. González-Briones A, Castellanos-Garzón J, Mezquita Martín Y, Prieto J, Corchado J and Giannelli C (2018). A Framework for Knowledge Discovery from Wireless Sensor Networks in Rural Environments, Wireless Communications & Mobile Computing, 2018, Online publication date: 1-Jan-2018.
  49. Uylaş Satı N, Ordin B and Lanza-Gutiérrez J (2018). Application of the Polyhedral Conic Functions Method in the Text Classification and Comparative Analysis, Scientific Programming, 2018, Online publication date: 1-Jan-2018.
  50. Ivanovsky L, Khryashchev V, Lebedev A and Kosterin I Facial Expression Recognition Algorithm Based on Deep Convolution Neural Network Proceedings of the 21st Conference of Open Innovations Association FRUCT, (141-147)
  51. ACM
    Kim J, Lee H and Yoon T Automated Diagnosis of Lung Cancer with the Use of Deep Convolutional Neural Networks on Chest CT Proceedings of the 2017 4th International Conference on Biomedical and Bioinformatics Engineering, (126-132)
  52. Boyer J Federating natural language question answering services of a cognitive enterprise data platform Proceedings of the 27th Annual International Conference on Computer Science and Software Engineering, (241-246)
  53. Ariaeinejad A, Samavi R, Chan T and Doyle T A performance predictive model for emergency medicine residents Proceedings of the 27th Annual International Conference on Computer Science and Software Engineering, (28-37)
  54. Marco-Ruiz L, Bnes E, de la Asuncin E, Gabarron E, Aviles-Solis J, Lee E, Traver V, Sato K and Bellika J (2017). Combining multivariate statistics and the think-aloud protocol to assess Human-Computer Interaction barriers in symptom checkers, Journal of Biomedical Informatics, 74:C, (104-122), Online publication date: 1-Oct-2017.
  55. ACM
    Nieves D, Ferri C, Hernández-Orallo J and Monserrat C Low-level Event Detection System for Minimally-Invasive Surgery Training Proceedings of the 4th International Workshop on Sensor-based Activity Recognition and Interaction, (1-6)
  56. Pintas J, Correia L and Bicharra Garcia A (2017). Crowd-based Feature Selection for Document Retrieval in Highly Demanding Decision-making Scenarios, Procedia Computer Science, 112:C, (822-832), Online publication date: 1-Sep-2017.
  57. De K and Masilamani V (2017). No-reference image contrast measure using image statistics and random forest, Multimedia Tools and Applications, 76:18, (18641-18656), Online publication date: 1-Sep-2017.
  58. ACM
    Kostakos V and Musolesi M (2017). Avoiding pitfalls when using machine learning in HCI studies, Interactions, 24:4, (34-37), Online publication date: 23-Jun-2017.
  59. Gonalves t, Kumaira S and Guadagnin F (2017). A machine learning approach to the potential-field method for implicit modeling of geological structures, Computers & Geosciences, 103:C, (173-182), Online publication date: 1-Jun-2017.
  60. Strese M, Schuwerk C, Iepure A and Steinbach E (2017). Multimodal Feature-Based Surface Material Classification, IEEE Transactions on Haptics, 10:2, (226-239), Online publication date: 1-Apr-2017.
  61. ACM
    Chen J, Yan S and Wong K Aggressivity Detection on Social Network Comments Proceedings of the 2017 International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence, (103-107)
  62. Bermejo S (2017). Ensembles of wrappers for automated feature selection in fish age classification, Computers and Electronics in Agriculture, 134:C, (27-32), Online publication date: 1-Mar-2017.
  63. ACM
    Price S and Flach P (2017). Computational support for academic peer review, Communications of the ACM, 60:3, (70-79), Online publication date: 21-Feb-2017.
  64. ACM
    Sudathip K and Sodanil M Ontology knowledge-based framework for machine learning concept Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services, (50-53)
  65. Koutsopoulos I and Spentzouris P Native Advertisement Selection and Allocation in Social Media Post Feeds European Conference on Machine Learning and Knowledge Discovery in Databases - Volume 9851, (588-603)
  66. ACM
    Woodward J, Johnson C and Brownlee A GP vs GI Proceedings of the 2016 on Genetic and Evolutionary Computation Conference Companion, (1155-1156)
  67. Sidhanta S, Golab W, Mukhopadhyay S and Basu S OptCon Proceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, (388-397)
  68. Elahi M, Ricci F and Rubens N (2016). A survey of active learning in collaborative filtering recommender systems, Computer Science Review, 20:C, (29-50), Online publication date: 1-May-2016.
  69. Erlandsson F, Borg A, Johnson H and Bródka P Predicting User Participation in Social Media Proceedings of the 12th International Conference and School on Advances in Network Science - Volume 9564, (126-135)
  70. Wei M, Chow T and Chan R (2015). Heterogeneous feature subset selection using mutual information-based feature transformation, Neurocomputing, 168:C, (706-718), Online publication date: 30-Nov-2015.
  71. Vranjković V, Struharik R and Novak L (2015). Hardware acceleration of homogeneous and heterogeneous ensemble classifiers, Microprocessors & Microsystems, 39:8, (782-795), Online publication date: 1-Nov-2015.
  72. Bolón-Canedo V, Sánchez-Maroño N and Alonso-Betanzos A (2015). Recent advances and emerging challenges of feature selection in the context of big data, Knowledge-Based Systems, 86:C, (33-45), Online publication date: 1-Sep-2015.
  73. da Silva P, Gonçalves E, Rios E, Muhammad A, Moss A, Pritchard T, Glassborow B, Plastino A and Azeredo R (2015). Automatic classification of carbonate rocks permeability from 1H NMR relaxation data, Expert Systems with Applications: An International Journal, 42:9, (4299-4309), Online publication date: 1-Jun-2015.
  74. ACM
    Hern´ndez-Orallo J (2014). Probabilistic Reframing for Cost-Sensitive Regression, ACM Transactions on Knowledge Discovery from Data, 8:4, (1-55), Online publication date: 7-Oct-2014.
  75. ACM
    Woodward J, Martin S and Swan J Benchmarks that matter for genetic programming Proceedings of the Companion Publication of the 2014 Annual Conference on Genetic and Evolutionary Computation, (1397-1404)
  76. ACM
    Woodward J, Swan J and Martin S The 'composite' design pattern in metaheuristics Proceedings of the Companion Publication of the 2014 Annual Conference on Genetic and Evolutionary Computation, (1439-1444)
  77. ACM
    Cuayáhuitl H, van Otterlo M, Dethlefs N and Frommberger L Machine learning for interactive systems and robots Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication, (19-28)
  78. ACM
    Križman V, Karalič A, Kukliński S and Cestnik B Implementation aspects of self-healing functionality in long term evolution (LTE) networks Proceedings of the 14th International Conference on Computer Systems and Technologies, (30-37)
  79. Popkov Y, Dubnov Y and Popkov A Randomized machine learning: Statement, solution, applications 2016 IEEE 8th International Conference on Intelligent Systems (IS), (27-39)
Contributors
  • University of Bristol

Index Terms

  1. Machine Learning: The Art and Science of Algorithms that Make Sense of Data

      Recommendations

      Reviews

      Fernando Berzal

      As the author highlights in his prologue, machine learning is the systematic study of algorithms and systems that improve their knowledge or performance with experience. Using a spam filter as an example, the reader is gently introduced to the "ingredients" of a machine learning system: tasks, models, and features. Machine learning tasks are stylized problems that can be solved with machine learning techniques, the most common of them being the classifier behind a spam filter, which distinguishes between spam (unwanted email) and ham (not spam). Two chapters are devoted to their description. The first one is focused on binary classification, as in the spam filter, and the related tasks of scoring and ranking. The second chapter extends the basic framework for classification problems with more than two classes and for the prediction of real-valued variables (regression). It also introduces other important machine learning tasks, including clustering, subgroup discovery, and association rules. Machine learning models are the output of machine learning algorithms that "make sense of data." Unsurprisingly, more than half of the chapters of Flach's textbook are dedicated to their study. The survey of available models starts with an overview of logical models, followed by two separate chapters on decision tree induction and rule induction, with discussions on ranking and probability estimation trees, descriptive rule learning for subgroup discovery, association rule mining, and learning first-order rules. After the 100 or so pages devoted to logical models, two chapters are devoted to geometric models. Geometric models encompass linear models (linear classifiers, linear regression, the perceptron, and support vector machines) and distance-based models (nearest-neighbor classifiers and the most common clustering techniques, for example k -means and hierarchical clustering). Short final sections nicely introduce the ideas behind kernel-based models that extend basic models and whose description is not always as readable as in this textbook. The final chapter on individual models is devoted to probabilistic models, of which na?ve Bayes and Gaussian mixture models are two prominent examples. After a detour through a separate chapter on features, a short 13-page chapter overviews model ensembles and completes a nice survey of existing machine learning models. Model ensembles combine individual models and often outperform them. In their chapter, readers will become acquainted with useful techniques such as bagging and boosting. Beyond the survey of machine learning models, the author devotes separate chapters to features and experiments. Features are the third ingredient of machine learning systems, and "the workhorses of machine learning," using the author's own words. Preprocessing them is often key in the performance of a machine learning system. Hence, many techniques have been proposed for transforming, selecting, and constructing features-discretization, normalization, calibration, and principal component analysis, to name just a few. The final chapter on experiments explores the importance of choosing the right performance measurement for the machine learning task at hand. Using a statistician's perspective, it takes a closer look at how to properly measure and interpret them. This textbook is clearly written and well organized. Starting from the basics, the author skillfully guides the reader through his learning process by providing useful facts and insight into the behavior of several machine learning techniques, as well as the high-level pseudocode of many key algorithms. Each chapter ends with a summary section that reinforces key points and provides just the right number of references for those interested in delving deeper. Summary sidebars are scattered throughout the book that review key mathematical concepts, making this book almost completely self-contained for novices. The book is also nicely typeset in color and includes plenty of cleverly designed figures that illustrate machine learning concepts as they are discussed in the text. There is only a minor caveat with respect to those figures: many of them contain so many details and their lines are so thin that they are hardly legible. It is surprising how much can be covered in around 400 pages. Did I mention the author's crystal clear explanations of computational learning theory (section 4.4, where concepts such as probably approximately correct (PAC) learning and the VC-dimension are introduced) and the minimum description length (section 9.5 on compression-based models), or his extensive use of coverage curves to visualize the behavior of machine learning models__?__ Obviously, breadth takes precedence over depth. The reader will often learn the intuition behind some models, but he will not be aware of all the technicalities that he should know in order to be able to implement the algorithms by himself. Of course, many might argue that well-tested software packages are available that provide the desired functionality. Hence, many readers would never try to implement their own machine learning algorithms. As users, however, they should always know the algorithms' theoretical foundations, key features, and, most of all, inherent limitations. This is precisely where this gentle introductory textbook excels. More reviews about this item: Amazon Online Computing Reviews Service

      Access critical reviews of Computing literature here

      Become a reviewer for Computing Reviews.