skip to main content
Skip header Section
Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold LearningAugust 2008
Publisher:
  • Springer Publishing Company, Incorporated
ISBN:978-0-387-78188-4
Published:28 August 2008
Pages:
734
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

Remarkable advances in computation and data storage and the ready availability of huge data sets have been the keys to the growth of the new disciplines of data mining and machine learning, while the enormous success of the Human Genome Project has opened up the field of bioinformatics. These exciting developments, which led to the introduction of many innovative statistical tools for high-dimensional data analysis, are described here in detail. The author takes a broad perspective; for the first time in a book on multivariate analysis, nonlinear methods are discussed in detail as well as linear methods. Techniques covered range from traditional multivariate methods, such as multiple regression, principal components, canonical variates, linear discriminant analysis, factor analysis, clustering, multidimensional scaling, and correspondence analysis, to the newer methods of density estimation, projection pursuit, neural networks, multivariate reduced-rank regression, nonlinear manifold learning, bagging, boosting, random forests, independent component analysis, support vector machines, and classification and regression trees. Another unique feature of this book is the discussion of database management systems. This book is appropriate for advanced undergraduate students, graduate students, and researchers in statistics, computer science, artificial intelligence, psychology, cognitive sciences, business, medicine, bioinformatics, and engineering. Familiarity with multivariable calculus, linear algebra, and probability and statistics is required. The book presents a carefully-integrated mixture of theory and applications, and of classical and modern multivariate statistical techniques, including Bayesian methods. There are over 60 interesting data sets used as examples in the book, over 200 exercises, and many color illustrations and photographs.

Cited By

  1. Yamashita N (2023). Principal component analysis constrained by layered simple structures, Advances in Data Analysis and Classification, 17:2, (347-367), Online publication date: 1-Jun-2023.
  2. Baran Á, Lerch S, El Ayari M and Baran S (2021). Machine learning for total cloud cover prediction, Neural Computing and Applications, 33:7, (2605-2620), Online publication date: 1-Apr-2021.
  3. Rani R, Singh A and Kumar R (2019). Impact of reduction in descriptor size on object detection and classification, Multimedia Tools and Applications, 78:7, (8965-8979), Online publication date: 1-Apr-2019.
  4. Perrot R, Aveneau L, Mora F and Meneveaux D (2019). Photon mapping with visible kernel domains, The Visual Computer: International Journal of Computer Graphics, 35:5, (707-720), Online publication date: 1-May-2019.
  5. Liu T, Shi Z and Liu Y (2019). Supervised dimensionality reduction on grassmannian for image set recognition, Neural Computation, 31:1, (156-175), Online publication date: 1-Jan-2019.
  6. Clark J and Provost F (2019). Unsupervised dimensionality reduction versus supervised regularization for classification from sparse data, Data Mining and Knowledge Discovery, 33:4, (871-916), Online publication date: 1-Jul-2019.
  7. ACM
    Rapin J, Gallagher M, Kerschke P, Preuss M and Teytaud O Exploring the MLDA benchmark on the nevergrad platform Proceedings of the Genetic and Evolutionary Computation Conference Companion, (1888-1896)
  8. Gyamfi K, Brusey J, Hunt A and Gaura E (2018). Linear dimensionality reduction for classification via a sequential Bayes error minimisation with an application to flow meter diagnostics, Expert Systems with Applications: An International Journal, 91:C, (252-262), Online publication date: 1-Jan-2018.
  9. Chen S and Banerjee A An improved analysis of alternating minimization for structured multi-response regression Proceedings of the 32nd International Conference on Neural Information Processing Systems, (6617-6628)
  10. Ultsch A and Ltsch J (2017). Machine-learned cluster identification in high-dimensional data, Journal of Biomedical Informatics, 66:C, (95-104), Online publication date: 1-Feb-2017.
  11. Gyamfi K, Brusey J, Hunt A and Gaura E (2017). Linear classifier design under heteroscedasticity in Linear Discriminant Analysis, Expert Systems with Applications: An International Journal, 79:C, (44-52), Online publication date: 15-Aug-2017.
  12. Chen S and Banerjee A Alternating estimation for structured high-dimensional multi-response models Proceedings of the 31st International Conference on Neural Information Processing Systems, (2835-2844)
  13. Lorente D, Martínez-Martínez F, Rupérez M, Lago M, Martínez-Sober M, Escandell-Montero P, Martínez-Martínez J, Martínez-Sanchis S, Serrano-López A, Monserrat C and Martín-Guerrero J (2017). A framework for modelling the biomechanical behaviour of the human liver during breathing in real time using machine learning, Expert Systems with Applications: An International Journal, 71:C, (342-357), Online publication date: 1-Apr-2017.
  14. Rabusseau G and Kadri H Low-rank regression with tensor responses Proceedings of the 30th International Conference on Neural Information Processing Systems, (1875-1883)
  15. ACM
    Hartmann H (2016). Statistics for Engineers, Queue, 14:1, (23-52), Online publication date: 1-Jan-2016.
  16. ACM
    Hartmann H (2016). Statistics for engineers, Communications of the ACM, 59:7, (58-66), Online publication date: 24-Jun-2016.
  17. Hartmann H (2018). Statistics for Engineers, Queue, 14:1, (23-52), Online publication date: 1-Jan-2016.
  18. Uno K, Satomura H and Adachi K (2016). Fixed factor analysis with clustered factor score constraint, Computational Statistics & Data Analysis, 94:C, (265-274), Online publication date: 1-Feb-2016.
  19. Liu H, Wang L and Zhaoy T (2015). Calibrated multivariate regression with application to neural semantic basis discovery, The Journal of Machine Learning Research, 16:1, (1579-1606), Online publication date: 1-Jan-2015.
  20. ACM
    Li Q, Niu W, Li G, Cao Y, Tan J and Guo L Lingo Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, (801-809)
  21. Efromovich S (2014). Nonparametric regression with missing data, WIREs Computational Statistics, 6:4, (265-275), Online publication date: 1-Jul-2014.
  22. Carrizosa E and Guerrero V (2014). Biobjective sparse principal component analysis, Journal of Multivariate Analysis, 132:C, (151-159), Online publication date: 1-Nov-2014.
  23. Sun M, Priebe C and Tang M (2013). Generalized canonical correlation analysis for disparate data fusion, Pattern Recognition Letters, 34:2, (194-200), Online publication date: 1-Jan-2013.
  24. Bijak K and Thomas L (2012). Does segmentation always improve model performance in credit scoring?, Expert Systems with Applications: An International Journal, 39:3, (2433-2442), Online publication date: 1-Feb-2012.
  25. ACM
    Holena M, Linke D and Bajer L Surrogate modeling in the evolutionary optimization of catalytic materials Proceedings of the 14th annual conference on Genetic and evolutionary computation, (1095-1102)
  26. Gu Y and Wang C A study of hierarchical correlation clustering for scientific volume data Proceedings of the 6th international conference on Advances in visual computing - Volume Part III, (437-446)
  27. Misra V, Harmon D and Bar-Yam Y Vulnerability analysis of high dimensional complex systems Proceedings of the 12th international conference on Stabilization, safety, and security of distributed systems, (560-572)
  28. Takai K and Yada K Relation between stay-time and purchase probability based on RFID data in a Japanese supermarket Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part III, (254-263)
  29. Holmström L and Koistinen P (2010). Pattern recognition, WIREs Computational Statistics, 2:4, (404-413), Online publication date: 1-Jul-2010.
  30. Gocheva-Ilieva S and Iliev I Modeling and prediction of laser generation in UV copper bromide laser via MARS Proceedings of the 2nd WSEAS international conference on Nanotechnology, (166-171)
  31. ACM
    Gavrishchaka V, Koepke M and Ulyanova O Boosting-based discovery of multi-component physiological indicators Proceedings of the 1st ACM International Health Informatics Symposium, (790-799)
  32. Theijssen D Variable selection in logistic regression Proceedings of the 2008 international conference on Interfaces: explorations in logic, language and computation, (87-101)
Contributors
  • Temple University

Recommendations