Designed for research scientists and graduate-level students in computer science and engineering, this book organizes all major concepts, theories, methodologies, trends, challenges and applications of DM and KDD into a coherent and unified repository.
Cited By
- Zhang J, Luximon Y, Shah P and Li P (2023). 3D Statistical Head Modeling for Face/head-Related Product Design, Computer-Aided Design, 159:C, Online publication date: 1-Jun-2023.
- Rivera A, Muñoz J, Pérez-Goody M, de San Pedro B, Charte F, Elizondo D, Rodríguez C, Abolafia M, Perea A and del Jesus M (2023). XAIRE, Artificial Intelligence in Medicine, 137:C, Online publication date: 1-Mar-2023.
- Xue M, Zhang H, Huang Q, Song J and Song M (2022). Learn decision trees with deep visual primitives, Journal of Visual Communication and Image Representation, 89:C, Online publication date: 1-Nov-2022.
- Ding M, Wang T and Wang X (2021). Establishing Smartphone User Behavior Model Based on Energy Consumption Data, ACM Transactions on Knowledge Discovery from Data, 16:2, (1-40), Online publication date: 30-Apr-2022.
- Moyano J and Ventura S (2022). Auto-adaptive Grammar-Guided Genetic Programming algorithm to build Ensembles of Multi-Label Classifiers, Information Fusion, 78:C, (1-19), Online publication date: 1-Feb-2022.
- Gosiewska A, Kozak A and Biecek P (2022). Simpler is better, Decision Support Systems, 150:C, Online publication date: 1-Nov-2021.
- Mongelli M (2021). Design of countermeasure to packet falsification in vehicle platooning by explainable artificial intelligence, Computer Communications, 179:C, (166-174), Online publication date: 1-Nov-2021.
- Bishay M, Palasek P, Priebe S and Patras I (2021). SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis, IEEE Transactions on Affective Computing, 12:4, (949-961), Online publication date: 1-Oct-2021.
- Abnane I, Idri A, Hosni M and Abran A Heterogeneous ensemble imputation for software development effort estimation Proceedings of the 17th International Conference on Predictive Models and Data Analytics in Software Engineering, (1-10)
- Garg A and Mago V (2022). Role of machine learning in medical research, Computer Science Review, 40:C, Online publication date: 1-May-2021.
- Paul S, Alvi A and Rahman R (2021). An analysis of the most accident prone regions within the Dhaka Metropolitan Region using clustering, International Journal of Advanced Intelligence Paradigms, 18:3, (294-315), Online publication date: 1-Jan-2021.
- Heldens S, Hijma P, van Werkhoven B, Maassen J, Bal H and van Nieuwpoort R Rocket Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
- Chaabane I, Guermazi R and Hammami M (2019). Enhancing techniques for learning decision trees from imbalanced data, Advances in Data Analysis and Classification, 14:3, (677-745), Online publication date: 1-Sep-2020.
- Asemi A and Ebrahimi F (2020). A Thematic Analysis of the Articles on the Internet of Things in the Web of Science With HAC Approach, International Journal of Distributed Systems and Technologies, 11:2, (1-17), Online publication date: 1-Apr-2020.
- de la Cal E, Villar J, Vergara P, Herrero Á and Sedano J (2019). Design issues in Time Series dataset balancing algorithms, Neural Computing and Applications, 32:5, (1287-1304), Online publication date: 1-Mar-2020.
- Teixeira J, Alves N and Fernandes P (2020). Vocal Acoustic Analysis, International Journal of E-Health and Medical Communications, 11:1, (37-51), Online publication date: 1-Jan-2020.
- Chen Y and Chang C (2019). Early prediction of the future popularity of uploaded videos, Expert Systems with Applications: An International Journal, 133:C, (59-74), Online publication date: 1-Nov-2019.
- Shah N and Patil H (2022). A novel approach to remove outliers for parallel voice conversion, Computer Speech and Language, 58:C, (127-152), Online publication date: 1-Nov-2019.
- Khan N, Chaudhuri U, Banerjee B and Chaudhuri S (2019). Graph convolutional network for multi-label VHR remote sensing scene recognition, Neurocomputing, 357:C, (36-46), Online publication date: 10-Sep-2019.
- Singh V and Hofenbitzer C Fairness across network positions in cyberbullying detection algorithms Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, (557-559)
- Rastin P, Cabanes G, Matei B, Bennani Y and Marty J (2022). A new sparse representation learning of complex data, Pattern Recognition, 91:C, (291-307), Online publication date: 1-Jul-2019.
- Wang P and He Y Uni-Detect Proceedings of the 2019 International Conference on Management of Data, (811-828)
- Saeed N, Nam H, Haq M and Muhammad Saqib D (2018). A Survey on Multidimensional Scaling, ACM Computing Surveys, 51:3, (1-25), Online publication date: 31-May-2019.
- Stopar L, Skraba P, Grobelnik M and Mladenic D (2019). StreamStory, IEEE Transactions on Visualization and Computer Graphics, 25:4, (1788-1802), Online publication date: 1-Apr-2019.
- Gu X, Angelov P and Zhao Z (2019). A distance-type-insensitive clustering approach, Applied Soft Computing, 77:C, (622-634), Online publication date: 1-Apr-2019.
- Nazari A, Dehghan A, Nejatian S, Rezaie V and Parvin H (2019). A comprehensive study of clustering ensemble weighting based on cluster quality and diversity, Pattern Analysis & Applications, 22:1, (133-145), Online publication date: 1-Feb-2019.
- Brodić D and Amelio A (2018). Association rule mining for the usability of the CAPTCHA interfaces, Multimedia Systems, 24:6, (625-644), Online publication date: 1-Nov-2018.
- Alhazmi K, Moubayed A and Shami A Distributed SDN Controller Placement Using Betweenness Centrality & Hierarchical Clustering Proceedings of the 8th ACM Symposium on Design and Analysis of Intelligent Vehicular Networks and Applications, (15-20)
- Youssar S, Bahtaoui M, Jarmouni Y and Berrado A Clustering of Pharmaceutical products using Random Forest algorithm Proceedings of the 12th International Conference on Intelligent Systems: Theories and Applications, (1-6)
- Zhao M, Tian Y, Zhao H, Alsheikh M, Li T, Hristov R, Kabelac Z, Katabi D and Torralba A RF-based 3D skeletons Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication, (267-281)
- tabash K and Happa J (2018). Insider-threat detection using Gaussian Mixture Models and Sensitivity Profiles, Computers and Security, 77:C, (838-859), Online publication date: 1-Aug-2018.
- Luque Ruiz I, Cerruela García G and Gómez-Nieto M (2018). Interactive mosaic building and its application to marketing strategies using NFC, Multimedia Tools and Applications, 77:12, (15291-15320), Online publication date: 1-Jun-2018.
- Rabie R, Eltoukhy M, al-Shatouri M and Rashed E Computer Aided Diagnosis System for Liver Cirrhosis Based on Ultrasound Images Proceedings of the 7th International Conference on Software and Information Engineering, (68-71)
- He J and Xiong N (2018). An effective information detection method for social big data, Multimedia Tools and Applications, 77:9, (11277-11305), Online publication date: 1-May-2018.
- Ghofrani J, Bozorgmehr A and Panah A A Fast Algorithm Based on Apriori Algorithms to Explore the Set of Repetitive Items of Large Transaction Data Proceedings of the 2nd International Conference on Compute and Data Analysis, (13-19)
- Abelln J, Mantas C and Castellano J (2018). AdaptativeCC4.5, Expert Systems with Applications: An International Journal, 92:C, (363-379), Online publication date: 1-Feb-2018.
- Parameshwarappa P, Chen Z and Gangopadhyay A Analyzing attack strategies against rule-based intrusion detection systems Proceedings of the Workshop Program of the 19th International Conference on Distributed Computing and Networking, (1-4)
- Tongman S, Chanama S, Chanama M, Plaimas K and Lursinsap C (2017). Metabolic pathway synthesis based on predicting compound transformable pairs by using neural classifiers with imbalanced data handling, Expert Systems with Applications: An International Journal, 88:C, (45-57), Online publication date: 1-Dec-2017.
- Aqil A, Khalil K, Atya A, Papalexakis E, Krishnamurthy S, Jaeger T, Ramakrishnan K, Yu P and Swami A Jaal Proceedings of the 13th International Conference on emerging Networking EXperiments and Technologies, (134-146)
- Villuendas-Rey Y, Rey-Bengura C, Ferreira-Santiago n, Camacho-Nieto O and Yez-Mrquez C (2017). The Nave Associative Classifier (NAC), Neurocomputing, 265:C, (105-115), Online publication date: 22-Nov-2017.
- Oliveira R, Pereira A and Tavares J (2017). Skin lesion computational diagnosis of dermoscopic images, Computer Methods and Programs in Biomedicine, 149:C, (43-53), Online publication date: 1-Oct-2017.
- Braun P, Cuzzocrea A, Doan L, Kim S, Leung C, Matundan J and Robby Singh R (2017). Enhanced Prediction of User-Preferred YouTube Videos Based on Cleaned Viewing Pattern History, Procedia Computer Science, 112:C, (2230-2239), Online publication date: 1-Sep-2017.
- Zhao K, Li B, Peng Z, Bu J and Wang C Navigation objects extraction for better content structure understanding Proceedings of the International Conference on Web Intelligence, (629-636)
- Rosenfeld A and Kraus S When security games hit traffic Proceedings of the 26th International Joint Conference on Artificial Intelligence, (3814-3822)
- CK Y, M H, R Y, Ngadiran R, H A, Yaacob S and Polat K (2017). Bispectral features and mean shift clustering for stress and emotion recognition from natural speech, Computers and Electrical Engineering, 62:C, (676-691), Online publication date: 1-Aug-2017.
- Spanakis G, Weiss G, Boh B, Lemmens L and Roefs A (2017). Machine learning techniques in eating behavior e-coaching, Personal and Ubiquitous Computing, 21:4, (645-659), Online publication date: 1-Aug-2017.
- Addawood A, Schneider J and Bashir M Stance Classification of Twitter Debates Proceedings of the 8th International Conference on Social Media & Society, (1-10)
- Zhou L, Wang Q and Fujita H (2017). One versus one multi-class classification fusion using optimizing decision directed acyclic graph for predicting listing status of companies, Information Fusion, 36:C, (80-89), Online publication date: 1-Jul-2017.
- Alcaide D and Aerts J DaaG Proceedings of the Eurographics/IEEE VGTC Conference on Visualization: Posters, (61-63)
- Abell K, Theurer M, Larson R, White B, Hardin D and Randle R (2017). Predicting bull behavior events in a multiple-sire pasture with video analysis, accelerometers, and classification algorithms, Computers and Electronics in Agriculture, 136:C, (221-227), Online publication date: 15-Apr-2017.
- Stein A, Rauh D, Tomforde S and Hhner J (2017). Interpolation in the eXtended Classifier System, Journal of Systems Architecture: the EUROMICRO Journal, 75:C, (79-94), Online publication date: 1-Apr-2017.
- C. S and Sherimon V A proposed onto-Apriori algorithm to mine frequent patterns of high quality seafood Proceedings of the Second International Conference on Internet of things, Data and Cloud Computing, (1-6)
- Mitra T, Wright G and Gilbert E A Parsimonious Language Model of Social Media Credibility Across Disparate Events Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, (126-145)
- Yazdanbakhsh O, Zhou Y and Dick S (2017). An intelligent system for livestock disease surveillance, Information Sciences: an International Journal, 378:C, (26-47), Online publication date: 1-Feb-2017.
- Ta V and Liu C Stock market analysis using clustering techniques Proceedings of the 7th Symposium on Information and Communication Technology, (99-106)
- Li Z, Kawamoto J, Feng Y and Sakurai K Cyberbullying detection using parent-child relationship between comments Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services, (325-334)
- Chou J, Wang Y and Ma K Privacy preserving event sequence data visualization using a Sankey diagram-like representation SIGGRAPH ASIA 2016 Symposium on Visualization, (1-8)
- Piltaver R, Luštrek M, Gams M and Martinčić-Ipšić S (2016). What makes classification trees comprehensible?, Expert Systems with Applications: An International Journal, 62:C, (333-346), Online publication date: 15-Nov-2016.
- Neves Y, Sindeaux M, Souza W, Kozievitch N, Loureiro A and Silva T Study of Google Popularity Times Series for Commercial Establishments of Curitiba and Chicago Proceedings of the 22nd Brazilian Symposium on Multimedia and the Web, (303-310)
- Sprint G, Cook D and Schmitter-Edgecombe M (2016). Unsupervised detection and analysis of changes in everyday physical activity data, Journal of Biomedical Informatics, 63:C, (54-65), Online publication date: 1-Oct-2016.
- Soltanifar B, Erdem A and Bener A Predicting Defectiveness of Software Patches Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, (1-10)
- Singh V, Huang Q and Atrey P Cyberbullying detection using probabilistic socio-textual information fusion Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, (884-887)
- Yang Y, Chen D, Gu R, Gu Y and Yu S (2016). Consumers’ Kansei Needs Clustering Method for Product Emotional Design Based on Numerical Design Structure Matrix and Genetic Algorithms, Computational Intelligence and Neuroscience, 2016, (12), Online publication date: 1-Aug-2016.
- Maldonado A, Aguilera P and Salmerón A (2016). Modeling zero-inflated explanatory variables in hybrid Bayesian network classifiers for species occurrence prediction, Environmental Modelling & Software, 82:C, (31-43), Online publication date: 1-Aug-2016.
- Panichella A, Alexandru C, Panichella S, Bacchelli A and Gall H A Search-based Training Algorithm for Cost-aware Defect Prediction Proceedings of the Genetic and Evolutionary Computation Conference 2016, (1077-1084)
- Jian C, Gao J and Ao Y (2016). A new sampling method for classifying imbalanced data based on support vector machine ensemble, Neurocomputing, 193:C, (115-122), Online publication date: 12-Jun-2016.
- Asadi S and Shahrabi J (2016). RipMC, Neurocomputing, 191:C, (19-33), Online publication date: 26-May-2016.
- Müller S and Fritz T Using (bio)metrics to predict code quality online Proceedings of the 38th International Conference on Software Engineering, (452-463)
- Haris N, Abdullah M, Hasim N and Rahman F A Study on Students Enrollment Prediction using Data Mining Proceedings of the 10th International Conference on Ubiquitous Information Management and Communication, (1-5)
- Palamakumbure D, Flentje P and Stirling D (2015). Consideration of optimal pixel resolution in deriving landslide susceptibility zoning within the Sydney Basin, New South Wales, Australia, Computers & Geosciences, 82:C, (13-22), Online publication date: 1-Sep-2015.
- Küçükdeniz T and Esnaf Ş Data Clustering by Particle Swarm Optimization with the Focal Particles Revised Selected Papers of the First International Workshop on Machine Learning, Optimization, and Big Data - Volume 9432, (280-292)
- Joia P, Petronetto F and Nonato L (2015). Uncovering Representative Groups in Multidimensional Projections, Computer Graphics Forum, 34:3, (281-290), Online publication date: 1-Jun-2015.
- Zhang X, Song Q, Wang G, Zhang K, He L and Jia X (2015). A dissimilarity-based imbalance data classification algorithm, Applied Intelligence, 42:3, (544-565), Online publication date: 1-Apr-2015.
- del Río S, López V, Benítez J and Herrera F (2014). On the use of MapReduce for imbalanced big data using Random Forest, Information Sciences: an International Journal, 285:C, (112-137), Online publication date: 20-Nov-2014.
- Huang Q, Singh V and Atrey P Cyber Bullying Detection Using Social and Textual Analysis Proceedings of the 3rd International Workshop on Socially-Aware Multimedia, (3-6)
- Bhat S, Abulaish M and Mirza A Spammer Classification Using Ensemble Methods over Structural Social Network Features Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02, (454-458)
- An A, Dauletbakov B and Levner E Multi-attribute Classification of Text Documents as a Tool for Ranking and Categorization of Educational Innovation Projects Proceedings of the 15th International Conference on Computational Linguistics and Intelligent Text Processing - Volume 8404, (404-416)
- Ziegler A and König I (2014). Mining data with random forests, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 4:1, (55-63), Online publication date: 1-Jan-2014.
- Armengol E and García-Cerdaña À Refining discretizations of continuous-valued attributes Proceedings of the 9th international conference on Modeling Decisions for Artificial Intelligence, (258-269)
- Haribhakta Y, Kalamkar S and Kulkarni P Feature annotation for text categorization Proceedings of the CUBE International Information Technology Conference, (308-313)
- Merhav Y, Mesquita F, Barbosa D, Yee W and Frieder O (2012). Extracting information networks from the blogosphere, ACM Transactions on the Web, 6:3, (1-33), Online publication date: 1-Sep-2012.
- Lopes R, Freitas A, Silva R and Guimarães F Differential evolution and perceptron decision trees for classification tasks Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning, (550-557)
- Mahmoud S, Lotfi A and Langensiepen C User activities outlier detection system using principal component analysis and fuzzy rule-based system Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments, (1-8)
- Gorzałczany M and Rudziński F Accuracy vs. interpretability of fuzzy rule-based classifiers Proceedings of the 2012 international conference on Swarm and Evolutionary Computation, (222-230)
- Yildizer E, Balci A, Hassan M and Alhajj R (2012). Efficient content-based image retrieval using Multiple Support Vector Machines Ensemble, Expert Systems with Applications: An International Journal, 39:3, (2385-2396), Online publication date: 1-Feb-2012.
- Hayashi Y, Kojiri T and Watanabe T Interaction based on contribution awareness in collaborative learning Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part II, (104-113)
- Hughes C, Hughes T and Lazar A Discovering coherence and justification clusters in digital transcripts using epistemic analysis Proceedings of the 13th International Conference on Artificial Intelligence and Law, (219-223)
- Mahmoud S, Lotfi A and Langensiepen C Abnormal behaviours identification for an elder's life activities using dissimilarity measurements Proceedings of the 4th International Conference on PErvasive Technologies Related to Assistive Environments, (1-5)
- Zhang Y, Li H, Wöhrer A, Brezany P and Dai G Decomposing data mining by a process-oriented execution plan Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I, (97-106)
- Petr P, Křupka J and Provazníková R Statistical approach to analysis of the regions Proceedings of the 10th WSEAS international conference on Applied computer science, (280-285)
- Guruler H, Istanbullu A and Karahasan M (2010). A new student performance analysing system using knowledge discovery in higher educational databases, Computers & Education, 55:1, (247-254), Online publication date: 1-Aug-2010.
- Bifet A Adaptive Stream Mining Proceedings of the 2010 conference on Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams, (1-212)
- Bhatnagar V and Ahuja S Robust clustering using discriminant analysis Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects, (143-157)
- Zaidi F, Archambault D and Melançon G Evaluating the quality of clustering algorithms using cluster path lengths Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects, (42-56)
- Drozdz K and Kwasnicka H Feature set reduction by evolutionary selection and construction Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part II, (140-149)
- Gómez I, Franco L, Jerez J and Subirats J Extension of the generalization complexity measure to real valued input data sets Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part I, (86-94)
- Wu J, Xiong H and Chen J (2010). COG, Data Mining and Knowledge Discovery, 20:2, (191-220), Online publication date: 1-Mar-2010.
- Gomolińska A Satisfiability judgement under incomplete information Transactions on Rough Sets XI, (66-91)
- Zhou L, Lai K and Yu L (2010). Least squares support vector machines ensemble models for credit scoring, Expert Systems with Applications: An International Journal, 37:1, (127-133), Online publication date: 1-Jan-2010.
- Saha S, Murthy C and Pal S Hypertext Classification Using Tensor Space Model and Rough Set Based Ensemble Classifier Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence, (213-218)
- Piton T, Blanchard J, Briand H and Guillet F Domain driven data mining to improve promotional campaign ROI and select marketing channels Proceedings of the 18th ACM conference on Information and knowledge management, (1057-1066)
- Lairenjam B and Wasan S Neural network with classification based on multiple association rule for classifying mammographic data Proceedings of the 10th international conference on Intelligent data engineering and automated learning, (465-476)
- Tseng V and Lee C (2009). Effective temporal data classification by integrating sequential pattern mining and probabilistic induction, Expert Systems with Applications: An International Journal, 36:5, (9524-9532), Online publication date: 1-Jul-2009.
- del Pilar Bautista Morales S, Fandiño H and Rodríguez J Hypertext classification to filtrate information on the web Proceedings of the 2009 Euro American Conference on Telematics and Information Systems: New Opportunities to increase Digital Citizenship, (1-7)
- Sutcliffe A (2008). The socio-economics of software architecture, Automated Software Engineering, 15:3-4, (343-363), Online publication date: 1-Dec-2008.
- Csorba K and Vajk I Cascaded search for similar documents between mobile devices Proceedings of the 12th WSEAS international conference on Computers, (122-127)
- Zhang Z, Shi Y, Gao G and Chai Y An Effective Feature Selection Method Using the Contribution Likelihood Ratio of Attributes for Classification Revised Selected Papers of the APWeb 2008 International Workshops on Advanced Web and Network Technologies, and Applications - Volume 4977, (165-171)
- Tsumoto S and Hirano S Contingency matrix theory I Transactions on computational science II, (161-179)
- Graening L, Olhofer M and Sendhoff B Knowledge extraction from unstructured surface meshes Proceedings of the 8th international conference on Intelligent data engineering and automated learning, (497-506)
- Bhatnagar V and Kaur S Exclusive and complete clustering of streams Proceedings of the 18th international conference on Database and Expert Systems Applications, (629-638)
- Wu J, Xiong H, Wu P and Chen J Local decomposition for rare class analysis Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, (814-823)
- Abad-Grau M and Sebastiani P Multivariate imputation of genotype data using short and long range disequilibrium Proceedings of the 11th international conference on Computer aided systems theory, (187-194)
- Yuan B, Orlowska M and Sadiq S Real-time acquisition of buyer behaviour data Proceedings of the 1st international conference on Business intelligence for the real-time enterprises, (106-117)
- Cao L and Zhang C Domain-Driven actionable knowledge discovery in the real world Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining, (821-830)
- Nguyen H Approximate boolean reasoning Transactions on Rough Sets V, (334-506)
- Welcome message from conference co-chairs 2015 IEEE International Conference on Intelligence and Security Informatics (ISI), (1-2)
Index Terms
- Data Mining and Knowledge Discovery Handbook
Recommendations
Advancing Knowledge Discovery and Data Mining
WKDD '08: Proceedings of the First International Workshop on Knowledge Discovery and Data MiningKnowledge discovery and data mining have become areas of growing significance because of the recent increasing demand for KDD techniques, including those used in machine learning, databases, statistics, knowledge acquisition, data visualization, and ...
Knowledge discovery and data mining in biological databases
The new technologies for Knowledge Discovery from Databases (KDD) and data mining promise to bring new insights into a voluminous growing amount of biological data. KDD technology is complementary to laboratory experimentation and helps speed up ...