skip to main content
Skip header Section
Dataset Shift in Machine LearningFebruary 2009
Publisher:
  • The MIT Press
ISBN:978-0-262-17005-5
Published:27 February 2009
Pages:
248
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

Dataset shift is a common problem in predictive modeling that occurs when the joint distribution of inputs and outputs differs between training and test stages. Covariate shift, a particular case of dataset shift, occurs when only the input distribution changes. Dataset shift is present in most practical applications, for reasons ranging from the bias introduced by experimental design to the irreproducibility of the testing conditions at training time. (An example is -email spam filtering, which may fail to recognize spam that differs in form from the spam the automatic filter has been built on.) Despite this, and despite the attention given to the apparently similar problems of semi-supervised learning and active learning, dataset shift has received relatively little attention in the machine learning community until recently. This volume offers an overview of current efforts to deal with dataset and covariate shift. The chapters offer a mathematical and philosophical introduction to the problem, place dataset shift in relationship to transfer learning, transduction, local learning, active learning, and semi-supervised learning, provide theoretical views of dataset and covariate shift (including decision theoretic and Bayesian perspectives), and present algorithms for covariate shift. Contributors: Shai Ben-David, Steffen Bickel, Karsten Borgwardt, Michael Brckner, David Corfield, Amir Globerson, Arthur Gretton, Lars Kai Hansen, Matthias Hein, Jiayuan Huang, Takafumi Kanamori, Klaus-Robert Mller, Sam Roweis, Neil Rubens, Tobias Scheffer, Marcel Schmittfull, Bernhard Schlkopf, Hidetoshi Shimodaira, Alex Smola, Amos Storkey, Masashi Sugiyama, Choon Hui Teo Neural Information Processing series

Cited By

  1. ACM
    Caton S and Haas C (2024). Fairness in Machine Learning: A Survey, ACM Computing Surveys, 56:7, (1-38), Online publication date: 31-Jul-2024.
  2. Alvarez J, Colmenarejo A, Elobaid A, Fabbrizzi S, Fahimi M, Ferrara A, Ghodsi S, Mougan C, Papageorgiou I, Reyero P, Russo M, Scott K, State L, Zhao X and Ruggieri S (2024). Policy advice and best practices on bias and fairness in AI, Ethics and Information Technology, 26:2, Online publication date: 1-Jun-2024.
  3. Su J, Shen H, Peng L and Hu D (2024). Few-Shot Domain-Adaptive Anomaly Detection for Cross-Site Brain Images, IEEE Transactions on Pattern Analysis and Machine Intelligence, 46:3, (1819-1835), Online publication date: 1-Mar-2024.
  4. Wen L, Chen S, Xie M, Liu C and Zheng L (2024). Training multi-source domain adaptation network by mutual information estimation and minimization, Neural Networks, 171:C, (353-361), Online publication date: 1-Mar-2024.
  5. Paldino G, Lebichot B, Le Borgne Y, Siblini W, Oblé F, Boracchi G and Bontempi G (2024). The role of diversity and ensemble learning in credit card fraud detection, Advances in Data Analysis and Classification, 18:1, (193-217), Online publication date: 1-Mar-2024.
  6. Huang W, Ye M, Shi Z and Du B (2024). Generalizable Heterogeneous Federated Cross-Correlation and Instance Similarity Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence, 46:2, (712-728), Online publication date: 1-Feb-2024.
  7. Hognon C, Conze P, Bourbonne V, Gallinato O, Colin T, Jaouen V and Visvikis D (2024). Contrastive image adaptation for acquisition shift reduction in medical imaging, Artificial Intelligence in Medicine, 148:C, Online publication date: 1-Feb-2024.
  8. Danks N, Ray S and Shmueli G (2024). The Composite Overfit Analysis Framework, Management Science, 70:1, (647-669), Online publication date: 1-Jan-2024.
  9. ACM
    Rajapakse V, Karunanayake I and Ahmed N (2023). Intelligence at the Extreme Edge: A Survey on Reformable TinyML, ACM Computing Surveys, 55:13s, (1-30), Online publication date: 31-Dec-2024.
  10. Yuan J, Ma X, Chen D, Wu F, Lin L and Kuang K (2023). Collaborative Semantic Aggregation and Calibration for Federated Domain Generalization, IEEE Transactions on Knowledge and Data Engineering, 35:12, (12528-12541), Online publication date: 1-Dec-2023.
  11. ACM
    Chien J, Roberts M and Ustun B Algorithmic Censoring in Dynamic Learning Systems Proceedings of the 3rd ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization, (1-20)
  12. Li J, Pan Y, Lyu Y, Yao Y, Sui Y and Tsang I (2023). Earning Extra Performance From Restrictive Feedbacks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 45:10, (11753-11765), Online publication date: 1-Oct-2023.
  13. Shui C, Pu R, Xu G, Wen J, Zhou F, Gagné C, Ling C and Wang B (2023). Towards More General Loss and Setting in Unsupervised Domain Adaptation, IEEE Transactions on Knowledge and Data Engineering, 35:10, (10140-10150), Online publication date: 1-Oct-2023.
  14. ACM
    Yuan J, Ma X, Xiong R, Gong M, Liu X, Wu F, Lin L and Kuang K (2023). Instrumental Variable-Driven Domain Generalization with Unobserved Confounders, ACM Transactions on Knowledge Discovery from Data, 17:8, (1-21), Online publication date: 30-Sep-2023.
  15. ACM
    Mohseni S, Wang H, Xiao C, Yu Z, Wang Z and Yadawa J (2022). Taxonomy of Machine Learning Safety: A Survey and Primer, ACM Computing Surveys, 55:8, (1-38), Online publication date: 31-Aug-2023.
  16. Biondi N, Pernici F, Bruni M and Del Bimbo A (2023). CoReS: Compatible Representations via Stationarity, IEEE Transactions on Pattern Analysis and Machine Intelligence, 45:8, (9567-9582), Online publication date: 1-Aug-2023.
  17. Wang J, Lan C, Liu C, Ouyang Y, Qin T, Lu W, Chen Y, Zeng W and Yu P (2023). Generalizing to Unseen Domains: A Survey on Domain Generalization, IEEE Transactions on Knowledge and Data Engineering, 35:8, (8052-8072), Online publication date: 1-Aug-2023.
  18. Lee K, Rahman M and Kocaoglu M Finding invariant predictors efficiently via causal structure Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, (1196-1206)
  19. ACM
    Paleyes A, Urma R and Lawrence N (2022). Challenges in Deploying Machine Learning: A Survey of Case Studies, ACM Computing Surveys, 55:6, (1-29), Online publication date: 31-Jul-2023.
  20. Kulinski S and Inouye D Towards explaining distribution shifts Proceedings of the 40th International Conference on Machine Learning, (17931-17952)
  21. Xie B, Li S, Li M, Liu C, Huang G and Wang G (2023). SePiCo: Semantic-Guided Pixel Contrast for Domain Adaptive Semantic Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 45:7, (9004-9021), Online publication date: 1-Jul-2023.
  22. ACM
    Gong T, Kim Y, Orzikulova A, Liu Y, Hwang S, Shin J and Lee S (2023). DAPPER, Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 7:2, (1-27), Online publication date: 12-Jun-2023.
  23. ACM
    Alvarez J, Scott K, Berendt B and Ruggieri S Domain Adaptive Decision Trees: Implications for Accuracy and Fairness Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, (423-433)
  24. Lee K, Shrivastava A and Kacorri H (2023). Leveraging Hand-Object Interactions in Assistive Egocentric Vision, IEEE Transactions on Pattern Analysis and Machine Intelligence, 45:6, (6820-6831), Online publication date: 1-Jun-2023.
  25. ACM
    Wu R, Bendeck A, Chu X and He Y (2023). Ground Truth Inference for Weakly Supervised Entity Matching, Proceedings of the ACM on Management of Data, 1:1, (1-28), Online publication date: 26-May-2023.
  26. ACM
    Ji X, Choi H, Sokolsky O and Lee I Incremental Anomaly Detection with Guarantee in the Internet of Medical Things Proceedings of the 8th ACM/IEEE Conference on Internet of Things Design and Implementation, (327-339)
  27. Xu Y, Gao X, Zhang C, Tan J and Li X (2023). High Quality Superpixel Generation Through Regional Decomposition, IEEE Transactions on Circuits and Systems for Video Technology, 33:4, (1802-1815), Online publication date: 1-Apr-2023.
  28. Nakajima S and Sugiyama M (2022). Positive-unlabeled classification under class-prior shift: a prior-invariant approach based on density ratio estimation, Machine Language, 112:3, (889-919), Online publication date: 1-Mar-2023.
  29. ACM
    Vellenga K, Karlsson A, Steinhauer H, Falkman G and Sjogren A Surrogate Deep Learning to Estimate Uncertainties for Driver Intention Recognition Proceedings of the 2023 15th International Conference on Machine Learning and Computing, (252-258)
  30. Chen S, Wang L, Hong Z and Yang X (2023). Domain Generalization by Joint-Product Distribution Alignment, Pattern Recognition, 134:C, Online publication date: 1-Feb-2023.
  31. Poličar P, Stražar M and Zupan B (2021). Embedding to reference t-SNE space addresses batch effects in single-cell classification, Machine Language, 112:2, (721-740), Online publication date: 1-Feb-2023.
  32. Hotvedt M, Grimstad B and Imsland L (2022). Passive learning to address nonstationarity in virtual flow metering applications, Expert Systems with Applications: An International Journal, 210:C, Online publication date: 30-Dec-2022.
  33. Zhang M, Levine S and Finn C MEMO Proceedings of the 36th International Conference on Neural Information Processing Systems, (38629-38642)
  34. Huang X, Lee D, Dobriban E and Hassani H Collaborative learning of discrete distributions under heterogeneity and communication constraints Proceedings of the 36th International Conference on Neural Information Processing Systems, (31915-31928)
  35. ACM
    Naing H, Cai W, Nan H, Tiantian W and Liang Y (2022). Dynamic Data-driven Microscopic Traffic Simulation using Jointly Trained Physics-guided Long Short-Term Memory, ACM Transactions on Modeling and Computer Simulation, 32:4, (1-27), Online publication date: 31-Oct-2022.
  36. ACM
    Davvetas A, Klampanos I, Skiadopoulos S and Karkaletsis V (2022). Evidence Transfer: Learning Improved Representations According to External Heterogeneous Task Outcomes, ACM Transactions on Knowledge Discovery from Data, 16:5, (1-22), Online publication date: 31-Oct-2022.
  37. ACM
    Yuan J, Hou F, Du Y, Shi Z, Geng X, Fan J and Rui Y Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation Proceedings of the 30th ACM International Conference on Multimedia, (3907-3916)
  38. Christiansen R, Pfister N, Jakobsen M, Gnecco N and Peters J (2022). A Causal Framework for Distribution Generalization, IEEE Transactions on Pattern Analysis and Machine Intelligence, 44:10_Part_2, (6614-6630), Online publication date: 1-Oct-2022.
  39. Huang L, Ruan S, Decazes P and Denœux T (2022). Lymphoma segmentation from 3D PET-CT images using a deep evidential network, International Journal of Approximate Reasoning, 149:C, (39-60), Online publication date: 1-Oct-2022.
  40. Esuli A, Moreo A, Sebastiani F and Sperduti G A Concise Overview of LeQua@CLEF 2022: Learning to Quantify Experimental IR Meets Multilinguality, Multimodality, and Interaction, (362-381)
  41. Wang N, Liu T, Wang J, Liu Q, Alibhai S and He X (2022). Locality-based transfer learning on compression autoencoder for efficient scientific data lossy compression, Journal of Network and Computer Applications, 205:C, Online publication date: 1-Sep-2022.
  42. Zhang X, Chan F, Yan C and Bose I (2022). Towards risk-aware artificial intelligence and machine learning systems, Decision Support Systems, 159:C, Online publication date: 1-Aug-2022.
  43. Bogatinovski J, Todorovski L, Džeroski S and Kocev D (2022). Explaining the performance of multilabel classification methods with data set properties, International Journal of Intelligent Systems, 37:9, (6080-6122), Online publication date: 30-Jul-2022.
  44. ACM
    Hullman J, Kapoor S, Nanayakkara P, Gelman A and Narayanan A The Worst of Both Worlds: A Comparative Analysis of Errors in Learning from Data in Psychology and Machine Learning Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, (335-348)
  45. Peng M, Li Z and Juan X (2022). Similarity-based domain adaptation network, Neurocomputing, 493:C, (462-473), Online publication date: 7-Jul-2022.
  46. ACM
    Abebe R, Hardt M, Jin A, Miller J, Schmidt L and Wexler R Adversarial Scrutiny of Evidentiary Statistical Software Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, (1733-1746)
  47. Esuli A, Moreo A and Sebastiani F LeQua@CLEF2022: Learning to Quantify Advances in Information Retrieval, (374-381)
  48. Pérez Maurera F, Ferrari Dacrema M and Cremonesi P An Evaluation Study of Generative Adversarial Networks for Collaborative Filtering Advances in Information Retrieval, (671-685)
  49. Tieppo E, Santos R, Barddal J and Nievola J (2022). Hierarchical classification of data streams: a systematic literature review, Artificial Intelligence Review, 55:4, (3243-3282), Online publication date: 1-Apr-2022.
  50. Fop M, Mattei P, Bouveyron C and Murphy T (2022). Unobserved classes and extra variables in high-dimensional discriminant analysis, Advances in Data Analysis and Classification, 16:1, (55-92), Online publication date: 1-Mar-2022.
  51. ACM
    Moreo A, Esuli A and Sebastiani F (2021). Lost in Transduction: Transductive Transfer Learning in Text Classification, ACM Transactions on Knowledge Discovery from Data, 16:1, (1-21), Online publication date: 28-Feb-2022.
  52. Coma-Puig B and Carmona J (2022). Non-technical losses detection in energy consumption focusing on energy recovery and explainability, Machine Language, 111:2, (487-517), Online publication date: 1-Feb-2022.
  53. Jhaveri R, Revathi A, Ramana K, Raut R, Dhanaraj R and Hakak S (2022). A Review on Machine Learning Strategies for Real-World Engineering Applications, Mobile Information Systems, 2022, Online publication date: 1-Jan-2022.
  54. Yin G, Wang W, Yuan Z, Ji W, Yu D, Sun S, Chua T and Wang C (2022). Conditional Hyper-Network for Blind Super-Resolution With Multiple Degradations, IEEE Transactions on Image Processing, 31, (3949-3960), Online publication date: 1-Jan-2022.
  55. Casimiro M, Garlan D, Cámara J, Rodrigues L and Romano P A Probabilistic Model Checking Approach to Self-adapting Machine Learning Systems Software Engineering and Formal Methods. SEFM 2021 Collocated Workshops, (317-332)
  56. ACM
    Lee K, Sato D, Asakawa S, Asakawa C and Kacorri H Accessing Passersby Proxemic Signals through a Head-Worn Camera: Opportunities and Limitations for the Blind Proceedings of the 23rd International ACM SIGACCESS Conference on Computers and Accessibility, (1-15)
  57. Casimiro M, Romano P, Garlan D, Moreno G, Kang E and Klein M Self-adaptive Machine Learning Systems: Research Challenges and Opportunities Software Architecture, (133-155)
  58. Gurumoorthy K, Jawanpuria P and Mishra B SPOT: A Framework for Selection of Prototypes Using Optimal Transport Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track, (535-551)
  59. Bennequin E, Bouvier V, Tami M, Toubhans A and Hudelot C Bridging Few-Shot Learning and Adaptation: New Challenges of Support-Query Shift Machine Learning and Knowledge Discovery in Databases. Research Track, (554-569)
  60. ACM
    Li X and Zhan D FedRS Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, (995-1005)
  61. Cheng Z, Chen C, Chen Z, Fang K and Jin X (2021). Robust and high-order correlation alignment for unsupervised domain adaptation, Neural Computing and Applications, 33:12, (6891-6903), Online publication date: 1-Jun-2021.
  62. Ke L, Wang J, Bhattacharjee T, Boots B and Srinivasa S Grasping with Chopsticks: Combating Covariate Shift in Model-free Imitation Learning for Fine Manipulation 2021 IEEE International Conference on Robotics and Automation (ICRA), (6185-6191)
  63. Xavier Á, Qiu F and Ahmed S (2020). Learning to Solve Large-Scale Security-Constrained Unit Commitment Problems, INFORMS Journal on Computing, 33:2, (739-756), Online publication date: 1-May-2021.
  64. Ramachandra B, Jones M and Vatsavai R (2021). Perceptual metric learning for video anomaly detection, Machine Vision and Applications, 32:3, Online publication date: 1-May-2021.
  65. Hamborg F, Donnay K and Gipp B Towards Target-Dependent Sentiment Classification in News Articles Diversity, Divergence, Dialogue, (156-166)
  66. Zellinger W, Moser B and Saminger-Platz S (2021). On generalization in moment-based domain adaptation, Annals of Mathematics and Artificial Intelligence, 89:3-4, (333-369), Online publication date: 1-Mar-2021.
  67. Luo H and Paal S (2021). Reducing the effect of sample bias for small data sets with double‐weighted support vector transfer regression, Computer-Aided Civil and Infrastructure Engineering, 36:3, (248-263), Online publication date: 15-Feb-2021.
  68. Xu Z, Yang D, Tang J, Tang Y, Yuan T, Wang Y and Xue G (2021). An Actor-Critic-Based Transfer Learning Framework for Experience-Driven Networking, IEEE/ACM Transactions on Networking, 29:1, (360-371), Online publication date: 1-Feb-2021.
  69. Heinze-Deml C and Meinshausen N (2021). Conditional variance penalties and domain shift robustness, Machine Language, 110:2, (303-348), Online publication date: 1-Feb-2021.
  70. Nguyen M, Ngo G and Chen N (2021). Domain-Shift Conditioning Using Adaptable Filtering Via Hierarchical Embeddings for Robust Chinese Spell Check, IEEE/ACM Transactions on Audio, Speech and Language Processing, 29, (2027-2036), Online publication date: 1-Jan-2021.
  71. Liu H, Long M, Wang J and Wang Y Learning to adapt to evolving domains Proceedings of the 34th International Conference on Neural Information Processing Systems, (22338-22348)
  72. Wang X, Long M, Wang J and Jordan M Transferable calibration with lower bias and variance in domain adaptation Proceedings of the 34th International Conference on Neural Information Processing Systems, (19212-19223)
  73. Taori R, Dave A, Shankar V, Carlini N, Recht B and Schmidt L Measuring robustness to natural distribution shifts in image classification Proceedings of the 34th International Conference on Neural Information Processing Systems, (18583-18599)
  74. Goldwasser S, Kalai A, Kalai Y and Montasser O Beyond perturbations Proceedings of the 34th International Conference on Neural Information Processing Systems, (15859-15870)
  75. Oneto L, Donini M, Luise G, Ciliberto C, Maurer A and Pontil M Exploiting MMD and sinkhorn divergences for fair and transferable representation learning Proceedings of the 34th International Conference on Neural Information Processing Systems, (15360-15370)
  76. Jung Y, Tian J and Bareinboim E Learning causal effects via weighted empirical risk minimization Proceedings of the 34th International Conference on Neural Information Processing Systems, (12697-12709)
  77. Fang T, Lu N, Niu G and Sugiyama M Rethinking importance weighting for deep learning under distribution shift Proceedings of the 34th International Conference on Neural Information Processing Systems, (11996-12007)
  78. Jesson A, Mindermann S, Shalit U and Gal Y Identifying causal-effect inference failure with uncertainty-aware models Proceedings of the 34th International Conference on Neural Information Processing Systems, (11637-11649)
  79. Nandy J, Hsu W and Lee M Towards maximizing the representation gap between in-domain & out-of-distribution examples Proceedings of the 34th International Conference on Neural Information Processing Systems, (9239-9250)
  80. Rosenfeld N, Hilgard S, Ravindranath S and Parkes D From predictions to decisions Proceedings of the 34th International Conference on Neural Information Processing Systems, (4115-4126)
  81. ACM
    AlShehhi M and Wang D Machine Learning Pipeline for Reusing Pretrained Models Proceedings of the 12th International Conference on Management of Digital EcoSystems, (72-75)
  82. Souza V, dos Reis D, Maletzke A and Batista G (2020). Challenges in benchmarking stream learning algorithms with real-world data, Data Mining and Knowledge Discovery, 34:6, (1805-1858), Online publication date: 1-Nov-2020.
  83. Jin Q, Ding H, Li L, Huang H, Wang L and Yan J Tackling MeSH Indexing Dataset Shift with Time-Aware Concept Embedding Learning Database Systems for Advanced Applications, (474-488)
  84. Bouvier V, Very P, Chastagnol C, Tami M and Hudelot C Robust Domain Adaptation: Representations, Weights and Inductive Bias Machine Learning and Knowledge Discovery in Databases, (353-377)
  85. Qi L, Khaleel M, Tavanapong W, Sukul A and Peterson D A Framework for Deep Quantification Learning Machine Learning and Knowledge Discovery in Databases, (232-248)
  86. Tan S, Peng X and Saenko K Class-Imbalanced Domain Adaptation: An Empirical Odyssey Computer Vision – ECCV 2020 Workshops, (585-602)
  87. Su P, Wang K, Zeng X, Tang S, Chen D, Qiu D and Wang X Adapting Object Detectors with Conditional Domain Normalization Computer Vision – ECCV 2020, (403-419)
  88. Jin Y, Wang X, Long M and Wang J Minimum Class Confusion for Versatile Domain Adaptation Computer Vision – ECCV 2020, (464-480)
  89. Duan H, Zhao Y, Xiong Y, Liu W and Lin D Omni-Sourced Webly-Supervised Learning for Video Recognition Computer Vision – ECCV 2020, (670-688)
  90. Srivastava M, Hashimoto T and Liang P Robustness to spurious correlations via human annotations Proceedings of the 37th International Conference on Machine Learning, (9109-9119)
  91. Locatello F, Poole B, Rätsch G, Schölkopf B, Bachem O and Tschannen M Weakly-supervised disentanglement without compromises Proceedings of the 37th International Conference on Machine Learning, (6348-6359)
  92. Lakkaraju H, Arsov N and Bastani O Robust and stable black box explanations Proceedings of the 37th International Conference on Machine Learning, (5628-5638)
  93. Kumar A, Ma T and Liang P Understanding self-training for gradual domain adaptation Proceedings of the 37th International Conference on Machine Learning, (5468-5479)
  94. Jiang X, Lao Q, Matwin S and Havaei M Implicit class-conditioned domain alignment for unsupervised domain adaptation Proceedings of the 37th International Conference on Machine Learning, (4816-4827)
  95. Filos A, Tigas P, McAllister R, Rhinehart N, Levine S and Gal Y Can autonomous vehicles identify, recover from, and adapt to distribution shifts? Proceedings of the 37th International Conference on Machine Learning, (3145-3153)
  96. Feldman Y and Indelman V (2020). Spatially-dependent Bayesian semantic perception under model and localization uncertainty, Autonomous Robots, 44:6, (1091-1119), Online publication date: 1-Jul-2020.
  97. ACM
    Fariha A, Tiwari A, Radhakrishna A and Gulwani S ExTuNe: Explaining Tuple Non-conformance Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, (2741-2744)
  98. Chen Z, Chen C, Jin X, Liu Y and Cheng Z (2019). Deep joint two-stream Wasserstein auto-encoder and selective attention alignment for unsupervised domain adaptation, Neural Computing and Applications, 32:11, (7489-7502), Online publication date: 1-Jun-2020.
  99. Fujita H, Matsukawa T and Suzuki E (2019). Detecting outliers with one-class selective transfer machine, Knowledge and Information Systems, 62:5, (1781-1818), Online publication date: 1-May-2020.
  100. ACM
    Zhang J, Li W, Ogunbona P and Xu D (2019). Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition, ACM Computing Surveys, 52:1, (1-38), Online publication date: 31-Jan-2020.
  101. Abdelaty M, Doriguzzi-Corin R and Siracusa D AADS: A Noise-Robust Anomaly Detection Framework for Industrial Control Systems Information and Communications Security, (53-70)
  102. Hanneke S and Kpotufe S On the value of target data in transfer learning Proceedings of the 33rd International Conference on Neural Information Processing Systems, (9871-9881)
  103. Roelofs R, Fridovich-Keil S, Miller J, Shankar V, Hardt M, Recht B and Schmidt L A meta-analysis of overfitting in machine learning Proceedings of the 33rd International Conference on Neural Information Processing Systems, (9179-9189)
  104. Wang X, Jin Y, Long M, Wang J and Jordan M Transferable normalization Proceedings of the 33rd International Conference on Neural Information Processing Systems, (1953-1963)
  105. Cummings M and Stimpson A (2019). Identifying Critical Contextual Design Cues Through a Machine Learning Approach, AI Magazine, 40:4, (28-39), Online publication date: 1-Dec-2019.
  106. ACM
    Li X, Li W, Yang Q, Yan W and Zomaya A Building an Online Defect Detection System for Large-scale Photovoltaic Plants Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, (253-262)
  107. Nakajima S Distortion and Faults in Machine Learning Software Structured Object-Oriented Formal Language and Method, (29-41)
  108. Poličar P, Stražar M and Zupan B Embedding to Reference t-SNE Space Addresses Batch Effects in Single-Cell Classification Discovery Science, (246-260)
  109. Nakajima S and Chen T Generating Biased Dataset for Metamorphic Testing of Machine Learning Programs Testing Software and Systems, (56-64)
  110. Páez A (2019). The Pragmatic Turn in Explainable Artificial Intelligence (XAI), Minds and Machines, 29:3, (441-459), Online publication date: 1-Sep-2019.
  111. Pendlebury F, Pierazzi F, Jordaney R, Kinder J and Cavallaro L TESSERACT Proceedings of the 28th USENIX Conference on Security Symposium, (729-746)
  112. Correa J and Bareinboim E From statistical transportability to estimating the effect of stochastic interventions Proceedings of the 28th International Joint Conference on Artificial Intelligence, (1661-1667)
  113. Tran L, Kossaifi J, Panagakis Y and Pantic M (2019). Disentangling Geometry and Appearance with Regularised Geometry-Aware Generative Adversarial Networks, International Journal of Computer Vision, 127:6-7, (824-844), Online publication date: 1-Jun-2019.
  114. Razzaghi P (2019). Self-taught support vector machines, Knowledge and Information Systems, 59:3, (685-709), Online publication date: 1-Jun-2019.
  115. Lin X, Guo P, Florensa C and Held D Adaptive Variance for Changing Sparse-Reward Environments 2019 International Conference on Robotics and Automation (ICRA), (3210-3216)
  116. ACM
    Israelsen B and Ahmed N (2019). “Dave...I can assure you ...that it’s going to be all right ...” A Definition, Case for, and Survey of Algorithmic Assurances in Human-Autonomy Trust Relationships, ACM Computing Surveys, 51:6, (1-37), Online publication date: 27-Feb-2019.
  117. ACM
    Coston A, Ramamurthy K, Wei D, Varshney K, Speakman S, Mustahsan Z and Chakraborty S Fair Transfer Learning with Missing Protected Attributes Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, (91-98)
  118. Malinin A and Gales M Predictive uncertainty estimation via prior networks Proceedings of the 32nd International Conference on Neural Information Processing Systems, (7047-7058)
  119. Magliacane S, van Ommen T, Claassen T, Bongers S, Versteeg P and Mooij J Domain adaptation by using causal inference to predict invariant conditional distributions Proceedings of the 32nd International Conference on Neural Information Processing Systems, (10869-10879)
  120. Chen I, Johansson F and Sontag D Why is my classifier discriminatory? Proceedings of the 32nd International Conference on Neural Information Processing Systems, (3543-3554)
  121. Long M, Cao Z, Wang J and Jordan M Conditional adversarial domain adaptation Proceedings of the 32nd International Conference on Neural Information Processing Systems, (1647-1657)
  122. ACM
    Liu W, Chang X, Yan Y, Yang Y and Hauptmann A (2018). Few-Shot Text and Image Classification via Analogical Transfer Learning, ACM Transactions on Intelligent Systems and Technology, 9:6, (1-20), Online publication date: 30-Nov-2018.
  123. Silva-Palacios D, Ferri C and Ramirez-Quintana M Adapting Hierarchical Multiclass Classification to Changes in the Target Concept Advances in Artificial Intelligence, (118-127)
  124. Zhu X, Zhou H, Yang C, Shi J and Lin D Penalizing Top Performers: Conservative Loss for Semantic Segmentation Adaptation Computer Vision – ECCV 2018, (587-603)
  125. Landeiro V and Culotta A (2019). Robust text classification under confounding shift, Journal of Artificial Intelligence Research, 63:1, (391-419), Online publication date: 1-Sep-2018.
  126. Aral A and Brandic I Consistency of the Fittest: Towards Dynamic Staleness Control for Edge Data Analytics Euro-Par 2018: Parallel Processing Workshops, (40-52)
  127. ACM
    Veale M, Van Kleek M and Binns R Fairness and Accountability Design Needs for Algorithmic Support in High-Stakes Public Sector Decision-Making Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, (1-14)
  128. Rojas-Carulla M, Schölkopf B, Turner R and Peters J (2018). Invariant models for causal transfer learning, The Journal of Machine Learning Research, 19:1, (1309-1342), Online publication date: 1-Jan-2018.
  129. Zuo H, Zhang G, Pedrycz W, Behbood V and Lu J (2017). Fuzzy Regression Transfer Learning in Takagi–Sugeno Fuzzy Models, IEEE Transactions on Fuzzy Systems, 25:6, (1795-1807), Online publication date: 1-Dec-2017.
  130. Shi Y and Knoblock C Learning with previously unseen features Proceedings of the 26th International Joint Conference on Artificial Intelligence, (2722-2729)
  131. Long M, Zhu H, Wang J and Jordan M Deep transfer learning with joint adaptation networks Proceedings of the 34th International Conference on Machine Learning - Volume 70, (2208-2217)
  132. Sechidis K, Sperrin M, Petherick E, Lujn M and Brown G (2017). Dealing with under-reported variables, International Journal of Approximate Reasoning, 85:C, (159-177), Online publication date: 1-Jun-2017.
  133. ACM
    Marcheggiani D and Sebastiani F (2017). On the Effects of Low-Quality Training Data on Information Extraction from Clinical Reports, Journal of Data and Information Quality, 9:1, (1-25), Online publication date: 31-Mar-2017.
  134. Pérez-Gállego P, Quevedo J and del Coz J (2017). Using ensembles for problems with characterizable changes in data distribution, Information Fusion, 34:C, (87-100), Online publication date: 1-Mar-2017.
  135. Alfeld S, Zhu X and Barford P Explicit defense actions against test-set attacks Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, (1274-1280)
  136. Tasche D (2017). Fisher consistency for prior probability shift, The Journal of Machine Learning Research, 18:1, (3338-3369), Online publication date: 1-Jan-2017.
  137. ACM
    Saha S, Banerjee B and Merchant S Unsupervised domain adaptation without source domain training samples Proceedings of the Tenth Indian Conference on Computer Vision, Graphics and Image Processing, (1-8)
  138. Steinhardt J and Liang P Unsupervised risk estimation using only conditional independence structure Proceedings of the 30th International Conference on Neural Information Processing Systems, (3664-3672)
  139. Niu G, du Plessis M, Sakai T, Ma Y and Sugiyama M Theoretical comparisons of positive-unlabeled learning against positive-negative learning Proceedings of the 30th International Conference on Neural Information Processing Systems, (1207-1215)
  140. ACM
    Ghassemi M, Sarwate A and Wright R Differentially Private Online Active Learning with Applications to Anomaly Detection Proceedings of the 2016 ACM Workshop on Artificial Intelligence and Security, (117-128)
  141. ACM
    Natarajan A, Angarita G, Gaiser E, Malison R, Ganesan D and Marlin B Domain adaptation methods for improving lab-to-field generalization of cocaine detection using wearable ECG Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, (875-885)
  142. ACM
    Wang J, Wang S, Cui Q and Wang Q Local-based active classification of test report to assist crowdsourced testing Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, (190-201)
  143. ACM
    Ribeiro M, Singh S and Guestrin C "Why Should I Trust You?" Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (1135-1144)
  144. Webb G, Hyde R, Cao H, Nguyen H and Petitjean F (2016). Characterizing concept drift, Data Mining and Knowledge Discovery, 30:4, (964-994), Online publication date: 1-Jul-2016.
  145. Fernández A, Elkano M, Galar M, Sanz J, Alshomrani S, Bustince H and Herrera F (2016). Enhancing evolutionary fuzzy systems for multi-class problems, International Journal of Approximate Reasoning, 73:C, (108-122), Online publication date: 1-Jun-2016.
  146. ACM
    Minkov E (2015). Event Extraction using Structured Learning and Rich Domain Knowledge, ACM Transactions on Intelligent Systems and Technology, 7:2, (1-34), Online publication date: 22-Jan-2016.
  147. Behbood V, Lu J, Zhang G and Pedrycz W (2015). Multistep Fuzzy Bridged Refinement Domain Adaptation Algorithm and Its Application to Bank Failure Prediction, IEEE Transactions on Fuzzy Systems, 23:6, (1917-1935), Online publication date: 1-Dec-2015.
  148. Ditzler G, Roveri M, Alippi C and Polikar R (2015). Learning in Nonstationary Environments: A Survey, IEEE Computational Intelligence Magazine, 10:4, (12-25), Online publication date: 1-Nov-2015.
  149. Vilalta R, Gupta K and Mahabal A Star classification under data variability Proceedings of the 2015th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part III, (241-244)
  150. Pozzolo A, Caelen O and Bontempi G When is undersampling effective in unbalanced classification tasks? Proceedings of the 2015th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I, (200-215)
  151. Raykar V and Saha A Data split strategies for evolving predictive models Proceedings of the 2015th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I, (3-19)
  152. Wen J, Greiner R and Schuurmans D Correcting covariate shift with the Frank-Wolfe algorithm Proceedings of the 24th International Conference on Artificial Intelligence, (1010-1016)
  153. ACM
    Esuli A and Sebastiani F (2015). Optimizing Text Quantifiers for Multivariate Loss Functions, ACM Transactions on Knowledge Discovery from Data, 9:4, (1-27), Online publication date: 1-Jun-2015.
  154. Gopalan R, Li R, Patel V and Chellappa R (2015). Domain Adaptation for Visual Recognition, Foundations and Trends® in Computer Graphics and Vision, 8:4, (285-378), Online publication date: 1-Mar-2015.
  155. Raza H, Prasad G and Li Y (2015). EWMA model based shift-detection methods for detecting covariate shifts in non-stationary environments, Pattern Recognition, 48:3, (659-669), Online publication date: 1-Mar-2015.
  156. Barranquero J, Díez J and José del Coz J (2015). Quantification-oriented learning based on reliable classifiers, Pattern Recognition, 48:2, (591-604), Online publication date: 1-Feb-2015.
  157. López V, del Río S, Benítez J and Herrera F (2015). Cost-sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data, Fuzzy Sets and Systems, 258:C, (5-38), Online publication date: 1-Jan-2015.
  158. ACM
    Vahdat A, Atwater A, McIntyre A and Heywood M On the application of GP to streaming data classification tasks with label budgets Proceedings of the Companion Publication of the 2014 Annual Conference on Genetic and Evolutionary Computation, (1287-1294)
  159. Krell M, Feess D and Straube S (2014). Balanced Relative Margin Machine - The missing piece between FDA and SVM classification, Pattern Recognition Letters, 41:C, (43-52), Online publication date: 1-May-2014.
  160. Sebastiani F Text Quantification Proceedings of the 36th European Conference on IR Research on Advances in Information Retrieval - Volume 8416, (819-822)
  161. ACM
    Xia C, Schwartz R, Xie K, Krebs A, Langdon A, Ting J and Naaman M CityBeat Proceedings of the 23rd International Conference on World Wide Web, (167-170)
  162. ACM
    Freitas A (2014). Comprehensible classification models, ACM SIGKDD Explorations Newsletter, 15:1, (1-10), Online publication date: 17-Mar-2014.
  163. ACM
    Gao W and Yang P Democracy is good for ranking Proceedings of the 7th ACM international conference on Web search and data mining, (63-72)
  164. ACM
    Aran O and Gatica-Perez D Cross-domain personality prediction Proceedings of the 15th ACM on International conference on multimodal interaction, (127-130)
  165. Sugiyama M, Yamada M and du Plessis M (2013). Learning under nonstationarity, WIREs Computational Statistics, 5:6, (465-477), Online publication date: 1-Nov-2013.
  166. ACM
    Zhang C, Zhang Y, Wang S, Pang J, Liang C, Huang Q and Tian Q Undo the codebook bias by linear transformation for visual applications Proceedings of the 21st ACM international conference on Multimedia, (533-536)
  167. ACM
    Pan S, Toh Z and Su J (2013). Transfer joint embedding for cross-domain named entity recognition, ACM Transactions on Information Systems, 31:2, (1-27), Online publication date: 1-May-2013.
  168. Moreno-Torres J, Llorí X, Goldberg D and Bhargava R (2013). Repairing fractures between data using genetic programming-based feature extraction, Information Sciences: an International Journal, 222, (805-823), Online publication date: 1-Feb-2013.
  169. Hofer V and Krempl G (2013). Drift mining in data, Computational Statistics & Data Analysis, 57:1, (377-391), Online publication date: 1-Jan-2013.
  170. Guan Z and Zhu T An overview of transfer learning and computational cyberpsychology Proceedings of the 2012 international conference on Pervasive Computing and the Networked World, (209-215)
  171. Morvant E, Habrard A and Ayache S (2012). Parsimonious unsupervised and semi-supervised domain adaptation with good similarity functions, Knowledge and Information Systems, 33:2, (309-349), Online publication date: 1-Nov-2012.
  172. Hoffman J, Kulis B, Darrell T and Saenko K Discovering Latent Domains for Multisource Domain Adaptation Proceedings, Part II, of the 12th European Conference on Computer Vision --- ECCV 2012 - Volume 7573, (702-715)
  173. Khosla A, Zhou T, Malisiewicz T, Efros A and Torralba A Undoing the damage of dataset bias Proceedings of the 12th European conference on Computer Vision - Volume Part I, (158-171)
  174. Calandra R, Raiko T, Deisenroth M and Pouzols F Learning deep belief networks from non-stationary streams Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II, (379-386)
  175. López V, Fernández A, Moreno-Torres J and Herrera F (2012). Analysis of preprocessing vs. cost-sensitive learning for imbalanced classification. Open problems on intrinsic data characteristics, Expert Systems with Applications: An International Journal, 39:7, (6585-6608), Online publication date: 1-Jun-2012.
  176. Hong J, Chen B and Yin J Transfer learning with local smoothness regularizer Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications, (521-528)
  177. Hachiya H, Sugiyama M and Ueda N (2012). Importance-weighted least-squares probabilistic classifier for covariate shift adaptation with application to human activity recognition, Neurocomputing, 80:C, (93-101), Online publication date: 15-Mar-2012.
  178. Moreno-Torres J, Raeder T, Alaiz-RodríGuez R, Chawla N and Herrera F (2012). A unifying view on dataset shift in classification, Pattern Recognition, 45:1, (521-530), Online publication date: 1-Jan-2012.
  179. ACM
    Prettenhofer P and Stein B (2011). Cross-Lingual Adaptation Using Structural Correspondence Learning, ACM Transactions on Intelligent Systems and Technology, 3:1, (1-22), Online publication date: 1-Oct-2011.
  180. Morvant E, Habrard A and Ayache S On the usefulness of similarity based projection spaces for transfer learning Proceedings of the First international conference on Similarity-based pattern recognition, (1-16)
  181. ACM
    Cai P, Gao W, Zhou A and Wong K Relevant knowledge helps in choosing right teacher Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, (115-124)
  182. Fernández A, García S and Herrera F Addressing the classification with imbalanced data Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part I, (1-10)
  183. Abbasian H, Drummond C, Japkowicz N and Matwin S Robustness of classifiers to changing environments Proceedings of the 23rd Canadian conference on Advances in Artificial Intelligence, (232-243)
  184. Baehrens D, Schroeter T, Harmeling S, Kawanabe M, Hansen K and Müller K (2010). How to Explain Individual Classification Decisions, The Journal of Machine Learning Research, 11, (1803-1831), Online publication date: 1-Mar-2010.
  185. Cornuéjols A On-line learning Ubiquitous knowledge discovery, (129-147)
  186. Cornuéjols A On-line learning Ubiquitous knowledge discovery, (129-147)
  187. Kanamori T, Hido S and Sugiyama M (2009). A Least-squares Approach to Direct Importance Estimation, The Journal of Machine Learning Research, 10, (1391-1445), Online publication date: 1-Dec-2009.
  188. Tadipatri V, Gowreesunker B, Tewfik A, Ince N, Ashe J and Pellizzer G Spatial proximity based subspace decomposition for movement direction decoding of local field potentials Proceedings of the 43rd Asilomar conference on Signals, systems and computers, (1090-1093)
  189. Fernandez A, Galar M, Sanz J, Bustince H, Cordon O and Herrera F On the impact of Distance-based Relative Competence Weighting approach in One-vs-One classification for Evolutionary Fuzzy Systems: DRCW-FH-GBML algorithm 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), (1-7)
Contributors
  • Microsoft Corporation
  • University of Cambridge

Index Terms

  1. Dataset Shift in Machine Learning

      Recommendations