skip to main content
Skip header Section
Bayesian Learning for Neural NetworksJanuary 1996
Publisher:
  • Springer-Verlag
  • Berlin, Heidelberg
ISBN:978-0-387-94724-2
Published:01 January 1996
Pages:
183
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

From the Publisher:

Artificial "neural networks" are now widely used as flexible models for regression classification applications, but questions remain regarding what these models mean, and how they can safely be used when training data is limited. Bayesian Learning for Neural Networks shows that Bayesian methods allow complex neural network models to be used without fear of the "overfitting" that can occur with traditional neural network learning methods. Insight into the nature of these complex Bayesian models is provided by a theoretical investigation of the priors over functions that underlie them. Use of these models in practice is made possible using Markov chain Monte Carlo techniques. Both the theoretical and computational aspects of this work are of wider statistical interest, as they contribute to a better understanding of how Bayesian methods can be applied to complex problems. Presupposing only the basic knowledge of probability and statistics, this book should be of interest to many researchers in statistics, engineering, and artificial intelligence. Software for Unix systems that implements the methods described is freely available over the Internet.

Cited By

  1. Ott K, Tiemann M, Hennig P and Briol F Bayesian numerical integration with neural networks Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, (1606-1617)
  2. ACM
    Soldani J and Brogi A (2022). Anomaly Detection and Failure Root Cause Analysis in (Micro) Service-Based Cloud Applications: A Survey, ACM Computing Surveys, 55:3, (1-39), Online publication date: 30-Apr-2023.
  3. Wild V, Hu R and Sejdinovic D Generalized variational inference in function spaces Proceedings of the 36th International Conference on Neural Information Processing Systems, (3716-3730)
  4. Hubin A, Storvik G and Frommlet F (2021). Flexible Bayesian Nonlinear Model Configuration, Journal of Artificial Intelligence Research, 72, (901-942), Online publication date: 4-Jan-2022.
  5. Cobb A, Jalaian B, Bastian N and Russell S Robust decision-making in the internet of battlefield things using bayesian neural networks Proceedings of the Winter Simulation Conference, (1-12)
  6. Roy A (2021). Multivariate Gaussian RBF‐net for smooth function estimation and variable selection, Statistical Analysis and Data Mining, 14:5, (484-500), Online publication date: 16-Sep-2021.
  7. ACM
    Selitskiy S, Christou N and Selitskaya N Isolating Uncertainty of the Face Expression Recognition with the Meta-Learning Supervisor Neural Network 2021 5th International Conference on Artificial Intelligence and Virtual Reality (AIVR), (104-112)
  8. Alarab I, Prakoonwit S and Nacer M (2021). Illustrative Discussion of MC-Dropout in General Dataset: Uncertainty Estimation in Bitcoin, Neural Processing Letters, 53:2, (1001-1011), Online publication date: 1-Apr-2021.
  9. Zhou Y and Cheung Y (2020). Bayesian Low-Tubal-Rank Robust Tensor Factorization with Multi-Rank Determination, IEEE Transactions on Pattern Analysis and Machine Intelligence, 43:1, (62-76), Online publication date: 1-Jan-2021.
  10. Hu S, Xie X, Liu S, Yu J, Ye Z, Geng M, Liu X and Meng H (2021). Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, 29, (1514-1529), Online publication date: 1-Jan-2021.
  11. Liu W, Qiao W and Liu X (2021). Discovering the realistic paths towards the realization of patent valuation from technical perspectives: defense, implementation or transfer, Neural Computing and Applications, 33:2, (577-590), Online publication date: 1-Jan-2021.
  12. Rossi S, Marmin S and Filippone M Walsh-hadamard variational inference for Bayesian deep learning Proceedings of the 34th International Conference on Neural Information Processing Systems, (9674-9686)
  13. Costante G and Mancini M (2020). Uncertainty Estimation for Data-Driven Visual Odometry, IEEE Transactions on Robotics, 36:6, (1738-1757), Online publication date: 1-Dec-2020.
  14. Ranjan V, Wang B, Shah M and Hoai M Uncertainty Estimation and Sample Selection for Crowd Counting Computer Vision – ACCV 2020, (375-391)
  15. Ali A, Testa M, Bianchi T and Magli E BioMetricNet: Deep Unconstrained Face Verification Through Learning of Metrics Regularized onto Gaussian Distributions Computer Vision – ECCV 2020, (133-149)
  16. Li Y, Zeng X, Gao Z, Lin L, Tao J, Han J, Cheng X, Tahoori M and Zeng X Exploring a bayesian optimization framework compatible with digital standard flow for soft-error-tolerant circuit Proceedings of the 57th ACM/EDAC/IEEE Design Automation Conference, (1-6)
  17. Ajčević M, Miladinović A, Silveri G, Furlanis G, Cilotto T, Stella A, Caruso P, Ukmar M, Naccarato M, Cuzzocrea A, Manganotti P and Accardo A A Big-Data Variational Bayesian Framework for Supporting the Prediction of Functional Outcomes in Wake-Up Stroke Patients Computational Science and Its Applications – ICCSA 2020, (992-1002)
  18. De Sousa Ribeiro F, Calivá F, Swainson M, Gudmundsson K, Leontidis G and Kollias S (2019). Deep Bayesian Self-Training, Neural Computing and Applications, 32:9, (4275-4291), Online publication date: 1-May-2020.
  19. Shaker M and Hüllermeier E Aleatoric and Epistemic Uncertainty with Random Forests Advances in Intelligent Data Analysis XVIII, (444-456)
  20. Chen Z, Wang B and Gorban A (2019). Multivariate Gaussian and Student-t process regression for multi-output prediction, Neural Computing and Applications, 32:8, (3005-3028), Online publication date: 1-Apr-2020.
  21. Yin T and Zhu H (2019). An efficient algorithm for architecture design of Bayesian neural network in structural model updating, Computer-Aided Civil and Infrastructure Engineering, 35:4, (354-372), Online publication date: 4-Mar-2020.
  22. Hernández S, Vergara D, Valdenegro-Toro M and Jorquera F (2019). Improving predictive uncertainty estimation using Dropout–Hamiltonian Monte Carlo, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 24:6, (4307-4322), Online publication date: 1-Mar-2020.
  23. Binois M, Ginsbourger D and Roustant O (2019). On the choice of the low-dimensional domain for global optimization via random embeddings, Journal of Global Optimization, 76:1, (69-90), Online publication date: 1-Jan-2020.
  24. Jerfel G, Grant E, Griffiths T and Heller K Reconciling meta-learning and continual learning with online mixtures of tasks Proceedings of the 33rd International Conference on Neural Information Processing Systems, (9122-9133)
  25. Nemeth C, Lindsten F, Filippone M and Hensman J Pseudo-extended Markov chain Monte Carlo Proceedings of the 33rd International Conference on Neural Information Processing Systems, (4312-4322)
  26. Khan M, Immer A, Abedi E and Korzepa M Approximate inference turns deep networks into Gaussian processes Proceedings of the 33rd International Conference on Neural Information Processing Systems, (3094-3104)
  27. Wan R, Zhong M, Xiong H and Zhu Z Neural Control Variates for Monte Carlo Variance Reduction Machine Learning and Knowledge Discovery in Databases, (533-547)
  28. Xie H, Li C, Xu R and Mengersen K Robust Kernelized Bayesian Matrix Factorization for Video Background/Foreground Separation Machine Learning, Optimization, and Data Science, (484-495)
  29. Li L, Yan J, Yang X and Jin Y Learning interpretable deep state space model for probabilistic time series forecasting Proceedings of the 28th International Joint Conference on Artificial Intelligence, (2901-2908)
  30. Yang Y and Perdikaris P (2019). Conditional deep surrogate models for stochastic, high-dimensional, and multi-fidelity systems, Computational Mechanics, 64:2, (417-434), Online publication date: 1-Aug-2019.
  31. Raissi M, Babaee H and Karniadakis G (2019). Parametric Gaussian process regression for big data, Computational Mechanics, 64:2, (409-416), Online publication date: 1-Aug-2019.
  32. Sonoda S and Murata N (2019). Transport analysis of infinitely deep neural network, The Journal of Machine Learning Research, 20:1, (31-82), Online publication date: 1-Jan-2019.
  33. Osband I, Aslanides J and Cassirer A Randomized prior functions for deep reinforcement learning Proceedings of the 32nd International Conference on Neural Information Processing Systems, (8626-8638)
  34. Jacot A, Gabriel F and Hongler C Neural tangent kernel Proceedings of the 32nd International Conference on Neural Information Processing Systems, (8580-8589)
  35. Shalev G, Adi Y and Keshet J Out-of-distribution detection using multiple semantic label representations Proceedings of the 32nd International Conference on Neural Information Processing Systems, (7386-7396)
  36. Malinin A and Gales M Predictive uncertainty estimation via prior networks Proceedings of the 32nd International Conference on Neural Information Processing Systems, (7047-7058)
  37. Perrone V, Jenatton R, Seeger M and Archambeau C Scalable hyperparameter transfer learning Proceedings of the 32nd International Conference on Neural Information Processing Systems, (6846-6856)
  38. Wang Z, Kim B and Kaelbling L Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior Proceedings of the 32nd International Conference on Neural Information Processing Systems, (10498-10509)
  39. Mangoubi O and Vishnoi N Dimensionally tight bounds for second-order Hamiltonian Monte Carlo Proceedings of the 32nd International Conference on Neural Information Processing Systems, (6030-6040)
  40. Jungo A, Meier R, Ermis E, Blatti-Moreno M, Herrmann E, Wiest R and Reyes M On the Effect of Inter-observer Variability for a Reliable Estimation of Uncertainty of Medical Image Segmentation Medical Image Computing and Computer Assisted Intervention – MICCAI 2018, (682-690)
  41. Domingues R, Michiardi P, Zouaoui J and Filippone M (2018). Deep Gaussian Process autoencoders for novelty detection, Machine Language, 107:8-10, (1363-1383), Online publication date: 1-Sep-2018.
  42. ACM
    Lee Y and Vempala S Convergence rate of Riemannian Hamiltonian Monte Carlo and faster polytope volume computation Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, (1115-1121)
  43. Kassani P, Teoh A and Kim E (2018). Sparse pseudoinverse incremental extreme learning machine, Neurocomputing, 287:C, (128-142), Online publication date: 26-Apr-2018.
  44. Zhang N, Ding S, Zhang J and Xue Y (2018). An overview on Restricted Boltzmann Machines, Neurocomputing, 275:C, (1186-1199), Online publication date: 31-Jan-2018.
  45. Lakshminarayanan B, Pritzel A and Blundell C Simple and scalable predictive uncertainty estimation using deep ensembles Proceedings of the 31st International Conference on Neural Information Processing Systems, (6405-6416)
  46. Jang P, Loeb A, Davidow M and Wilson A Scalable lévy process priors for spectral kernel learning Proceedings of the 31st International Conference on Neural Information Processing Systems, (3943-3952)
  47. Song G, Wang S, Huang Q and Tian Q (2017). Multimodal Similarity Gaussian Process Latent Variable Model, IEEE Transactions on Image Processing, 26:9, (4168-4181), Online publication date: 1-Sep-2017.
  48. Fu C, Zhang P, Jiang J, Yang K and Lv Z (2017). A Bayesian approach for sleep and wake classification based on dynamic time warping method, Multimedia Tools and Applications, 76:17, (17765-17784), Online publication date: 1-Sep-2017.
  49. Wang Z and Jegelka S Max-value entropy search for efficient Bayesian Optimization Proceedings of the 34th International Conference on Machine Learning - Volume 70, (3627-3635)
  50. Tripuraneni N, Rowland M, Ghahramani Z and Turner R Magnetic Hamiltonian Monte Carlo Proceedings of the 34th International Conference on Machine Learning - Volume 70, (3453-3461)
  51. Pakman A, Gilboa D, Carlson D and Paninski L Stochastic Bouncy Particle Sampler Proceedings of the 34th International Conference on Machine Learning - Volume 70, (2741-2750)
  52. Molchanov D, Ashukha A and Vetrov D Variational dropout sparsifies deep neural networks Proceedings of the 34th International Conference on Machine Learning - Volume 70, (2498-2507)
  53. Cutajar K, Bonilla E, Michiardi P and Filippone M Random feature expansions for Deep Gaussian Processes Proceedings of the 34th International Conference on Machine Learning - Volume 70, (884-893)
  54. Rath J, Hutton P, Chen L and Roy S (2017). A hybrid empirical-Bayesian artificial neural network model of salinity in the San Francisco Bay-Delta estuary, Environmental Modelling & Software, 93:C, (193-208), Online publication date: 1-Jul-2017.
  55. Kotera J, Smidl V and Sroubek F (2017). Blind Deconvolution With Model Discrepancies, IEEE Transactions on Image Processing, 26:5, (2533-2544), Online publication date: 1-May-2017.
  56. Andersen M, Vehtari A, Winther O and Hansen L (2017). Bayesian inference for spatio-temporal spike-and-slab priors, The Journal of Machine Learning Research, 18:1, (5076-5133), Online publication date: 1-Jan-2017.
  57. Daniely A, Frostig R and Singer Y Toward deeper understanding of neural networks Proceedings of the 30th International Conference on Neural Information Processing Systems, (2261-2269)
  58. Robles-Kelly A Least-Squares Regression with Unitary Constraints for Network Behaviour Classification Structural, Syntactic, and Statistical Pattern Recognition, (26-36)
  59. ACM
    Sasaka Y, Ogawa T and Haseyama M Multimodal Interest Level Estimation via Variational Bayesian Mixture of Robust CCA Proceedings of the 24th ACM international conference on Multimedia, (387-391)
  60. Tripathy R, Bilionis I and Gonzalez M (2016). Gaussian processes with built-in dimensionality reduction, Journal of Computational Physics, 321:C, (191-223), Online publication date: 15-Sep-2016.
  61. Zhang Y, Henao R, Li C and Carin L Bayesian dictionary learning with Gaussian processes and sigmoid belief networks Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, (2364-2370)
  62. Gao T and Jojic V Degrees of freedom in deep neural networks Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, (232-241)
  63. Borrotti M, Pievatolo A, Critelli I, Degiorgi A and Colledani M (2016). A computer-aided methodology for the optimization of electrostatic separation processes in recycling, Applied Stochastic Models in Business and Industry, 32:1, (133-148), Online publication date: 1-Jan-2016.
  64. Soh H and Demiris Y (2015). Learning assistance by demonstration, Journal of Human-Robot Interaction, 4:3, (76-100), Online publication date: 6-Dec-2015.
  65. Altmann Y, Wallace A and McLaughlin S (2015). Spectral Unmixing of Multispectral Lidar Signals, IEEE Transactions on Signal Processing, 63:20, (5525-5534), Online publication date: 1-Oct-2015.
  66. Stulp F and Sigaud O (2015). Many regression algorithms, one unified model, Neural Networks, 69:C, (60-79), Online publication date: 1-Sep-2015.
  67. Mahani A and Sharabiani M (2015). SIMD parallel MCMC sampling with applications for big-data Bayesian analytics, Computational Statistics & Data Analysis, 88:C, (75-99), Online publication date: 1-Aug-2015.
  68. Young J, Mendelson A, Cardoso M, Modat M, Ashburner J and Ourselin S Improving MRI Brain Image Classification with Anatomical Regional Kernels Revised Selected Papers of the First International Workshop on Machine Learning Meets Medical Imaging - Volume 9487, (45-53)
  69. ACM
    Zhou J and Tung A SMiLer Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, (1871-1886)
  70. Guizilini V and Ramos F (2015). Online self-supervised learning for dynamic object segmentation, International Journal of Robotics Research, 34:4-5, (559-581), Online publication date: 1-Apr-2015.
  71. Schmidhuber J (2015). Deep learning in neural networks, Neural Networks, 61:C, (85-117), Online publication date: 1-Jan-2015.
  72. Kocadağlı O and Aşıkgil B (2014). Nonlinear time series forecasting with Bayesian neural networks, Expert Systems with Applications: An International Journal, 41:15, (6596-6610), Online publication date: 1-Nov-2014.
  73. Lu W and Wang D (2014). Learning machines, Applied Soft Computing, 24:C, (135-141), Online publication date: 1-Nov-2014.
  74. ACM
    Song L, Minku L and Yao X The potential benefit of relevance vector machine to software effort estimation Proceedings of the 10th International Conference on Predictive Models in Software Engineering, (52-61)
  75. Srivastava N, Hinton G, Krizhevsky A, Sutskever I and Salakhutdinov R (2014). Dropout, The Journal of Machine Learning Research, 15:1, (1929-1958), Online publication date: 1-Jan-2014.
  76. Ayhan M, Benton R, Raghavan V and Choubey S Composite Kernels for Automatic Relevance Determination in Computerized Diagnosis of Alzheimer's Disease Proceedings of the International Conference on Brain and Health Informatics - Volume 8211, (126-137)
  77. Priam R, Nadif M and Govaert G Gaussian Topographic Co-clustering Model Proceedings of the 12th International Symposium on Advances in Intelligent Data Analysis XII - Volume 8207, (345-356)
  78. Guizilini V and Ramos F (2013). Semi-parametric learning for visual odometry, International Journal of Robotics Research, 32:5, (526-546), Online publication date: 1-Apr-2013.
  79. Chen Z (2013). An overview of bayesian methods for neural spike train analysis, Computational Intelligence and Neuroscience, 2013, (1-1), Online publication date: 1-Jan-2013.
  80. Tasoulas E, Haugerud H and Begnum K Bayllocator Proceedings of the 26th international conference on Large Installation System Administration: strategies, tools, and techniques, (111-122)
  81. Hermans M and Schrauwen B Infinite sparse threshold unit networks Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I, (612-619)
  82. Chakraborty S, Ghosh M and Mallick B (2012). Bayesian nonlinear regression for large p small n problems, Journal of Multivariate Analysis, 108, (28-40), Online publication date: 1-Jul-2012.
  83. Zhao X and Cheung L (2011). Multiclass Kernel-Imbedded Gaussian Processes for Microarray Data Analysis, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 8:4, (1041-1053), Online publication date: 1-Jul-2011.
  84. Schaffernicht E and Gross H Weighted mutual information for feature selection Proceedings of the 21st international conference on Artificial neural networks - Volume Part II, (181-188)
  85. Dahl G, Ranzato M, Mohamed A and Hinton G Phone recognition with the mean-covariance restricted Boltzmann Machine Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 1, (469-477)
  86. Agakov F, McKeigue P, Krohn J and Storkey A Sparse Instrumental Variables (SPIV) for genome-wide studies Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 1, (28-36)
  87. Sun Y, Gomez F and Schmidhuber J Improving the asymptotic performance of Markov chain Monte-Carlo by inserting vortices Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 2, (2235-2243)
  88. Ranzato M, Mnih V and Hinton G Generating more realistic images using gated MRF's Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 2, (2002-2010)
  89. Navarro D Learning the context of a category Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 2, (1795-1803)
  90. Nakajima S, Sugiyama M and Tomioka R Global analytic solution for variational Bayesian matrix factorization Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 2, (1768-1776)
  91. Shen L, Qi Y, Kim S, Nho K, Wan J, Risacher S and Saykin A Sparse bayesian learning for identifying imaging biomarkers in AD prediction Proceedings of the 13th international conference on Medical image computing and computer-assisted intervention: Part III, (611-618)
  92. Viinikanoja J, Klami A and Kaski S Variational Bayesian mixture of robust CCA models Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III, (370-385)
  93. Viinikanoja J, Klami A and Kaski S Variational Bayesian mixture of robust CCA models Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III, (370-385)
  94. Viinikanoja J, Klami A and Kaski S Variational Bayesian mixture of robust CCA models Proceedings of the 2010th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part III, (370-385)
  95. Tzikas D and Likas A An incremental Bayesian approach for training multilayer perceptrons Proceedings of the 20th international conference on Artificial neural networks: Part I, (87-96)
  96. Lee M and Chen T Visualising intellectual structure of ubiquitous computing Proceedings of the 11th international conference on Knowledge management and acquisition for smart systems and services, (261-272)
  97. Drugan M and Thierens D (2010). Geometrical recombination operators for real-coded evolutionary mcmcs, Evolutionary Computation, 18:2, (157-198), Online publication date: 1-Jun-2010.
  98. Fan Y, Kaufer D and Shen D Joint estimation of multiple clinical variables of neurological diseases from imaging patterns Proceedings of the 2010 IEEE international conference on Biomedical imaging: from nano to Macro, (852-855)
  99. Titterington M (2009). Neural networks, WIREs Computational Statistics, 2:1, (1-8), Online publication date: 19-Jan-2010.
  100. Shahbaba B and Neal R (2009). Nonlinear Models Using Dirichlet Process Mixtures, The Journal of Machine Learning Research, 10, (1829-1850), Online publication date: 1-Dec-2009.
  101. Muramatsu D and Matsumoto T Online signature verification algorithm with a user-specific global-parameter fusion model Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics, (486-491)
  102. Berrones A (2009). Characterization of the convergence of stationary Fokker-Planck learning, Neurocomputing, 72:16-18, (3602-3608), Online publication date: 1-Oct-2009.
  103. Shimokawa T, Suzuki K, Misawa T and Okano Y (2009). Predicting investment behavior, Neurocomputing, 72:16-18, (3447-3461), Online publication date: 1-Oct-2009.
  104. Chakraborty S (2009). Bayesian binary kernel probit model for microarray based cancer classification and gene selection, Computational Statistics & Data Analysis, 53:12, (4198-4209), Online publication date: 1-Oct-2009.
  105. Chang X, Zheng Q and Lin P Ordinal regression with sparse Bayesian Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications, (591-599)
  106. Lima C, Coelho A and Chagas S (2009). Automatic EEG signal classification for epilepsy diagnosis with Relevance Vector Machines, Expert Systems with Applications: An International Journal, 36:6, (10054-10059), Online publication date: 1-Aug-2009.
  107. ACM
    Zhu J, Xing E and Zhang B Primal sparse Max-margin Markov networks Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, (1047-1056)
  108. ACM
    Foo C, Do C and Ng A A majorization-minimization algorithm for (multiple) hyperparameter learning Proceedings of the 26th Annual International Conference on Machine Learning, (321-328)
  109. O'Callaghan S, Ramos F and Durrant-Whyte H Contextual occupancy maps using Gaussian processes Proceedings of the 2009 IEEE international conference on Robotics and Automation, (3630-3636)
  110. Yuan J, Liu C, Liu X, Wang K and Yu T (2009). Incorporating prior model into Gaussian processes regression for WEDM process modeling, Expert Systems with Applications: An International Journal, 36:4, (8084-8092), Online publication date: 1-May-2009.
  111. Chen T and Ren J (2009). Bagging for Gaussian process regression, Neurocomputing, 72:7-9, (1605-1610), Online publication date: 1-Mar-2009.
  112. Srivastava A and Das S (2009). Detection and prognostics on low-dimensional systems, IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 39:1, (44-54), Online publication date: 1-Jan-2009.
  113. Ren Y, Ding Y and Liang F (2008). Adaptive evolutionary Monte Carlo algorithm for optimization with applications to sensor placement problems, Statistics and Computing, 18:4, (375-390), Online publication date: 1-Dec-2008.
  114. ACM
    Guiver J and Snelson E Learning to rank with SoftRank and Gaussian processes Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, (259-266)
  115. ACM
    Zheng Y, Neo S, Chua T and Tian Q Probabilistic optimized ranking for multimedia semantic concept detection via RVM Proceedings of the 2008 international conference on Content-based image and video retrieval, (161-168)
  116. ACM
    Qi Y, Liu D, Dunson D and Carin L Multi-task compressive sensing with Dirichlet process priors Proceedings of the 25th international conference on Machine learning, (768-775)
  117. Seeger M (2008). Bayesian Inference and Optimal Design for the Sparse Linear Model, The Journal of Machine Learning Research, 9, (759-813), Online publication date: 1-Jun-2008.
  118. Motoi S, Nakada Y, Misu T, Matsumoto T and Yagi N A novel hierarchical Bayesian HMM for multi-dimensional discrete data Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications, (52-57)
  119. Novoa E Simple model-based exploration and exploitation of Markov decision processes using the elimination algorithm Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence, (327-336)
  120. Schaffernicht E, Stephan V and Groß H An efficient search strategy for feature selection using Chow-Liu trees Proceedings of the 17th international conference on Artificial neural networks, (190-199)
  121. Nakajima S and Watanabe S Generalization error of automatic relevance determination Proceedings of the 17th international conference on Artificial neural networks, (1-10)
  122. Rubio G, Pomares H, Herrera L and Rojas I Kernel methods applied to time series forecasting Proceedings of the 9th international work conference on Artificial neural networks, (782-789)
  123. ACM
    Kropotov D and Vetrov D On one method of non-diagonal regularization in sparse Bayesian learning Proceedings of the 24th international conference on Machine learning, (457-464)
  124. Cawley G, Janacek G, Haylock M and Dorling S (2007). 2007 Special Issue, Neural Networks, 20:4, (537-549), Online publication date: 1-May-2007.
  125. Kaderali L Individualized predictions of survival time distributions from gene expression data using a Bayesian MCMC approach Proceedings of the 1st international conference on Bioinformatics research and development, (77-89)
  126. Wood M and Bryson J Representations for action selection learning from real-time observation of task experts Proceedings of the 20th international joint conference on Artifical intelligence, (641-646)
  127. Shutin D, Kubin G and Fleury B (2007). Application of the evidence procedure to the estimation of wireless channels, EURASIP Journal on Advances in Signal Processing, 2007:1, (71-71), Online publication date: 1-Jan-2007.
  128. Zhou J, Foster D, Stine R and Ungar L (2006). Streamwise Feature Selection, The Journal of Machine Learning Research, 7, (1861-1885), Online publication date: 1-Dec-2006.
  129. Centeno T and Lawrence N (2006). Optimising Kernel Parameters and Regularisation Coefficients for Non-linear Discriminant Analysis, The Journal of Machine Learning Research, 7, (455-491), Online publication date: 1-Dec-2006.
  130. Liu F, Zhou J, Qiu F, Yang J and Liu L Nonlinear hydrological time series forecasting based on the relevance vector regression Proceedings of the 13th international conference on Neural Information Processing - Volume Part II, (880-889)
  131. Nakajima S and Watanabe S Analytic solution of hierarchical variational bayes in linear inverse problem Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II, (240-249)
  132. Wang L, Bo L and Jiao L Sparse Kernel ridge regression using backward deletion Proceedings of the 9th Pacific Rim international conference on Artificial intelligence, (365-374)
  133. ACM
    Le Q, Smola A and Gärtner T Simpler knowledge-based support vector machines Proceedings of the 23rd international conference on Machine learning, (521-528)
  134. Jung K, Kim H and Lee J A novel learning network for option pricing with confidence interval information Proceedings of the Third international conference on Advances in Neural Networks - Volume Part III, (491-497)
  135. Kim H and Lee J Pseudo-density estimation for clustering with gaussian processes Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I, (1238-1243)
  136. Cherkassky V, Krasnopolsky V, Solomatine D and Valdes J (2006). 2006 Special issue, Neural Networks, 19:2, (113-121), Online publication date: 1-Mar-2006.
  137. Skabar A Application of bayesian techniques for MLPs to financial time series forecasting Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence, (888-891)
  138. Eleuteri A, Tagliaferri R and Milano L (2005). A novel information geometric approach to variable selection in MLP networks, Neural Networks, 18:10, (1309-1318), Online publication date: 1-Dec-2005.
  139. Viaene S, Dedene G and Derrig R (2005). Auto claim fraud detection using Bayesian learning neural networks, Expert Systems with Applications: An International Journal, 29:3, (653-666), Online publication date: 1-Oct-2005.
  140. Skabar A Application of Bayesian MLP techniques to predicting mineralization potential from geoscientific data Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II, (963-968)
  141. Wang W, Van Gelder P and Vrijling J Some issues about the generalization of neural networks for time series prediction Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II, (559-564)
  142. Ito Y, Srinivasan C and Izumi H Bayesian learning of neural networks adapted to changes of prior probabilities Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II, (253-259)
  143. ACM
    Wang T, Lizotte D, Bowling M and Schuurmans D Bayesian sparse sampling for on-line reward optimization Proceedings of the 22nd international conference on Machine learning, (956-963)
  144. ACM
    Chu W and Ghahramani Z Preference learning with Gaussian processes Proceedings of the 22nd international conference on Machine learning, (137-144)
  145. Guo Y, Greiner R and Schuurmans D Learning coordination classifiers Proceedings of the 19th international joint conference on Artificial intelligence, (714-721)
  146. Shao J, Xu D, Wang L and Wang Y Bayesian neural networks for prediction of protein secondary structure Proceedings of the First international conference on Advanced Data Mining and Applications, (544-551)
  147. Liang F (2005). Evidence Evaluation for Bayesian Neural Networks Using Contour Monte Carlo, Neural Computation, 17:6, (1385-1410), Online publication date: 1-Jun-2005.
  148. Krishnapuram B, Carin L, Figueiredo M and Hartemink A (2005). Sparse Multinomial Logistic Regression, IEEE Transactions on Pattern Analysis and Machine Intelligence, 27:6, (957-968), Online publication date: 1-Jun-2005.
  149. Menchero A, Montes Diez R, Ríos Insua D and M¨ller P (2005). Bayesian Analysis of Nonlinear Autoregression Models Based on Neural Networks, Neural Computation, 17:2, (453-485), Online publication date: 1-Feb-2005.
  150. Fukumizu K, Bach F and Jordan M (2004). Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces, The Journal of Machine Learning Research, 5, (73-99), Online publication date: 1-Dec-2004.
  151. Haraldsson H, Edenbrandt L and Ohlsson M (2004). Detecting acute myocardial infarction in the 12-lead ECG using Hermite expansions and neural networks, Artificial Intelligence in Medicine, 32:2, (127-136), Online publication date: 1-Oct-2004.
  152. Cawley G, Talbot N, Janacek G and Peck M Bayesian kernel learning methods for parametric accelerated life survival analysis Proceedings of the First international conference on Deterministic and Statistical Methods in Machine Learning, (37-55)
  153. Roberts S and Choudrey R Bayesian independent component analysis with prior constraints Proceedings of the First international conference on Deterministic and Statistical Methods in Machine Learning, (159-179)
  154. Krishnapuram B, Hartemink A, Carin L and Figueiredo M (2004). A Bayesian Approach to Joint Feature Selection and Classifier Design, IEEE Transactions on Pattern Analysis and Machine Intelligence, 26:9, (1105-1111), Online publication date: 1-Sep-2004.
  155. ACM
    Grochow K, Martin S, Hertzmann A and Popović Z Style-based inverse kinematics ACM SIGGRAPH 2004 Papers, (522-531)
  156. ACM
    Grochow K, Martin S, Hertzmann A and Popović Z (2004). Style-based inverse kinematics, ACM Transactions on Graphics, 23:3, (522-531), Online publication date: 1-Aug-2004.
  157. Smola A and Schölkopf B (2004). A tutorial on support vector regression, Statistics and Computing, 14:3, (199-222), Online publication date: 1-Aug-2004.
  158. ACM
    Qi Y, Minka T, Picard R and Ghahramani Z Predictive automatic relevance determination by expectation propagation Proceedings of the twenty-first international conference on Machine learning
  159. Gustafson P, Macnab Y and Wen S (2004). On the Value of derivative evaluations and random walk suppression in Markov Chain Monte Carlo algorithms, Statistics and Computing, 14:1, (23-38), Online publication date: 1-Jan-2004.
  160. Manjunath R and Gurumurthy K Bayesian decisions with differentially fed neural networks Proceedings of the 2nd WSEAS International Conference on Electronics, Control and Signal Processing, (1-4)
  161. Quiñonero-Candela J and Rasmussen C Analysis of some methods for reduced rank gaussian process regression Switching and Learning in Feedback Systems, (98-127)
  162. Chu W, Keerthi S and Ong C (2003). Bayesian trigonometric support vector classifier, Neural Computation, 15:9, (2227-2254), Online publication date: 1-Sep-2003.
  163. Figueiredo M (2003). Adaptive Sparseness for Supervised Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence, 25:9, (1150-1159), Online publication date: 1-Sep-2003.
  164. ACM
    Chudova D, Gaffney S, Mjolsness E and Smyth P Translation-invariant mixture models for curve clustering Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, (79-88)
  165. Ramakrishnan N and Bailey-Kellogg C Gaussian process models of spatial aggregation algorithms Proceedings of the 18th international joint conference on Artificial intelligence, (1045-1051)
  166. Liang F (2003). An effective Bayesian neural network classifier with a comparison study to support vector machine, Neural Computation, 15:8, (1959-1989), Online publication date: 1-Aug-2003.
  167. Eleuteri A, Tagliaferri R, Milano L, De Placido S and De Laurentiis M (2003). A novel neural network-based survival analysis model, Neural Networks, 16:5-6, (855-864), Online publication date: 1-Jun-2003.
  168. ACM
    Krishnapuram B, Carin L and Hartemink A Joint classifier and feature optimization for cancer diagnosis using gene expression data Proceedings of the seventh annual international conference on Research in computational molecular biology, (167-175)
  169. Liu Z and Bozdogan H (2003). RBF neural networks for classification using new Kernel functions, Neural, Parallel & Scientific Computations, 11:1 & 2, (41-52), Online publication date: 1-Mar-2003.
  170. Bakker B and Heskes T (2003). Clustering ensembles of neural network models, Neural Networks, 16:2, (261-269), Online publication date: 1-Mar-2003.
  171. Ueda N and Ghahramani Z (2002). Bayesian model search for mixture models based on optimizing variational bounds, Neural Networks, 15:10, (1223-1241), Online publication date: 1-Dec-2002.
  172. Valpola H and Karhunen J (2002). An unsupervised ensemble learning method for nonlinear dynamic state-space models, Neural Computation, 14:11, (2647-2692), Online publication date: 1-Nov-2002.
  173. Vehtari A and Lampinen J (2002). Bayesian model assessment and comparison using cross-validation predictive densities, Neural Computation, 14:10, (2339-2468), Online publication date: 1-Oct-2002.
  174. Hsu W, Welge M, Redman T and Clutter D (2002). High-Performance Commercial Data Mining, Data Mining and Knowledge Discovery, 6:4, (361-391), Online publication date: 1-Oct-2002.
  175. Hammer B and Villmann T (2002). Generalized relevance learning vector quantization, Neural Networks, 15:8-9, (1059-1068), Online publication date: 1-Oct-2002.
  176. Chipman H, George E and McCulloch R (2002). Bayesian Treed Models, Machine Language, 48:1-3, (299-320), Online publication date: 30-Sep-2002.
  177. Gunn S and Kandola J (2002). Structural Modelling with Sparse Kernels, Machine Language, 48:1-3, (137-163), Online publication date: 30-Sep-2002.
  178. Heskes T, Bakker B and Kappen B (2002). Approximate algorithms for neural-Bayesian approaches, Theoretical Computer Science, 287:1, (219-238), Online publication date: 25-Sep-2002.
  179. Doya K (2002). Metalearning and neuromodulation, Neural Networks, 15:4, (495-506), Online publication date: 1-Jun-2002.
  180. Feng X, Williams C and Felderhof S (2002). Combining Belief Networks and Neural Networks for Scene Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 24:4, (467-483), Online publication date: 1-Apr-2002.
  181. Gao J, Gunn S, Harris C and Brown M (2002). A Probabilistic Framework for SVM Regression and Error Bar Estimation, Machine Language, 46:1-3, (71-89), Online publication date: 11-Mar-2002.
  182. Ma Q, Wang J and Gattiker J Mining biomolecular data using background knowledge and artificial neural networks Handbook of massive data sets, (1141-1168)
  183. Nürnberger A, Pedrycz W and Kruse R Data mining tasks and methods: Classification Handbook of data mining and knowledge discovery, (304-317)
  184. Tipping M (2001). Sparse bayesian learning and the relevance vector machine, The Journal of Machine Learning Research, 1, (211-244), Online publication date: 1-Sep-2001.
  185. Roberts S, Holmes C and Denison D (2001). Minimum-Entropy Data Partitioning Using Reversible Jump Markov Chain Monte Carlo, IEEE Transactions on Pattern Analysis and Machine Intelligence, 23:8, (909-914), Online publication date: 1-Aug-2001.
  186. Sato M (2001). Online Model Selection Based on the Variational Bayes, Neural Computation, 13:7, (1649-1681), Online publication date: 1-Jul-2001.
  187. Ruiz De Angulo V and Torras C (2001). Architecture-Independent Approximation of Functions, Neural Computation, 13:5, (1119-1135), Online publication date: 1-May-2001.
  188. Sundararajan S and Keerthi S (2001). Predictive Approaches for Choosing Hyperparameters in Gaussian Processes, Neural Computation, 13:5, (1103-1118), Online publication date: 1-May-2001.
  189. Neal R (2001). Annealed importance sampling, Statistics and Computing, 11:2, (125-139), Online publication date: 1-Apr-2001.
  190. ACM
    Tresp V The generalized Bayesian committee machine Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, (130-139)
  191. Williams C and Vivarelli F (2000). Upper and Lower Bounds on the Learning Curve for Gaussian Processes, Machine Language, 40:1, (77-102), Online publication date: 1-Jul-2000.
  192. Schuurmans D and Southey F Monte Carlo inference via greedy importance sampling Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence, (523-532)
  193. Andrieu C, de Freitas N and Doucet A Reversible jump MCMC simulated annealing for neural networks Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence, (11-18)
  194. Jain A, Duin R and Mao J (2000). Statistical Pattern Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 22:1, (4-37), Online publication date: 1-Jan-2000.
  195. Hsu W, Ray S and Wilkins D (2000). A Multistrategy Approach to Classifier Learning from Time Series, Machine Language, 38:1-2, (213-236), Online publication date: 1-Jan-2000.
  196. Williams C and Barber D (1998). Bayesian Classification With Gaussian Processes, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20:12, (1342-1351), Online publication date: 1-Dec-1998.
  197. Magni P, Bellazzi R and De Nicolao G (1998). Bayesian Function Learning Using MCMC Methods, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20:12, (1319-1331), Online publication date: 1-Dec-1998.
  198. Movellan J and Mineiro P (1998). Robust Sensor Fusion, Machine Language, 32:2, (85-100), Online publication date: 1-Aug-1998.
  199. Buntine W (1998). Will Domain-Specific Code Synthesis Become a Silver Bullet?, IEEE Intelligent Systems, 13:2, (9-15), Online publication date: 1-Mar-1998.
  200. Mackay D and Takeuchi R (1998). Interpolation models with multiple hyperparameters, Statistics and Computing, 8:1, (15-23), Online publication date: 1-Jan-1998.
  201. Bekiroglu Y, Damianou A, Detry R, Stork J, Kragic D and Ek C Probabilistic consolidation of grasp experience 2016 IEEE International Conference on Robotics and Automation (ICRA), (193-200)
Contributors
  • University of Toronto

Recommendations