DOSEN PROFIL LENGKAP

Draft article not currently submitted for review.

This is a draft Articles for creation (AfC) submission. It is not currently pending review. While there are no deadlines, abandoned drafts may be deleted after six months. To edit or make changes to this draft, simply click on the "Edit" tab at the top of the window.

To be accepted, a draft should:

Show the subject qualifies for a Wikipedia article by using multiple sources that meet four criteria. The sources should be (1) reliable (2) secondary (3) independent of the subject (4) talk about the subject in some depth. For some topics, there are alternative criteria.
Be written from a neutral point of view
Respect copyright and do not plagiarize. Do not copy-paste.

It is strongly discouraged to write about either yourself or your business or employer. If you do so, you must declare it.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Last edited by Uppsimba (talk | contribs) 59 days ago. (Update)

Submit the draft for review!

A Bayesian Neural Network (BNN) is a neural network, that trains a distribution over its network parameters using Bayesian inference. Once trained it uses this distribution over parameters to predict a probability distribution in the output space for a single input. BNNs are used in applications which require the quantification of uncertainties or in which multimodal distributions are expected that point-estimate predictions cannot express.^[1] The design of a BNN entails the choice of a functional model ${\textstyle m_{\omega }}$ (the neural network, with model parameters ${\textstyle \omega }$ ), and a stochastic model which contains both priors ${\textstyle p(\omega )}$ and ${\textstyle p(y|x,\omega )}$ .^[2]

Not to be confused with Bayesian Network.

The first use of BNNs was in 1993 by Hinton and Van Camp^[3].

Setup

From a Bayesian point of view, the model parameters ${\textstyle \omega }$ of a BNN are treated as latent random variables and the training process is their inference, conditional to the (observed) training data ${\textstyle D}$ . The distribution of the completely trained model parameters is directly given by Bayes' theorem

$p(\omega |D)={\frac {p(\omega )\,p(D|\omega )}{p(D)}}$

In practice, the marginal is often intractable, which makes it necessary to adapt strategies to approximate the true posterior. Once the parameter posterior is computed, other quantities of interest can be computed via marginalisation. For example, the predictive posterior is given by

$p(y|x,D)=\int p(y|x,\omega )\,p(\omega |D)d\omega$

where ${\textstyle p(y|x,\omega )=m_{\omega }(x)}$ is provided by the functional model.

Variants

The numerical difficulties that come with the computation of Bayes theorem have given rise to a family of BNN approaches, which can roughly be separated into parametric methods (Variational Inference (VI), Bayes by Backprop) and nonparametric methods (free-form VI, Monte Carlo Dropout (MCD), Direct sampling).

Variational inference

A simplification often performed is to introduce a surrogate posterior ${\textstyle q_{\theta }}$ with closed form and minimize its Kullback-Leibler (KL) divergence to the true parameter posterior with respect to the variational parameters ${\textstyle \theta }$ . This is possible because the objective function can be rewritten such that the intractable log marginal likelihood ${\textstyle p(D)}$ is separated from the variational term: ${\begin{aligned}{\text{KL}}(q_{\theta }(\omega )\mid \mid p(\omega |D))&={\text{KL}}(q_{\theta }(\omega )\mid \mid p(\omega ))-\mathbb {E} _{q}[\log p(D|\omega )]+\log p(D)\\&=\underbrace {\mathbb {E} _{q}[\log p(D|\omega )-\log q_{\theta }(\omega )+\log p(\omega )]} _{-{\text{ELBO}}}+\log p(D)\end{aligned}}$

The quantity to maximize therefore becomes the evidence lower bound (ELBO).

Using the reparameterization trick, ...

This allows the reformulation of each Bayesian update to an optimization problem, which can be solved using established gradient descent methods. This method is known as variational inference.

Common choices for the closed form include the mean-field approach, which assumes a complete factorisation of q, and the more general multivariate normal distribution with a low-rank covariance matrix. [Barber and Bishop (Ensemble Learning in Bayesian Neural Networks)] This choice is the equivalent of a loss function in point estimate ML.^[2]

In cases where the computation of the complete log-likelihood becomes infeasible due to the large volume of training data, VI also works with stochastic gradient descent (stochastic variational inference), where in each Bayesian update only a subset of data is used to approximate the likelihood term.

Monte Carlo dropout

Another approximation technique is Monte Carlo Dropout (MCD)^[4] where conventional dropout layers are kept enabled during inference time. This allows the sampling of predictions.

Direct sampling

In contrast to VI, samplers like MCMC converge to the true posterior without assumptions about its form.[Goan]

Often used Hamiltonian Monte Carlo (HMC) or Langevin Monte Carlo

d^[5]

Connection to Gaussian Processes

Neural network Gaussian process

...

Limitations

BNN are usually computation-heavy in training and inference, due to their need to generate many samples from the posterior distribution. For this reason, the VI method is often only suitable in its mean field form, which trades expressiveness for lower computational complexity.^[5] Further, it was found that VI approaches often underestimate the variance of their predictions. Direct sampling using Markov Chain Monte Carlo (MCMC) scales poorly with the amount of data as by default it requires the processing of the entire training dataset to perform an update. Stochastic MCMC methods, which consume only a subset of the data, have been shown to introduce a bias to the posterior.^[5]

VI: Placed assumptions and restrictions about the form may introduce a bias and induce inaccuracies in predictions.

References

^ Gawlikowski, Jakob; Tassi, Cedrique Rovile Njieutcheu; Ali, Mohsin; Lee, Jongseok; Humt, Matthias; Feng, Jianxiang; Kruspe, Anna; Triebel, Rudolph; Jung, Peter; Roscher, Ribana; Shahzad, Muhammad; Yang, Wen; Bamler, Richard; Zhu, Xiao Xiang (2023-10-01). "A survey of uncertainty in deep neural networks". Artificial Intelligence Review. 56 (1): 1513–1589. doi:10.1007/s10462-023-10562-9. ISSN 1573-7462.
^ ^a ^b Jospin, Laurent Valentin; Laga, Hamid; Boussaid, Farid; Buntine, Wray; Bennamoun, Mohammed (May 2022). "Hands-On Bayesian Neural Networks—A Tutorial for Deep Learning Users". IEEE Computational Intelligence Magazine. 17 (2): 29–48. doi:10.1109/MCI.2022.3155327. ISSN 1556-603X.
^ Hinton, Geoffrey E.; van Camp, Drew (1993). "Keeping the neural networks simple by minimizing the description length of the weights". ACM Press: 5–13. doi:10.1145/168304.168306. ISBN 978-0-89791-611-0. {{cite journal}}: Cite journal requires |journal= (help)
^ Gal, Yarin; Ghahramani, Zoubin (2015), Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning, arXiv, doi:10.48550/ARXIV.1506.02142, retrieved 2026-03-17
^ ^a ^b ^c Goan, Ethan; Fookes, Clinton (2020), Mengersen, Kerrie L.; Pudlo, Pierre; Robert, Christian P. (eds.), "Bayesian Neural Networks: An Introduction and Survey", Case Studies in Applied Bayesian Data Science: CIRM Jean-Morlet Chair, Fall 2018, Cham: Springer International Publishing, pp. 45–87, doi:10.1007/978-3-030-42553-1_3, ISBN 978-3-030-42553-1, retrieved 2026-03-17{{citation}}: CS1 maint: work parameter with ISBN (link)

Category:Bayesian statistics

[1] Gawlikowski, Jakob; Tassi, Cedrique Rovile Njieutcheu; Ali, Mohsin; Lee, Jongseok; Humt, Matthias; Feng, Jianxiang; Kruspe, Anna; Triebel, Rudolph; Jung, Peter; Roscher, Ribana; Shahzad, Muhammad; Yang, Wen; Bamler, Richard; Zhu, Xiao Xiang (2023-10-01). "A survey of uncertainty in deep neural networks". Artificial Intelligence Review. 56 (1): 1513–1589. doi:10.1007/s10462-023-10562-9. ISSN 1573-7462.

[:0-2] Jospin, Laurent Valentin; Laga, Hamid; Boussaid, Farid; Buntine, Wray; Bennamoun, Mohammed (May 2022). "Hands-On Bayesian Neural Networks—A Tutorial for Deep Learning Users". IEEE Computational Intelligence Magazine. 17 (2): 29–48. doi:10.1109/MCI.2022.3155327. ISSN 1556-603X.

[3] Hinton, Geoffrey E.; van Camp, Drew (1993). "Keeping the neural networks simple by minimizing the description length of the weights". ACM Press: 5–13. doi:10.1145/168304.168306. ISBN 978-0-89791-611-0. {{cite journal}}: Cite journal requires |journal= (help)

[4] Gal, Yarin; Ghahramani, Zoubin (2015), Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning, arXiv, doi:10.48550/ARXIV.1506.02142, retrieved 2026-03-17

[:1-5] Goan, Ethan; Fookes, Clinton (2020), Mengersen, Kerrie L.; Pudlo, Pierre; Robert, Christian P. (eds.), "Bayesian Neural Networks: An Introduction and Survey", Case Studies in Applied Bayesian Data Science: CIRM Jean-Morlet Chair, Fall 2018, Cham: Springer International Publishing, pp. 45–87, doi:10.1007/978-3-030-42553-1_3, ISBN 978-3-030-42553-1, retrieved 2026-03-17{{citation}}: CS1 maint: work parameter with ISBN (link)

[1]

[2]

[3]

[4]

[5]