Bayesian Approaches in Machine Learning
From point estimates to distributions over hypotheses.
Bayes' rule as the engine
Bayesian learning updates beliefs about parameters or models after observing data:
posterior = likelihood × prior / evidence.
Instead of committing to one best parameter vector, we maintain uncertainty and integrate over possibilities.
Why this matters
- Uncertainty calibration for high-stakes decisions.
- Better behavior in small-data settings via informative priors.
- Natural protection against overfitting through posterior averaging.
Core Bayesian methods
- Conjugate models: analytically tractable updates.
- MCMC: sample from complex posteriors.
- Variational inference: optimize a simpler approximate posterior.
- Bayesian neural networks: distributions over weights.
Decision-theoretic layer
Bayesian pipelines separate inference (what is true?) from decisions (what should we do?). Utility-aware decisions can be made from posterior predictive distributions using expected risk minimization.
Takeaway: Bayesian ML frames learning as uncertainty-aware inference, not only error minimization.