What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Bayesian Inference

Bayesian inference is a statistical method that uses Bayes' theorem to update the probability of a hypothesis as new evidence becomes available, providing a principled framework for reasoning under uncertainty.

6 min readLast updated May 2026Foundations

Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update the probability for a hypothesis as additional evidence is observed. Unlike frequentist inference, which treats parameters as fixed unknown quantities and data as random, Bayesian inference treats parameters as random variables with their own probability distributions. This perspective provides a coherent framework for reasoning under uncertainty, combining prior knowledge with observed data to produce updated beliefs known as posterior distributions.

Theoretical Foundation

The central equation of Bayesian inference is Bayes' theorem, which expresses the conditional probability of a hypothesis H given evidence E as the product of the likelihood of the evidence under the hypothesis and the prior probability of the hypothesis, divided by the total probability of the evidence. In compact form: posterior is proportional to likelihood times prior. The denominator, called the marginal likelihood or evidence, normalises the result so that it integrates to one over the parameter space.

The prior distribution encodes beliefs about parameters before observing data. It may be informative — incorporating expert knowledge or previous experiments — or weakly informative, providing only loose constraints. The likelihood function describes how probable the observed data are under each candidate value of the parameter. The posterior distribution, obtained by combining prior and likelihood, summarises updated beliefs and serves as the basis for prediction, decision-making, and further analysis.

Computational Approaches

Closed-form solutions to Bayes' theorem exist only for a limited family of conjugate prior–likelihood pairs, such as beta-binomial or normal-normal models. For most practical problems, the posterior must be approximated numerically.

Markov Chain Monte Carlo

Markov chain Monte Carlo (MCMC) methods, including the Metropolis–Hastings algorithm and Gibbs sampling, construct a Markov chain whose stationary distribution is the target posterior. Samples drawn from the chain after a burn-in period approximate the posterior, allowing computation of expectations, credible intervals, and predictive distributions. Hamiltonian Monte Carlo (HMC) and its adaptive variant, the No-U-Turn Sampler (NUTS) used in the Stan probabilistic programming language, exploit gradient information for more efficient exploration of high-dimensional posteriors.

Variational Inference

Variational inference reformulates posterior approximation as an optimisation problem. A simpler distribution from a chosen family — often a factorised Gaussian — is fitted to the true posterior by minimising the Kullback–Leibler divergence between them. Variational methods scale better than MCMC to large datasets and high-dimensional models but provide an approximation rather than asymptotically exact samples.

Laplace Approximation

The Laplace approximation fits a Gaussian distribution centred at the posterior mode, using the curvature of the log-posterior as the precision matrix. It is computationally cheap and often used as an initial approximation or within larger inference pipelines.

Applications in Machine Learning

Bayesian methods underpin a wide range of machine learning techniques. Gaussian processes provide a non-parametric Bayesian framework for regression and classification, returning calibrated uncertainty estimates over predictions. Bayesian neural networks place prior distributions over network weights and approximate the posterior to capture predictive uncertainty — useful in safety-critical applications such as medical diagnosis and autonomous driving. Bayesian optimisation uses a probabilistic surrogate model to guide search over expensive black-box functions, widely applied to hyperparameter tuning of deep learning models.

In probabilistic programming, languages such as Stan, PyMC, NumPyro, and Edward allow practitioners to specify generative models in code and perform inference automatically. These tools have made Bayesian methods accessible to a broader community of data scientists and engineers.

Bayesian vs Frequentist Perspectives

The choice between Bayesian and frequentist approaches has been the subject of long-standing debate in statistics. Frequentist methods rely on long-run frequency interpretations of probability and avoid placing prior distributions on parameters, while Bayesian methods are explicit about prior beliefs and produce probabilistic statements about parameters directly. In practice, the two approaches often yield similar conclusions for well-identified problems with abundant data, but Bayesian methods are particularly valuable when data are scarce, when uncertainty quantification is critical, or when external knowledge must be incorporated formally.

Malaysian Context — Bayesian Methods in Industry and Research

Bayesian inference is applied across several Malaysian sectors where uncertainty quantification and incorporation of expert knowledge are important. In the financial services industry, banks including Maybank, CIMB, and Public Bank use Bayesian credit scoring and Bayesian network models for fraud detection and risk assessment, where the ability to combine historical data with expert judgement is particularly valuable. Bank Negara Malaysia (BNM) has published guidance on model risk management that recognises Bayesian approaches as acceptable methods for credit and market risk modelling under the Risk-Based Capital Framework.

In the energy and resources sector, Petronas applies Bayesian methods to reservoir characterisation, drilling decision support, and predictive maintenance of offshore infrastructure, where prior geological knowledge can be combined with sparse sensor data to produce calibrated forecasts. Bayesian inference is also used in palm oil yield modelling by research bodies such as the Malaysian Palm Oil Board (MPOB) to combine satellite imagery, weather data, and agronomic measurements.

Malaysian universities including Universiti Malaya, Universiti Sains Malaysia, and Universiti Kebangsaan Malaysia teach Bayesian statistics in their statistics, computer science, and actuarial science programmes. The Institute of Statistics Malaysia hosts workshops and short courses on Bayesian methods, and the Malaysian Institute of Statistics collaborates with regional partners on Bayesian biostatistics for public health research. Bayesian methods have been applied to dengue outbreak forecasting in Malaysia using data from the Ministry of Health (KKM), where they combine historical case counts with weather and demographic priors to estimate transmission parameters.

In policy and governance, the Department of Statistics Malaysia (DOSM) increasingly uses Bayesian small-area estimation techniques to produce reliable district-level estimates from sample survey data, supporting evidence-based planning under the MyDigital Blueprint and Twelfth Malaysia Plan.

References

Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B. (2013). Bayesian Data Analysis (3rd ed.). Chapman and Hall/CRC.
Bishop, C. M. (2006). Pattern Recognition and Machine Learning. Springer.
Hoffman, M. D., and Gelman, A. (2014). The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research, 15(1), 1593–1623.
Bank Negara Malaysia. (2019). Policy Document on Model Risk Management. Kuala Lumpur: BNM.
Carpenter, B. et al. (2017). Stan: A Probabilistic Programming Language. Journal of Statistical Software, 76(1).

Tags:bayesian probability statistical inference machine learning

Type	Statistical inference method
Named after	Reverend Thomas Bayes (1701–1761)
Core equation	P(H\|E) = P(E\|H) * P(H) / P(E)
Key concepts	Prior, likelihood, posterior, evidence
Common methods	MCMC, variational inference, Laplace approximation
Related	Probabilistic programming, Gaussian process, Hidden Markov model