What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Monte Carlo Methods

A broad class of computational algorithms that use repeated random sampling to obtain numerical results, widely used in machine learning for Bayesian inference, reinforcement learning, and uncertainty estimation.

5 min readLast updated May 2026Foundations

Monte Carlo methods are a family of computational algorithms that approximate quantities of interest — integrals, expectations, probabilities — by drawing repeated random samples from a probability distribution and aggregating the results. The methods take their name from the casinos of Monaco and were formalised in the 1940s at Los Alamos National Laboratory by Stanislaw Ulam, John von Neumann and Nicholas Metropolis as part of work on neutron diffusion. In machine learning they provide the foundational machinery for Bayesian inference, reinforcement learning, generative modelling and uncertainty estimation.

Core idea

If a quantity of interest can be written as an expectation of some function f under a distribution p, the Monte Carlo estimator is the simple sample mean: draw N independent samples from p, evaluate f on each, and average. By the law of large numbers the estimator converges to the true expectation, with an error that decreases as one over the square root of N. Crucially, this rate is independent of the dimension of the integral, which is why Monte Carlo dominates in high-dimensional problems where deterministic quadrature is intractable.

Markov Chain Monte Carlo

Direct sampling from a target distribution is often impossible, particularly for the posterior distributions that arise in Bayesian inference. Markov Chain Monte Carlo (MCMC) sidesteps this by constructing a Markov chain whose stationary distribution is the target. Running the chain long enough produces samples that, although correlated, behave as draws from the target for the purpose of estimating expectations.

The Metropolis-Hastings algorithm, introduced by Metropolis and colleagues in 1953 and generalised by Hastings in 1970, proposes candidate moves from a proposal distribution and accepts or rejects them according to a ratio of target densities. Gibbs sampling, a special case in which each variable is updated in turn from its conditional distribution, is widely used when those conditionals are tractable. Hamiltonian Monte Carlo (HMC) and the No-U-Turn Sampler (NUTS) employ gradient information to propose long, efficient moves and underpin probabilistic programming systems such as Stan and PyMC.

Monte Carlo in reinforcement learning

In reinforcement learning, Monte Carlo methods estimate the value of a state or state-action pair as the average return observed across many sampled trajectories. They differ from temporal-difference methods such as Q-learning in that they wait until the end of an episode before updating estimates, trading higher variance for lower bias. Modern policy gradient algorithms, including REINFORCE and Proximal Policy Optimization, are Monte Carlo estimators of the policy gradient.

Monte Carlo dropout and uncertainty

Monte Carlo dropout, proposed by Yarin Gal and Zoubin Ghahramani in 2016, reinterprets dropout at inference time as approximate Bayesian inference. By running multiple stochastic forward passes through a dropout-enabled network and averaging the predictions, practitioners obtain calibrated predictive uncertainty without modifying the training procedure. This technique is now standard in medical imaging and safety-critical applications.

Sequential Monte Carlo and particle filters

Sequential Monte Carlo, also known as particle filtering, maintains a population of weighted samples that are propagated, reweighted and resampled to track a posterior over time. It is widely used in robotics for localisation and mapping, in epidemiology for disease tracking, and in financial modelling for stochastic volatility models.

Limitations

Monte Carlo methods suffer from high variance when the target distribution is poorly explored by the sampler. Diagnosing convergence of MCMC chains is notoriously difficult, with standard tools including the Gelman-Rubin statistic and effective sample size. Variational inference offers a deterministic alternative that trades exactness for speed and is often combined with Monte Carlo estimators in modern Bayesian deep learning.

Malaysian Context — Monte Carlo in research, finance and public health

Monte Carlo methods are embedded in several Malaysian sectors. Bank Negara Malaysia (BNM) and the Securities Commission Malaysia (SC) rely on Monte Carlo simulation for stress testing, value-at-risk computation and capital adequacy assessments under the Basel framework. Domestic banks including Maybank, CIMB, RHB and Public Bank use Monte Carlo engines for derivatives pricing, mortgage prepayment modelling and operational risk quantification.

In public health, the Ministry of Health Malaysia and the Institute for Medical Research used particle filters and Bayesian Monte Carlo models extensively during the COVID-19 response to estimate reproduction numbers and project hospital demand. Universiti Malaya's Centre for Epidemiology and Evidence-Based Practice has continued this work in dengue and leptospirosis surveillance.

Petronas applies Monte Carlo simulation to reservoir engineering and exploration risk, while Tenaga Nasional Berhad (TNB) uses it for load forecasting and renewable integration studies. The Malaysian Nuclear Agency and Universiti Kebangsaan Malaysia maintain expertise in Monte Carlo radiation transport codes such as MCNP for medical physics and reactor safety analyses.

On the AI front, MDEC-supported research grants under the National AI Roadmap 2021–2025 include Bayesian deep learning projects at Universiti Sains Malaysia and Universiti Putra Malaysia focused on uncertainty quantification in medical imaging and agricultural disease detection.

References

Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H. and Teller, E. (1953). Equation of State Calculations by Fast Computing Machines. Journal of Chemical Physics 21(6).
Hastings, W. K. (1970). Monte Carlo Sampling Methods Using Markov Chains and Their Applications. Biometrika 57.
Neal, R. M. (2011). MCMC Using Hamiltonian Dynamics. Handbook of Markov Chain Monte Carlo, CRC Press.
Gal, Y. and Ghahramani, Z. (2016). Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. ICML.
Bank Negara Malaysia. (2023). Financial Stability Review. BNM.

Tags:monte-carlo sampling bayesian-inference MCMC

Type	Stochastic computational technique
Originated	1940s, Los Alamos (Stanislaw Ulam, John von Neumann)
Key variants	MCMC, Metropolis-Hastings, Gibbs, HMC, sequential MC
Used in	Bayesian ML, reinforcement learning, finance, physics
Related	Bayesian inference, variational inference