Mixture of Experts: Essentials
23 December 2025
A sparse routing for scalable Transformers
1What I cannot create, I do not understand.
Richard Feynman
Flow Matching: A Minimal Guide
10 November 2025
Learning vector fields in continuous and discrete spaces
4
Generalized Reparametrization Tricks
09 August 2025
Backpropagation of continuous and discrete random variables
7
Discrete Diffusion Models: The Modern BERT
18 July 2025
Diffusion language models for images, languages, and general state spaces
9
Mean Flow: A Brief Introduction
31 May 2025
One-step ultra-fast generations for images and videos
11
Feynman–Kac Formula Without the Mystery
15 March 2025
A popular tool in finance and stochastic optimal control
12
Sequential Monte Carlo: A Quick Guide
08 February 2025
A general framework for modeling nonlinear state-space models
13
11 November 2024
Interpretating Bayes’ law via optimal transport for filtering problems
15
Hutchinson Estimator, Explained
27 July 2024
An unbiased Monte Carlo sampler for implicit trace estimation
16
10 April 2024
A Monte Carlo sampler for radial basis function kernels and positional embedding
18
09 March 2024
Does a straighter flow always yield more efficient transport?
21
07 March 2024
The optimal penalty can be zero or negative for real-world high dimensional data.
22
The Triangle of Flow, Diffusion, and PDE
01 July 2023
Connections between probability flows, diffusions, and PDEs.
23
20 May 2023
A general coupling technique for characterizing a broad range of diffusions.
24
19 June 2022
A framework that unifies ODE, PDE, SDE, stochastic control, optimal transport, and fluid dynamics
26
Understanding Hamiltonian Monte Caro
01 November 2021
An elegant sampler that utilizes Hamiltonian dynamics to propose new states in simulations.
27
Couplings and Monte Carlo Methods (I)
01 August 2021
A family of techniques to understand the convergence of random variables.
28
Lyapunov Function for Poincaré Inequality
01 June 2021
An inequality that unifies ODE, PDE, SDE, functional analysis, and Riemannian geometry.
29
Replica Exchange and Variance Reduction
01 May 2021
Running multiple MCMCs at different temperatures to explore the solution thoroughly.
30
Dynamic Importance Sampling and Beyond
05 November 2020
Negative learning rates help escape local traps.
31