Deriving the Diffusion Equation from a Random Walk

A short derivation of the diffusion equation from a random walk.

Author

Affiliation

NYU Abu Dhabi, NYU Tandon

Published

September 15, 2024

The Full Derivation

We derive the 2D diffusion equation from a discrete 2D random walk. To model our discrete random walk, we work with the Cartesian grid shown in Figure 1.

In the simplest case of a 2D random walk, initialized at the origin (white), we are allowed to take one step in the \(x\) or \(y\) direction for each unit of time, resulting in four possible locations (blue) after \(t=1\). At \(t=2\), we can take yet another step in any of the four directions. Assuming we took the first step upwards, the red circles \(and\) the origin are all possible locations we can arrive at after \(t=2\).

We leave out the other three red arrows to avoid visual clutter.

Figure 1: Beginning a random walk on the 2D coordinate system.

Pondering on \(t=2\), we can ask, what is the probability we arrived at the top most red location? Looking at the picture, it depends on the probability of picking that red spot (amongst all four red spots) when on the preceding blue spot \(and\) the probability we were at the blue spot. Symbolically,

\[ P(\text{top red after two steps}) = P(\text{selecting top red}) \cdot P(\text{neighbor blue after one step}). \]

With only two time steps, there is just one way that we could have arrived at the top red spot after \(t=2\). However, if we played this out for a few more steps, it is not hard to imagine that you could arrive at the top red spot from any of its four neighbors. In fact, we can already visualize that scenario by looking at the origin: a return to the origin could be from any of the four blue locations.

Since we are on the 2D coordinate system, an arbitrary location on our grid can be described by a coordinate pair \((x,y)\). And, we can refer to any of its four neighbors with the set \(\{(x-1, y), (x+1, y), (x, y-1), (x, y+1)\}\). So, extending the equation above, if we wanted to describe the probability of arriving at \((x,y)\), we would write

\[\begin{aligned} P(x,y,t) = \frac{1}{4} \bigl( P \left(x-1,y,t-1\right) + P \left(x+1,y,t-1\right) + \\ P \left(x,y-1,t-1\right)+ P \left(x,y+1,t-1\right) \bigr). \end{aligned}\]

Instead of accounting for the path from just one neighbor, we additively account for all four neighbors. In the equation above, we have just fixed the probability of selecting \((x,y)\) as \(\frac{1}{4}\) and factored it out.

While we have modeled discrete movement, we have not really considered a continuous movement. The key to making that transition is to ask the same question with a fraction of a step in a fraction of a time unit. So, if \(\frac{1}{2}t\) elapsed, we might ask about the probability we have taken a \(\frac{1}{2}x\) or \(\frac{1}{2}y\) step. Using \(\Delta\) to represent any fraction, we can say

\[\begin{aligned} P(x,y,t) = \frac{1}{4} \bigl( P \left(x+\Delta x,y,t-\Delta t \right) + P \left(x+\Delta x,y,t-\Delta t\right) + \\ P \left(x,y-\Delta y,t-\Delta t\right)+ P \left(x,y+\Delta y,t-\Delta t \right) \bigr). \end{aligned}\]

Recalling the general form of a second-order Taylor series expansion with two variables,

\[\begin{aligned} f(x, y) &= f(a, b) + (x-a) \frac{\partial f}{\partial x} \bigg|_{x=a, y=b} + (y-b) \frac{\partial f}{\partial y}\bigg|_{x=a, y=b} + \\ & (x-a)^{2} \frac{\partial^2 f}{\partial x^2}\bigg|_{x=a, y=b} + (x-a)(y-b) \frac{\partial^2 f}{\partial x \partial y}\bigg|_{x=a, y=b} + (y-b)^{2} \frac{\partial^2 f}{\partial y^2}\bigg|_{x=a, y=b}, \end{aligned}\]

we proceed with approximating the four terms:

\[\begin{aligned} P(x+\Delta x, y, t-\Delta t) &\approx {\color{blue} P \left(x,y,t\right)} + \Delta x \frac{\partial P}{\partial x} {\color{blue}- \Delta t \frac{\partial P}{\partial t}} + \\ & {\color{blue} \frac{(\Delta x)^2}{2} \frac{\partial^2 P}{\partial x^2}} - (\Delta x \Delta t) \frac{\partial^2 P}{\partial x \partial t} + {\color{blue}(-\Delta t)^2 \frac{\partial^2 P}{\partial t^2}} \end{aligned}\]
\[\begin{aligned} P(x-\Delta x, y, t-\Delta t) &\approx {\color{blue} P \left(x,y,t\right)} - \Delta x \frac{\partial P}{\partial x} {\color{blue}- \Delta t \frac{\partial P}{\partial t}} + \\ & {\color{blue} \frac{(-\Delta x)^2}{2} \frac{\partial^2 P}{\partial x^2}} + (\Delta x \Delta t) \frac{\partial^2 P}{\partial x \partial t} + {\color{blue}(-\Delta t)^2 \frac{\partial^2 P}{\partial t^2}} \end{aligned}\]
\[\begin{aligned} P(x, y + \Delta y, t-\Delta t) &\approx {\color{blue} P \left(x,y,t\right)} + \Delta y \frac{\partial P}{\partial y} {\color{blue}- \Delta t \frac{\partial P}{\partial t}} + \\ & {\color{blue} \frac{(\Delta y)^2}{2} \frac{\partial^2 P}{\partial y^2}} - (\Delta y \Delta t) \frac{\partial^2 P}{\partial y \partial t} + {\color{blue}(-\Delta t)^2 \frac{\partial^2 P}{\partial t^2}} \end{aligned}\]
\[\begin{aligned} P(x, y-\Delta y, t-\Delta t) &\approx {\color{blue} P \left(x,y,t\right)} - \Delta y \frac{\partial P}{\partial y} {\color{blue}- \Delta t \frac{\partial P}{\partial t}} + \\ & {\color{blue} \frac{(-\Delta y)^2}{2} \frac{\partial^2 P}{\partial y^2}} + (\Delta y \Delta t) \frac{\partial^2 P}{\partial y \partial t} + {\color{blue}(-\Delta t)^2 \frac{\partial^2 P}{\partial t^2}} \end{aligned}\]

When summing the four terms above, only the blue terms remain and the rest cancel out, resulting in

\[\begin{aligned} P(x,y,t) &\approx \frac{1}{4} \left( 4P(x,y,t) - 4\Delta t \frac{\partial P}{\partial t} + 4\Delta t^2 \frac{\partial^2 P}{\partial t^2} + \Delta x^2 \frac{\partial^2 P}{\partial x^2} + \Delta y^2 \frac{\partial^2 P}{\partial y^2} \right) \\ &\approx P(x,y,t) - \Delta t \frac{\partial P}{\partial t} + \Delta t^2 \frac{\partial^2 P}{\partial t^2} + \frac{\Delta x^2}{4} \frac{\partial^2 P}{\partial x^2} + \frac{\Delta y^2}{4} \frac{\partial^2 P}{\partial y^2} \\ \Delta t \frac{\partial P}{\partial t} - \Delta t^2 \frac{\partial^2 P}{\partial t^2} &\approx \frac{\Delta x^2}{4} \frac{\partial^2 P}{\partial x^2} + \frac{\Delta y^2}{4} \frac{\partial^2 P}{\partial y^2} \\ \frac{\partial P}{\partial t} - \Delta t \frac{\partial^2 P}{\partial t^2} &\approx \frac{\Delta x^2}{4\Delta t} \frac{\partial^2 P}{\partial x^2} + \frac{\Delta y^2}{4\Delta t} \frac{\partial^2 P}{\partial y^2} \\ \end{aligned}\]

At this point, we take the limits of \(\Delta x, \Delta y, \Delta t \rightarrow 0\), which allows us to simplify the equation above and define a diffusion coefficient, \(D\). Concretely, \(\frac{\Delta x^2}{4\Delta t} = \frac{\Delta y^2}{4\Delta t} = D\) with the assumption that \(\Delta x = \Delta y\), results in

\[ \frac{\partial P}{\partial t} - \Delta t \frac{\partial^2 P}{\partial t^2} \approx D \left( \frac{\partial^2 P}{\partial x^2} + \frac{\partial^2 P}{\partial y^2} \right). \]

Additionally, because we have taken the limit \(\Delta t \rightarrow 0\), it causes the \(\Delta t \frac{\partial^2 P}{\partial t^2}\) term to vanish relative to the other terms, returning the canonical form of the diffusion equation

\[ \frac{\partial P}{\partial t} \approx D \left( \frac{\partial^2 P}{\partial x^2} + \frac{\partial^2 P}{\partial y^2} \right). \]

The solution to the 2D diffusion equation is given by the 2D Gaussian distribution

\[ P(x,y,t) = \frac{1}{4\pi Dt} \exp{\left(- \frac{x^2+y^2}{4Dt} \right)} \]

Ultimately, our approach models the probability that a particle is at a given location and time. If we have a solution of particles in a container, we can also use the model to estimate the density in any area of the container at a given time since the diffusion began.

Simulation

We proceed with simulating the diffusion of a substance (consisting of \(N\) blue particles) from the origin by executing \(N\) random walks. Along with the particle trajectories, we also visualize the solution of the diffusion equation with \(D = \frac{0.05^2}{4}\). Empirically, the solution is a good model for a random walk!

Code

from copy import deepcopy
from dataclasses import dataclass

import matplotlib.pyplot as plt
import numpy as np

@dataclass
class Particle:
    x_pos: int
    y_pos: int

class Substance:
    def __init__(
        self,
        num_particles: int = 10,
        x_delta: float = 1,
        y_delta: float = 1,
    ):
        self.num_particles: int = num_particles
        self.particles = [Particle(0, 0) for _ in range(num_particles)]
        self.x_delta, self.y_delta = x_delta, y_delta

# Make substance
num_particles = 5_00
substance = Substance(num_particles=num_particles, x_delta=0.05, y_delta=0.05)

# Define simulation parameters
num_steps = 250
x_step_choices = [substance.x_delta, -substance.x_delta]
y_step_choices = [substance.y_delta, -substance.y_delta]

# Storage dict
location_dict = {}
all_distances = []

# Iterate over time
for step in range(num_steps):

    # Get choices over uniform distribution
    x_choices = np.random.choice(x_step_choices, size=substance.num_particles)
    y_choices = np.random.choice(y_step_choices, size=substance.num_particles)

    location_dict[step] = {}

    for i, p in enumerate(substance.particles):

        location_dict[step][i] = [p.x_pos, p.y_pos]

        # Move points
        p.x_pos += x_choices[i]
        p.y_pos += y_choices[i]

ojs_define(location_dict = location_dict)
ojs_define(num_particles = num_particles)

Code

{
  const width = 749;
  const height = 749;
  const margin = 50;
  const D = 0.000625; // Diffusion coefficient
  const grid_limit = 3;

  // Create the SVG
  const svg = d3.create("svg")
    .attr("width", width)
    .attr("height", height);

  // Add a border box
  svg.append("rect")
    .attr("x", margin)
    .attr("y", margin)
    .attr("width", width - 2 * margin)
    .attr("height", height - 2 * margin)
    .attr("fill", "none")
    .attr("stroke", "grey");


    // Set up scales
   const xScale = d3.scaleLinear()
    .domain([-grid_limit, grid_limit])
    .range([margin, width - margin]);

   const yScale = d3.scaleLinear()
    .domain([-grid_limit, grid_limit])
    .range([height - margin, margin]);

    // Create contour group
   const contourGroup = svg.append("g");

    // Create scatter plot objects
    const scatters = {};
    for (let i = 0; i < num_particles; i++) {
    scatters[i] = svg.append("circle")
        .attr("r", 3)
        .style("fill", "#0075ff")
        .style("opacity", 0.3); 
    }

    // Animation variables
    const frames = Object.keys(location_dict);
    let currentFrame = 0;
    let animationId;
    let isPlaying = false;

    // Generate grid for contour
    const n = 100;
    const x = d3.range(n).map(i => -grid_limit + (2 * grid_limit) * i / (n - 1));
    const y = d3.range(n).map(j => -grid_limit + (2 * grid_limit) * j / (n - 1));
    const grid = new Array(n * n);


  // Gaussian function
  function gaussian(x, y, t) {
    return (1 / (4 * Math.PI * D * t)) * Math.exp(-(x * x + y * y) / (4 * D * t));
  }

  // Update contours
  function updateContours(t) {
    for (let j = 0, k = 0; j < n; ++j) {
      for (let i = 0; i < n; ++i, ++k) {
        grid[k] = gaussian(x[i], y[j], t);
      }
    }

    const maxValue = d3.max(grid);
    const contours = d3.contours()
      .size([n, n])
      .thresholds(d3.range(20).map(i => maxValue * Math.pow(2, -i)))
      (grid);

    const color = d3.scaleSequential(d3.interpolateYlGn)
      .domain([0, 1]);

    contourGroup.selectAll("path")
      .data(contours)
      .join("path")
        .attr("d", d3.geoPath(d3.geoIdentity().scale((width - 2 * margin) / n)))
        .attr("fill", d => color(d.value))
        .attr("opacity", 0.5)
        .attr("transform", `translate(${margin},${margin})`);
  }

  // Animation function
  function animate() {
    updateFrame(currentFrame);
    if (isPlaying) {
      currentFrame = (currentFrame + 1) % frames.length;
      animationId = setTimeout(() => requestAnimationFrame(animate), 50);
    }
  }

  function updateFrame(frame) {
    const frameData = location_dict[frames[frame]];
    for (let i = 0; i < num_particles; i++) {
      if (i in frameData) {
        const [x, y] = frameData[i];
        scatters[i]
          .attr("cx", xScale(x))
          .attr("cy", yScale(y))
          .style("visibility", "visible");
      } else {
        scatters[i].style("visibility", "hidden");
      }
    }
    updateContours(frame + 1); // Add 1 to avoid t=0
    frameSlider.property("value", frame);
    frameDisplay.text(`Time: ${frame}`);
  }

  // Create controls
  const controls = d3.create("div")
    .style("display", "flex")
    .style("justify-content", "center")
    .style("align-items", "center")
    .style("gap", "10px")

  // Play/Pause button
  const playPauseButton = controls.append("button")
    .text("Play")
    .on("click", function() {
      isPlaying = !isPlaying;
      this.textContent = isPlaying ? "Pause" : "Play";
      if (isPlaying) animate();
      else clearTimeout(animationId);
    });

  // Frame slider
  const frameSlider = controls.append("input")
    .attr("type", "range")
    .attr("min", 0)
    .attr("max", frames.length - 1)
    .attr("value", 0)
    .style("width", "400px")
    .on("input", function() {
      currentFrame = +this.value;
      updateFrame(currentFrame);
    });

  // Frame display
  const frameDisplay = controls.append("span")
    .text("Time: 0");

  // Initial render
  updateFrame(0);

  // Combine SVG and controls in a container
  const container = d3.create("div");
  container.append(() => svg.node());
  container.append(() => controls.node());

  return container.node();
}

Citation

BibTeX citation:

@online{aswani2024,
  author = {Aswani, Nishant},
  title = {Deriving the {Diffusion} {Equation} from a {Random} {Walk}},
  date = {2024-09-15},
  url = {https://nishantaswani.com/articles/randomwalk/randomwalk.html},
  langid = {en}
}

For attribution, please cite this work as:

Aswani, Nishant. 2024. “Deriving the Diffusion Equation from a Random Walk.” September 15, 2024. https://nishantaswani.com/articles/randomwalk/randomwalk.html.