Formalising the Dirac Delta

The Dirac delta “function” \delta has been a bane in my teaching since I needed to start teaching it in engineering mathematics, since it is, really, not actually a function. Many of its results seem miraculously correct, and yet we don’t have any reasonably accessible introduction to its formal definition.

We will adopt a measure-theoretic interpretation of \delta and regard it as a measure. We will then define the relevant Laplace transform notions on measures that extend the classical setting. Finally, we assert that

\displaystyle \int_{(-\infty, t]} \mathrm d\delta = U(t) \equiv \mathbb I_{[0,\infty)}(t),

so that, in some limited sense, U' = \delta.

For any measure \mu on (\mathbb R, \frak{B}(\mathbb R)), if the measurable function f is \mu-integrable, make the definition

\displaystyle \langle  \mu, f \rangle := \int_{\mathbb R} f\, \mathrm d\mu.

whenever the right-hand side is well-defined. Using the Radon-Nikodým theorem, for suitably-defined functions f, g,

\langle \lambda_g, f\rangle = \langle f \cdot g, \lambda \rangle = \langle \lambda_f, g \rangle,

where the measure \lambda_f is defined by \lambda_f(S) := \int_S f\, \mathrm d\lambda. We remark that by the same theorem, the map f \to \lambda_f is injective \lambda-a.e. in the following sense: if \lambda_f = 0, then f = 0 \lambda-a.e.. Furthermore, it is bona-fide injective when restricted to the space of continuous functions (i.e. f is continuous).

Define the map K : \mathbb R^2 \to \mathbb R by K(s,t) := e^{-st} \cdot U(t).

Definition 1. For any measure \mu on (\mathbb R, \frak{B}(\mathbb R)), if f is \mu-integrable and K(s, \cdot) \cdot f is \mu-integrable, we define the \muLaplace transform by

\displaystyle \mathcal L_{\mu}\{f \} (s) := \langle \mu ,f \cdot K(s,\cdot)\rangle \equiv \int_{\mathbb R} K(s, \cdot) \cdot f\, \mathrm d\mu,

whenever the right-hand side is well-defined. In particular, define \mathcal L\{\mu\} := \mathcal L_{\mu}\{1\}.

Problem 1. If K(s,\cdot)\cdot f is \lambda-integrable, prove that \mathcal L_{\lambda}\{f \} = \mathcal L\{f\}, where the right-hand side refers to the classical Laplace transform on sub-exponential functions.

(Click for Solution)

Solution. By definition, for any s,

\begin{aligned} \mathcal L_{\lambda}\{f \}(s) &= \langle \lambda, f \cdot K(s,\cdot),  \rangle \\ &= \int_{\mathbb R} K(s, \cdot) \cdot f\, \mathrm d\lambda \\ &= \int_{-\infty}^{\infty} K(s, t) \cdot f(t)\, \mathrm d \lambda( t ) \\ &= \int_{-\infty}^{\infty} e^{-st} \cdot f(t)\, \mathrm dt = \mathcal L\{f\}(s). \end{aligned}

Hence, \mathcal L\{\lambda_f\} \equiv \mathcal L_{\lambda_f}\{1\} \equiv \mathcal L_{\lambda}\{f\} = \mathcal L\{f\}.

Problem 2. Prove that the Laplace transform on measures is linear in the measures: for suitably defined measures \mu,\nu, a suitably-defined function f, and \alpha \in \mathbb R,

\mathcal L_{\mu + \nu} \{f\} = \mathcal L_{\mu}\{f\} + \mathcal L_{\nu}\{f\},\quad \mathcal L_{\mu}\{\alpha f \} = \alpha \cdot \mathcal L_{\mu}\{f\}.

Furthermore, if \mu is signed, then \mathcal L_{\alpha \mu}\{f \} = \alpha \cdot \mathcal L_{\mu}\{f\}.

(Click for Solution)

Solution. For additivity, we first prove the additivity of measures: for S \in \frak B(\mathbb R) and suitably defined functions f, we first claim that

\begin{aligned} \langle \mu + \nu, f\rangle &= \int_{\mathbb R} f\, \mathrm d(\mu + \nu) \\ &= \int_{\mathbb R} f\, \mathrm d\mu + \int_{\mathbb R} f\, \mathrm d\nu \\ &= \langle  \mu,f \rangle + \langle  \nu, f\rangle, \end{aligned}

so that for suitably chosen s,

\begin{aligned}\mathcal L_{\mu + \nu}\{f\}(s) &= \langle \mu + \nu, K(s, \cdot) \cdot f\rangle \\ &= \langle \mu, K(s, \cdot) \cdot f \rangle + \langle \nu, K(s, \cdot) \cdot f\rangle \\ &= \mathcal L_{\mu }\{f\}(s) + \mathcal L_{ \nu}\{f\}(s) \\ &= (\mathcal L_{\mu }\{f\} + \mathcal L_{\nu }\{f\})(s).\end{aligned}

To that end, we prove the result for the non-negative simple function f = \sum_{i=1}^n a_i \cdot \mathbb I_{S_i} and leave the extensions via the monotone convergence theorem and decomposition f = f^+ - f^- as an exercise:

\begin{aligned} \int_{\mathbb R} f\, \mathrm d(\mu + \nu) &= \sum_{i=1}^n a_i (\mu + \nu)(S_i) \\ &= \sum_{i=1}^n a_i \cdot (\mu(S_i) + \nu(S_i)) \\ &= \sum_{i=1}^n a_i \cdot \mu(S_i) + \sum_{i=1}^n a_i \cdot \nu(S_i) \\ &= \int_{\mathbb R} f\, \mathrm d\mu + \int_{\mathbb R} f\, \mathrm d\nu, \end{aligned}

as required. Scalar multiplication follows similarly.

Problem 3. Prove that for any a \in \mathbb R, the map \delta_a defined by \delta_a(S) := \mathbb I_{S}(a) is a probability measure. We call \delta := \delta_0 the Dirac measure and observe that that \delta_a = \delta ( \cdot - a).

(Click for Solution)

Solution. We verify the three properties of a probability measure. Firstly,

\delta(\emptyset) = \mathbb I_{\emptyset}(0) = 0.

Secondly,

\begin{aligned} \delta \left( \bigsqcup_{i=1}^\infty S_i \right) &= \mathbb I_{\bigsqcup_{i=1}^\infty S_i }(0) = \sum_{i=1}^\infty \mathbb I_{ S_i }(0) = \sum_{i=1}^\infty \delta(S_i).\end{aligned}

Finally, \delta(\mathbb R) = \mathbb I_{\mathbb R}(0) = 1.

Problem 4. Prove the sifting property:

\displaystyle \langle \delta_a, f \rangle := \int_{\mathbb R} f\, \mathrm d\delta_a = f(a).

Deduce that \mathcal L_{\delta_a}\{f\}= K(\cdot, a) \cdot f. In particular, \mathcal L_{\delta_a}\{1\}(s) = e^{-as}.

(Click for Solution)

Solution. We can use the same non-negative simple \Rightarrow non-negative \Rightarrow integrable technique for the sifting property, which we will carry out for completeness.

Let f = \sum_{i=1}^n a_i \cdot \mathbb I_{S_i} be a non-negative simple function. We observe that

\delta_a(S_i) = 1 \quad \iff \quad a \in S_i \quad \iff \quad f(a) = a_i.

Then

\begin{aligned} \int_{\mathbb R} f\, \mathrm d\delta_a &= \sum_{i=1}^n a_i \cdot \delta_a(S_i) = a_i \cdot 1 = f(a).\end{aligned}

Now suppose f is non-negative. Find a sequence \varphi_n of simple functions such that \varphi_n \to f monotonically. By the monotone convergence theorem,

\begin{aligned} \int_{\mathbb R} f\, \mathrm d\delta_a &=\lim_{n \to \infty} \int_{\mathbb R} \varphi_n\, \mathrm d\delta_a = \lim_{n \to \infty} \varphi_n(a) = f(a).\end{aligned}

Finally, suppose f is integrable. Writing f = f^+ - f^-,

\begin{aligned}\int_{\mathbb R} f\, \mathrm d\delta_a &= \int_{\mathbb R} f^+\, \mathrm d\delta_a - \int_{\mathbb R} f^-\, \mathrm d\delta_a \\ &= f^+(a) - f^-(a) \\ &= (f^+ - f^-)(a) = f(a).\end{aligned}

In particular, for any fixed s,

\displaystyle \mathcal L_{\delta_a}\{f\}(s) = \langle \delta_a, K(s,\cdot) \cdot f\rangle = \int_{\mathbb R} K(s, \cdot) \cdot f\ \mathrm d\delta_a = K(s, a) \cdot f(a) .

In particular, \mathcal L_{\delta_a}\{1\} = K(\cdot, a).

Remark 1. We have shown that \mathcal L_{\mu}\{f\} is a meaningful generalisation of both the usual Laplace transform \mathcal L\{f\} and the Laplace transform of the measure \mathcal L\{ {\delta_a} \}:=\mathcal L_{\delta_a}\{1\}:

\begin{aligned} \mathcal L \{ \alpha \cdot \lambda_f + \beta \cdot \delta_a \} &= \mathcal L_{ \alpha \cdot \lambda_f + \beta \cdot \delta_a}\{1\} \\ &= \mathcal L_{ \alpha \cdot \lambda_f }\{1\} + \mathcal L_{ \beta \cdot \delta_a}\{1\}\\ &= \alpha \cdot \mathcal L_{\lambda_f}\{1\} + \beta \cdot \mathcal L_{\delta_a}\{1\} \\ &= \alpha \cdot \mathcal L \{\lambda_f\} + \beta \cdot \mathcal L \{\delta_a \} \\ &= \alpha \cdot \mathcal L \{f\} + \beta \cdot \mathcal L \{\delta_a \}. \end{aligned}

In this manner and usage, we embed f \to \lambda_f as a (signed-)measure, and define f + \beta \cdot \delta_a := \lambda_f + \beta \cdot \delta_a via said embedding, finally permitting engineers to write the following guiltlessly:

\begin{aligned} \mathcal L\{\alpha \cdot f + \beta \cdot \delta_a\} &\equiv \mathcal L \{ \alpha \cdot \lambda_f + \beta \cdot \delta_a \} \\ &= \alpha \cdot \mathcal L \{f\} + \beta \cdot \mathcal L \{\delta_a \}. \end{aligned}

Problem 5. Prove that \displaystyle \int_{(-\infty, t]} \mathrm d\delta_a = U_a(t). Deduce that for a > 0 and valid s,

\displaystyle \int_{\mathbb R} \mathrm d\delta_a = 1 \quad \text{and} \quad \mathcal L\{\delta_a\}(s) = s \cdot \mathcal L\{U_a\}(s),

where U_a = U( \cdot - a).

(Click for Solution)

Solution. For the first claim,

\begin{aligned}  \int_{(-\infty, t]} \mathrm d\delta_a &= \int_{\mathbb R} \mathbb I_{(-\infty, t]}\, \mathrm d\delta_a = \delta_a((-\infty, t]) = \begin{cases} 1, & t \geq a, \\ 0, & t < a. \end{cases}\end{aligned}

Taking T \to \infty and T > a in particular, by the monotonicity of measures,

\begin{aligned}  \int_{\mathbb R} \mathrm d\delta_a &= \lim_{T \to \infty} \int_{(-\infty, T]} \mathrm d\delta_a = \lim_{T \to \infty} U_a(T) = \lim_{T \to \infty} 1 = 1.\end{aligned}

For the final result,

\displaystyle \mathcal L\{U_a\}(s) = \frac{e^{-as}}{s} = \frac{\mathcal L\{\delta_a\}(s)}{s}.

Remark 2. In classical formulas, if f is differentiable such that f' \cdot \mathbb I_{(-\infty, t]} is \lambda-integrable for any t \in \mathbb R, \displaystyle \lim_{N \to \infty}f(-N) = 0, and has a Laplace transform, we have

\displaystyle \int_{(-\infty, t]} f'\, \mathrm d\lambda = f(t),\quad \mathcal L\{f'\}(s) = s \cdot \mathcal L\{f\}(s) - f(0).

Since Problems 3 and 4 agree with classical setups, engineers sloppily write U' = \delta and use these results on their happy merry way, leaving mathematicians infuriated in having to clean up after their…stuff.

This measure-theoretic formalisation only helps us with the integration aspect of the Dirac delta, since measure theory is, in some sense a “completion” of integral calculus. Yet, a full understanding requires us to develop a “completion” of differential calculus—this generalisation builds on measure theory and discusses distribution theory.

We hinted at these objects via the notation \langle \mu, f \rangle. The map

\langle \mu, \cdot \rangle : \{\text{good functions}\} \to \mathbb R

is called a distribution. In a sense, then we could define U := \langle U, \cdot \rangle as a distribution (though this requires much more effort), and by defining differentiation of distributions, we get the true equality U' = \delta. But all that work requires a lot more pain and suffering.

As completions, they become indispensable tools in solving many more differential equations than a usual post-secondary or even undergraduate course would present.

—Joel Kindiak, 16 Jul 25, 1150H

,

Published by


Leave a comment