Let denote the vector space of infinitely differentiable functions. Given a real constant
, what nonzero function
satisfies the property that
If we allowed , then clearly
works. That’s very uninteresting. For the
case, with some prodding, we will notice that
fits the bill (and doing some calculus, show that only functions of this type work).
In fact, we have a whole family of functions that work. Since is linear, for any
,
There are many directions we can take from this observation, but since we are writing in the context of linear algebra, we will take the eigenvector approach. To that end, let’s use the differentiation operator to illustrate: for any ,
Since kernels are vector spaces, the set of solutions
forms a vector space.
Let be a field,
be a vector space over
, and
be a linear transformation.
Definition 1. We call a nonzero vector an eigenvector of
if there exists a scalar
, called an eigenvalue, such that
For instance, the function defined by
is an eigenvector of the transformation
. Note that we do not assume that
is finite-dimensional, at least not yet in our discussion.
Right away, we can make several useful deductions from Definition 1. Let denote
-fold composition of
.
Lemma 1. The scalar is an eigenvalue of
if and only if the linear transformation
is not injective. Equivalently, the eigenspace
contains at least one nonzero element. In particular,
is injective if and only if
is not an eigenvalue of
.
Lemma 2. Let be an eigenvector of
with eigenvalue
.
- For any
,
.
- If
is bijective, then
.
- For any linear transformation
, if
is also an eigenvector of
with eigenvalue
, then
is an eigenvector of
and
with eigenvalue
.
- If
is an eigenvector of
with a different eigenvalue
, then
is linearly independent.
All of the results in Lemma 1 and Lemma 2 can be proven by applying Definition 1.
Perhaps before yapping more about eigenstuff, why should we care, really? We could discuss real-world applications in operations research and even some applied-mathematical uses like Markov chains, but since we are discussing mathematics in general, let’s use a pure mathematical example.
Consider the following sequence of Fibonacci numbers:
The sequence of terms in this case are . The question is a simple one: can you derive a formula
that yields the
-th Fibonacci number? For instance,
and
. In fact,
is the only number
such that
.
Since this discussion is set in the context of linear algebra, let’s use linear algebra to tackle this problem. Let’s first convert our problem into vector notation:
Something interesting that happens is that by letting , we have the equation
If we applied the formula on the right-hand side more times, we get
Hence, if we can find a simple expression for , we can use our recipe-ingredient analogy to obtain
. Then, the
-th Fibonacci number would correspond to the first entry in
. Re-index to
and we are done.
Now, this is where we run into problems. How do we evaluate ? For instance, do we compute
matrix multiplications to compute
? Clearly, we will need a better strategy.
Well, treating as a linear transformation, Lemma 2 hints at a plausible strategy. If we can find an eigenvector
with eigenvalue
, then
Ideally, if we can convert the vector into some variant of
, we can take advantage of the special property
then appropriately “re-convert” back into .
So herein still lies the question: How do we find the eigenvector(s) of ? Well, Lemma 1 suggests that we obtain the eigenvalues first. We require
to not be injective (i.e. be singular), and hence, be singular. This is where our detour into determinants will serve our purposes well. A matrix is singular if and only if its determinant is
, and thus we need to solve the equation
We can compute the determinant of a matrix, so that we need to solve the polynomial:
Now, we need to solve this equation, and we should take a pause here: not all polynomials in can be solved by elements in
. For instance, the equation
does not have roots in
, even though its coefficients
are all rational. Sometimes, we get solutions. Sometimes, we don’t. In this equation, if we use the field
, the quadratic equation yields the roots
If instead we had the equation , then we would need to seek our solutions in the complex numbers
. It turns out that the complex numbers are all that we need, known as the fundamental theorem of algebra. However, we would need to develop complex analysis to actually prove that result properly, and so we will omit its proof. We will accept this result whenever we discuss
.
Anyway, we don’t get one eigenvalue, but two: . The positive variant
has a name called the golden ratio, and its negative variant can be shown to equal
. Why did we care about our eigenvalues in the first place? To find eigenvectors
, then take advantage of the property
or
. To do that, we need to find an eigenvector
such that
In other words, we want to compute . We had a whole other detour into Gaussian elimination just so that we can compute such kernels! Perform Gaussian elimination (with relevant row operations that I’m too lazy to list out in detail) to reduce
into
For both of the roots that satisfy the equation
, we have
We call the eigenspace associated with the eigenvalue
. In this example, we have the eigenspaces
This is the key step: using the eigenvector property, we have
Can we find constants such that
If so, then using the linearity of ,
Therefore, the choice of will directly let us obtain the solution
Re-indexing then yields the desired answer
So it behooves us to compute . Interestingly though, what we need to do, effectively, is solve the matrix equation
This means we need to do another round of equation-solving, either using Gaussian elimination or computing the inverse matrix. Since the matrix is and thus sufficiently small, let’s just compute the inverse matrix using the standard formula:
Then performing usual matrix multiplication,
Therefore, though complicated, the constants are given by
Therefore, the -th Fibonacci number is given by
Oh, just for completeness, let’s actually simplify these constants. By expressing in terms of
, we observe that
Similarly,
Therefore, we have the following result.
Theorem 1. The -th Fibonacci number is given by Binet’s formula:
Perhaps the rather surprising fact is that in order to describe the integers , we had to use an irrational number
. Before this discussion takes yet another tangent, let’s raise more more theoretical linear-algebraic questions: under what circumstances can we actually perform such calculations? This is commonly known as diagonalisation, and it is worth wondering where this name arises from. We will explore these ideas in the next post.
—Joel Kindiak, 6 Mar 25, 1953H
Leave a comment