Previously, we wanted to find a best approximation to the usually unsolvable equation . Using the idea of projections, we know that
will give us the best approximation if and only if
With a bit more work, we show that this equation holds if and only if
But where do we go from here?
Now, let’s recall the usual inner product on . Given
, the usual inner product is defined by
Thus, we have the definition if and only if
In this case, . To adapt to more general settings, we adopt the left-hand definition, called an adjoint.
Henceforth, let be an inner product space over a field
.
Definition 1. For any , define its adjoint by
.
Given , a reasonable definition for
would be
In particular, by writing in terms of the usual standard basis,
where we adopted the inner product notation for aesthetic purposes. When , we denote
.
Theorem 1. We have . If
, then
. Thus, we call
the conjugate transpose of
, and
the transpose of
.
Proof. Let . By definition,
To generalise this idea to the linear transformation between two inner product spaces would take considerably much more effort, but there are instances where we might be able to make a sufficiently reasonable definition.
In particular, if and
form bases for
and
respectively (this is certainly true in the finite-dimensional setting), then assuming that
is finite, we can make the definition
where the sum of the right-hand side is finite and thus well-defined. In particular, for each ,
If furthermore that is orthonormal, then applying
on both sides,
Extending by linearity, we obtain the crucial identity
that serves as a working definition of , which we have properly constructed at least in the case
is finite-dimensional.
Let be an inner product space over
.
Lemma 1. Let be a linear transformation such that there exists a linear transformation
that satisfies the crucial property
Then is unique. We call
the adjoint of
.
Proof. We first remark that for any ,
implies
Now suppose there exist two such transformations that satisfy the crucial property. Fix
. Then linearity yields
Since is arbitrary, we have
, as required.
Corollary 1. For any ,
, and
,
Furthermore, the map is an isomorphism.
Corollary 2. Let be inner product spaces over
and
,
where their adjoints exist. Then
.
Proof. Fix . Then
By uniqueness, .
Corollary 3. For and
. Then
. If
, then
.
Corollary 4. Define for appropriately sized matrices. For
,
, even when
. Hence, for
,
.
Finally, we can return to our original problem in . We have derived the following identity using best approximations:
In the context ,
, so that taking adjoints,
Since is arbitrary, by the injectivity of
,
. Therefore, we can solve our best approximation problem. In fact, we can solve this problem even if
.
Theorem 2. Let be inner product spaces over
,
a linear transformation.
- If
is finite-dimensional, then for any
, the equation
has a unique best approximation.
- If the equation
has a unique best approximation then
is the best approximation if and only if
.
- Furthermore, if
exists, then
is the best approximation if and only if
.
Corollary 5. Fix and
. The equation
has a unique best approximation. Furthermore,
is the best approximation if and only if
. In particular, if
,
must satisfy the equation
.
And so I used my graphing calculator to solve the equation , and with 5 minutes of somewhat tedious typing, computed the theoretically perfect best-fit line (up to a precision of 3 significant figures). Then 3 months later, I learned that the calculator had an in-built function to compute this line in 3 seconds. Bummer.
In our next post, we will examine the adjoint transformation a bit more, and discover something peculiar about matrices that satisfy the innocent-looking equality
. These tools will help us explore the spectral theorems, which will empower us with one very useful concrete application—singular value decomposition.
—Joel Kindiak, 13 Mar 25, 1716H
Leave a comment