and A = (aij )n x n . Define
If A is diagonalizable, A = TDT - 1 where D = diag (
1 , ...,
n ), then
| (1) | f(A) | = bk TDkT - 1 + bk - 1 TDk - 1T - 1 + ··· + b1 TDT - 1 + b0 TT - 1| = T [ bk Dk + bk - 1 Dk - 1 + ··· + b1 D + b0 I ] T - 1 | = Tf(D)T - 1. | |
If p is the characteristic polynomial of A, then p(D) = 0, because
1 k, ...,
n k) , k = 0, 1, ..., n
| => p(D) = |
| p( 1 ) | 0 |
| = 0n x n |
| ·. | ||
| 0 | p( n ) |
It now follows from (1) that p(A) = 0 (when A is diagonalizable).
More generally,
Example 1: Cayley-Hamilton theorem
Let p be the characteristic polynomial of A.
(2)
) = ( - 1)n
n + bn - 1
n - 1 + ··· + b1
+ b0 = 0
(3)
where we can substitute An
from the second last equation. It follows that all the powers
Ak, k
n,
can be written in terms of
A0, A1, ..., An-1.
If A - 1 exists, that is, if
= 0 is not an eigenvalue of A
=> det A = p(0) = b0
0, it follows from (3) that
and thus
(4)
Next consider functions f (A) of matrices, where f : C -> C can be expressed as a power series
(5)
ck zk.
We have already mentioned the functions eA, sin A and cos A before. (Section 5.1 and exercises) Define
(6)
ck Ak (A0 = I)
if the series converges. The following result can be shown:
Proposition 1. If the radius of convergence of the power series (5) is r ( f can be expressed as
(5) when z, | z | < r ),
then the matrix series (6) converges if the
spectral radius r (A) of A satisfies r (A) < r. (r(A) = max{|
1 | , ..., |
n |}).
Since all the powers Ak, k
n, can be written in terms of A0, A1, ...,
An-1, it follows from (6) that
(7)
The coefficients d0 , ..., dn - 1 depend from A. They satisfy the equation
i ) = d0 + d1
i + d2
2i + ··· + dn - 1
i n - 1, i = 1, ..., n
where
i 's are the eigenvalues of A.
i ) =
ck
i k , p (
i ) = 0 =>
f(
i ) = d0 + d1
i + ··· + dn - 1
i n - 1
Note. If q(
) = 0, where q is a polynomial,
it is not necessarily true that q(A) = 0.
There exists a polynomial of minimal degree, called the minimal polynomial of A, such that m(A) = 0.
If the multiplicity of an eigenvalue
i
is bigger than one, we may also use the equations
i ) = d1 + 2d2
i + ··· + (n - 1)dn - 1
i n - 2
If A is diagonalizable, then
| f(A) | = ck Ak = ck TDkT - 1 = T [ ck diag ( i k, ..., n k) ] T - 1 = T [ | diag (ck i k, ..., ck n k) ] T -1 = T diag ( | ck 1 k, ..., ck n k)T - 1 = T diag (f ( | 1 ), ..., f ( n ))T - 1 |
and thus
(8)
1 ), ..., f (
n ))T - 1.