Section7.2Properties of Eigenvalues and Eigenvectors
In this section weβll explore how the eigenvalues and eigenvectors of a matrix relate to other properties of that matrix. This section is essentially a hodgepodge of interesting facts about eigenvalues; the goal here is not to memorize various facts about matrix algebra, but to again be amazed at the many connections between mathematical concepts.
Since our matrix is triangular, the determinant is easy to compute; it is just the product of the diagonal elements. Therefore, we found (and factored) our characteristic polynomial very easily, and we see that we have eigenvalues of \(\lambda = 1, 4\text{,}\) and 6.
This examples demonstrates a wonderful fact for us: the eigenvalues of a triangular matrix are simply the entries on the diagonal. Finding the corresponding eigenvectors still takes some work, but finding the eigenvalues is easy.
With that fact in the backs of our minds, let us proceed to the next example where we will come across some more interesting facts about eigenvalues and eigenvectors.
Finding the eigenvalues and eigenvectors of these matrices is not terribly hard, but it is not βeasy,β either. Therefore, we omit showing the intermediate steps and go right to the conclusions.
Of course, computing the transpose of \(A\) and \(B\) is easy; computing their eigenvalues and eigenvectors takes more work. Again, we omit the intermediate steps.
Now that we have completed the βgrunt work,β letβs analyze the results of the previous example. We are looking for any patterns or relationships that we can find.
The eigenvalues and eigenvectors of \(A\) and \(A^{-1}\).
In our example, we found that the eigenvalues of \(A\) are \(-6\) and 12; the eigenvalues of \(A^{-1}\) are \(-1/6\) and \(1/12\text{.}\) Also, the eigenvalues of \(B\) are \(-1\text{,}\) 2 and 3, whereas the eigenvalues of \(B^{-1}\) are \(-1\text{,}\)\(1/2\) and \(1/3\text{.}\) There is an obvious relationship here; it seems that if \(\lambda \) is an eigenvalue of \(A\text{,}\) then \(1/\lambda \) will be an eigenvalue of \(A^{-1}\text{.}\) We can also note that the corresponding eigenvectors matched, too.
Why is this the case? Consider an invertible matrix \(A\) with eigenvalue \(\lambda \neq 0\) and eigenvector\(\vx\text{.}\) Then, by definition, we know that \(A\vx = \lambda \vx\text{.}\) Now multiply both sides by \(A^{-1}\text{:}\)
We have just shown that \(A^{-1}\vx = 1/\lambda \vx\text{;}\) this, by definition, shows that \(\vx\) is an eigenvector of \(A^{-1}\) with eigenvalue \(1/\lambda \text{.}\) This explains the result we saw above. Of course, this all falls apart if \(\lambda = 0\text{,}\) but this is impossible for an invertible matrix: see TheoremΒ 7.2.5 below.
The eigenvalues and eigenvectors of \(A\) and \(A^T\).
Our example showed that \(A\) and \(A^T\) had the same eigenvalues but different (but somehow similar) eigenvectors; it also showed that \(B\) and \(B^T\) had the same eigenvalues but unrelated eigenvectors. Why is this?
We can answer the eigenvalue question relatively easily; it follows from the properties of the determinant and the transpose. Recall the following two facts:
We find the eigenvalues of a matrix by computing the characteristic polynomial; that is, we find \(\det{A - \lambda I}\text{.}\) What is the characteristic polynomial of \(A^T\text{?}\) Consider:
The eigenvalues and eigenvectors of \(A\) and the trace.
Note that the eigenvalues of \(A\) are \(-6\) and 12, and the trace is 6; the eigenvalues of \(B\) are \(-1\text{,}\) 2 and 3, and the trace of \(B\) is 4. Do we notice any relationship?
The answer to this is a bit out of the scope of this text; we can justify part of this fact, and another part weβll just state as being true without justification.
Suppose \(A\) is an \(n\times n\) matrix with no complex eigenvalues (that is, the characteristic polynomial can be completely factored over the real numbers). When this is the case, it turns out that we can find a square matrix \(P\) such that \(P^{-1}AP\) is an upper triangular matrix with the eigenvalues of \(A\) on the diagonal. (We unfortunately do not have the time to prove this, or to explain how the matrix \(P\) is determined.)
Now, recall from TheoremΒ 6.2.3 that \(\tr(AB) = \tr(BA)\text{.}\) Since \(P^{-1}AP\) is upper-triangular, we know that \(\tr(P^{-1}AP)\) is the sum of the eigenvalues; also, using our TheoremΒ 6.2.3, we know that \(\tr(P^{-1}AP) = \tr(P^{-1}PA) = \tr(A)\text{.}\) Thus the trace of \(A\) is the sum of the eigenvalues. It turns out that this result remains true when \(A\) has complex eigenvalues, and the proof is similar, except that the entries of the matrix \(P\) will be complex numbers.
The eigenvalues and eigenvectors of \(A\) and the determinant.
Again, the eigenvalues of \(A\) are \(-6\) and 12, and the determinant of \(A\) is \(-72\text{.}\) The eigenvalues of \(B\) are \(-1\text{,}\) 2 and 3; the determinant of \(B\) is \(-6\text{.}\) It seems as though the product of the eigenvalues is the determinant.
This is indeed true; we defend this with our argument from above. We know that the determinant of a triangular matrix is the product of the diagonal elements. Therefore, given a matrix \(A\text{,}\) we can find \(P\) such that \(P^{-1}AP\) is upper triangular with the eigenvalues of \(A\) on the diagonal. Thus \(\det{P^{-1}AP}\) is the product of the eigenvalues. Using TheoremΒ 6.4.12, we know that \(\det{P^{-1}AP} = \det{P^{-1}PA} = \det{A}\text{.}\) Thus the determinant of \(A\) is the product of the eigenvalues.
If is an eigenvalue of \(A\) with eigenvector\(\vx\text{,}\) then \(\frac{1}{\lambda }\) is an eigenvalue of \(A^{-1}\) with eigenvector \(\vx\text{.}\)
One interesting thing about the above example is that we see that 0 is an eigenvalue of \(A\text{;}\) we have not officially encountered this before. Does this mean anything significant?
Think about what an eigenvalue of 0 means: there exists an nonzero vector \(\vx\) where \(A\vx = 0\vx = \zero\text{.}\) That is, we have a nontrivial solution to \(A\vx = \zero\text{.}\) We know this only happens when \(A\) is not invertible.
So if \(A\) is invertible, there is no nontrivial solution to \(A\vx=\zero\text{,}\) and hence 0 is not an eigenvalue of \(A\text{.}\) If \(A\) is not invertible, then there is a nontrivial solution to \(A\vx=\zero\text{,}\) and hence 0 is an eigenvalue of \(A\text{.}\) This leads us to our final addition to the Invertible Matrix Theorem.
This section is about the properties of eigenvalues and eigenvectors. Of course, we have not investigated all of the numerous properties of eigenvalues and eigenvectors; we have just surveyed some of the most common (and most important) concepts. One of the more important topics β which we will describe briefly in our final section β is the question of diagonalization. For some \(n\times n\) matrices \(A\text{,}\) it is possible to find an invertible matrix \(P\) such that \(P^{-1}AP\) is a diagonal matrix, and the diagonal entries of this matrix are precisely the eigenvalues of \(A\text{.}\) When is this possible? When is it not? To a large extent, this comes down to the question of multiplicity: when we have a repeated eigenvalue, how many independent eigenvectors are associated with it?
Finally, we have found the eigenvalues of matrices by finding the roots of the characteristic polynomial. We have limited our examples to quadratic and cubic polynomials; one would expect for larger sized matrices that a computer would be used to factor the characteristic polynomials. However, in general, this is not how the eigenvalues are found. Factoring high order polynomials is too unreliable, even with a computer β round off errors can cause unpredictable results. Also, to even compute the characteristic polynomial, one needs to compute the determinant, which is also expensive (as discussed in the previous chapter).
So how are eigenvalues found? There are iterative processes that can progressively transform a matrix \(A\) into another matrix that is almost an upper triangular matrix (the entries below the diagonal are almost zero) where the entries on the diagonal are the eigenvalues. The more iterations one performs, the better the approximation is.
Most textbooks on Linear Algebra will provide direction on exploring the above topics and give further insight to what is going on. We have mentioned all the eigenvalue and eigenvector properties in this section for the same reasons we gave in the previous section. First, knowing these properties helps us solve numerous real world problems, and second, it is fascinating to see how rich and deep the theory of matrices is.