In the previous sections we have learned about the determinant, but we haven’t given a really good reason why we would want to compute it. This section shows two applications of the determinant: solving systems of linear equations and computing the inverse of a matrix.
Subsection6.5.1Cramer’s Rule
Theorem6.5.1.Cramer’s Rule.
Let \(\tta\) be an \(n\times n\) matrix with \(\det(A)\neq 0\) and let be an \(n\times 1\) column vector. Then the linear system
Since \(\det(A)\neq 0\text{,}\) we can apply Cramer’s Rule. Following Theorem 6.5.1, we compute \(\det(\tta_1(\vb))\text{,}\)\(\det(\tta_2(\vb))\) and \(\det(\tta_3(\vb))\text{.}\)
We learned in Section 6.4 that when considering a linear system \(\ttaxb\) where \(\tta\) is square, if \(\det(A)\neq 0\) then \(\tta\) is invertible and \(\ttaxb\) has exactly one solution. We also stated in Key Idea 4.5.2 that if \(\det(A) = 0\text{,}\) then \(\tta\) is not invertible and so therefore either \(\ttaxb\) has no solution or infinitely many solutions. Our method of figuring out which of these cases applied was to form the augmented matrix \(\bbm \tta \amp \vb \ebm\text{,}\) put it into reduced row echelon form, and then interpret the results.
Cramer’s Rule specifies that \(\det(A)\neq 0\) (so we are guaranteed a solution). When \(\det(A)=0\) we are not able to discern whether infinitely many solutions or no solution exists for a given vector\(\vb\text{.}\) Cramer’s Rule is only applicable to the case when exactly one solution exists.
We end this section with a practical consideration. We have mentioned before that finding determinants is a computationally intensive operation. To solve a linear system with 3 equations and 3 unknowns, we need to compute 4 determinants. Just think: with 10 equations and 10 unknowns, we’d need to compute 11 really hard determinants of \(10\times 10\) matrices! That is a lot of work!
The upshot of this is that Cramer’s Rule makes for a poor choice in solving numerical linear systems. It simply is not done in practice; it is hard to beat Gaussian elimination.
So why include it? Because its truth is amazing. The determinant is a very strange operation; it produces a number in a very odd way. It should seem incredible to the reader that by manipulating determinants in a particular way, we can solve linear systems.
Subsection6.5.2The Adjugate Formula
Recall that Theorem 4.4.10 in Section 4.4 gave us a “shortcut” for computing the inverse of a \(2\times 2\) matrix \(A=\bbm a\amp b\\c\amp d\ebm\text{:}\) as long as \(\det(A)\neq 0\text{,}\) we have
\begin{equation*}
A^{-1} = \frac{1}{\det(A)}\bbm d \amp -b\\-c \amp a\ebm\text{.}
\end{equation*}
This result can be easily verified by checking that \(AA^{-1}=I_2\) as required. The reader may have wondered if there is a similar formula for \(A^{-1}\) for a general \(n\times n\) matrix \(A\text{,}\) and whether or not such a formula would still constitute a “shortcut”. The results here are mixed. Yes, there’s a formula, and we will present it shortly. However, as with Cramer’s rule, it is not a shortcut. The reasons are the same as those we just mentioned for Cramer’s rule: as long as we’re dealing with a matrix whose entries are numbers, computing the inverse using row operations is vastly more efficient.
We begin with a definition.
Definition6.5.4.The adjugate of a matrix.
Let \(A\) be an \(n\times n\) matrix.
The matrix of cofactors of \(A\) is the \(n\times n\) matrix
Thus to obtain the matrix of cofactors for \(A\text{,}\) we replace each entry of \(A\) by the corresponding cofactor. Taking the transpose of this matrix produces the adjugate of \(A\text{.}\)
Why do we care about the adjugate matrix? Consider the product \(A\cdot \operatorname{adj}(A)\text{:}\)
Notice that the indices for \(\operatorname{adj}(A)\) are reversed, since we took the transpose of the cofactor matrix. What is the \((i,j)\) entry of this product? Consider first the case where \(i=j\text{.}\) We find that the \((i,i)\)-entry is
But this is just the cofactor expansion of \(\det(A)\) along the \(i^{\text{th}}\) row! Thus, the \((i,i)\) entry of \(A\cdot \operatorname{adj}(A)\) is simply \(\det(A)\text{.}\) This tells us what the diagonal is. What about the off-diagonal entries?
When \(i\neq j\text{,}\) we have the \((i,j)\)-entry
This is no longer a cofactor expansion for the determinant of \(A\text{,}\) since we’re taking entries from one row of \(A\text{,}\) and cofactors from another. This is, however, a cofactor expansion for the determinant of the matrix \(B\) that we obtain if we replace Row \(j\) of \(A\) with another copy of Row \(i\text{.}\) (Take a moment to think about why this is true.) But this means that the matrix \(B\) has two identical rows, and using Theorem 6.4.6, we can see that we must have \(\det(B)=0\text{.}\) This means that all of the off-diagonal entries of our product are zero! We have
Now, we know that \(A\) is invertible if and only if \(\det(A)\neq 0\text{,}\) and as long as \(\det(A)\neq 0\text{,}\) we can multiply both sides of the above equation by \(\dfrac{1}{\det(A)}\text{.}\) With a bit of rearranging, we find
But we know that if we can find any matrix \(B\) such that \(AB=I_n\text{,}\) then \(B\) is necessarily the inverse of \(A\text{.}\) We have established the following theorem.
Theorem6.5.5.The adjugate formula for the inverse.
Let \(A\) be an \(n\times n\) matrix. If \(\det(A)\neq 0\text{,}\) then \(A\) is invertible, and
Let us repeat our words of caution from the beginning of this discussion. Just because we have a formula for the inverse does not mean we need to use it! Consider the case of a \(5\times 5\) matrix (remember that this is a relatively small matrix by practical standards). Would you want to use Theorem 6.5.5 to compute the inverse? What would this require? Well, we’d need to compute \(\det(A)\text{,}\) since that appears in the formula, so there’s already a \(5\times 5\) determinant to deal with. But don’t forget what \(\operatorname{adj}(A)\) is: a matrix of cofactors. In this case, \(\operatorname{adj}(A)\) would consist of twenty-five different \(4\times 4\) determinants that would all need to be computed. What do you think would be less work? Computing one \(5\times 5\) determinant and 25 \(4\times 4\) determinants, or using row operations? Now consider doing this for \(10\times 10\text{,}\) or \(100\times 100\) matrices. Sometimes the first method is also the best!
Let’s do one example to see that even for a \(3\times 3\) matrix, there’s a fair amount of work involved.
Example6.5.6.Using the adjugate formula.
Use Theorem 6.5.5 to compute the inverse of the matrix
We begin by computing \(\det(A)\text{,}\) to make sure that the inverse exists. Using the \(-1\) in the first row to create a zero in the \((3,2)\) spot below it, we have
The reader should verify that \(AA^{-1}=I\) to make sure that we haven’t made any mistakes. (The author made two mistakes that were caught doing this verification!)
In the next chapter we’ll see another use for the determinant. Meanwhile, try to develop a deeper appreciation of math: odd, complicated things that seem completely unrelated often are intricately tied together. Mathematicians see these connections and describe them as “beautiful.”
Exercises6.5.3Exercises
Exercise Group.
Matrices \(\tta\) and \(\vb\) are given.
Give \(\det(\tta)\) and \(\det(\tta_i)\) for all \(i\text{.}\)
Use Cramer’s Rule to solve \(\ttaxb\text{.}\) If Cramer’s Rule cannot be used to find the solution, then state whether or not a solution exists.