So far, whenever we have solved a system of linear equations, we have always found exactly one solution. This is not always the case; we will find in this section that some systems do not have a solution, and others have more than one.
We start with a very simple example. Consider the following linear system:
\begin{equation*}
x-y=0\text{.}
\end{equation*}
There are obviously infinitely many solutions to this system; as long as \(x=y\text{,}\) we have a solution. We can picture all of these solutions by thinking of the graph of the equation \(y=x\) on the traditional \(x,y\) coordinate plane.
Let’s continue this visual aspect of considering solutions to linear systems. Consider the system
Each of these equations can be viewed as lines in the coordinate plane, and since their slopes are different, we know they will intersect somewhere (see Figure 3.4.1.(a)). In this example, they intersect at the point \((1,1)\) — that is, when \(x=1\) and \(y=1\text{,}\) both equations are satisfied and we have a solution to our linear system. Since this is the only place the two lines intersect, this is the only solution.
It is clear that while we have two equations, they are essentially the same equation; the second is just a multiple of the first. Therefore, when we graph the two equations, we are graphing the same line twice (see Figure 3.4.1.(b); the thicker line is used to represent drawing the line twice). In this case, we have an infinite solution set, just as if we only had the one equation \(x+y=1\text{.}\) We often write the solution as \(x=1-y\) to demonstrate that \(y\) can be any real number, and \(x\) is determined once we pick a value for \(y\text{.}\)
We should immediately spot a problem with this system; if the sum of \(x\) and \(y\) is 1, how can it also be 2? There is no solution to such a problem; this linear system has no solution. We can visualize this situation in Figure 3.4.1.(c); the two lines are parallel and never intersect.
If we were to consider a linear system with three equations and two unknowns, we could visualize the solution by graphing the corresponding three lines. We can picture that perhaps all three lines would meet at one point, giving exactly 1 solution; perhaps all three equations describe the same line, giving an infinite number of solutions; perhaps we have different lines, but they do not all meet at the same point, giving no solution. We further visualize similar situations with, say, 20 equations with two variables.
On the other hand, if we increase the number of variables to three, then we know from Section 2.6 that the correct geometric visualization of the situation involves planes. A single equation such as
\begin{equation*}
2x-3y+4z=7
\end{equation*}
describes a plane in \(\mathbb{R}^3\) with normal vector \(\vec n = \bbm 2\\-3\\4\ebm\text{.}\) (By inspection, we can also choose a point on the plane. Since \(3(-1)+4(1)=7\text{,}\) we know that the point \((0,-1,1)\) lies on the plane.)
If we have two equations in three unknowns, three possibilities arise. The second equation could be a multiple of the first, in which case we still have our original plane. The second equation could also describe a parallel plane; for example, we might have
In this case we know that there is no possible solution to the system, since there can be no point in common to these two parallel planes. The last possibility is that the planes are non-parallel, in which case they intersect in a line. For example, the system
From here, we can go on to consider three or more equations in three variables; there might be no solution, or the planes might intersect in a single point, or along a common line, or they might even all describe the same plane.
While it becomes harder to visualize when we add variables, no matter how many equations and variables we have, solutions to linear equations always come in one of three forms: exactly one solution, infinitely many solutions, or no solution. This is a fact that we will not prove here, but it deserves to be stated.
Theorem3.4.2.Solution Forms of Linear Systems.
Every linear system of equations has exactly one solution, infinitely many solutions, or no solution.
This leads us to a definition. Here we don’t differentiate between having one solution and infinitely many solutions, but rather just whether or not a solution exists.
Definition3.4.3.Consistent and Inconsistent Linear Systems.
A system of linear equations is consistent if it has a solution (perhaps more than one). A linear system is inconsistent if it does not have a solution.
How can we tell what kind of solution (if one exists) a given system of linear equations has? The answer to this question lies with properly understanding the reduced row echelon form of a matrix. To discover what the solution is to a linear system, we first put the matrix into reduced row echelon form and then interpret that form properly.
Before we start with a simple example, let us make a note about finding the reduced row echelon form of a matrix.
As a general rule, when we are learning a new technique, it is best to not use technology to aid us. This helps us learn not only the technique but some of its “inner workings.” We can then use technology once we have mastered the technique and are now learning how to use it to solve problems.
From here on out, in our examples, when we need the reduced row echelon form of a matrix, we will not show the steps involved. Rather, we will give the initial matrix, then immediately give the reduced row echelon form of the matrix. We trust that the reader can verify the accuracy of this form by both performing the necessary steps by hand or utilizing some technology to do it for them.
Our first example explores officially a quick example used in the introduction of this section.
\begin{align*}
x_1 \amp =1-x_2\\
x_2 \amp \text{ is a } \textit{free parameter}\text{.}
\end{align*}
We have just introduced a new term, free parameter, or free variable, or simply, parameter. It is used to stress that idea that \(x_2\) can take on any value; we are “free” to choose any value for \(x_2\text{.}\) Once this value is chosen, the value of \(x_1\) is determined. We have infinitely many choices for the value of \(x_2\text{,}\) so therefore we have infinitely many solutions. The variable \(x_1\) is also known as a leading variable, since the column corresponding to \(x_1\) contains a leading 1 in the reduced row echelon form of our augmented matrix.
For example, if we set \(x_2 = 0\text{,}\) then \(x_1 = 1\text{;}\) if we set \(x_2 = 5\text{,}\) then \(x_1 = -4\text{.}\) Common practice is to assign a new name to our free parameter; \(t\) is always a good choice. Doing so, we can write the solution to our system as follows:
where \(t\) can be any real number. (Writing the solution in this way is meant to be reminiscent of the parametric equations for a line in Section 2.5.)
Let’s try another example, one that uses more variables.
These two equations tell us that the values of \(x_1\) and \(x_2\) depend on what \(t=x_3\) is. As we saw before, there is no restriction on what \(t\) must be; it is “free” to take on the value of any real number. Once \(t\) is chosen, we obtain a solution \((x_1,x_2,x_3)\) to our system. Since we have infinitely many choices for the value of \(t\text{,}\) we have infinitely many solutions.
As examples, \(x_1 = 2\text{,}\)\(x_2 = 3\text{,}\)\(x_3 = 0\) is one solution; \(x_1 = -2\text{,}\)\(x_2 = 5\text{,}\)\(x_3 = 2\) is another solution. Try plugging these values back into the original equations to verify that these indeed are solutions. (By the way, since infinitely many solutions exist, this system of equations is consistent.)
Examples such as these are known as particular solutions to our system of equations; they correspond to a particular choice for the parameter \(t\text{.}\) The solution written in terms of the parameter \(t\) is known as the general solutionto the system of equations. Note that we’re not restricted to verifying particular solutions: we can go whole hog and plug the general solution into our system to make sure it works. For example, in the first equation, we have
so the second equation is satisfied. The reader can just as easily verify that our solution works in the third equation as well.
In the two previous examples we have used the word “free” to describe certain variables. What exactly is a free variable? How do we recognize which variables are free and which are not?
Look back to the reduced matrix in Example 3.4.4. Notice that there is only one leading 1 in that matrix, and that leading 1 corresponded to the \(x_1\) variable. That told us that \(x_1\) was not a free variable; since \(x_2\)did not correspond to a leading 1, it was a free variable.
Look also at the reduced matrix in Example 3.4.5. There were two leading 1s in that matrix; one corresponded to \(x_1\) and the other to \(x_2\text{.}\) This meant that \(x_1\) and \(x_2\) were not free variables; since there was not a leading 1 that corresponded to \(x_3\text{,}\) it was a free variable.
We formally define this and a few other terms in this following definition.
Definition3.4.6.Dependent and Independent Variables.
Consider the reduced row echelon form of an augmented matrix of a linear system of equations. Then:
A variable that corresponds to a leading 1 is a leading, or dependent, variable
A variable that does not correspond to a leading 1 is a non-leading, or free, or independent, variable. Free variables are also known as parameters.
These definitions help us understand when a consistent system of linear equations will have infinitely many solutions. If there are no free variables, then there is exactly one solution; if there are any free variables, there are infinitely many solutions.
Key Idea3.4.7.Consistent Solution Types.
A consistent linear system of equations will have exactly one solution if and only if there is a leading 1 for each variable in the system.
If a consistent linear system of equations has a free variable, it has infinitely many solutions.
If a consistent linear system has more variables than leading 1s, then the system will have infinitely many solutions.
A consistent linear system with more variables than equations will always have infinitely many solutions.
Note:Key Idea 3.4.7 applies only to consistent systems. If a system is inconsistent, then no solution exists and talking about free and basic variables is meaningless.
When a consistent system has only one solution, each equation that comes from the reduced row echelon form of the corresponding augmented matrix will contain exactly one variable. If the consistent system has infinitely many solutions, then there will be at least one equation coming from the reduced row echelon form that contains more than one variable. The “first” variable will be the basic (or dependent) variable; all others will be free variables.
We have now seen examples of consistent systems with exactly one solution and others with infinitely many solutions. How will we recognize that a system is inconsistent? Let’s find out through an example.
or, more concisely, \(0=1\text{.}\) Obviously, this is not true; we have reached a contradiction. Therefore, no solution exists; this system is inconsistent.
In previous sections we have only encountered linear systems with unique solutions (exactly one solution). Now we have seen three more examples with different solution types. The first two examples in this section had infinitely many solutions, and the third had no solution. How can we tell if a system is inconsistent?
A linear system will be inconsistent only when it implies that 0 equals 1. We can tell if a linear system implies this by putting its corresponding augmented matrix into reduced row echelon form. If we have any row where all entries are 0 except for the entry in the last column, then the system implies \(0=1\text{.}\) More succinctly, if we have a leading 1 in the last column of an augmented matrix, then the linear system has no solution.
Key Idea3.4.9.Inconsistent Systems of Linear Equations.
A system of linear equations is inconsistent if the reduced row echelon form of its corresponding augmented matrix has a leading 1 in the last column.
Example3.4.10.Verifying that a system is inconsistent.
We can verify that this system has no solution in two ways. First, let’s just think about it. If \(x+y=0\text{,}\) then it stands to reason, by multiplying both sides of this equation by 2, that \(2x+2y = 0\text{.}\) However, the second equation of our system says that \(2x+2y= 4\text{.}\) Since \(0\neq 4\text{,}\) we have a contradiction and hence our system has no solution. (We cannot possibly pick values for \(x\) and \(y\) so that \(2x+2y\) equals both 0 and 4.)
Now let us confirm this using the prescribed technique from above. The reduced row echelon form of the corresponding augmented matrix is
We have a leading 1 in the last column, so therefore the system is inconsistent.
Let’s summarize what we have learned up to this point. Consider the reduced row echelon form of the augmented matrix of a system of linear equations. (That sure seems like a mouthful in and of itself. However, it boils down to “look at the reduced form of the usual matrix.”) If there is a leading 1 in the last column, the system has no solution. Otherwise, if there is a leading 1 for each variable, then there is exactly one solution; otherwise there are infinitely many solutions. (i.e., there are free variables.)
Systems with exactly one solution or no solution are the easiest to deal with; systems with infinitely many solutions are a bit harder to deal with. Therefore, we’ll do a little more practice. First, a definition: if there are infinitely many solutions, what do we call one of those infinitely many solutions?
Definition3.4.11.Particular Solution.
Consider a linear system of equations with infinitely many solutions. A particular solution is one solution out of the infinitely many set of possible solutions.
The easiest way to find a particular solution is to pick values for the free variables which then determines the values of the dependent variables. Again, more practice is called for.
Example3.4.12.Finding general and particular solutions.
Give the general solution to a linear system whose augmented matrix in reduced row echelon form is
Notice how the variables \(x_1\) and \(x_3\) correspond to the leading 1s of the given matrix. Therefore \(x_1\) and \(x_3\) are dependent variables; all other variables (in this case, \(x_2\) and \(x_4\)) are free variables.
We generally write our solution with the dependent variables on the left and independent variables and constants on the right. It is also a good practice to acknowledge the fact that our free variables are, in fact, free. So our final solution would look something like
\begin{align*}
x_1 \amp = 4 + s - 2t \\
x_2 \amp = s \text{ is free} \\
x_3 \amp = 7+3t \\
x_4 \amp = t \text{ is free}\text{.}
\end{align*}
Note that in this case we have two free variables, \(x_2\) and \(x_4\text{,}\) so we introduce two parameters, \(s\) and \(t\text{,}\) in our general solution. While we would be correct in saying that this system has infinitely many solutions, we can be more precise about the nature of those solutions. Here, we say that we have a two-parameter family of solutions, to indicate the fact that two different parameters are required to describe the most general solution.
To find particular solutions, choose values for our free variables. There is no “right” way of doing this; we are “free” to choose whatever we wish.
By setting \(x_2 = 0 = x_4\text{,}\) we have the solution \(x_1 = 4\text{,}\)\(x_2 = 0\text{,}\)\(x_3 = 7\text{,}\)\(x_4 = 0\text{.}\) By setting \(x_2 = 1\) and \(x_4 = -5\text{,}\) we have the solution \(x_1 = 15\text{,}\)\(x_2 = 1\text{,}\)\(x_3 = -8\text{,}\)\(x_4 = -5\text{.}\) It is easier to read this when are variables are listed vertically, so we repeat these solutions:
We see that \(x_1\) and \(x_2\) are our dependent variables, for they correspond to the leading 1s. Therefore, \(x_3\) and \(x_4\) are independent variables. This situation feels a little unusual, for \(x_3\) doesn’t appear in any of the equations above, but cannot overlook it; it is still a free variable since there is not a leading 1 that corresponds to it. We write our solution as:
\begin{align*}
x_1 \amp = 3-2t \\
x_2 \amp =5-4t \\
x_3 \amp = s \text{ is free} \\
x_4 \amp = t \text{ is free}\text{.}
\end{align*}
To find two particular solutions, we pick values for our free variables. Again, there is no “right” way of doing this (in fact, there are \(\ldots\) infinitely many ways of doing this) so we give only an example here.
\begin{align*}
x_1 \amp = 4-t\\
x_2 \amp =1\\
x_3 \amp = t \text{ is free}.
\end{align*}
Once again, we get a bit of an “unusual” solution; while \(x_2\) is a dependent variable, it does not depend on any free variable; instead, it is always 1. (We can think of it as depending on the value of 1.) By picking two values for \(x_3\text{,}\) we get two particular solutions.
The constants and coefficients of a matrix work together to determine whether a given system of linear equations has one, infinitely many, or no solution. The concept will be fleshed out more in later chapters, but in short, the coefficients determine whether a matrix will have exactly one solution or not. In the “or not” case, the constants determine whether or not infinitely many solutions or no solution exists. (So if a given linear system has exactly one solution, it will always have exactly one solution even if the constants are changed.) Let’s look at an example to get an idea of how the values of constants and coefficients work together to determine the solution type.
Example3.4.15.Solving a system with a variable coefficient.
For what values of \(k\) will the given system have exactly one solution, infinitely many solutions, or no solution?
We answer this question by forming the augmented matrix and starting the process of putting it into reduced row echelon form. Below we see the augmented matrix and one elementary row operation that starts the Gaussian elimination process.
This is as far as we need to go. In looking at the second row, we see that if \(k=6\text{,}\) then that row contains only zeros and \(x_2\) is a free variable; we have infinitely many solutions. If \(k\neq 6\text{,}\) then our next step would be to make that second row, second column entry a leading one. We don’t particularly care about the solution, only that we would have exactly one as both \(x_1\) and \(x_2\) would correspond to a leading one and hence be dependent variables.
Our final analysis is then this. If \(k\neq 6\text{,}\) there is exactly one solution; if \(k=6\text{,}\) there are infinitely many solutions. In this example, it is not possible to have no solutions.
As an extension of the previous example, consider the similar augmented matrix where the constant 9 is replaced with a 10. Performing the same elementary row operation gives
As in the previous example, if \(k\neq6\text{,}\) we can make the second row, second column entry a leading one and hence we have one solution. However, if \(k=6\text{,}\) then our last row is \([0\ 0\ 1]\text{,}\) meaning we have no solution.
We have been studying the solutions to linear systems mostly in an “academic” setting; we have been solving systems for the sake of solving systems. In the next section, we’ll look at situations which create linear systems that need solving (i.e., “word problems”).
ExercisesExercises
Exercise Group.
Find the solution to the given linear system. If the system has infinitely many solutions, give 2 particular solutions. Then give a geometric description of the system and its solution in terms of points, lines, and planes.