APEX Partial Derivatives

Section 13.3 Partial Derivatives

Let \(y\) be a function of \(x\text{.}\) We have studied in great detail the derivative of \(y\) with respect to \(x\text{,}\) that is, \(\frac{dy}{dx}\text{,}\) which measures the rate at which \(y\) changes with respect to \(x\text{.}\) Consider now \(z=f(x,y)\text{.}\) It makes sense to want to know how \(z\) changes with respect to \(x\) and/or \(y\text{.}\) This section begins our investigation into these rates of change.

🔗

Subsection 13.3.1 First-order partial derivatives

Consider the function \(f(x,y) = x^2+2y^2\text{,}\) as graphed in Figure 13.3.1.(a). By fixing \(y=2\text{,}\) we focus our attention to all points on the surface where the \(y\)-value is 2, shown in both Figure 13.3.1.(a) and Figure 13.3.1.(b). These points form a curve in the plane \(y=2\text{:}\) \(z = f(x,2) = x^2+8\) which defines \(z\) as a function of just one variable. We can take the derivative of \(z\) with respect to \(x\) along this curve and find equations of tangent lines, etc.

🔗

Link to full-sized image

The graph \(z=x^2+2y^2\) is an elliptic paraboloid, opening upward, with its vertex at the origin. The plot uses a rectangular domain, so we see peaks at the corners of the domain. Mesh curves corresponding to the traces where either \(x\) or \(y\) are constant are shown, and one of these curves, the trace \(y=2\text{,}\) is highlighted.

🔗

(a)

🔗

Link to full-sized image

The curve given by the trace \(y=2\) in the surface \(z=x^2+2y^2\) is plotted in three dimensions. This is the same curve shown in Figure 13.3.1.(a), but this time only the curve is plotted. The surface has been removed, to help visualize the curve.

🔗

The curve itself has the shape of a parabola, opening upwards. It is hanging in space, with its vertex above the mark on the \(y\) axis for \(y=2\text{.}\) The curve is viewed in perspective but is clearly a parabola.

🔗

(b)

🔗

Figure 13.3.1. By fixing \(y=2\text{,}\) the surface \(z=f(x,y) = x^2+2y^2\) is a curve in space

🔗

The key notion to extract from this example is: by treating \(y\) as constant (it does not vary) we can consider how \(z\) changes with respect to \(x\text{.}\) In a similar fashion, we can hold \(x\) constant and consider how \(z\) changes with respect to \(y\text{.}\) This is the underlying principle of partial derivatives. We state the formal, limit-based definition first, then show how to compute these partial derivatives without directly taking limits.

🔗

Aside

Definition 13.3.2. Partial Derivative.

Let \(z=f(x,y)\) be a continuous function on a set \(S\) in \(\mathbb{R}^2\text{.}\)

🔗

The partial derivative of \(f\) with respect to \(x\) is:

\begin{equation*} f_x(x,y) = \lim_{h\to 0} \frac{f(x+h,y) - f(x,y)}h\text{.} \end{equation*}

🔗

🔗
The partial derivative of \(f\) with respect to \(y\) is:

\begin{equation*} f_y(x,y) = \lim_{h\to 0} \frac{f(x,y+h) - f(x,y)}h\text{.} \end{equation*}

🔗

🔗

🔗

Example 13.3.3. Computing partial derivatives with the limit definition.

Let \(f(x,y) = x^2y + 2x+y^3\text{.}\) Find \(f_x(x,y)\) using the limit definition.

🔗

Solution.

Using Definition 13.3.2, we have:

\begin{align*} f_x(x,y) \amp = \lim_{h\to 0} \frac{f(x+h,y) - f(x,y)}{h}\\ \amp = \lim_{h\to 0} \frac{(x+h)^2y+2(x+h)+y^3 - (x^2y+2x+y^3)}{h}\\ \amp = \lim_{h\to 0} \frac{x^2y+2xhy+h^2y+2x+2h+y^3-(x^2y+2x+y^3)}{h}\\ \amp = \lim_{h\to 0} \frac{2xhy+h^2y+2h}{h}\\ \amp =\lim_{h\to 0} 2xy+hy+2\\ \amp = 2xy+2\text{.} \end{align*}

🔗

We have found \(f_x(x,y) = 2xy+2\text{.}\)

🔗

Example 13.3.3 found a partial derivative using the formal, limit-based definition. Using limits is not necessary, though, as we can rely on our previous knowledge of derivatives to compute partial derivatives easily. When computing \(f_x(x,y)\text{,}\) we hold \(y\) fixed — it does not vary. Therefore we can compute the derivative with respect to \(x\) by treating \(y\) as a constant or coefficient.

🔗

Just as \(\frac{d}{dx}\big(5x^2\big) = 10x\text{,}\) we compute \(\frac{\partial}{\px}\big(x^2y\big) = 2xy\text{.}\) Here we are treating \(y\) as a coefficient.

🔗

Just as \(\frac{d}{dx}\big(5^3\big) = 0\text{,}\) we compute \(\frac{\partial}{\px}\big(y^3\big) = 0\text{.}\) Here we are treating \(y\) as a constant. More examples will help make this clear.

🔗

Example 13.3.4. Finding partial derivatives.

Find \(f_x(x,y)\) and \(f_y(x,y)\) in each of the following.

🔗

\(\displaystyle f(x,y) = x^3y^2+ 5y^2-x+7\)

🔗
\(\displaystyle f(x,y) = \cos(xy^2)+\sin(x)\)

🔗
\(\displaystyle f(x,y) = e^{x^2y^3}\sqrt{x^2+1}\)

🔗

🔗

Solution.

We have \(f(x,y) = x^3y^2+ 5y^2-x+7\text{.}\) Begin with \(f_x(x,y)\text{.}\) Keep \(y\) fixed, treating it as a constant or coefficient, as appropriate:

\begin{equation*} f_x(x,y) = 3x^2y^2-1\text{.} \end{equation*}

Note how the \(5y^2\) and \(7\) terms go to zero. To compute \(f_y(x,y)\text{,}\) we hold \(x\) fixed:

\begin{equation*} f_y(x,y) = 2x^3y+10y\text{.} \end{equation*}

Note how the \(-x\) and \(7\) terms go to zero.

🔗

🔗
We have \(f(x,y) = \cos(xy^2)+\sin(x)\text{.}\) Begin with \(f_x(x,y)\text{.}\) We need to apply the Chain Rule with the cosine term; \(y^2\) is the coefficient of the \(x\)-term inside the cosine function.

\begin{equation*} f_x(x,y) = -\sin(xy^2)(y^2)+\cos(x) = -y^2\sin(xy^2)+\cos(x)\text{.} \end{equation*}

To find \(f_y(x,y)\text{,}\) note that \(x\) is the coefficient of the \(y^2\) term inside of the cosine term; also note that since \(x\) is fixed, \(\sin(x)\) is also fixed, and we treat it as a constant.

\begin{equation*} f_y(x,y) = -\sin(xy^2)(2xy) = -2xy\sin(xy^2)\text{.} \end{equation*}

🔗

🔗
We have \(f(x,y) = e^{x^2y^3}\sqrt{x^2+1}\text{.}\) Beginning with \(f_x(x,y)\text{,}\) note how we need to apply the Product Rule.

\begin{align*} f_x(x,y) \amp = e^{x^2y^3}(2xy^3)\sqrt{x^2+1} + e^{x^2y^3}\frac12\big(x^2+1\big)^{-1/2}(2x)\\ \amp = 2xy^3e^{x^2y^3}\sqrt{x^2+1}+\frac{xe^{x^2y^3}}{\sqrt{x^2+1}}\text{.} \end{align*}

Note that when finding \(f_y(x,y)\) we do not have to apply the Product Rule; since \(\sqrt{x^2+1}\) does not contain \(y\text{,}\) we treat it as fixed and hence becomes a coefficient of the \(e^{x^2y^3}\) term.

\begin{equation*} f_y(x,y) = e^{x^2y^3}(3x^2y^2)\sqrt{x^2+1} = 3x^2y^2e^{x^2y^3}\sqrt{x^2+1}\text{.} \end{equation*}

🔗

🔗

🔗

We have shown how to compute a partial derivative, but it may still not be clear what a partial derivative means. Given \(z=f(x,y)\text{,}\) \(f_x(x,y)\) measures the rate at which \(z\) changes as only \(x\) varies: \(y\) is held constant.

🔗

Imagine standing in a rolling meadow, then beginning to walk due east. Depending on your location, you might walk up, sharply down, or perhaps not change elevation at all. This is similar to measuring \(z_x\text{:}\) you are moving only east (in the “\(x\)”-direction) and not north/south at all. Going back to your original location, imagine now walking due north (in the “\(y\)”-direction). Perhaps walking due north does not change your elevation at all. This is analogous to \(z_y=0\text{:}\) \(z\) does not change with respect to \(y\text{.}\) We can see that \(z_x\) and \(z_y\) do not have to be the same, or even similar, as it is easy to imagine circumstances where walking east means you walk downhill, though walking north makes you walk uphill.

🔗

The following example helps us visualize this more.

🔗

Example 13.3.5. Evaluating partial derivatives.

Let \(z=f(x,y)=-x^2-\frac12y^2+xy+10\text{.}\) Find \(f_x(2,1)\) and \(f_y(2,1)\) and interpret their meaning.

🔗

Solution.

We begin by computing \(f_x(x,y) = -2x+y\) and \(f_y(x,y) = -y+x\text{.}\) Thus

\begin{equation*} f_x(2,1) = -3 \text{ and } f_y(2,1) = 1\text{.} \end{equation*}

🔗

It is also useful to note that \(f(2,1) = 7.5\text{.}\) What does each of these numbers mean?

🔗

Consider \(f_x(2,1)=-3\text{,}\) along with Figure 13.3.6.(a). If one “stands” on the surface at the point \((2,1,7.5)\) and moves parallel to the \(x\)-axis (i.e., only the \(x\)-value changes, not the \(y\)-value), then the instantaneous rate of change is \(-3\text{.}\) Increasing the \(x\)-value will decrease the \(z\)-value; decreasing the \(x\)-value will increase the \(z\)-value.

🔗

Link to full-sized image

The surface given by the graph \(z=f(x,y)\) is shown, for \(f(x,y) = -x^2-\frac12 y^2+xy+10\text{.}\) It is a portion of an elliptic paraboloid, opening downward.

🔗

Along the surface a curve is drawn, corresponding to the trace \(y=1\text{.}\) This is a curve that moves along the surface as \(x\) varies, while \(y\) is held constant. At the point \((2,1,f(2,1))\) on the surface, a line is drawn, tangent to the curve. This is a line in space, but its “slope”, relative to \(x\text{,}\) is given by the partial derivative \(f_x(2,1)\text{.}\)

🔗

(a)

🔗

Link to full-sized image

The surface given by the graph \(z=f(x,y)\) is shown, for \(f(x,y) = -x^2-\frac12 y^2+xy+10\text{.}\) It is a portion of an elliptic paraboloid, opening downward.

🔗

Along the surface a curve is drawn, corresponding to the trace \(x=\text{.}\) This is a curve that moves along the surface as \(y\) varies, while \(x\) is held constant. At the point \((2,1,f(2,1))\) on the surface, a line is drawn, tangent to the trace \(x=2\text{.}\) This tangent line is perpendicular to the one drawn in Figure 13.3.6.(b). It is a line in space, but its “slope”, relative to \(y\text{,}\) is given by the partial derivative \(f_y(2,1)\text{.}\)

🔗

(b)

🔗

Figure 13.3.6. Illustrating the meaning of partial derivatives

🔗

Now consider \(f_y(2,1)=1\text{,}\) illustrated in Figure 13.3.6.(b). Moving along the curve drawn on the surface, i.e., parallel to the \(y\)-axis and not changing the \(x\)-values, increases the \(z\)-value instantaneously at a rate of 1. Increasing the \(y\)-value by 1 would increase the \(z\)-value by approximately 1.

🔗

Since the magnitude of \(f_x\) is greater than the magnitude of \(f_y\) at \((2,1)\text{,}\) it is “steeper” in the \(x\)-direction than in the \(y\)-direction.

🔗

Subsection 13.3.2 Tangent Planes

Another way to interpret partial derivatives is in terms of the tangent plane. Consider the graph of a function \(f(x,y)\text{,}\) such as the one in Figure 13.3.1. Setting \(x=a\text{,}\) \(y=b\) defines a point \((a,b,f(a,b))\) on the graph. Through the point \((a,b)\text{,}\) we have the lines \(x=a+s, y=b\text{,}\) and \(x=a, y=b+t\text{,}\) parallel to the \(x\) and \(y\) axes, respectively (where \(s,t\) are parameters).

🔗

Using the function \(f(x,y)\) we define two vector-valued functions:

\begin{align*} \vec{r}_1(s) \amp = \la a+s, b, f(a+s,b)\ra\\ \vec{r}_2(t) \amp = \la a, b+t, f(a,b+t)\ra\text{.} \end{align*}

Both vector-valued functions define space curves that lie on the surface \(z=f(x,y)\text{,}\) and these curves intersect at the point \((a,b,f(a,b))\text{,}\) when \(s=t=0\text{.}\)

🔗

Now consider computing \(\vec{r}_1'(s)\text{.}\) The first two components of this derivative are found in a straightforward manner: they are \(1\) and \(0\text{,}\) respectively. To find the third component of the derivative, notice that in \(\vec{r}_1(s)\) we vary the \(x\)-component of \(f\) while holding the \(y\)-component constant. Using the Chain Rule and Definition 13.3.2, we find that the third component is \(f_x(a+s,b)\text{.}\) Altogether, we have

\begin{equation*} \vec{r}_1'(s) = \la 1,0,f_x(a+s,b)\ra\text{.} \end{equation*}

Evaluating this at \(s=0\) gives

\begin{equation*} \vec{v}=\vec{r}_1'(0) = \la 1,0,f_x(a,b)\ra\text{.} \end{equation*}

We can perform a similar process with \(\vec{r}_2(t)\text{,}\) ultimately leading to

\begin{equation*} \vec{w}=\vec{r}_2'(0) = \la 0,1,f_y(a,b)\ra\text{.} \end{equation*}

From Section 12.2, we know that \(\vec{r}_1'(0)\) defines a tangent vector to the curve \(\vec{r}_1(s)\) when \(s=0\text{,}\) and similarly, \(\vec{r}_2'(0)\) defines a tangent vector to the curve \(\vec{r}_2(t)\) when \(t=0\text{.}\)

🔗

It seems reasonable that any vector that is tangent to these curves, which lie on our surface, should also be considered tangent to that surface. The vectors \(\vec{v}\) and \(\vec{w}\) are therefore tangent to \(z=f(x,y)\) at \((a,b,f(a,b))\text{,}\) and they are definitely not parallel. From Section 11.6 we know that any two non-parallel vectors at a point define a plane through that point. We also know that taking the cross product of these two vectors gives us a normal vector: the cross product gives us

\begin{equation*} \vec{n}=\vec{v}\times\vec{w}=\la -f_x(a,b), -f_y(a,b), 1\ra\text{.} \end{equation*}

🔗

The equation of the plane through \((a,b,f(a,b))\) with normal vector \(\vec{n}=\la -f_x(a,b),-f_y(a,b),1\ra\) is

\begin{equation*} -f_x(a,b)(x-a)-f_y(a,b)(y-b)+(z-f(a,b))=0\text{.} \end{equation*}

It is customary to solve for \(z\) in this equation and make the following definition.

🔗

Definition 13.3.7.

Let \(f(x,y)\) be a function whose first-order partial derivatives exist at \((a,b)\text{.}\) The tangent plane to the surface \(z=f(x,y)\) at the point \((a,b,f(a,b))\) is the plane defined by the equation

\begin{equation*} z = f(a,b)+f_x(a,b)(x-a)+f_y(a,b)(y-b)\text{.} \end{equation*}

🔗

Example 13.3.8. Finding a tangent plane equation.

Find the equation of tangent plane to the surface \(z=x^2+3y^2\) at \((x,y)=(1,-1)\text{.}\)

🔗

Solution.

Our function is \(f(x,y)=x^2+3y^2\text{,}\) and we have \(f(1,-1)=4\text{,}\) so the point on the surface is \((1,-1,4)\text{.}\) The partial derivatives are \(f_x(x,y)=2x\) and \(f_y(x,y)=6y\text{,}\) so \(f_x(1,-1)=2\text{,}\) \(f_y(1,-1)=-6\text{.}\) Using Definition 13.3.7, our plane is given by

\begin{equation*} z = 4+2(x-1)-6(y+1)\text{.} \end{equation*}

🔗

Notice the similarity between the tangent plane equation in Definition 13.3.7 and the single variable tangent line equation \(y = f(c)+f'(c)(x-c)\text{.}\) As with functions of one variable, this suggests a connection between derivatives and linear approximation. We explore this connection in Section 13.4, where we’ll see that Definition 13.3.7 should be strengthed to require that the partial derivatives of \(f\) be continuous.

🔗

Subsection 13.3.3 Second-order partial derivatives

Let \(z=f(x,y)\text{.}\) We have learned to find the partial derivatives \(f_x(x,y)\) and \(f_y(x,y)\text{,}\) which are each functions of \(x\) and \(y\text{.}\) Therefore we can take partial derivatives of them, each with respect to \(x\) and \(y\text{.}\) We define these “second partials” along with the notation, give examples, then discuss their meaning.

🔗

Definition 13.3.9. Second Partial Derivative, Mixed Partial Derivative.

Let \(z=f(x,y)\) be continuous on a set \(S\text{.}\)

🔗

The second partial derivative of \(f\) with respect to \(x\) then \(x\) is

\begin{equation*} \frac{\partial}{\partial x}\left(\frac{\partial f}{\px}\right) = \frac{\partial^2 f}{\px^2} = \big(\,f_x\,\big)_x = f_{xx} \end{equation*}

🔗

🔗
The second partial derivative of \(f\) with respect to \(x\) then \(y\) is

\begin{equation*} \frac{\partial}{\partial y}\left(\frac{\partial f}{\px}\right) = \frac{\partial^2f}{\py\px} = \big(\,f_x\,\big)_y = f_{xy} \end{equation*}

🔗

🔗

🔗

Similar definitions hold for \(\frac{\partial^2f}{\py^2} = f_{yy}\) and \(\frac{\partial^2f}{\px\py} = f_{yx}\text{.}\)

🔗

The second partial derivatives \(f_{xy}\) and \(f_{yx}\) are mixed partial derivatives.

🔗

The notation of second partial derivatives gives some insight into the notation of the second derivative of a function of a single variable. If \(y=f(x)\text{,}\) then \(\fp'(x) = \frac{d^2 y}{dx^2}\text{.}\) The “\(d^2y\)” portion means “take the derivative of \(y\) twice,” while “\(dx^2\)” means “with respect to \(x\) both times.” When we only know of functions of a single variable, this latter phrase seems silly: there is only one variable to take the derivative with respect to. Now that we understand functions of multiple variables, we see the importance of specifying which variables we are referring to.

🔗

Aside

Example 13.3.10. Second partial derivatives.

For each of the following, find all six first and second partial derivatives. That is, find

\begin{equation*} f_x, f_y, f_{xx}, f_{yy}, f_{xy} \text{ and } f_{yx}\,\text{.} \end{equation*}

🔗

\(\displaystyle f(x,y) = x^3y^2 + 2xy^3+\cos(x)\)

🔗
\(\displaystyle f(x,y) = \frac{x^3}{y^2}\)

🔗
\(\displaystyle f(x,y)=e^{x}\sin(x^2y)\)

🔗

🔗

Solution.

In each, we give \(f_x\) and \(f_y\) immediately and then spend time deriving the second partial derivatives.

🔗

\begin{align*} f(x,y) \amp = x^3y^2+2xy^3+\cos(x)\\ f_x(x,y) \amp = 3x^2y^2+2y^3-\sin(x)\\ f_y(x,y) \amp = 2x^3y+6xy^2\\ f_{xx}(x,y) \amp = \frac{\partial}{\px}\big(f_x\big) = \frac{\partial}{\px}\big(3x^2y^2+2y^3-\sin(x) \big)\\ \amp = 6xy^2-\cos(x)\\ f_{yy}(x,y) \amp = \frac{\partial}{\py}\big(f_y\big) = \frac{\partial}{\py}\big(2x^3y+6xy^2\big)\\ \amp = 2x^3+12xy\\ f_{xy}(x,y) \amp = \frac{\partial}{\py}\big(f_x\big) = \frac{\partial}{\py}\big(3x^2y^2+2y^3-\sin(x) \big)\\ \amp = 6x^2y+6y^2\\ f_{yx}(x,y) \amp = \frac{\partial}{\px}\big(f_x\big) = \frac{\partial}{\px}\big(2x^3y+6xy^2\big)\\ \amp = 6x^2y+6y^2 \end{align*}

🔗

🔗
\begin{align*} f(x,y) \amp = \frac{x^3}{y^2} = x^3y^{-2}\\ f_x(x,y) \amp = \frac{3x^2}{y^2}\\ f_y(x,y) \amp = -\frac{2x^3}{y^3}\\ f_{xx}(x,y) \amp = \frac{\partial}{\px}\big(f_x\big) = \frac{\partial}{\px}\big(\frac{3x^2}{y^2}\big)\\ \amp = \frac{6x}{y^2}\\ f_{yy}(x,y) \amp = \frac{\partial}{\py}\big(f_y\big) = \frac{\partial}{\py}\big(-\frac{2x^3}{y^3}\big)\\ \amp = \frac{6x^3}{y^4}\\ f_{xy}(x,y) \amp = \frac{\partial}{\py}\big(f_x\big) = \frac{\partial}{\py}\big(\frac{3x^2}{y^2}\big)\\ \amp = -\frac{6x^2}{y^3}\\ f_{yx}(x,y) \amp = \frac{\partial}{\px}\big(f_x\big) = \frac{\partial}{\px}\big(-\frac{2x^3}{y^3}\big)\\ \amp = -\frac{6x^2}{y^3} \end{align*}

🔗

🔗
\(f(x,y) = e^x\sin(x^2y)\) Because the following partial derivatives get rather long, we omit the extra notation and just give the results. In several cases, multiple applications of the Product and Chain Rules will be necessary, followed by some basic combination of like terms.

\begin{align*} f_x(x,y) \amp = e^x\sin(x^2y) + 2xye^x\cos(x^2y)\\ f_y(x,y) \amp = x^2e^x\cos(x^2y)\\ f_{xx}(x,y) \amp = e^x\sin(x^2y)+4xye^x\cos(x^2y)+2ye^x\cos(x^2y)-4x^2y^2e^x\sin(x^2y)\\ f_{yy}(x,y) \amp = -x^4e^x\sin(x^2y)\\ f_{xy}(x,y) \amp = x^2e^x\cos(x^2y)+2xe^x\cos(x^2y)-2x^3ye^x\sin(x^2y)\\ f_{yx}(x,y) \amp = x^2e^x\cos(x^2y)+2xe^x\cos(x^2y)-2x^3ye^x\sin(x^2y) \end{align*}

🔗

🔗

🔗

Notice how in each of the three functions in Example 13.3.10, \(f_{xy} = f_{yx}\text{.}\) Due to the complexity of the examples, this likely is not a coincidence. The following theorem states that it is not.

🔗

Theorem 13.3.11. Mixed Partial Derivatives.

Let \(f\) be defined such that \(f_{xy}\) and \(f_{yx}\) are continuous on a set \(S\text{.}\) Then for each point \((x,y)\) in \(S\text{,}\) \(f_{xy}(x,y) = f_{yx}(x,y)\text{.}\)

🔗

Finding \(f_{xy}\) and \(f_{yx}\) independently and comparing the results provides a convenient way of checking our work.

🔗

Subsection 13.3.4 Understanding Second Partial Derivatives

Now that we know how to find second partials, we investigate what they tell us.

🔗

Again we refer back to a function \(y=f(x)\) of a single variable. The second derivative of \(f\) is “the derivative of the derivative,” or “the rate of change of the rate of change.” The second derivative measures how much the derivative is changing. If \(\fp'(x)\lt 0\text{,}\) then the derivative is getting smaller (so the graph of \(f\) is concave down); if \(\fp'(x) \gt 0\text{,}\) then the derivative is growing, making the graph of \(f\) concave up.

🔗

Now consider \(z=f(x,y)\text{.}\) Similar statements can be made about \(f_{xx}\) and \(f_{yy}\) as could be made about \(\fp'(x)\) above. When taking derivatives with respect to \(x\) twice, we measure how much \(f_x\) changes with respect to \(x\text{.}\) If \(f_{xx}(x,y)\lt 0\text{,}\) it means that as \(x\) increases, \(f_x\) decreases, and the graph of \(f\) will be concave down in the \(x\)-direction. Using the analogy of standing in the rolling meadow used earlier in this section, \(f_{xx}\) measures whether one’s path is concave up/down when walking due east.

🔗

Similarly, \(f_{yy}\) measures the concavity in the \(y\)-direction. If \(f_{yy}(x,y) \gt 0\text{,}\) then \(f_y\) is increasing with respect to \(y\) and the graph of \(f\) will be concave up in the \(y\)-direction. Appealing to the rolling meadow analogy again, \(f_{yy}\) measures whether one’s path is concave up/down when walking due north.

🔗

We now consider the mixed partials \(f_{xy}\) and \(f_{yx}\text{.}\) The mixed partial \(f_{xy}\) measures how much \(f_x\) changes with respect to \(y\text{.}\) Once again using the rolling meadow analogy, \(f_{x}\) measures the slope if one walks due east. Looking east, begin walking north (side-stepping). Is the path towards the east getting steeper? If so, \(f_{xy} \gt 0\text{.}\) Is the path towards the east not changing in steepness? If so, then \(f_{xy}=0\text{.}\) A similar thing can be said about \(f_{yx}\text{:}\) consider the steepness of paths heading north while side-stepping to the east.

🔗

The following example examines these ideas with concrete numbers and graphs.

🔗

Example 13.3.12. Understanding second partial derivatives.

Let \(z=x^2-y^2+xy\text{.}\) Evaluate the 6 first and second partial derivatives at \((-1/2,1/2)\) and interpret what each of these numbers mean.

🔗

Solution.

We find that:

🔗

\(f_x(x,y) = 2x+y\text{,}\)\(f_y(x,y) = -2y+x\text{,}\)\(f_{xx}(x,y) = 2\text{,}\) \(f_{yy}(x,y) = -2\) and \(f_{xy}(x,y) = f_{yx}(x,y) = 1\text{.}\) Thus at \((-1/2,1/2)\) we have

\begin{equation*} f_x(-1/2,1/2) = -1/2,\qquad f_y(-1/2,1/2) = -3/2\text{.} \end{equation*}

🔗

The slope of the tangent line at \((-1/2, 1/2, -1/4)\) in the direction of \(x\) is \(-1/2\text{:}\) if one moves from that point parallel to the \(x\)-axis, the instantaneous rate of change will be \(-1/2\text{.}\) The slope of the tangent line at this point in the direction of \(y\) is \(-3/2\text{:}\) if one moves from this point parallel to the \(y\)-axis, the instantaneous rate of change will be \(-3/2\text{.}\) These tangents lines are graphed in Figure 13.3.13.(a) and Figure 13.3.13.(b), respectively, where the tangent lines are drawn in a solid line.

🔗

Link to full-sized image

The surface \(z=x^2-y^2+xy\) is a hyperbolic paraboloid, or saddle surface. It is plotted along with the trace \(x=-\frac12\text{.}\) This is a curve lying on the surface that has the shape of a downward-opening parabola. The fact that this curve is concave down, when viewed along the \(x\) axis, corresponds to the fact that \(f_{yy}\) is negative.

🔗

At three points along this trace, tangent lines are drawn. These lines are tangent to curves on the surface given by traces of constant \(y\text{,}\) for three different values of \(y\text{.}\) Each one has negative slope, relative to \(x\text{.}\) For larger values of \(y\text{,}\) the slope of these lines, relative to \(x\text{,}\) becomes less negative. This suggests that the slope, given by \(f_x\) is increasing with \(y\text{,}\) and this corresponds to the fact that \(f_{xy}\) is positive.

🔗

(a)

🔗

Link to full-sized image

The surface \(z=x^2-y^2+xy\) is a hyperbolic paraboloid, or saddle surface. It is plotted along with the trace \(y=\frac12\text{.}\) This is a curve lying on the surface that has the shape of an upward-opening parabola. The fact that this curve is concave up, when viewed along the \(y\) axis, corresponds to the fact that \(f_{xx}\) is positive.

🔗

At three points along this trace, tangent lines are drawn. These lines are tangent to curves on the surface given by traces of constant \(x\text{,}\) for three different values of \(x\text{.}\) Each one has negative slope, relative to \(y\text{.}\) For larger values of \(x\text{,}\) the slope of these lines, relative to \(y\text{,}\) becomes less negative. This suggests that the slope, given by \(f_y\) is increasing with \(x\text{,}\) and this corresponds to the fact that \(f_{yx}\) is positive.

🔗

(b)

🔗

Figure 13.3.13. Understanding the second partial derivatives in Example 13.3.12

🔗

Now consider only Figure 13.3.13.(a). Three directed tangent lines are drawn (two are dashed), each in the direction of \(x\text{;}\) that is, each has a slope determined by \(f_x\text{.}\) Note how as \(y\) increases, the slope of these lines get closer to \(0\text{.}\) Since the slopes are all negative, getting closer to 0 means the slopes are increasing. The slopes given by \(f_x\) are increasing as \(y\) increases, meaning \(f_{xy}\) must be positive.

🔗

Since \(f_{xy}=f_{yx}\text{,}\) we also expect \(f_y\) to increase as \(x\) increases. Consider Figure 13.3.13.(b) where again three directed tangent lines are drawn, this time each in the direction of \(y\) with slopes determined by \(f_y\text{.}\) As \(x\) increases, the slopes become less steep (closer to 0). Since these are negative slopes, this means the slopes are increasing.

🔗

Thus far we have a visual understanding of \(f_x\text{,}\) \(f_y\text{,}\) and \(f_{xy}=f_{yx}\text{.}\) We now interpret \(f_{xx}\) and \(f_{yy}\text{.}\) In Figure 13.3.13.(a), we see a curve drawn where \(x\) is held constant at \(x=-1/2\text{:}\) only \(y\) varies. This curve is clearly concave down, corresponding to the fact that \(f_{yy}\lt 0\text{.}\) In part Figure 13.3.13.(b) of the figure, we see a similar curve where \(y\) is constant and only \(x\) varies. This curve is concave up, corresponding to the fact that \(f_{xx} \gt 0\text{.}\)

🔗

Subsection 13.3.5 Partial Derivatives and Functions of Three Variables

The concepts underlying partial derivatives can be easily extend to more than two variables. We give some definitions and examples in the case of three variables and trust the reader can extend these definitions to more variables if needed.

🔗

Definition 13.3.14. Partial Derivatives with Three Variables.

Let \(w=f(x,y,z)\) be a continuous function on a set \(D\) in \(\mathbb{R}^3\text{.}\)

🔗

The partial derivative of \(f\) with respect to \(x\) is:

\begin{equation*} f_x(x,y,z) = \lim_{h\to 0} \frac{f(x+h,y,z)-f(x,y,z)}{h}\text{.} \end{equation*}

🔗

Similar definitions hold for \(f_y(x,y,z)\) and \(f_z(x,y,z)\text{.}\)

🔗

By taking partial derivatives of partial derivatives, we can find second partial derivatives of \(f\) with respect to \(z\) then \(y\text{,}\) for instance, just as before.

🔗

Example 13.3.15. Partial derivatives of functions of three variables.

For each of the following, find \(f_x\text{,}\) \(f_y\text{,}\) \(f_z\text{,}\) \(f_{xz}\text{,}\) \(f_{yz}\text{,}\) and \(f_{zz}\text{.}\)

🔗

\(\displaystyle f(x,y,z) = x^2y^3z^4+x^2y^2+x^3z^3+y^4z^4\)

🔗
\(\displaystyle f(x,y,z) = x\sin(yz)\)

🔗

🔗

Solution.

\begin{align*} f_x(x,y,z) \amp = 2xy^3z^4+2xy^2+3x^2z^3\\ f_y(x,y,z) \amp = 3x^2y^2z^4+2x^2y+4y^3z^4\\ f_z(x,y,z) \amp = 4x^2y^3z^3+3x^3z^2+4y^4z^3\\ f_{xz}(x,y,z) \amp = 8xy^3z^3+9x^2z^2\\ f_{yz}(x,y,z) \amp = 12x^2y^2z^3+16y^3z^3\\ f_{zz}(x,y,z) \amp = 12x^2y^3z^2+6x^3z+12y^4z^2 \end{align*}

🔗

🔗
\(f_x = \sin(yz)\text{;}\) \(f_y = xz\cos(yz)\text{;}\) \(f_z = xy\cos(yz)\text{,}\) and

\begin{align*} f_{xz}(x,y,z) \amp = y\cos(yz)\\ f_{yz}(x,y,z) \amp = x\cos(yz) - xyz\sin(yz)\\ f_{zz}(x,y,z) \amp = -xy^2\sin(yz) \end{align*}

🔗

🔗

🔗

Subsection 13.3.6 Higher Order Partial Derivatives

We can continue taking partial derivatives of partial derivatives of partial derivatives of …; we do not have to stop with second partial derivatives. These higher order partial derivatives do not have a tidy graphical interpretation; nevertheless they are not hard to compute and worthy of some practice.

🔗

We do not formally define each higher order derivative, but rather give just a few examples of the notation.

\begin{align*} f_{xyx}(x,y) \amp = \frac{\partial}{\px}\left(\frac{\partial}{\py}\left(\frac{\pf}{\px}\right)\right) \text{ and }\\ f_{xyz}(x,y,z) \amp =\frac{\partial}{\partial z}\left(\frac{\partial}{\py}\left(\frac{\pf}{\px}\right)\right)\text{.} \end{align*}

🔗

Example 13.3.16. Higher order partial derivatives.

Let \(f(x,y) = x^2y^2+\sin(xy)\text{.}\) Find \(f_{xxy}\) and \(f_{yxx}\text{.}\)
🔗

🔗
Let \(f(x,y,z) = x^3e^{xy}+\cos(z)\text{.}\) Find \(f_{xyz}\text{.}\)
🔗

🔗

🔗

Solution.

To find \(f_{xxy}\text{,}\) we first find \(f_x\text{,}\) then \(f_{xx}\text{,}\) then \(f_{xxy}\text{:}\)

\begin{align*} f_x(x,y) \amp = 2xy^2+y\cos(xy)\\ f_{xx}(x,y) \amp = 2y^2-y^2\sin(xy)\\ f_{xxy}(x,y) \amp = 4y-2y\sin(xy) - xy^2\cos(xy)\text{.} \end{align*}

To find \(f_{yxx}\text{,}\) we first find \(f_y\text{,}\) then \(f_{yx}\text{,}\) then \(f_{yxx}\text{:}\)

\begin{align*} f_y(x,y) \amp = 2x^2y+x\cos(xy)\\ f_{yx}(x,y) \amp = 4xy + \cos(xy) - xy\sin(xy)\\ f_{yxx}(x,y) \amp = 4y-y\sin(xy) - \big(y\sin(xy) + xy^2\cos(xy)\big)\\ \amp = 4y-2y\sin(xy)-xy^2\cos(xy)\text{.} \end{align*}

Note how \(f_{xxy} = f_{yxx}\text{.}\)

🔗

🔗
To find \(f_{xyz}\text{,}\) we find \(f_x\text{,}\) then \(f_{xy}\text{,}\) then \(f_{xyz}\text{:}\)

\begin{align*} f_x(x,y,z) \amp = 3x^2e^{xy}+ x^3ye^{xy}\\ f_{xy}(x,y,z) \amp= 3x^3e^{xy}+x^3e^{xy}+x^4ye^{xy}\\ \amp = 4x^3e^{xy}+x^4ye^{xy}\\ f_{xyz}(x,y,z) \amp = 0\text{.} \end{align*}

🔗

🔗

🔗

In the previous example we saw that \(f_{xxy} = f_{yxx}\text{;}\) this is not a coincidence. While we do not state this as a formal theorem, as long as each partial derivative is continuous, it does not matter the order in which the partial derivatives are taken. For instance, \(f_{xxy} = f_{xyx} = f_{yxx}\text{.}\)

🔗

This can be useful at times. Had we known this, the second part of Example 13.3.16 would have been much simpler to compute. Instead of computing \(f_{xyz}\) in the \(x\text{,}\) \(y\) then \(z\) orders, we could have applied the \(z\text{,}\) then \(x\) then \(y\) order (as \(f_{xyz} = f_{zxy}\)). It is easy to see that \(f_z = -\sin(z)\text{;}\) then \(f_{zx}\) and \(f_{zxy}\) are clearly 0 as \(f_z\) does not contain an \(x\) or \(y\text{.}\)

🔗

A brief review of this section: partial derivatives measure the instantaneous rate of change of a multivariable function with respect to one variable. With \(z=f(x,y)\text{,}\) the partial derivatives \(f_x\) and \(f_y\) measure the instantaneous rate of change of \(z\) when moving parallel to the \(x\)- and \(y\)-axes, respectively. How do we measure the rate of change at a point when we do not move parallel to one of these axes? What if we move in the direction given by the vector \(\la 2,1\ra\text{?}\) Can we measure that rate of change? The answer is, of course, yes, we can. This is the topic of Section 13.6. First, we need to define what it means for a function of two variables to be differentiable.

🔗

Exercises 13.3.7 Exercises

Terms and Concepts

1.

What is the difference between a constant and a coefficient?

🔗

2.

Given a function \(f(x,y)\text{,}\) explain in your own words how to compute \(f_x\text{.}\)

🔗

3.

In the mixed partial fraction \(f_{xy}\text{,}\) which is computed first, \(f_x\) or \(f_y\text{?}\)

🔗

\(f_x\)
\(f_y\)

🔗

4.

In the mixed partial fraction \(\frac{\partial^2f}{\partial x\partial y}\text{,}\) which is computed first, \(f_x\) or \(f_y\text{?}\)

🔗

\(f_x\)
\(f_y\)

🔗

Problems

Exercise Group.

In the following exercises, evaluate \(f_x(x,y)\) and \(f_y(x,y)\) at the indicated point.

🔗

5.

\(f(x,y) = x^2y-x+2y+3\) at \((1,2)\)

🔗

6.

\(f(x,y) = x^3-3x+y^2-6y\) at \((-1,3)\text{.}\)

🔗

7.

\(f(x,y) = \sin(y) \cos(x)\) at \((\pi/3,\pi/3)\)

🔗

8.

\(f(x,y) = \ln(xy)\) at \((-2,-3)\) Find:

🔗

Exercise Group.

In the following exercises, find \(f_x\text{,}\) \(f_y\text{,}\) \(f_{xx}\text{,}\) \(f_{yy}\text{,}\) \(f_{xy}\) and \(f_{yx}\text{.}\)

🔗

9.

\(f(x,y) = x^2y+3x^2+4y-5\)

🔗

10.

\(f(x,y) = y^3+3xy^2+3x^2y+x^3\)

🔗

11.

\(\ds f(x,y) = \frac xy\)

🔗

12.

\(f(x,y) = \frac{4}{xy}\)

🔗

13.

\(\ds f(x,y) = e^{x^2+y^2}\)

🔗

14.

\(f(x,y) = e^{x+2y}\)

🔗

15.

\(\ds f(x,y) = \sin(x) \cos(y)\)

🔗

16.

\(f(x,y) = (x+y)^3\)

🔗

17.

\(f(x,y) = \cos(5xy^3)\)

🔗

18.

\(f(x,y) = \sin\mathopen{}\left(5x^2+2y^3\right)\mathclose{}\)

🔗

19.

\(f(x,y) = \sqrt{4xy^2+1}\)

🔗

20.

\(\ds f(x,y) =(2x+5y)\sqrt{y}\)

🔗

21.

\(\ds f(x,y) =\frac{1}{x^2+y^2+1}\)

🔗

22.

\(f(x,y) = 5x-17y\)

🔗

23.

\(\ds f(x,y) =3x^2+1\)

🔗

24.

\(f(x,y) = \ln(x^2+y)\)

🔗

25.

\(\ds f(x,y) =\frac{\ln(x) }{4y}\)

🔗

26.

\(f(x,y) = 5e^x\sin(y)+9\)

🔗

Exercise Group.

In the following exercises, form a function \(f(x,y)\) such that \(f_x\) and \(f_y\) match those given.

🔗

27.

\(f_x = \sin(y) +1\text{,}\)\(f_y = x\cos(y)\)

🔗

28.

\(f_x = x+y\) and \(f_y = x+y\)

🔗

29.

\(f_x = 6xy-4y^2\text{,}\)\(f_y = 3x^2-8xy+2\)

🔗

30.

\(f_x = \frac{2x}{x^2+y^2}\) and \(f_y = \frac{2y}{x^2+y^2}\)

🔗

Exercise Group.

In the following exercises, find \(f_x\text{,}\) \(f_y\text{,}\) \(f_z\text{,}\) \(f_{yz}\) and \(f_{zy}\text{.}\)

🔗

31.

\(\ds f(x,y,z) = x^2e^{2y-3z}\)

🔗

32.

\(f(x,y,z) = x^3y^2+x^3z+y^2z\)

🔗

33.

\(\ds f(x,y,z) = \frac{3x}{7y^2z}\)

🔗

34.

\(f(x,y,z) = \ln(xyz)\)

🔗

Prev Top Next