APEX Directional Derivatives

Section 13.6 Directional Derivatives

Partial derivatives give us an understanding of how a surface changes when we move in the \(x\) and \(y\) directions. We made the comparison to standing in a rolling meadow and heading due east: the amount of rise/fall in doing so is comparable to \(f_x\text{.}\) Likewise, the rise/fall in moving due north is comparable to \(f_y\text{.}\) The steeper the slope, the greater in magnitude \(f_y\text{.}\)

🔗

But what if we didn’t move due north or east? What if we needed to move northeast and wanted to measure the amount of rise/fall? Partial derivatives alone cannot measure this. This section investigates directional derivatives, which do measure this rate of change.

🔗

Figure 13.6.1. Introducing directional derivatives

🔗

Subsection 13.6.1 Functions of Two Variables

We begin with a definition.

🔗

Definition 13.6.2. Directional Derivatives.

Let \(z=f(x,y)\) be continuous on a set \(S\) and let \(\vec u = \la u_1,u_2\ra\) be a unit vector. For all points \((x,y)\text{,}\) the directional derivative of \(f\) at \((x,y)\) in the direction of \(\vec u\) is

\begin{equation*} D_{\vec u\,}f(x,y) = \lim_{h\to 0} \frac{f(x+hu_1,y+hu_2) - f(x,y)}h\text{.} \end{equation*}

🔗

The partial derivatives \(f_x\) and \(f_y\) are defined with similar limits, but only \(x\) or \(y\) varies with \(h\text{,}\) not both. Here both \(x\) and \(y\) vary with a weighted \(h\text{,}\) determined by a particular unit vector \(\vec u\text{.}\) This may look a bit intimidating but in reality it is not too difficult to deal with; it often just requires extra algebra. However, the following theorem reduces this algebraic load.

🔗

Theorem 13.6.3. Directional Derivatives.

Let \(z=f(x,y)\) be differentiable on a set \(S\) containing \((x_0,y_0)\text{,}\) and let \(\vec u = \la u_1,u_2\ra\) be a unit vector. The directional derivative of \(f\) at \((x_0,y_0)\) in the direction of \(\vec u\) is

\begin{equation*} D_{\vec u\,}f(x_0,y_0)=f_x(x_0,y_0)u_1 + f_y(x_0,y_0)u_2\text{.} \end{equation*}

🔗

Example 13.6.4. Computing directional derivatives.

Let \(z= 14-x^2-y^2\) and let \(P=(1,2)\text{.}\) Find the directional derivative of \(f\text{,}\) at \(P\text{,}\) in the following directions:

🔗

toward the point \(Q=(3,4)\text{,}\)
🔗

🔗
in the direction of \(\la 2,-1\ra\text{,}\) and
🔗

🔗
toward the origin.
🔗

🔗

🔗

Solution.

The surface is plotted in Figure 13.6.5, where the point \(P=(1,2)\) is indicated in the \(x,y\)-plane as well as the point \((1,2,9)\) which lies on the surface of \(f\text{.}\) We find that \(f_x(x,y) = -2x\) and \(f_x(1,2) = -2\text{;}\) \(f_y(x,y) = -2y\) and \(f_y(1,2) = -4\text{.}\)

🔗

Link to full-sized image

Figure 13.6.5. Understanding the directional derivative in Example 13.6.4

🔗

Let \(\vec u_1\) be the unit vector that points from the point \((1,2)\) to the point \(Q=(3,4)\text{,}\) as shown in the figure. The vector \(\overrightarrow{PQ} = \la 2,2\ra\text{;}\) the unit vector in this direction is \(\vec u_1=\la 1/\sqrt{2}, 1/\sqrt{2}\ra\text{.}\) Thus the directional derivative of \(f\) at \((1,2)\) in the direction of \(\vec u_1\) is

\begin{equation*} D_{\vec u_1}f(1,2) = -2(1/\sqrt{2}) +(-4)(1/\sqrt{2}) = -6/\sqrt{2}\approx -4.24\text{.} \end{equation*}

Thus the instantaneous rate of change in moving from the point \((1,2,9)\) on the surface in the direction of \(\vec u_1\) (which points toward the point \(Q\)) is about \(-4.24\text{.}\) Moving in this direction moves one steeply downward.

🔗

🔗
We seek the directional derivative in the direction of \(\la 2,-1\ra\text{.}\) The unit vector in this direction is \(\vec u_2 = \la 2/\sqrt{5},-1/\sqrt{5}\ra\text{.}\) Thus the directional derivative of \(f\) at \((1,2)\) in the direction of \(\vec u_2\) is

\begin{equation*} D_{\vec u_2}f(1,2) = -2(2/\sqrt{5})+(-4)(-1/\sqrt{5}) = 0\text{.} \end{equation*}

Starting on the surface of \(f\) at \((1,2)\) and moving in the direction of \(\la 2,-1\ra\) (or \(\vec u_2\)) results in no instantaneous change in \(z\)-value. This is analogous to standing on the side of a hill and choosing a direction to walk that does not change the elevation. One neither walks up nor down, rather just “along the side” of the hill. Finding these directions of “no elevation change” is important.

🔗

🔗
At \(P=(1,2)\text{,}\) the direction towards the origin is given by the vector \(\la -1,-2\ra\text{;}\) the unit vector in this direction is \(\vec u_3=\la -1/\sqrt{5},-2/\sqrt{5}\ra\text{.}\) The directional derivative of \(f\) at \(P\) in the direction of the origin is

\begin{equation*} D_{\vec u_3}f(1,2) = -2(-1/\sqrt{5}) + (-4)(-2/\sqrt{5}) = 10/\sqrt{5} \approx 4.47\text{.} \end{equation*}

Moving towards the origin means “walking uphill” quite steeply, with an initial slope of about \(4.47\text{.}\)

🔗

🔗

🔗

Figure 13.6.6. Computing directional derivatives

🔗

As we study directional derivatives, it will help to make an important connection between the unit vector \(\vec u = \la u_1,u_2\ra\) that describes the direction and the partial derivatives \(f_x\) and \(f_y\text{.}\) We start with a definition and follow this with a Key Idea.

🔗

Definition 13.6.7. Gradient.

Let \(z=f(x,y)\) be differentiable on a set \(S\) that contains the point \((x_0,y_0)\text{.}\)

The gradient of \(f\) is \(\nabla f(x,y) = \la f_x(x,y),f_y(x,y)\ra\text{.}\)
🔗

🔗
The gradient of \(f\) at \((x_0,y_0)\) is \(\nabla f(x_0,y_0) = \la f_x(x_0,y_0),f_y(x_0,y_0)\ra\text{.}\)
🔗

🔗

🔗

To simplify notation, we often express the gradient as \(\nabla f = \la f_x, f_y\ra\text{.}\) The gradient allows us to compute directional derivatives in terms of a dot product.

🔗

Key Idea 13.6.8. The Gradient and Directional Derivatives.

The directional derivative of \(z=f(x,y)\) in the direction of \(\vec u\) is

\begin{equation*} D_{\vec u\,}f = \nabla f\cdot \vec u\text{.} \end{equation*}

🔗

The properties of the dot product previously studied allow us to investigate the properties of the directional derivative. Given that the directional derivative gives the instantaneous rate of change of \(z\) when moving in the direction of \(\vec u\text{,}\) three questions naturally arise:

🔗

In what direction(s) is the change in \(z\) the greatest (i.e., the “steepest uphill”)?
🔗

🔗
In what direction(s) is the change in \(z\) the least (i.e., the “steepest downhill”)?
🔗

🔗
In what direction(s) is there no change in \(z\text{?}\)
🔗

🔗

🔗

Using the key property of the dot product, we have

\begin{equation} \nabla f\cdot \vec u = \norm{\nabla f}\,\vnorm u \cos(\theta) = \norm{\nabla f}\cos(\theta)\text{,}\tag{13.6.1} \end{equation}

where \(\theta\) is the angle between the gradient and \(\vec u\text{.}\) (Since \(\vec u\) is a unit vector, \(\vnorm{u} = 1\text{.}\)) This equation allows us to answer the three questions stated previously.

🔗

Equation (13.6.1) is maximized when \(\cos(\theta) =1\text{,}\) i.e., when the gradient and \(\vec u\) have the same direction. We conclude the gradient points in the direction of greatest \(z\) change.
🔗

🔗
Equation (13.6.1) is minimized when \(\cos(\theta) = -1\text{,}\) i.e., when the gradient and \(\vec u\) have opposite directions. We conclude the gradient points in the opposite direction of the least \(z\) change.
🔗

🔗
Equation (13.6.1) is 0 when \(\cos(\theta) = 0\text{,}\) i.e., when the gradient and \(\vec u\) are orthogonal to each other. We conclude the gradient is orthogonal to directions of no \(z\) change.
🔗

🔗

🔗

This result is rather amazing. Once again imagine standing in a rolling meadow and face the direction that leads you steepest uphill. Then the direction that leads steepest downhill is directly behind you, and side-stepping either left or right (i.e., moving perpendicularly to the direction you face) does not change your elevation at all.

🔗

Recall that a level curve is defined as a curve in the \(xy\)-plane along which the \(z\)-values of a function do not change. Let a surface \(z=f(x,y)\) be given, and let’s represent one such level curve as a vector-valued function, \(\vrt = \la x(t), y(t)\ra\text{.}\) As the output of \(f\) does not change along this curve, \(f\big(x(t),y(t)\big) = c\) for all \(t\text{,}\) for some constant \(c\text{.}\)

🔗

Since \(f\) is constant for all \(t\text{,}\) \(\frac{df}{dt} = 0\text{.}\) By the Multivariable Chain Rule, we also know

\begin{align*} \frac{df}{dt} \amp = f_x(x,y)x'(t) + f_y(x,y)\yp(t)\\ \amp = \la f_x(x,y),f_y(x,y)\ra \cdot \la x'(t),\yp(t)\ra\\ \amp = \nabla f \cdot \vrp(t)\\ \amp =0\text{.} \end{align*}

🔗

This last equality states \(\nabla f \cdot \vrp(t) = 0\text{:}\) the gradient is orthogonal to the derivative of \(\vec r\text{,}\) meaning the gradient is orthogonal to the graph of \(\vec r\text{.}\) Our conclusion: at any point on a surface, the gradient at that point is orthogonal to the level curve that passes through that point.

🔗

We restate these ideas in a theorem, then use them in an example.

🔗

Theorem 13.6.9. The Gradient and Directional Derivatives.

Let \(z=f(x,y)\) be differentiable on a set \(S\) with gradient \(\nabla f\text{,}\) let \(P=(x_0,y_0)\) be a point in \(S\) and let \(\vec u\) be a unit vector.

🔗

The maximum value of \(D_{\vec u\,}f(x_0,y_0)\) is \(\norm{\nabla f(x_0,y_0)}\text{;}\) the direction of maximal \(z\) increase is \(\nabla f(x_0,y_0)\text{.}\)
🔗

🔗
The minimum value of \(D_{\vec u\,}f(x_0,y_0)\) is \(-\norm{\nabla f(x_0,y_0)}\text{;}\) the direction of minimal \(z\) increase is \(-\nabla f(x_0,y_0)\text{.}\)
🔗

🔗
At \(P\text{,}\) \(\nabla f(x_0,y_0)\) is orthogonal to the level curve passing through \(\big(x_0,y_0\big)\text{.}\)
🔗

🔗

🔗

Example 13.6.10. Finding directions of maximal and minimal increase.

Let \(f(x,y) = \sin(x) \cos(y)\) and let \(P=(\pi/3,\pi/3)\text{.}\) Find the directions of maximal/minimal increase, and find a direction where the instantaneous rate of \(z\) change is 0.

🔗

Solution.

We begin by finding the gradient. \(f_x = \cos(x) \cos(y)\) and \(f_y = -\sin(x) \sin(y)\text{,}\) thus

\begin{equation*} \nabla f = \la \cos(x) \cos(y) ,-\sin(x) \sin(y) \ra \text{ and, at \(P\), } \nabla f\left(\frac{\pi}3,\frac{\pi}3\right) = \la\frac14,-\frac34\ra\text{.} \end{equation*}

🔗

Thus the direction of maximal increase is \(\la 1/4, -3/4\ra\text{.}\) In this direction, the instantaneous rate of \(z\) change is \(\norm{\la 1/4,-3/4\ra} = \sqrt{10}/4 \approx 0.79\text{.}\)

🔗

Figure 13.6.11 shows the surface plotted from two different perspectives. In each, the gradient is drawn at \(P\) with a dashed line (because of the nature of this surface, the gradient points “into” the surface). Let \(\vec u = \la u_1, u_2\ra\) be the unit vector in the direction of \(\nabla f\) at \(P\text{.}\) Each graph of the figure also contains the vector \(\la u_1, u_2, \norm{\nabla f\,}\ra\text{.}\) This vector has a “run” of 1 (because in the \(xy\)-plane it moves 1 unit) and a “rise” of \(\norm{\nabla f}\text{,}\) hence we can think of it as a vector with slope of \(\norm{\nabla f}\) in the direction of \(\nabla f\text{,}\) helping us visualize how “steep” the surface is in its steepest direction.

🔗

Link to full-sized image

(a)

🔗

Link to full-sized image

(b)

🔗

Figure 13.6.11. Graphing the surface and important directions in Example 13.6.10

🔗

The direction of minimal increase is \(\la -1/4,3/4\ra\text{;}\) in this direction the instantaneous rate of \(z\) change is \(-\sqrt{10}/4 \approx -0.79\text{.}\)

🔗

Any direction orthogonal to \(\nabla f\) is a direction of no \(z\) change. We have two choices: the direction of \(\la 3,1\ra\) and the direction of \(\la -3,-1\ra\text{.}\) The unit vector in the direction of \(\la 3,1\ra\) is shown in each graph of the figure as well. The level curve at \(z=\sqrt{3}/4\) is drawn: recall that along this curve the \(z\)-values do not change. Since \(\la 3,1\ra\) is a direction of no \(z\)-change, this vector is tangent to the level curve at \(P\text{.}\)

🔗

Example 13.6.12. Understanding when \(\nabla f = \vec 0\).

Let \(f(x,y) = -x^2+2x-y^2+2y+1\text{.}\) Find the directional derivative of \(f\) in any direction at \(P=(1,1)\text{.}\)

🔗

Solution.

We find \(\nabla f = \la -2x+2, -2y+2\ra\text{.}\) At \(P\text{,}\) we have \(\nabla f(1,1) = \la 0,0\ra\text{.}\) According to Theorem 13.6.9, this is the direction of maximal increase. However, \(\la 0,0\ra\) is directionless; it has no displacement. And regardless of the unit vector \(\vec u\) chosen, \(D_{\vec u\,}f = 0\text{.}\)

🔗

Figure 13.6.13 helps us understand what this means. We can see that \(P\) lies at the top of a paraboloid. In all directions, the instantaneous rate of change is 0.

🔗

So what is the direction of maximal increase? It is fine to give an answer of \(\vec 0 = \la 0,0\ra\text{,}\) as this indicates that all directional derivatives are 0.

🔗

Link to full-sized image

Figure 13.6.13. At the top of a paraboloid, all directional derivatives are 0

🔗

The fact that the gradient of a surface always points in the direction of steepest increase/decrease is very useful, as illustrated in the following example.

🔗

Example 13.6.14. The flow of water downhill.

Consider the surface given by the graph of \(f(x,y)= 20-x^2-2y^2\text{.}\) Water is poured on the surface at \((1,1/4)\text{.}\) What path does it take as it flows downhill?

🔗

Solution.

Let \(\vrt = \la x(t), y(t)\ra\) be the vector-valued function describing the path of the water in the \(xy\)-plane; we seek \(x(t)\) and \(y(t)\text{.}\) We know that water will always flow downhill in the steepest direction; therefore, at any point on its path, it will be moving in the direction of \(-\nabla f\text{.}\) (We ignore the physical effects of momentum on the water.) Thus \(\vrp(t)\) will be parallel to \(\nabla f\text{,}\) and there is some constant \(c\) such that \(c\nabla f = \vrp(t) = \la x'(t), \yp(t)\ra\text{.}\)

🔗

We find \(\nabla f = \la -2x, -4y\ra\) and write \(x'(t)\) as \(\frac{dx}{dt}\) and \(\yp(t)\) as \(\frac{dy}{dt}\text{.}\) Then

\begin{align*} c\nabla f \amp = \la x'(t), \yp(t)\ra\\ \la -2cx, -4cy \ra \amp = \la \frac{dx}{dt}, \frac{dy}{dt}\ra\text{.} \end{align*}

🔗

This implies

\begin{equation*} -2cx = \frac{dx}{dt} \text{ and } -4cy =\frac{dy}{dt}, \text{i.e.,} \end{equation*}

\begin{equation*} c = -\frac{1}{2x}\frac{dx}{dt} \text{ and } c =-\frac{1}{4y}\frac{dy}{dt}\text{.} \end{equation*}

🔗

As \(c\) equals both expressions, we have

\begin{equation*} \frac{1}{2x}\frac{dx}{dt} =\frac{1}{4y}\frac{dy}{dt}\text{.} \end{equation*}

🔗

To find an explicit relationship between \(x\) and \(y\text{,}\) we can integrate both sides with respect to \(t\text{.}\) Recall from our study of differentials that \(\frac{dx}{dt}dt = dx\text{.}\) Thus:

\begin{align*} \int \frac{1}{2x}\frac{dx}{dt}\,dt \amp =\int \frac{1}{4y}\frac{dy}{dt}\,dt\\ \int \frac{1}{2x}\,dx \amp =\int\frac{1}{4y}\,dy\\ \frac 12\ln\abs{x} \amp = \frac14\ln\abs{y} +C_1\\ 2\ln\abs{x} \amp = \ln\abs{y} +C_1\\ \ln\abs{x^2} \amp = \ln\abs{y}+C_1\\ \end{align*}

Now raise both sides as a power of \(e\text{:}\)

\begin{align*} x^2 \amp = e^{\ln\abs{y}+C_1}\\ x^2 \amp = e^{\ln\abs{y}}e^{C_1}\qquad \text{(Note that \(e^{C_1}\) is just a constant.)}\\ x^2 \amp = yC_2\\ \frac1{C_2}x^2 \amp =y \qquad \text{ (Note that \(1/C_2\) is just a constant.) }\\ Cx^2 \amp = y\text{.} \end{align*}

🔗

As the water started at the point \((1,1/4)\text{,}\) we can solve for \(C\text{:}\)

\begin{equation*} C(1)^2 = \frac14 \Rightarrow C = \frac14\text{.} \end{equation*}

🔗

A contour plot of the paraboloid in this example, and the corresponding trajectory in the plane. — (a)
🔗

Thus the water follows the curve \(y=x^2/4\) in the \(xy\)-plane. The surface and the path of the water is graphed in Figure 13.6.15.(a). In Figure 13.6.15.(b), the level curves of the surface are plotted in the \(xy\)-plane, along with the curve \(y=x^2/4\text{.}\) Notice how the path intersects the level curves at right angles. As the path follows the gradient downhill, this reinforces the fact that the gradient is orthogonal to level curves.

🔗

Figure 13.6.16. Discussing the significance of directional derivatives

🔗

Subsection 13.6.2 Functions of Three Variables

The concepts of directional derivatives and the gradient are easily extended to three (and more) variables. We combine the concepts behind Definitions 13.6.2 and 13.6.7 and Theorem 13.6.3 into one set of definitions.

🔗

Definition 13.6.17. Directional Derivatives and Gradient with Three Variables.

Let \(w=F(x,y,z)\) be differentiable on a set \(D\) and let \(\vec u\) be a unit vector in \(\mathbb{R}^3\text{.}\)

The gradient of \(F\) is \(\nabla F = \la F_x,F_y,F_z\ra\text{.}\)
🔗

🔗
The directional derivative of \(F\) in the direction of \(\vec u\) is

\begin{equation*} D_{\vec u\,}F=\nabla F\cdot \vec u\text{.} \end{equation*}

🔗

🔗

🔗

The same properties of the gradient given in Theorem 13.6.9, when \(f\) is a function of two variables, hold for \(F\text{,}\) a function of three variables.

🔗

Theorem 13.6.18. The Gradient and Directional Derivatives with Three Variables.

Let \(w=F(x,y,z)\) be differentiable on a set \(D\text{,}\) let \(\nabla F\) be the gradient of \(F\text{,}\) and let \(\vec u\) be a unit vector.

The maximum value of \(D_{\vec u\,}F\) is \(\norm{\nabla F}\text{,}\) obtained when the angle between \(\nabla F\) and \(\vec u\) is 0, i.e., the direction of maximal increase is \(\nabla F\text{.}\)
🔗

🔗
The minimum value of \(D_{\vec u\,}F\) is \(-\norm{\nabla F}\text{,}\) obtained when the angle between \(\nabla F\) and \(\vec u\) is \(\pi\text{,}\) i.e., the direction of minimal increase is \(-\nabla F\text{.}\)
🔗

🔗
\(D_{\vec u\,}F = 0\) when \(\nabla F\) and \(\vec u\) are orthogonal.
🔗

🔗

🔗

We interpret the third statement of the theorem as “the gradient is orthogonal to level surfaces,” the three-variable analogue to level curves.

🔗

Example 13.6.19. Finding directional derivatives with functions of three variables.

If a point source \(S\) is radiating energy, the intensity \(I\) at a given point \(P\) in space is inversely proportional to the square of the distance between \(S\) and \(P\text{.}\) That is, when \(S=(0,0,0)\text{,}\) \(I(x,y,z) = \frac{k}{x^2+y^2+z^2}\) for some constant \(k\text{.}\)

🔗

Let \(k=1\text{,}\) let \(\vec u = \la 2/3, 2/3, 1/3\ra\) be a unit vector, and let \(P = (2,5,3)\text{.}\) Measure distances in inches. Find the directional derivative of \(I\) at \(P\) in the direction of \(\vec u\text{,}\) and find the direction of greatest intensity increase at \(P\text{.}\)

🔗

Solution.

We need the gradient \(\nabla I\text{,}\) meaning we need \(I_x\text{,}\) \(I_y\) and \(I_z\text{.}\) Each partial derivative requires a simple application of the Quotient Rule, giving

\begin{align*} \nabla I \amp = \la \frac{-2x}{(x^2+y^2+z^2)^2},\frac{-2y}{(x^2+y^2+z^2)^2},\frac{-2z}{(x^2+y^2+z^2)^2}\ra\\ \nabla I(2,5,3) \amp = \la \frac{-4}{1444},\frac{-10}{1444},\frac{-6}{1444}\ra \approx \la -0.003,-0.007,-0.004\ra\\ D_{\vec u\,}I \amp = \nabla I(2,5,3)\cdot \vec u\\ \amp = -\frac{17}{2166} \approx -0.0078\text{.} \end{align*}

🔗

The directional derivative tells us that moving in the direction of \(\vec u\) from \(P\) results in a decrease in intensity of about \(-0.008\) units per inch. (The intensity is decreasing as \(\vec u\) moves one farther from the origin than \(P\text{.}\))

🔗

The gradient gives the direction of greatest intensity increase. Notice that

\begin{align*} \nabla I(2,5,3) \amp = \la \frac{-4}{1444},\frac{-10}{1444},\frac{-6}{1444}\ra\\ \amp = \frac{2}{1444}\la -2,-5,-3\ra\text{.} \end{align*}

🔗

That is, the gradient at \((2,5,3)\) is pointing in the direction of \(\la -2,-5,-3\ra\text{,}\) that is, towards the origin. That should make intuitive sense: the greatest increase in intensity is found by moving towards to source of the energy.

🔗

The directional derivative allows us to find the instantaneous rate of \(z\) change in any direction at a point. We can use these instantaneous rates of change to define lines and planes that are tangent to a surface at a point, which is the topic of the next section.

🔗

Exercises 13.6.3 Exercises

Terms and Concepts

1.

What is the difference between a directional derivative and a partial derivative?

🔗

2.

For \(f(x,y)\text{,}\) for what \(\vec u\) is \(D_{\vec u}\, f = f_x\text{?}\)

🔗

\(\veci\)
🔗
\(\vecj\)
🔗
\(\veck\)
🔗

🔗

3.

For \(f(x,y)\text{,}\) for what \(\vec u\) is \(D_{\vec u}\, f = f_y\text{?}\)

🔗

\(\veci\)
🔗
\(\vecj\)
🔗
\(\veck\)
🔗

Consider \(f(x,y) = x^2y^3-2x\text{,}\) at \(P = (1,1)\text{.}\)

🔗

(a)

Find the directional derivative in the direction of \(\vec v=\la 3,3\ra\text{.}\)

🔗

(b)

Find the directional derivative in the direction toward the point \(Q = (1,2)\text{.}\)

🔗

Exercise Group.

In the following exercises, a function \(f(x,y)\) and a point \(P\) are given. Investigate the directions of maximal increase and decrease, as indicated.

🔗

🔗

Prev Top Next