Skip to main content

Section 6.2 Erdős-Szekeres Problem of Convex Polygons - Part Two

All generalizations are false, including this one. — Mark Twain, American author, 1835 — 1910

Reminder. For any integer \(n\geq 3\text{,}\) determine the smallest positive integer \(N(n)\) such that any set of at least \(N(n)\) points in general position in the plane (i.e. no three of the points are on a line) contains \(n\) points that are the vertices of a convex \(n\)-gon.

  1. \(\displaystyle N(3)=\)

  2. \(\displaystyle N(4)=\)

  3. \(\displaystyle N(5)=\)

Question 6.2.1.
  1. Does \(N(n)\) exist for any \(n\geq 3\text{?}\)

  2. Which values of \(N(n)\) are known?

  3. Are there any bounds for the size of \(N(n)\text{?}\)

Pattern?

  1. \(\displaystyle N(3)=3=2^1+1=2^{3-2}+1\)

  2. \(\displaystyle N(4)=5=2^2+1=2^{4-2}+1\)

  3. \(\displaystyle N(5)=9=2^3+1=2^{5-2}+1\)

Well… Szekeres and Peters, 2006:

\begin{equation*} N(6)=17=2^4+1=2^{6-2}+1\text{.} \end{equation*}

Not long before his death in 1996, Erdős wrote that he would pay $500 for a proof of this conjecture.

Still… How do we prove that \(N(n)\) exists for all \(n\geq 3\text{?}\)

Two Theorems:

Recall Theorem 2.3.19:

Ramsey's Theorem. For any natural numbers \(k,r, l_1, \ldots,l_r\) there exists the least natural number \(m_0=R(k;l_1,l_2,\ldots,l_r)\) such that for any \(m\geq m_0\text{,}\) if the set of all \(k\)-element subset of the set \(S_m\text{,}\) where \(|S_m|=m\text{,}\) is \(r\)-coloured then there exists \(i\in [1,r]\) and the \(l_i\)-element subset \(\Delta_{l_i} \subseteq S_m\) such that all its \(k\)-element subsets have the colour \(i\text{.}\)

(Proof by induction.)

We already know that \(N(3)=3\text{,}\) \(N(4)=5\text{,}\) and \(N(5)=9\text{.}\)

Let \(n\geq 4\text{.}\) Let \(m>R(4;n,5)\) and let \(S_m\) be a set of \(m\) points in the plane in general position. Let

\begin{equation*} S_m^{(4)}=\{ \{A,B,C,D\}: A,B,C,D\in S_m\}\text{,} \end{equation*}

i.e., let \(S_m^{(4)}\) be the set of all four-element subsets of \(S_m\text{.}\)

We define a 2-colouring

\begin{equation*} c:S_m^{(4)}\to \{ \color{red}{\bullet}, \color{blue}{\bullet}\} \end{equation*}

in the following way. For \(T\in S_m^{(4)}\)

\begin{equation*} c(T)=\left\{ \begin{array}{ll} \color{red}{\bullet}\amp \mbox{if } T\mbox{ forms a concave quadrilateral} \\ \color{blue}{\bullet}\amp \mbox{if } T\mbox{ forms a convex quadrilateral.}\\ \end{array} \right. \end{equation*}

See Figure 6.2.5.

Figure 6.2.5. A convex quadrilateral and a non-convex quadrilateral

By Ramsey's theorem (and our choice of \(m>R(4;n,5)\)) there is an \(n\)-element set \(\Delta_n\subset S_m\) such that all of its four-element subsets are coloured blue or a \(5\)-element set \(\Delta_5\subset S_m\) such that all of its four-element subsets are coloured red.

Since \(N(4)=5\text{,}\) any set of five points in the plane in general position contains a convex quadrilateral. This implies that it is impossible to find a \(5\)-element set \(\Delta_5\subset S_m\) such that all of its four-element subsets are coloured red.

Hence there must be an \(n\)-element set \(\Delta_n\subset S_m\) such that all of its four-element subsets are coloured blue. But then, by Lemma, the set \(\Delta_n\) forms a convex \(n\)-gon.

Therefore… For any \(n\geq 4\text{,}\) \(N(n)\leq R(4;n,5)\text{.}\)

Cups and Caps. See Figure 6.2.6.

Figure 6.2.6. Cups and caps
Observation 6.2.7.

Note that in a \(k\)-cup, the sequence of slopes is increasing and that in an \(l\)-cap, the sequence of slopes is decreasing.

Observation 6.2.8.

It is clear that if we find a cup or a cap in a set \(S\) in some system of coordinates, then we will also find a convex polygon. See Figure 6.2.9.

Figure 6.2.9. Convex polygons from cups and caps
Observation 6.2.10.

The expression “in some system of coordinates” can be substituted for the expression “in any system of coordinates, in which there are no two points in \(S\) that belong to the vertical line”. Let us call any system of coordinates with such property right for \(S\). In what follows we will always assume that, for a given set \(S\) of points in general position, we have chosen a coordinate system that is right for \(S\text{.}\) See Figure 6.2.11.

Figure 6.2.11. Making a right coordinate system
Definition 6.2.12.

For \(k,l\geq 3\) we define \(f(k,l)\) to be the least positive integers such that any set \(S\) of points in the plane (with a given coordinate system that is ‘right’ for \(S\)) in general position such that

\begin{equation*} |S|\geq f(k,l) \end{equation*}

contains either a \(k\)-cup or an \(l\)-cap.

We prove the theorem by induction on \(m=k+l\) if \(m\geq 6\text{.}\)

  1. Base Case: For any \(k\geq 3\text{,}\)

    \begin{equation*} f(k,3)=f(3,k)=k\text{.} \end{equation*}

    Take a set \(S\) with \(k\) points in the plane in general position. Let

    \begin{equation*} S=\{ A_1,A_2,\ldots, A_k\} \end{equation*}

    where for \(i\lt j\text{,}\) the \(x\)-coordinate of \(A_i\) is less than the the \(x\)-coordinate of \(A_j\text{.}\)

    Let \(s_i\) be the slope of the line segment \(\overline{A_iA_{i+1}}\text{,}\) for \(i\in [1,k-1]\text{.}\) If

    \begin{equation*} s_1\lt s_2\lt \cdots \lt s_{k-1} \end{equation*}

    then \(S\) contains a \(k\)-cup.

    If, for some \(i\in [1,k-2]\text{,}\)

    \begin{equation*} s_i>s_{i+1} \end{equation*}

    then the set \(S\) contains a 3-cap \(\{A_i,A_{i+1},A_{i+2}\}\text{.}\)

    Figure 6.2.14. \(f(k,3)=f(3,k)=k\)
  2. Note that we have proved that \(f(3,3)=3\) and \(f(4,3)=f(3,4)=4\text{.}\) In particular this means that if \(k,l\geq3\) are such that \(k+l\leq 7\) then \(f(k,l)\) exists.

  3. Inductive Step: Let \(m\geq 7\) and \(k,l\geq 3\) be such whenever \(k+l=m\) then \(f(k,l)\) exists. We choose \(k,l\geq 3\) such that

    \begin{equation*} k+l=m+1\text{.} \end{equation*}

    This implies that

    \begin{equation*} (k-1)+l=k+(l-1)=m \end{equation*}

    and, hence \(f(k-1,l)\) and \(f(k,l-1)\) exist.

    Let

    \begin{equation*} n=f(k-1,l)+f(k,l-1)-1\text{.} \end{equation*}

    Let us fix a set \(S\) of cardinality \(n\) and any right for \(S\) system of coordinates. We have to prove that \(S\) contains either a \(k\)-cup or an \(l\)-cap. Let \(L\) be the set of all points that are the left ends of \((k-1)\)-cups in \(S\text{.}\)

    Figure 6.2.15. For \(k-1=4\text{,}\) the top left point belongs to \(L\text{.}\)
    1. Let us assume first that the set \(S\backslash L\) has at least \(f(k-1, l)\) points. Then it contains either a \((k-1)\)-cup or an \(l\)-cap. But taking the set \(L\) out of \(S\) destroys all \((k-1)\)-cups in \(S\text{.}\) Hence \(S\backslash L\) does not contain any \((k-1)\)-cups and therefore must contain an \(l\)-cap.

    2. Suppose then that \(|S\backslash L| \leq f(k-1, l) -1\text{.}\) It follows

      \begin{equation*} |L| = |S| - |S\backslash L| \geq n - f(k-1, l) + 1 = f(k, l-1)\text{.} \end{equation*}

      Therefore, there exists a \(k\)-cup in \(L\) (and everything is alright) or there exists a \((l-1)\)-cap in \(L\text{.}\)

      Let us consider the point \(Y\text{,}\) the right end of that cap.

      Let \(X\) be the point that is immediately to the left of \(Y\) in the \((l-1)\)-cap in \(L\text{.}\) Since \(Y\in L\text{,}\) the point \(Y\) is a left end of some (\(k-1)\)-cup in \(S\text{.}\)

      Let \(Z\) be the point that is immediately to the right of \(Y\) in the \((k-1)\)-cup in \(S\text{.}\) If

      \begin{equation*} \text{ Slope of } \overline{XY}>\text{ Slope of } \overline{YZ} \end{equation*}

      then adding the point \(Z\) to the \((l-1)\)-cap in \(L\) makes an \(l\)-cap in \(S\text{.}\) (See Figure 6.2.16.)

      Figure 6.2.16. Getting an \(l\)-cap in \(S\) from an \((l-1)\)-cap in \(L\text{.}\)

      If

      \begin{equation*} \text{ Slope of } \overline{XY}\lt \text{ Slope of } \overline{YZ} \end{equation*}

      then adding the point \(X\) to the \((k-1)\)-cup in \(S\) makes an \(k\)-cup in \(S\text{.}\) (See Figure 6.2.17.)

      Figure 6.2.17. Getting a \(k\)-cup in \(S\) from an \(l\)-cap in \(L\text{.}\)

      Therefore, any set \(S\) such that

      \begin{equation*} |S|=n=f(k-1,l)+f(k,l-1)-1 \end{equation*}

      contains either a \(k\)-cup or an \(l\)-cup which implies that \(f(k,l)\) exists and that

      \begin{equation*} f(k,l)\leq f(k-1,l)+f(k,l-1)-1\text{.} \end{equation*}

    By the Principle of Mathematical Induction, \(f(k,l)\) exists for any \(k,l\geq 3\text{.}\)

How big is \(f(k,l)\text{?}\)

\begin{equation*} f(k,l)\le \binom{k+l-4}{k-2}+1\text{.} \end{equation*}

Proof via induction on \(k+l\text{:}\) If \(k+l=6\) then \(k=l=3\) and

\begin{equation*} f(3,3)=3 \text{ and } \binom{3+3-4}{3-2}+1=2+1=3\text{.} \end{equation*}

If \(k+l=7\) then \(k=4\) and \(l=3\) or \(k=3\) and \(l=4\text{.}\) From

\begin{equation*} f(4,3)=f(3,4)=4 \text{ and } \binom{4+3-4}{4-2}+1=3+1=4 \text{ and } \binom{3+4-4}{3-2}+1=3+1=4 \end{equation*}

we conclude that in the case that \(k+l=7\) we have

\begin{equation*} f(k,l)\le \binom{k+l-4}{k-2}+1\text{.} \end{equation*}

Suppose that \(m\geq 7\) is such that whenever \(k,l\geq 3\) are such such that \(k+l=m\) then

\begin{equation*} f(k,l)\le \binom{k+l-4}{k-2}+1\text{.} \end{equation*}

Actually…

\begin{equation*} f(k,l)= \binom{k+l-4}{k-2}+1\text{.} \end{equation*}

Back to \(N(n)\text{:}\)

\begin{equation*} N(n)\leq f(n,n)\leq \binom{2n-4}{n-2}+1\text{.} \end{equation*}

Also…

\begin{equation*} N(n)\geq 2^{n-2}+1\text{.} \end{equation*}

What is known?

  1. The next step is if not to prove the hypothesis, then at least to improve the estimation a little bit.

  2. The inequality

    \begin{equation*} N(n) \leq \binom{2n-4}{n-2}+1 \end{equation*}

    was proved by Erdős and Szekeres in 1935.

  3. And in 1998 there were three improvements at once!

    1. The first of them was made by F. Chung and R. Graham:

      \begin{equation*} N(n) \leq \binom{2n-4}{n-2}\text{.} \end{equation*}
    2. D. Kleitman and L. Pachtler showed that it is true that

      \begin{equation*} N(n) \leq \binom{2n-4}{n-2}+7-2n \end{equation*}
    3. The third improvement was achieved by G. Tot and P. Vultr:

      \begin{equation*} N(n) \leq \binom{2n-5}{n-2}+2\text{.} \end{equation*}

    The last result is approximately twice as good as the result of Erdős - Szekeres.

  4. The current record also belongs to Tot and Vultr (2005).

    \begin{equation*} N(n) \leq \binom{2n-5}{n-2}+1, \ n\geq 5\text{.} \end{equation*}
  5. And that was it, no more improvement. This is a classic problem, and crowds of people are working on it.

  6. Chung and Graham offered $100 for the first proof that

    \begin{equation*} g(n) \leq c^n\text{,} \end{equation*}

    where \(c\lt 4\) is a constant.

Resources.

  1. Happy Ending Problem - Wikipedia

  2. Happy Ending Problem by Ron Graham

  3. Happy Ending Problem by D. Harvey

  4. Erdős, P.; Szekeres, G. (1935), “A combinatorial problem in geometry”, Compositio Math 2: 463-470.

  5. The Erdos-Szekeres problem on points in convex position - a survey by W. Morris and V. Soltan

  6. A Puzzle of Clever Connections Nears a Happy End by Kevin Hartnett