What pedagogical motivation is there for the existence/approximation of pi?

In summary, the conversation is discussing the topic of finding a mathematical proof or derivation of the constant pi from a few first principles. The goal is to carefully illustrate how definitions can lead to a useful definition and approximation of pi. The conversation also touches on the use of measure theory and the idea of Archimedes to achieve this goal. Finally, the conversation ends with a suggestion to use the exponential function and trigonometric functions to define and approximate pi.
  • #1
Dr. Seafood
121
0
This is being discussed in micromass's "Math stuff that hasn't been proven" thread, but I want to be particular about this topic.

Essentially, I think I'm looking for a "proof" or derivation of pi from few first principles. Honestly I have no idea which is the "purest", most motivating question to ask that reveals the number [itex]\pi[/itex] as a constant.

After defining some (relatively) intuitive axioms, I think it's okay to forsake some rigor (proofs of existence/uniqueness) so that we can really understand the meat of the origins of [itex]\pi[/itex]. Probably, a few first questions are: What is a circle? Why is the ratio of a circle's circumference to its diameter a constant; moreover, why is it a number between 3 and 4? Which "first principles" should we accept for the derivation of this number? How can we use a limiting expression to evaluate a decimal approximation for pi?
The goal is to carefully illustrate how we can begin with definitions, and arrive at a useful definition and approximation for pi.

To make analogy: teaching elementary calculus a few weeks ago, I tried to develop the use of e. I'm pretty much looking for a similar derivation of pi. Obviously it would rely more on a geometrically motivated problem, instead of a beginner's calculus one. Here is a quick recap of how I did this. Sorry for tl;dr.

Begin by defining exponentiation and exponential function, and making continuity arguments to extend "repeated multiplication" to entire real domain. The "motivating question" is "What is [itex]{d \over dx} {b^x}, \quad b > 1[/itex]"?

Using Newton's quotient (first principles) to differentiate, after some algebra which should be familiar to you all, we get
[tex]{{d \over dx} {b^x}} = {b^x} {\lim_{\Delta{x} \rightarrow 0} {{{b^{\Delta{x}} - 1} \over {\Delta{x}}}}} \qquad \forall x \in ℝ[/tex]
Now we make an assumption, for the sake of teaching, that the limit involved here exists. Let [itex]M = {\lim_{\Delta{x} \rightarrow 0} {{{b^{\Delta{x}} - 1} \over {\Delta{x}}}}}[/itex] so that we can write [itex]{{d \over dx} {b^x}} = M{b^x}[/itex]. Note that since M is just the definition of the derivative (Newton's quotient) of bx at x = 0, geometrically this limit is the slope of the exponential function at x = 0. Another assumption, again for pedagogy instead of rigor: we assert that a unique number [itex]e[/itex] exists so that [itex]M = {\lim_{\Delta{x} \rightarrow 0} {{{e^{\Delta{x}} - 1} \over {\Delta{x}}}}} = 1[/itex]. The derivative of ex at x = 0 is exactly 1. We persuade by claiming k ∈ ℝ can be chosen so that y = bkx = (bk)x has slope 1 at x = 0, i.e. bk = e: choose k = 1/M; we can always "stretch" an exponential function so that its slope at x = 0 is 1, or so that the base of the function is e. Obviously the coolest result here is that [itex]{d \over dx} e^x = e^x[/itex]!

Now define ln x to be the function such that [itex]\ln(e^x) = x[/itex] and [itex]e^{\ln x} = x[/itex]. Note then by this definition, ln 1 = 0 and ln e = 1. To find [itex]{d \over dx} \ln x[/itex], let y = ln x so x = ey, and differentiate implicitly to yield [itex]{{d \over dx} \ln x} = {1 \over x}[/itex].

Observe that b = eln b holds, so (using the chain rule):
[tex]{{d \over dx} b^x} = {{d \over dx} e^{{x}{\ln b}}} = ({\ln b})e^{{x}{\ln b}} = ({\ln b})b^{x}[/tex]
We can see that the limit M = ln b, solving the problem of finding the derivative of a general exponential function. Now we want to achieve a decimal approximation for the "magic exponential number" e. Using the continuity of ln (limit of log is log of limit), we evaluate the following limit:
[tex]{\lim_{x \rightarrow \infty} (1 + {1 \over x})^x}[/tex]
Let [itex]{\Delta{x}} = {1 \over x}[/itex]. Take the log of the limiting expression to get
[tex]{\ln({(1 + {1 \over x})^x})} = x\ln(1 + {1 \over x})
= {1 \over \Delta{x}}{\ln(1 + \Delta{x})} = {1 \over \Delta{x}}({\ln(1 + \Delta{x}) - \ln 1})[/tex]
The far right-side is Newton's quotient for ln x at 0, but we know this is just 1. The result follows from continuity of log:
[tex]\ln {\lim_{x \rightarrow \infty} (1 + {1 \over x})^x} = {\lim_{x \rightarrow \infty} \ln(1 + {1 \over x})^x} = {\lim_{\Delta{x} \rightarrow 0} \ln(1 + \Delta{x})^{1 \over \Delta{x}}} = 1 \implies {\lim_{x \rightarrow \infty} (1 + {1 \over x})^x} = e^{\lim_{\Delta{x} \rightarrow 0} \ln(1 + \Delta{x})^{1 \over \Delta{x}}} = e^1 = e[/tex]
We can evaluate the limiting expression for large values of x to yield approximations to e, namely [itex]e \approx 2.71828[/itex].
 
Last edited:
Mathematics news on Phys.org
  • #2
I think that a pure derivation from first principles would inevitably involve measure theory. Why?? Because arc length and area are measure theoretic principles.

I'll think about it, but did you see measure theory already?? It would be rather difficult if you didn't...
 
  • #3
Couldn't you use the idea of Archimedes himself?

Proscribe a polygon in a circle and find an expression of the length of all the sides as a function of the radius and the number of sides used to approximate the perimeter.
 
  • #4
OK, here is something that should work.
Firstly, I'll assume that the exponential function [itex]e^x[/itex] is well defined on [itex]\mathbb{R}[/itex] (you did this already). We can extend the exponential function to complex numbers by

[tex]e^x=\sum{\frac{x^n}{n!}}[/tex]

We now define

[tex]\cos(x)=\frac{e^{ix}+e^{-ix}}{2}~\text{and}~\sin(x)=\frac{e^{ix}-e^{-ix}}{2i}[/tex]

From the properties of the exponential, we can show that

  • [itex]\overline{e^x}=e^{\overline{x}}[/itex], thus cos(x) and sin(x) are real if x is real.
  • [itex]e^{ix}=\cos(x)+i\sin(x)[/itex]
  • cos(0)=1, sin(0)=0
  • [itex]|e^{ix}|=1[/itex] if x is real.
  • [itex](\sin(x))^\prime=\cos(x),~(\cos(x))^\prime=\sin(x)[/itex]

Lemma: There exists a positive number x such that cos(x)=0
Proof: Suppose that this is not the case. Since cos(0)=1, it follows that cos(x)>0 for all positive x. Thus sin(x) is strictly increasing. And since sin(0)=0, it follows that sin(x)>0 for all positive x. Thus if 0<x<y, we have
[tex]\sin(x)(x-y)\leq \int_x^y{\sin(t)dt}=\cos(x)-\cos(y)\leq 2[/tex]
But since sin(x)>0, we have that the above cannot be true for large y. This is a contradiction.

Recall the the zero's of continuous functions form a closed set. Thus the following makes sense:

Definition: We define pi as the least number x such that cos(x/2)=0.

It can now be shown that

  • the exponential function is periodic with period 2pi i
  • sine and cosine are periodic with period 2pi
  • Polar Decomposition: If z is a nonzero complex number, then there exists a unique r and a unique [itex]t\in [0,2\pi[[/itex] such that z=r(cos(t)+isin(t)).

The proofs of all of this can be found in Rudin, page 182.

In the following post, I'll make clear how to relate our pi to the circle.
 
  • #5
The idea here is from Billingsley's "probability and measure". Exercise 12.6

Let [itex]\pi[/itex] be defined in my last post. And let [itex]\pi_0[/itex] be the surface area of the closed disk. The idea is to show [itex]\pi=\pi_0[/tex].

(a) Let [itex]T_\phi[/itex] be the linear map with matrix

[tex]\left(\begin{array}{cc} \cos(\phi) & -\sin(\phi)\\ \sin(\phi) & \cos(phi) \end{array} \right)[/tex]

This matrix rotates the plane through the angle [itex]\phi[/itex] (by the addition formula's). It also preserves area.

(b) Let [itex]S_{\alpha,\beta}[/itex] be the sector consisting of all the points in the unit disk with polar coordinates between alpha and beta. It can be shown that
[tex]area(S_{\alpha,\beta})=\frac{\pi_0(\theta_2-\theta_1)}{2\pi}[/tex]

(c) by Fubini's theorem, it follows for [itex]0<\theta<\pi/2[/itex] that

[tex]area(S_{0,\theta})= \frac{1}{2}\sin(\theta)\cos(\theta) + \int_{\cos(theta)}^1{\sqrt{1-x^2}dx}[/tex]

(d) equate (b) and (c) and differentiate both sides. This concludes that [itex]\pi=\pi_0[/itex].
 
  • #6
Euclid showed, in his "Elements", that all "circles are similar". That is, if you take two circles, of radius r, circumference c, and radius R, circumference C, then r/R= c/C. From that, it follows that c/r= C/R.

He did it, if I recall correctly, by inscribing regular polygons with 3, 4, ..., n sides in the circle, showing that the ratio of the perimeter of a polygon to circumradius (distance from center to a vertex) is independent of the radius, then arguing that, since, as n gets larger and larger, the polygon approaches the circle, the same must be true for the circle. That argument can be made rigorous using limits.
 
  • #7
micromass, as much sense as that all makes, you delved into calculus (series, def integrals), arithmetic with complex numbers, and even some linear algebra to give a geometric result. Unfortunately not exactly what I'm looking for. I guess it doesn't have the 'beauty" I'm expecting, also that to get an approximation for [itex]\pi[/itex] in that manner you'd probably have to prove one of those crazy approximating series that converge to [itex]\pi[/itex]. But defining [itex]\pi[/itex] to be the least number [itex]\theta[/itex] such that [itex]\cos({\theta \over 2}) = 0[/itex] seems like a good idea that doesn't stray from plane geometry (we define [itex]\cos(\theta)[/itex] to be the x-coordinate of the point (1, 0) after rotation along the unit circle by [itex]\theta[/itex] radians). Still, I suppose I don't really have a good idea of what exactly I'm looking for here.

HallsofIvy, I think I'm going to read Elements. I spend a lot of time reading about math but I've never read it. MUST READ
 
Last edited:
  • #8
··· = 1/Δx(ln(1 + Δx) - ln1)

The far right-side is Newton's quotient for ln x at 0, but we know this is just 1.


Could you explain what you mean by Newton's quotient for ln x at 0? Shouldn't it be ln(1 + x)?

I really like your derivation of e; I've been wanting to see something like that for a while.
 
  • #9
^ Woops, sorry, that is Newton's quotient for ln x at 1. My mistake, I'll fix it now.

EDIT: ... Except I can't edit the OP post anymore ... Well, this stands corrected.

micromass said:
I think that a pure derivation from first principles would inevitably involve measure theory. Why?? Because arc length and area are measure theoretic principles.

I'll think about it, but did you see measure theory already?? It would be rather difficult if you didn't...

Well, measure theory is a bit abstract for a specific problem like this one, I think. When I say "motivation" here, I mean for a person who wants to learn geometry with a full understanding of all the concepts while taking as little for granted as possible, but also without straying far from geometry.

chiro said:
Couldn't you use the idea of Archimedes himself?

Proscribe a polygon in a circle and find an expression of the length of all the sides as a function of the radius and the number of sides used to approximate the perimeter.

I've definitely been reading about this a lot in my pi quest here. I think the approximation to pi will be hiding in a limiting expression, similar to my "derivation" of e above. I'll read more about Archimedes and Euclid's work on this topic.
 
Last edited:
  • #10
I think we ought to start out as follows, with these definitions. Define a circle geometrically as the collection of points a fixed distance r (radius) from a fixed center (origin). By Pythagoras's theorem (which we will take for granted), it follows that all points (x, y) on the circle are such that x2 + y2 = r2 (sum of squares of coordinates is equal to square of radius). Suppose an angle is subtended by an arc of a circle of radius r, and that this arc has length L. Define this angle to have radian measure L/r. Then one radian is the angle subtended by an arc of length r. It follows that a "half-rotation", or the angle subtended by a semi-circle, is (C/2)/r = C/2r where C is the circumference of the circle and r is its radius. Define [itex]\pi[/itex] as the half-rotation in radians (it would be so much nicer to define [itex]\tau[/itex] as the full rotation, but that's a story for a later day). Geometrically speaking, clearly [itex]\pi[/itex] both exists and is unique, and is a real number. We get [itex]C = 2\pi r[/itex]. Our job is to approximate [itex]\pi[/itex]. We can prove the lemma that all circles are similar to one another, i.e. ratios between dimensions are constant -- so we can proceed without loss of generality using the unit circle as our primary object of focus. Also, similarity of circles ensures that radians are well-defined. Note [itex]\pi \over 2[/itex] is a quarter rotation, [itex]2\pi[/itex] is a complete rotation, etc.

Define the ray (x, y) as the line segment from the origin to (x, y). Draw the ray from the origin to a point (x, y) on the circle, and measure the angle [itex]\theta[/itex] between the rays (1, 0) and (x, y) in radians. Note that [itex]\theta[/itex] is also the ratio of the arc length from (1, 0) to (x, y) to the radius of the circle. If the circle is the unit circle, then [itex]\theta[/itex] is simply the arc length. Define the functions "sine" and "cosine" to be such that, in this context, [itex]x = \cos\theta[/itex] and [itex]y = \sin\theta[/itex]. Essentially, if we rotate the point (1, 0) along the circumference of a circle by an angle [itex]\theta[/itex], its new coordinates are [itex](\cos\theta, \sin\theta)[/itex]. The Pythagorean identity immediately follows, as do the symmetrical, periodic, and bounded natures of sine and cosine (coordinates change sign roughly after each quarter rotation (plane quadrant); rotated coordinates "repeat" after a full rotation, i.e. [itex]2\pi[/itex] radians, etc). Perhaps it is nontrivial that sine and cosine are both bounded by [-1, 1], in which case a brief (geometric, since this is all we really have to work with here) proof should be provided.

It also follows that [itex]\cos(\pi) = -1, \sin(\pi) = 0[/itex]. Sine and cosine achieve these respective values (and any other values) for infinitely many arguments, by their periodicity; but note that [itex]\pi[/itex] is the least positive value for [itex]\theta[/itex] such that both [itex]\cos(\theta) = -1, \sin(\theta) = 0[/itex] hold.

Brb, reading Elements.

EDIT: Also observe that we have defined [itex]\pi[/itex] to be the number such that a circle with radius r has circumference [itex]C = 2\pi r[/itex] (it followed immediately from definition). Then the task here is not to prove this circumference formula -- it is to approximate our magic "half-rotation number".

Do we need sines and cosines? Do we make Archimedes's argument using limits of inscribed regular polygons?
 
Last edited:

Related to What pedagogical motivation is there for the existence/approximation of pi?

1. What is the concept of "pedagogical motivation" in relation to pi?

Pedagogical motivation refers to the educational reasons for teaching or learning a particular concept or topic. In the case of pi, the pedagogical motivation is the understanding and application of this mathematical constant in various real-world situations.

2. Why is it important for students to learn about pi?

Pi is a fundamental mathematical concept that is used in many fields such as engineering, physics, and architecture. It is also essential for understanding geometric shapes and their properties. Teaching students about pi helps them develop critical thinking and problem-solving skills that are valuable in many academic and professional pursuits.

3. How does the approximation of pi help in understanding the concept?

The approximation of pi, which is typically represented as 3.14, helps students understand that pi is an irrational number with an infinite number of decimal places. This concept is difficult to grasp, but using approximations can aid in visualizing how the number is derived and its significance in mathematics.

4. What are some practical applications of pi in the real world?

Pi is used in various real-world applications, such as calculating the circumference and area of circles, determining the volume of cylinders and spheres, and even in GPS technology for accurate location tracking. It is also used in physics and engineering equations, such as calculating the force of gravity and determining the size and shape of objects in space.

5. How can teachers motivate students to learn about pi?

Teachers can motivate students to learn about pi by showing real-world examples of its applications and how it is used in various fields. They can also incorporate fun and interactive activities, such as measuring the circumference of objects and exploring the relationship between the diameter and circumference of a circle. Additionally, teachers can emphasize the importance of pi in problem-solving and critical thinking, and how it can open doors to various career paths.

Similar threads

Replies
12
Views
1K
  • General Math
Replies
2
Views
758
Replies
2
Views
2K
  • General Math
Replies
3
Views
822
  • General Math
Replies
0
Views
716
  • General Math
Replies
3
Views
1K
Replies
4
Views
463
Replies
4
Views
1K
Replies
3
Views
274
Back
Top