Republic of Mathematics blog

Central ideas of school mathematics: Algorithmic thinking

Posted by: Gary Ernest Davis on: September 22, 2010

This is the second in a series of posts on central ideas of school mathematics, prompted by a joint project with my colleague Robert Hunting. Issues discussed here are ideas that I see as central to school mathematics if that subject is to avoid total disconnection with the rapidly changing world of professional and applied mathematics. [First post, on patterns]

Algorithmic thinking

I use the phrase “algorithmic thinking” rather than “algorithms” because I want to emphasize the cognitive aspects of engaging with algorithms, their implementation, and analysis.

Brief history

The word algorithm refers, in broad terms, to a mechanical procedure to carry out a process.

The origin of the word is an Anglicization of the Latin form of the name of the Persian scholar Abū ʿAbdallāh Muḥammad ibn Mūsā al-Khwārizmī.

Al-KhwārizmÄ« wrote on methods for manipulating fractions that had been invented by Indian mathematicians. About 300 years after he wrote this work it was translated into Latin in the 12^{th} century as  Algoritmi de numero Indorum. This title means “Algoritmi on the numbers of the Indians” and “Algoritmi” was the Latin version of Al-KhwārizmÄ«’s Arabic name.

Al-KhwārizmÄ«’s account of the Hindu method of manipulating fractions was regarded as making simple and mechanical what was previously a relatively difficult calculation with ratios.

Long before Al-KhwārizmÄ«, the Greek mathematician Euclid described a mechanical procedure, which if followed faithfully, would find the greatest common divisor of two positive whole numbers. We now call this Euclid’s algorithm (or the Euclidean algorithm).

Algorithms have become increasingly important in modern life. Algorithms are used extensively in mathematics, engineering and science. The systematic study of algorithms in general (not necessarily numerical or geometrical algorithms) is what computer science is largely about.

Comprehending and using algorithms

In school mathematics algorithms are usually taught as something for students to comprehend and practice. At best this gets students familiar with an algorithm. At worst, it has students behaving like machines, instead of thinking human beings.

A very simple algorithm is the bisection algorithm for finding a zero (or root) of a function. This can be used to approximate specific numbers. For example, suppose we want to get a decimal approximation to \sqrt{2}. One way to do this is to approximate a positive number where the function f(x):=x^2-2 takes the value 0.

Plot of the function f(x)=x^2-2

We can see visually that the number we seek to approximate, \sqrt{2}, lies between 1 and 2.

We can also figure this numerically because f(1)=1^2-2=-1<0 and f(2)=2^2-2=2>0.

The bisection method works, iteratively, as follows:

  1. At each step we have an interval of numbers [L,R] such that f(L)=L^2-2<0 and f(R)=R^2-2>0;
  2. We check the midpoint M=\frac{L+R}{2} to see if f(M) is  zero, negative or positive;
  3. If f(M)=0 we are done, and M is the number we seek [This cannot happen with \sqrt{2} starting from L=1 and R=2 because \sqrt{2} is irrational.]
  4. If f(M)<0  we set L_{new}:=M, R_{new}:=R and begin again with the new interval of numbers [L_{new},R_{new}].
  5. If f(M)>0  we set L_{new}:=L, R_{new}:=M and begin again with the new interval of numbers [L_{new},R_{new}].

We continue this process until we find an exact answer (which, as explained above in the case of \sqrt{2} we will not) or until we are satisfied with the degree of approximation.

When students in school are introduced to the bisection algorithm they will typically get a teacher led explanation of the mechanics of the algorithm and then get to practice how it is implemented in several examples. This will probably be followed with homework and a quiz or test of the same type of activity.

So a student would calculate (but usually not write as expansively about the process) as follows:

  1. Begin with the interval [1,2] where f(1)<0 and f(2)>0.
  2. The midpoint is \frac{1+2}{2}=\frac{3}{2} and f(\frac{3}{2})= \frac{3}{2}^2-2=\frac{1}{4}>0 so the new interval is [1,\frac{3}{2}].
  3. The new midpoint is {1+3/2}{2}= \frac{5}{4} and f(\frac{5}{4})=\frac{5}{4}^2-2=-\frac{7}{16}<0 so the new interval is [\frac{5}{4},\frac{3}{2}].
  4. \ldots \ldots

When we repeat this process a number of times we get the following sequence of intervals, whose endpoints (left or right)are successively  better approximations to \sqrt{2}:

[1, 2]

[1, 3/2]

[5/4, 3/2]

[11/8, 3/2]

[11/8, 23/16]

[45/32, 23/16]

[45/32, 91/64]

[181/128, 91/64]

[181/128, 363/256]

[181/128, 725/512]

[181/128, 1449/1024]

[181/128, 2897/2048]

[181/128, 5793/4096]

[11585/8192, 5793/4096]

[11585/8192, 23171/16384]

[11585/8192, 46341/32768]

On the other hand it is hard to see by visual inspection alone that these exactly calculated fractions are getting closer together, and so approximating  \sqrt{2}. One way to get a better visual  feel this is to  approximate these fractions by decimals:

[1, 2]

[1.000000000, 1.500000000]

[1.250000000, 1.500000000]

[1.375000000, 1.500000000]

[1.375000000, 1.437500000]

[1.406250000, 1.437500000]

[1.406250000, 1.421875000]

[1.414062500, 1.421875000]

[1.414062500, 1.417968750]

[1.414062500, 1.416015625]

[1.414062500, 1.415039063]

[1.414062500, 1.414550781]

[1.414062500, 1.414306641]

[1.414184570, 1.414306641]

[1.414184570, 1.414245605]

[1.414184570, 1.414215088]

Now we can calculate the squares of the endpoints of the intervals to see if they are getting closer to 2:

[1, 4 ]

[1.000000000, 2.250000000 ]

[1.562500000, 2.250000000 ]

[1.890625000, 2.250000000 ]

[1.890625000, 2.066406250 ]

[1.977539063, 2.066406250 ]

[1.977539063, 2.021728516 ]

[1.999572754, 2.021728516 ]

[1.999572754, 2.010635376 ]

[1.999572754, 2.005100250 ]

[1.999572754, 2.002335548 ]

[1.999572754, 2.000953913 ]

[1.999572754, 2.000263274 ]

[1.999917999, 2.000263274 ]

[1.999917999, 2.000090633 ]

[1.999917999, 2.000004315 ]

So we can see that after 14 or so iterations, the estimates look accurate to about 4 decimal places.

Students, will, of course, generally find such calculation very tedious to do accurately, so it’s time to think about another important aspect of algorithms:

Machine implementation

Algorithms by their very nature are designed to be implemented on machines, to avoid excessive calculation by humans.  A critical aspect of engagement with algorithms in school mathematics is how to implement an algorithm on a machine.

The numerical results above were calculated in Mathematica. Typically school mathematics classrooms do not have access to Mathematica. What’s even more limiting, many, if not most, mathematics teachers and classes do not have easy access to computers.

What mathematics teachers do usually have access to is calculators. If those calculators are programmable then the bisection algorithm, above, can be programmed into the calculator, which then does all the numerical calculations. If student calculators are not programmable. students can use calculators to get decimal approximations step by step. However, the very act of programming a machine to carry out the algorithm already leads to higher order thinking about the algorithm and helps embed into student memory what it is that the algorithm does.

Here is how I implemented the bisection algorithm to approximate \sqrt{2} by decimals, to 10 decimal places, in Mathematica:

f[x_] := x^2 – 2;

L = {{1, 2}};

Do[

m = (L[[-1]][[1]] + L[[-1]][[2]])/2;

If[f[m] < 0, L = Append[L, {N[m, 10], N[L[[-1]][[2]], 10]}],

L = Append[L, {N[L[[-1]][[1]], 10], N[m, 10]}]],

{k, 1, 15}];

L

Whichever way the bisection method is implemented it will contain a loop ( a “do” loop or a “while” loop) and an “if” statement. With access to computers, students can implement the bisection algorithm in a spreadsheet.

Machine errors

Computers and calculators do not represent decimal numbers exactly. They cannot: even a number as simple as \frac{1}{3} cannot be represented exactly as a decimal number on a machine.

As an iterative process such as the bisection method proceeds, the decimal numbers it uses are only approximations to the exact fractions it could use. Calculation machine errors can, and will, spread as the iteration proceeds.

To address this issue with students, some sort of check as to the accuracy of the results needs to be carried out.

In the calculations above we squared the decimals to see if the result was close to 2. This squaring process again involves approximations and errors, but at the very least, it can provide reassurance that the errors are not spreading too badly.

Discussion of, and reflection on, possible machine errors in calculation is an essential part of a modern way of thinking about calculations. It is one part of the central idea of algorithmic thinking.

Speed

Another modern focus in algorithmic thinking is speed: if an algorithm such as the bisection method to approximate \sqrt{2} seems to converge on a solution, how fast does it do that? Another way to ask the same question is how many steps would be needed to get a preassigned degree of accuracy?

Students can get a visual feel for the speed of convergence to a solution by plotting successive left and right hand approximations:

Plot of left hand (bottom) and right hand (top) approximations to square root of 2

We can expand this plot around 1.4 (from 1.35 to 1.45):

The plots give visual reinforcement to the feeling that the bisection method to approximate \sqrt{2} seems to converge relatively quickly (though, as it turns out, it is far from a really quick algorithm).

Why do we worry about speed of convergence?

Basically for two reasons:

  • The time the algorithm takes to run to get a desired degree of accuracy might be excessively long, on the order of days or months for some algorithms, compared with minutes or seconds.
  • The longer an iterative algorithm runs the more it runs the risk of spreading machine errors to the point it is highly inaccurate.

The theoretical error – as opposed to machine calculated error – in using the midpoint of the previous interval as an approximation to \sqrt{2} halves at each step. So if we start with an interval [L,R] then after n steps the theoretical error in using the midpoint as an approximation to \sqrt{2} is at most \frac{R-L}{2^n}. With a starting interval [1,2] this means the theoretical error after n steps is at most \frac{1}{2^n}.

This is a good motivation for base 2 logarithms, because if we want an error of no more than size \epsilon>0 then we need to carry out at least \log_2(\frac{1}{\epsilon})=-\log_2(\epsilon) steps of the algorithm. For example to get 10 decimal place accuracy (\epsilon = 10^{-10}) we will need to carry out the algorithm for at least \log_2(\frac{1}{10^{10}})\approx 33.2193 steps i.e. for at least 34 steps.

Correctness

How do we know an algorithm does what we expect it to do?

This is an obvious and important question that is central to modern thinking about algorithms, yet is  probably never addressed in school mathematics, and only rarely in university or college (usually in more advanced discrete mathematics courses, or computer science courses).

Checking that an algorithm is correct is not often an easy task, and sometimes it is quite hard. Yet the issue is relatively simple: what reasoning can we bring to bear on why  a particular algorithm does what it is supposed to do?

A school student answer might not be as sophisticated as a college student, or as a graduate student or as a professor of mathematics or computer science. Yet that is no reason to avoid addressing the issue of correctness.

How might we, informally at least begin to address the correctness of the bisection method?

This is an unusual activity for school mathematics (although a central one, I claim) because it involves thinking.

  • At each step of the bisection algorithm to approximate \sqrt{2} we take an interval [L,R] where we know the function f(x):=x^2-2 is negative at the left endpoint and positive at the right endpoint.
  • From this interval we calculate the midpoint \frac{L+R}{2} and check the value f(M) for positivity or negativity.
  • If f(M)<0 we replace L by M; if f(M)>0 we replace R by M.
  • The new interval is contained in the previous interval.
  • The size of the interval goes down by \frac{1}{2} at each step.
  • So we construct a decreasing sequence of closed intervals whose size halves at each step and therefore  approaches 0 as the number of steps increases.
  • The monotone convergence theorem – an intuitively obvious statement about a geometric interpretation of decimal numbers -  says that the left hand points of the intervals converge to a number L_{\infty} and the right hand endpoints converge to a number R_{\infty}.
  • Since the distance between the left hand and right hand endpoints approaches 0, L_{\infty}=R_{\infty}
  • Because f(x)=x^2-2 is a continuous function – it operates on nearby numbers to produce nearby numbers -  f(L_{\infty})=0, that is L_{\infty}^2=2.

Fragments of this – informal – argument can be used to heighten a student’s sense of proving a reason why the algorithm does what it is supposed to do, through utilizing geometric properties of real numbers as decimals.

Moral

Algorithmic thinking, as distinct from students simply practicing hand implementation of algorithms, is a central idea of school mathematics because of the centrality of algorithmic thinking in mathematics, engineering and science.

Important aspects of engagement with algorithmic thinking are:

  • Comprehending an algorithm
  • Hand implementing an algorithm
  • Machine implementing an algorithm
  • Reflecting on and estimating machine errors
  • Reflecting on and estimating theoretical errors
  • Reasoning about correctness

And after all that we have not yet touched on another important aspect of algorithmic thinking, one that the National Council of Teachers of Mathematics stresses:

  • Generation of new algorithms

Algorithmic thinking needs to be critically addressed in school mathematics at the highest policy levels if school mathematics is not to become increasingly disconnected from, and irrelevant to, modern mathematical practice and concerns.

Postscript

There’s an interesting discussion at reddit.com on what algorithm blows your mind.

 

3 Responses to "Central ideas of school mathematics: Algorithmic thinking"

Super-Duper site! I am loving it!! Will come back again – taking you feeds also, Thanks.

Thanks so much. It’s a pleasure to get comments from you.

Thanks so much for your positive comments. Good news is always welcome. Let me know if there are particular mathematics issues or topics you would like to see addressed.

Leave a Reply to Gary DavisCancel reply