Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
Letters to the Editor
The Mathematical Intelligencer
encourages comments about the
material in this issue. Letters
to the editor should be sent to either of
the editors-in-chief, Chandler Davis or
Marjorie Senechal.
Nonegenarian Fibonacci Devotee
Please let me take this opportunity to make one more obeisance to the Fibonacci sequence.Fibonacci tended to take over my mathematical life from the time, many years ago, when I found that the occurrence of the numbers in leaf patterns needed more explaining. One thing led to another, decade after decade, paper after paper.1 I lived comfortably among these numbers-until midnight of April 26, 2003. At that instant, I ceased to be 89 years old; and there seems little prospect of my ever again having a Fibonacci number as my age. To be sure, my rural route address is now Box 532, Route 1, a concatenation of Fibonacci numbers in reverse order, but that is small consolation.
Something more is needed to reaffirm my allegiance. Here is my offering.
I will prove that the Fibonacci numbers with odd index can be generated iteratively from the quadratic equation
(la) x2 + y2 = 3xy - 1
in the following way. Put x equal to any Fibonacci number with odd index;:::: 1, and solve (1a) for y; the larger root will be the Fibonacci number with the next larger odd index. The Fibonacci numbers with even index are generated by
exactly the same procedure from the equation
(lb) x2+y2=3xy+l.
To prove these, I will use an immediate consequence of the defining iter
ation Fn+l = Fn + Fn-t:
(2) Fn-2 + Fn+2 = 3F,.
I will also use the identity
(3) Fn-2Fn+2 = Fn2 + ( -1)n+l,
which is a special case of an identity in Hoggatt.2
Now I set x = Fn (n odd) in (la)
(4) Fr/ + y2 = 3FnY- 1,
and I am able to show that the larger root for y is F11+2. Substituting (3) on the left and (2) on the right of ( 4) reduces it to
which does indeed have F,+2 as its larger root. Similarly for the assertion for even n.
Irving Adler
297 Cold Spring Road
North Bennington, VT 05257
USA
e-mail: [email protected]
'For instance. my articles in J. Theor. Bioi. 45 (1g74), 1-7g: and J. Algebra 205 (1ggs), 227-243.
2V. E. Hoggatt, Jr. Fibonacci and Lucas Numbers (Houghton Mifflin, 1969). See p. 59.
4 THE MATHEMATICAL INTELLIGENCER © 2006 Springer Science+ Bus1ness Med1a, Inc
BARRY KOREN
Computationa F uid Dynamics· Science and Too
The year 2003 marked the 1 OOth anniversary of both the birth of John von Neumann
and the first manned flight with a powered plane-both events of great importance for computational fluid dynamics.
he science of flows of gases and liquids is fluid dynamics, a subdiscipline of physics.
No courses in fluid dynamics are given in high school, as it requires too much math-
ematical background. Fluid dynamics is taught at university and at engineering col-
leges, for one cannot ignore fluid dynamics if one wants to design an aircraft, a rocket,
a combustion engine, or an artificial heart.
Particularly for aircraft design, knowledge and under
standing of fluid dynamics-aerodynamics in this case-is
of major importance. Except for the dangerous gravity, all
forces acting on a flying plane are forces exerted by air. To
fly an aircraft safely (tanked up with fuel and with pas
sengers on board), a precise knowledge, understanding,
and control of these aerodynamic forces is a matter of life
and death. Moreover, flying must not only be safe but also
fuel-efficient and quiet. For aerospace engineering, aero
dynamics is indispensable.
Nowadays, both experimental and theoretical means are
available for investigating fluid flows. Wind tunnels are the
canonical tool for experimental aerodynamics. The Wright
brothers, who made the first manned flight with a powered
plane (Fig. 1), had at their disposal a wind tunnel, one they
themselves had made. Wind-tunnel testing has many dis
advantages, but it is deemed trustworthy because real air
is used and not the virtual air of theoretical aerodynamics.
A Brief History of Computational Fluid Dynamics
Nowadays, the technological relevance of theoretical aero
dynamics, of theoretical fluid dynamics in general, is widely
appreciated. However, in the past it was mainly an acade-
mic activity, with results that strongly differed from ex
perimental observations. The technical applications of fluid
dynamics developed independent of theory. Theoretical
and technological breakthroughs have since closed the gap
between theory and practice, and today we see a fruitful
interaction between the two. The airplane has played a very
stimulating role in this development.
I proceed by highlighting some key developments from
the history of theoretical fluid dynamics with an eye toward
computational fluid dynamics.
Revolutionary innovations
Theoretical fluid dynamics has an illustrious history [1, 2,
3]. In the course of centuries, many great names have con
tributed to the understanding of fluid flow and have helped
in building up theoretical fluid dynamics, step by step. The
oretical fluid dynamics goes back to Aristotle (384-322 BC),
who introduced the concept of a continuous medium. In
my opinion, though, it actually began 2000 years later, when
Leonhard Euler published his equations of motion for the
flow of liquids and gases, on the basis of Newton's second
law of motion [4, 5].
Euler's idea to describe the motion of liquids and gases
© 2006 Springer Science+Business Media, Inc., Volume 28. Number 1, 2006 5
Figure 1 . Glass-plate photo of the first manned flight with a powered
plane (flight distance: 37 meters, flight time: 12 seconds), Kitty Hawk,
North Carolina, 1 0.35 h., December 17, 1 903. Prone on the lower wing:
Orville Wright. Running along with the plane to balance it by hand if
necessary: Wilbur Wright. Visible in the foreground: the rail from
which the plane took off and the bench on which the wing rested.
The very same day, the Wright brothers made a flight of 260 meters
and 59 seconds!
in the form of partial differential equations was a revolu
tionary innovation. However, his equations, known today
as the Euler equations, were still unsuited for practical ap
plications, because they neglect friction forces: only pres
sure forces were taken into account. It was almost a cen
tury later, in 1845, that George Stokes proposed fluid-flow
equations which also consider friction [6]; equations which,
for an incompressible flow, had already been found by
Claude Navier [7] and are now known as the Navier-Stokes
equations. With the introduction of the Navier-Stokes equa
tions, the problem of understanding and controlling a large
class of fluid flows seemed to be within reach, as it had
been reduced to the integration of a handful of fundamen
tal differential equations.
Although formulating the Navier-Stokes equations con
stituted great progress, the analytical solution of the com
plete equations was not feasible. (It remains one of the out
standing open mathematical problems of the 21st century.)
One developed instead a large number of simplified equa
tions, derived from the Navier-Stokes equations for special
cases, equations that could be handled analytically. More
over, a gap continued between experimental and theoreti
cal fluid dynamics. The former developed greatly during the
Industrial Revolution, independent of the latter. It was para
doxical that the introduction of the Navier-Stokes equa
tions led to a further fragmentation into different flow mod
els, all of which described the flow of the same fluid (air
in our case)-a theoretically highly undesirable situation.
Theoretical fluid dynamics stagnated along a front of non
linear problems. This barrier was finally broken in the sec
ond half of the 20th century, with numerical mathematics,
at the expense of much-often very much-computational
work A key role was played in this by a Hungarian-born
mathematician, John von Neumann.
6 THE MATHEMATICAL INTELLIGENCER
Trail-blazing ideas from Princeton
In the tens and twenties of the past century, Budapest was a
fruitful breeding ground for scientific talent. It saw in 1903
the birth of John von Neumann (Fig. 2). In his early years von
Neumann received a private education; at the age of 10 he
went to school for the first time, directly to high school.
There, his great talent for mathematics was discovered. He
received extra lessons from mathematicians of the Univer
sity of Budapest, among them Michael Fekete, with whom
von Neumann wrote his first mathematics paper, at the age
of 18. By then he was already a professional mathematician.
Von Neumann studied at the ETH ZUrich and the University
of Budapest; he obtained his PhD degree at the age of 22.
Next he moved to Gem1any, where he lectured at universi
ties in Berlin and Hamburg. There he was particularly active
in pure mathematics: in set theory, algebra, measure theory,
topology, and group theory. He contributed to existing theo
ries: the sure way to quick recognition. From the mid 1930s,
von Neumann chose a riskier way of working: breaking new
ground. He turned to applied mathematics in the sense of
mathematicians like Hilbert and Courant, i.e., not mathe
matics applied to all kinds of ad hoc problems, but the sys
tematic application of mathematics to other sciences, in par
ticular to physics, with subdisciplines like aerodynamics. The
rapidly deteriorating political situation in Europe, from which
von Neun1ann had already emigrated to Princeton, 1 played a
·-------- -------
1Von Neumann was one of the many scientists who left Europe in the early 1 930s.
For a description of the fall of Gottingen under Nazi pressures. see Richard
Courant's biography [8].
Figure 2. John von Neumann, 1 903-1 957.
role in this decision. War was looming and brought in
creasing demands for answers to questions related to mil
itary engineering.
Whereas von Neumann had worked on a mathematical
basis for the equations of quantum mechanics before the
war, during the war he "lowered" himself to developing nu
merical solution methods for the Euler equations. His idea
to compute possible discontinuities in solutions of the
Euler equations without explicitly imposing jump relations
was very original. Instead, von Neumann proposed the in
troduction of artificial (numerical) diffusion, in such a way
that the discontinuities automatically appear in a physically
correct way: shock capturing, nowadays a standard tech
nique. He also came up with an original method for ana
lyzing the stability of numerical calculations: a Fourier
method, now a standard technique as well.
In 1944, the urgent need arose to apply von Neumann's
numerical methods on automatic calculators, computers,
beyond the scope of the machines of that time. This moti
vated von Neumann to start working also on the develop
ment of the computer. In 1944 and 1945 he did trail-blazing
work, writing his numerical methods for computing a fluid
flow problem in a set of instructions for a still non-existent
computer. These instructions were not to be put into the
computer by changing its hardware or its wiring. Instead,
von Neumann proposed to equip computers with hardware
as general as possible, and to store the computing instruc
tions in the computer, together with the other data involved
(input data, intermediate results, and output data). In 1949,
the first computer was realized which completely fulfilled
von Neumann's internal programming and memory princi
ples: the EDSAC (Electronic Delay Storage Automatic Cal
culator), by M. V. Wilkes, at Cambridge University. Today,
the two principles are still generally applied.
L. F. Richardson and Richard Courant and colleagues had
combined theoretical fluid dynamics and numerical mathe
matics before von Neumann [9, 10], but still without clear
ideas about computers-without computer science. Compu
tational fluid dynamics (abbreviated CFD) is a combination
of three disciplines: theoretical fluid dynamics, numerical
mathematics, and computer science. Because von Neumann
brought in this last discipline, he can be considered the found
ing father of CFD. A detailed description of von Neumann's
contributions to scientific computing is given by Aspray [11).
A good overview of his other pioneering work can be found
in the scientific biography written by Ulam [12].
Traveling from place to place as an honored mathe
matician with many social and political obligations, von
Neumann must have had very little time to write down his
scientific ideas. He published only one paper about both
shock capturing and the aforementioned stability analysis,
and that not until 1950 [13].
On his many travels, von Neumann visited the Nether
lands. In 1954, he was an invited speaker during the Inter
national Congress of Mathematicians held in Amsterdam.
A tea party with Queen Juliana of the Netherlands was
arranged for a select group of participants, among them
John von Neumann (Fig. 3).
Figure 3. John von Neumann and colleagues at Soestdijk Palace. Above: all together, John von Neumann front row, far left. Queen Juliana,
with white handbag, is flanked by the two new recipients of the Fields Medal: Jean-Pierre Serre (with Herman Weyl's hands on his shoulders)
and Kunihiko Kodaira. At the right of von Neumann: Mary Cartwright.
© 2CXJ6 Springer Science+ Business Media, lnc., Volume 28, Number 1, 2006 7
Pioneering work in Amsterdam
In the third quarter of the 20th century, computer science
was a new and growing discipline. Initially, the Netherlands
played no significant role in the development of computer
science, but the country was quickly moving forward. In
1946, the Mathematisch Centrum (MC) was founded in Am
sterdam. The mission of this new institute was to do pure
and applied mathematics research in order to increase "the
level of prosperity and culture in the Netherlands and the
contributions of the Netherlands to international culture."
(Not at all the pure ivory tower.) The foundation of the MC
did not proceed without struggle. The most prominent
Dutch mathematician of the day, L. E. J. Brouwer of the
University of Amsterdam, was of the opinion that mathe
matics should be indifferent towards the physical sciences
and even rejecting of technology; an odd point of view, con
sidering the work of mathematicians like Hilbert, Courant,
and von Neumann. With the MC, Brouwer wanted to tum
Amsterdam into the new Gottingen of pure mathematics.
It did not work out that way. His biographer feels that
Brouwer was sacrificed to the foundation of the MC ([14],
p. 479).
The founders of the MC had heard about von Neumann's
ideas about machines which should be able to perform a
series of calculations as independently as possible. They
wanted the MC to have a computing department in order
to develop a computer and to execute advanced comput
ing work. Aad van Wijngaarden (Fig. 4), former student of
J. M. Burgers of the Delft University of Technology, was
appointed as the first staff member of the MC, in 194 7. That
year he made a study tour to visit von Neumann in Prince
ton. Van Wijngaarden and his co-workers designed and con
structed the first Dutch computer: the ARRA I (Automa
tische Relais Rekenmachine Amsterdam I, Fig. 5). New
Figure 4. Aad van Wijngaarden, 1 916-1987 (photo courtesy of CWI).
8 THE MATHEMATICAL INTELLIGENCER
computers were designed and built (one per design only),
in 1955 exclusively for the Fokker aircraft industries:
the FERT A (Fokker Elektronische Rekenmachine Type
ARRA). Much human labor was required to perform com
putations on these early computers. At the MC, this was
done by young women (Fig. 6), schooled in mathematics
by Van Wijngaarden. A highlight was the project for the de
velopment of the Fokker Friendship airplane, a numerical
project on which Van Wijngaarden and his "computing
girls" worked from 1949 until 1951. The computations con
cerned oscillations of the airplane's wing in subsonic flow:
flutter. The first computing work was still very "external"
and machine-dependent; for each computation, cables had
to be plugged into the computer. With the accomplishment
of internal programming as proposed by von Neumann, at
tention shifted entirely to the invention of algorithms and
their coding as computer programs. Edsger Dijkstra, a later
Turing Award recipient, was appointed at the MC as the
first Dutch computer programmer. Van Wijngaarden and
Dijkstra left an international mark on computer science
with their contributions to the development of the pro
gramming language Algol 60 [15], and Van Wijngaarden
later added to that reputation with Algol 68 [16]. In 1979,
Van Wijngaarden was awarded an honorary doctorate for
his pioneering work by the Delft University of Technology,
and the MC grew into the present CWI (Centrum voor
Wiskunde en Informatica), which celebrates its 60th an
niversary in 2006.
Computational fluid dynamics research on the basis of
the Euler or Navier-Stokes equations, of the same funda
mental character as that established by von Neumann, was
not done in the Netherlands of 1945-1960. For this funda
mental work, we have to go to the United States and the
Soviet Union of the 1950s.
Figure 5. The first computer at the MC and in the Netherlands, the ARRA I in its final set-up. From left to right: power frame and the three
arithmetic registers. On the table in the middle: the punch-tape reader. Some 1200 relays are at the back of the machine {photo courtesy of
CWI).
A continuous flow of CFD from New York
In early December 1941, a passenger ship carried a 15-year
old Hungarian boy from Europe to the United States. The
boy, along with his parents, was escaping the tragic fate
threatening European Jews. (It was to be the last passen
ger ship from Europe's mainland to the United States for
years to come. During the voyage, the United States was
drawn into the Second World War by the attack on Pearl
Harbor.) The young ship passenger carried with him two
letters of recommendation from his teachers. It seems
likely that von Neumann saw those letters brought by his
young fellow-countryman, for the boy, Peter Lax (Fig. 7),
had a meteroric rise to success. In 1945, while still a
teenager, he became involved in the Manhattan Project. In
1949, he received his PhD degree from New York Univer
sity, with Richard Courant as his thesis advisor, and in 1951,
Figure 6. Female arithmeticians at the MC, the "girls of Van Wijngaarden." In the foreground: Ria Debets, later the spouse of Edsger Dijk
stra. {photo courtesy of CWI).
© 2CX>6 Springer Science+ Business Media, Inc., Volume 28, Number 1, 2006 9
Figure 7. Peter D. Lax.
he became assistant professor there. His work in mathematics continues to this day and has led to many honors and awards, among them the 2005 Abel Prize in mathematics [ 17].
Like von Neumann, Lax is a homo universalis in mathematics. He has performed ground-breaking research, and has been a productive and versatile author of mathematics books. His books deal with such diverse topics as partial differential equations, scattering theory, linear algebra, and functional analysis. Above all, he is known for his research on numerical methods for partial differential equations, in particular for hyperbolic systems of conservation laws, such as those arising in fluid dynamics. Lax's name has been given to several mathematical discoveries of importance to CFD:
• the Lax equivalence theorem [18], stating that consistency and stability of a finite-difference discretization of a well-posed initial-boundary-value problem are necessary and sufficient for the convergence of that discretization,
• the Lax-Friedrichs scheme [ 19], a stabilized central finitedifference scheme for hyperbolic partial differential equations,
• the Lax-Wendroff scheme [20], a more accurate but equally stable version of the Lax-Friedrichs scheme,
• the Lax entropy condition [21], a principle for selecting the unique physically correct shock-wave solution of nonlinear hyperbolic partial differential equations that allow multiple shock-wave solutions, and
• the Harten-Lax-Van Leer scheme [22], a very efficient numerical method for solving the Riemann problem.
Like von Neumann, Lax was (and still is) a strong proponent of the use of computers in mathematics. A quote
1 0 THE MATHEMATICAL INTELLIGENCER
from Lax: "The impact of computers on mathematics (both applied and pure) is comparable to the roles of telescopes in astronomy and microscopes in biology."
Despite the Second World War and the Cold War, Lax has always had very good connections with scientists worldwide. One such relation is with a famous Russian mathematician, mentioned in the next section.
A brilliant idea from Moscow
A substantial part of the Euler and Navier-Stokes software used worldwide is based on a single journal paper [23], distilled by the then-young Russian mathematician Sergei Konstantinovich Godunov (Fig. 8) from his PhD thesis.
Godunov proposed the following. Suppose one has a tube and in it a membrane separating a gas on the left with uniformly constant pressure, from a gas on the right with a likewise uniformly constant but lower pressure (Fig. 9, top). If the membrane is instantaneously removed-the traffic light changes from red to green-then the yellow gas will push the blue gas to the right; the interface between the two gases, the contact discontinuity, runs to the right. At the same time, two pressure waves start running through the tube: a compression wave running ahead of the contact discontinuity and an expansion wave running to the left (Fig. 9, bottom). In the 19th century, the Euler flow in this tube, a shock tube, had already been computed by Riemann, with "pencil and paper" [24]. (For this old work of Riemann, Duivesteijn has written a nice, interactive Java applet [25].) For the computation of the flow in a tube in the case of an initial condition which has more spatial variation, Godunov proposed to decompose the tube into virtual cells (Fig. 10, above), with a uniformly constant gas state in each cell, and with each individual cell wall to be considered as the aforementioned membrane (traffic light). To know the interac-
Figure 8. Sergei Konstantinovich Godunov.
rarefaction wave contact discontinuity shockwave .....
Figure 9. Shock tube. Top: condition of rest in left and right part: high and low pressure, respectively. Bottom: condition of motion with shock
wave and contact discontinuity running to the right and rarefaction wave running to the left. (drawing: Tobias Baanders, CWI).
tion between the gas states in two neighboring cells, one instantaneously 'removes' the cell wall separating the two cells, and computes the Riemann solution locally there, and hence the local mass, momentum, and energy flux (Fig. 10, bottom). This is done at all cell faces. With this, the net transport for each cell is known and a time step can be made. A plain method and a very simple flow problem, so it seems. If one can do this well, the flow around a complete aircraft or spacecraft can be computed. The remarkable property of the method is that at the lowest discrete level, that of cell faces, a lot of physics has been built into it, not just numerical mathematics.
The more cells, the better the accuracy, yet also the more expensive the computation. Godunov did not have access to computers, but to "computing girls," who called Godunov and his fellow PhD students "that science," and who received payment on the basis of the number of computations they performed, right or wrong. No real CFD there either!
In 1997, Godunov received an honorary doctorate from the University of Michigan, and a symposium was organized
for him at the university's Department of Aerospace Engineering. At that symposium, in a one-and-a-half-hour lecture, Godunov gave insight into his earlier research, whose strategic importance was not appreciated in the Soviet Union at the time. This historic lecture has since been published [26, 27].
A second important result in Godunov's classical paper from 1959 [23] is his proof that it is impossible to devise a linear method which is more than first-order accurate, without being plagued by physically incorrect oscillations in the solution: wiggles (Fig. 11). With a first-order-accurate method, the solution becomes twice as accurate and remains free of wiggles when the cells are taken twice as small. With a second-order-accurate method, the solution becomes four times more accurate then, but-unfortunately-possibly wiggle-ridden.
Wiggles can be very troublesome in practice. For example, a simple speed-of-sound calculation in a single cell only may break down the entire flow computation, because of a possibly negative pressure. The wiggle problem does not occur only with Godunov's method; it is a general prob-
Figure 10. Shock tube divided into small cells. Top: cells. Bottom: wave propagation over all cell faces. (drawing: Tobias Baanders, CWI).
© 2006 Springer Science+Bus1ness Media, Inc., Volume 28, Number 1, 2006 1 1
pressure pressure
1
0
Figure 1 1 . Right and wrong pressure distribution. Left: without wiggles. Right: with wiggles. (drawing: Tobias Baanders, CWI).
lem. A drawback of Godunov's method is that it is computing-intensive; at each cell face, the intricate Riemann problem is solved exactly.
Technology pushes from Lelden
It took about two decades before good remedies were found for the wiggles of higher-order methods and the high cost of the Godunov algorithm. The aid came from an astronomer. In space, large clouds of hydrogen are found. Simulation of the flow of this hydrogen provides models of the development of galaxies. The literally astronomical
Figure 12. Bram van Leer (photo: Michigan Engineering).
1 2 THE MATHEMATICAL INTELUGENCER
speeds and pressures which may arise in these computations impose high demands on the accuracy, and particularly the robustness, of the computational methods to be applied. While still in Leiden, in the 1970s, the astronomer Bram van Leer (Fig. 12) published a series of papers in which he proposed methods which are second-order accurate and do not allow wiggles. The fifth and last paper in this series is [28). Furthermore, Van Leer introduced a computationally efficient alternative to the Godunov algorithm [29): two technology pushes, not only for astronomy but also for aerospace engineering, as well as for other dis-
ciplines. In 1990, Van Leer was awarded an honorary doctorate for this work by the Free University of Brussels.
Efficient solution algorithms from Rehovot and
other places
Broadly speaking, how does an Euler- or Navier-Stokesflow computation around an aircraft work? The airspace out to a large distance from the aircraft, may be divided into (say) small hexahedra, small 3D cells. Just as in the 1D shock tube example, one can then compute for each cell the net inflow of mass, momentum, and energy, using at each cell face the Godunov alternative ala Van Leer or other alternatives, like the Roe scheme [30] or the Osher scheme [31]. The finer the mesh of cells around the aircraft, the grid (Fig. 13), the more accurate the solution, but also the higher the computing cost. A grid of one million cells for an Euler- or Navier-Stokes-flow computation is not unusual. Suppose that we want to simulate a steady flow. We then have to solve, per cell, five coupled nonlinear partial differential equations. The cells themselves are coupled as well: what flows out of a cell flows into a neighboring cell (or across a boundary of the computational domain). In Navier-Stokes-flow computations, the flow solution in a single cell may influence the flow solutions in all other cells. In our modest example, we may have to solve a system of five million coupled nonlinear algebraic equations. Efficient solution of these millions of equations is an art in itself. Many efficient solution algorithms have been devel-
oped, the most efficient of which are the multigrid algorithms. Multigrid methods were invented at different locations and by several people. A leading role has been played by Achi Brandt from the Weizmann Institute of Science in Rehovot, Israel [32]. Multigrid algorithms have a linear increase of the computing time with the number of cells. This may seem expensive-2, 3, or 4 times higher computing cost for a grid with 2, 3, or 4 times more cells, respectivelybut it is not. In numerical mathematics, no bulk discount is given. For many solution algorithms the rule is 22,32,42, . .. times higher computing cost for a grid with 2,3,4, . .. times more cells! For the interested reader, a book on multigrid methods is [33].
Present State of the Art in CFD
An example
A quick impression will now be given of what can be done with CFD by looking at a standard flow problem. It concerns the recent MSc work of Jeroen Wackers. From scratch, he developed 2D Euler software in which the grid is automatically adapted to the flow, and what follows describes one of his results. Consider the channel depicted in Figure 14, and in it a uniformly constant supersonic air flow (from left to right) at three times the speed of sound. One may consider the channel to be a stylized engine inlet of a supersonic aircraft. In fact it is just a benchmark geometry [34, 35]. The red vertical valve at the bottom of the channel instantaneously snaps up, so that, together with the red
Figure 13. Cross sections of a hexahedral grid around the Space SHuttle.
© 2006 Spnnger Science+ Business Med1a, Inc .. Volume 28, Number 1. 2006 1 3
' \ ' \ flow speed ' \ 7 _) ..J
speed of sound
I Figure 1 4. 20, parallel channel. In it, a parallel plate and a vertical valve which is still open.
horizontal plate, it forms a step which suddenly chokes part of the channel.
Figure 15 shows a computational result. We see how the uniformly constant initial solution and the grid have developed after some time. The computational method highly satisfies the often conflicting requirements of numerical stability, accuracy, and monotonicity on the one hand, and computing and memory efficiency on the other [36].
Books, journals, and software
Twenty years ago, textbooks on CFD were rare, but several are available now (see, e.g., [37, 38, 39, 40]). There are
>
>
X
X
also scientific journals dedicated to CFD. Moreover, offthe-shelf CFD software can be purchased these days. Each issue of, e.g., the monthly Aerospace America contains colorful, full-page advertisements for CFD software. A practical overview of the CFD literature, software, and also vacancies can be found on the Web site of CFD Online [41].
Today, CFD's role is about as important as that of experimental fluid dynamics. And CFD continues to grow. It is fed by improvements in both computer science and numerical mathematics. In addition, CFD itself stimulates research in computer science and numerical mathematics: a fruitful interaction.
Figure 15. Computational result some time after instanteously closing the lower part of the channel. Top: iso-lines of density. When the ver
tical valve is still in the open position, the density in the entire channel is constant (everywhere the same blue color as at the inlet). Bottom:
computational grid automatically adapted to flow solution.
14 THE MATHEMATICAL INTELLIGENCER
At present, CFD enters into full cooperation with other disciplines, such as structural mechanics (computational fluid-structure interactions) and electromagnetism (computational magnetohydrodynamics ).
Outlook
The fact that commercial CFD software is a success is proof of the practical importance of the theoretical fluid-dynamics work since Euler. The growing availability of CFD software may seem to be a threat for CFD research; CFD researchers seem to make themselves redundant by their own success. Yet, this growing software availability may also be considered a good development. Not everyone has to write his/her own Euler or Navier-Stokes code. Coding such software from scratch gives the best insight and is pleasing work, but it may easily take too much time.
Education
A new question arises: How to teach CFD, now that it has become more and more important as an easily available, automatic tool? Not just factual knowledge but also understanding of the mathematical and physical principles of CFD remains indispensable, not only when practicing it as a science, but also when using it as a tool. The CFD-tool user must know and understand these principles well in (1) posing the computational problem, (2) choosing the numerical method to solve that problem, and (3) interpreting the computational results. The user must know the possibilities and limitations of computational methods and should be able to assess whether the computational results obtained fulfill the expectations or not. If not, it should be found out why. Thus CFD is not solving flow problems by blind numerical force. On the contrary, stimulated by the growing potential of CFD, still more complicated flow problems will be considered, problems which will require even more knowledge and understanding of flow physics and numerical mathematics.
Research
As CFD becomes more and more mature, it also becomes more difficult to contribute fundamental research to it. In recent decades a PhD student can hardly do such fundamental work as Godunov did. Students will have to acquire an evergrowing knowledge and understanding of CFD before they can start working in it themselves. On the other hand, thanks to the availability of CFD tools, the possibilities for application of CFD are far greater now than in Godunov's era. Just how CFD will develop remains unpredictable, and this is part of what makes it an exciting and attractive discipline.
In CFD plenty of research questions remain. New fluidflow problems will continue to arise, and there will certainly be times when we may say with Orville Wright, "Isn't it astonishing that all these secrets have been preserved for so many years just so that we could discover them!"
REFERENCES
[1] T. von Karman, Aerodynamics, McGraw-Hi l l , New York, 1963.
[2] H . Rouse and S. lnce, History of Hydraulics, Dover, New York,
1963.
[3] J. D . Anderson, A History of Aerodynamics, Cambridge University
Press, Cambridge, 1997.
[4] L. Euler, Principes gf!meraux du mouvement des fluides, Memoires
de I 'Academie des Sciences de Berlin, 11 (1755), pp. 274-31 5 .
[5] M . D. Salas, Leonhard Euler and his contributions to fluid me
chanics, AIM-paper 88-3564, AIM, Reston, VA, 1988.
[6] G. G. Stokes, On the theories of the internal friction of fluids in mo
tion, and of the equilibrium and motion of elastic solids, Transac
tions of the Cambridge Philosophical Society, 8 (1845), pp. 287.
[7] C. L. M . H. Navier, Memoire sur les lois du mouvernent des f/uides,
Memoires de I'Academie des Sciences, 6 (1822), pp. 389-440.
[8] C. Reid, Courant in Gottingen and New York. The Story of an Im
probable Mathematician, Springer-Verlag, New York, 1976.
[9] L. F. Richardson, Weather Prediction by Numencal Process, Cam
bridge University Press, Cambridge, 1922.
[1 0] R. Courant, K. 0. Friedrichs, and H. Lewy, Ober die partie/len Dif
ferenzgleichungen der mathematischen Physik, Mathematische
Annalen, 1 00 (1 928), pp. 32-74.
[11] W. Aspray, John von Neumann and the Origins of Modern Com
puting, MIT Press, Cambridge, Massachusetts, 1990.
[1 2] S. Ulam, John von Neumann, 1903-1957, Bulletin of the Ameri
can Mathematical Society, 64 (1958), pp. 1-49.
[13] J. von Neumann and R. D. Richtmyer, A method for the numeri
cal calculation of shocks, Journal of Applied Physics, 21 (1950),
pp. 232-237.
[14] D . van Dalen, L. E. J . Brouwer, 1 881-1966, Het Heldere Licht van
de Wiskunde, Bert Bakker, Amsterdam, 2002.
[15] P. Nauer (ed .) , Revised Report on the Algorithmic Language Algol
60 (available for download from http://www.masswerk.at/
algol60/report.htm).
[16] A. van Wijngaarden et al., Revised Report on the Algorithmic Lan
guage Algol 68, Springer-Verlag, Berl in, 1 976.
[17] http://www.abelprisen.no/en/.
[18] P. D. Lax and R. D. Richtmyer, Survey of the stability of l inear fi
nite difference equations, Communications on Pure and Applied
Mathematics, 9 (1 956), pp. 267-293.
[19] P. D. Lax, Weak solutions of nonlinear hyperbolic equations and
their numerical computation, Communications on Pure and Ap
plied Mathematics, 7 (1954), pp. 159-1 93.
[20] P. D. Lax and B. Wendroff, Systems of conservation laws, Commu
nications on Pure and Applied Mathematics, 13 (1960) , pp. 217-237.
[21] P. D. Lax, Hyperbolic Systems of Conservation Laws and the Math
ematical Theory of Shock Waves, SIAM, Philadelphia, 1973.
[22] A. Harten, P. D. Lax, and B. van Leer, On upstream differencing
and Godunov-type schemes for hyperbolic conservation laws,
SIAM Review, 25 (1 983), pp. 35-61.
[23] S. K. Godunov, Finite difference method for the numerical com
putation of discontinuous solutions of the equations of fluid dy
namics, Mathemat1cheskfi"Sborn1k, 47 (1959), pp. 271-306. Trans
lated from Russian at the Cornell Aeronautical Laboratory.
[24] G. F. B. Riemann, Ober die Fortpflanzung ebener Luftwellen von
endlicher Schwingungsweite, in : Gesammelte Werke, Leipzig,
1876. Reprint: Dover, New York, 1953.
[25] G. F. Duivesteijn , Visual shock tube solver (to be downloaded from
http://www. piteon.ni/cfd/) .
[26] B. van Leer, An Introduction to the article "Reminiscences about
difference schemes", by S. K. Godunov, Journal of Computational
Physics, 153 (1999), pp. 1-5.
© 2006 Spnnger Science+ Business Media, Inc., Volume 28, Number 1, 2006 15
[27] S. K. Gudunov, Reminiscences about difference schemes, Jour
nal of Computational Physics, 1 53 (1 999), pp. 6-25.
[28] B. van Leer, Towards the ultimate conseNative difference scheme.
V. A second-order sequel to Godunov's method, Journal of Com
putational Physics, 32 (1 979), pp. 1 01 -1 36. Reprint: Journal of
Computational Physics, 135 (1997}, pp. 229-248.
[29] B. van Leer, Flux-vector splitting for the Euler equations, in Lec
ture Notes in Physics, Vol. 170, Springer-Verlag , Berl in , 1 982, pp.
507-5 1 2 .
[30] P . L. Roe, Approximate Riemann solvers, parameter vectors, and
differences schemes, Journal of Computational Physics, 43 ( 1 981 ) .
pp. 357-372.
[31] S. Osher and F. Solomon, Upwind difference schemes for hyper
bolic systems of conseNation laws, Mathematics of Computation,
38 (1 982), pp. 339-374
[32] A Brandt , Multi-level adaptive solutions to boundary-value prob
lems, Mathematics of Computation, 31 (1 977), pp. 333-390.
[33] U. Trottenberg, C. W Oosterlee, and A Schuller, Multigrid, Aca
demic Press, New York, 2001 .
[34] A F. Emery, An evaluation of several differencing methods tor in
viscid fluid flow problems, Journal of Computational Physics, 2
(1 968), pp. 306--331 .
[35] P. R. Woodward and P. Colella, The numerical simulation of two
dimensional fluid flow with strong shocks, Journal of Computa
tional Physics, 54 (1 984), pp. 1 1 5-1 73.
[36] J. Wackers and B. Koren, A simple and efficient space-time adap
tive grid technique for unsteady compressible flows, in Proceed
ings 1 6th AIM CFD Conference (CD-ROM), AIM-paper 2003-
3825, American Institute of Aeronautics and Astronautics. Reston,
VA, 2003.
[37] P. Wesseling, Principles of Computational Fluid Dynamics.
Springer-Verlag, Berl in , 2001
[38] P. J. Roache, Fundamentals of Computational Fluid Dynamics,
Hermosa, Albuquerque. NM, 1 998.
[39] Ch. Hirsch, Numerical Computation of Internal and External Flows.
Vol. 1 Fundamentals of Numerical Discretization, Vol. 2 Computa-
AUTHOR
BARRY KOREN
CWI P.O. Box 94079
1 090 GB Amsterdam
The Netherlands
e-mail: [email protected]
Barry Koren studied Aerospace Engineering at the Delft Insti
tute of Technology, and Computational Fluid Dynamics at the
Von Karman Institute for Fluid Dynamics in Belgium. He is now
leader of the research group in Computing and Control at the
Dutch Centre for Mathematics and Computer Science (CWI)
in Amsterdam, and also professor of Computational Fluid Dy
namics at the Delft Institute of Techno logy. More information
can be found at http://homepages.cwi.nV-barry/.
He is married and the father of three children.
tiona/ Methods for lnviscid and Viscous Flows, Wiley, Chichester,
1 988-1 990.
[40] C. A J. Fletcher, Computational Techniques for Fluid Dynamics,
Vol. 1 Fundamental and General Techniques, Vol. 2 Specific Tech
niques for Different Flow Categories, Springer-Verlag, Berl in, 1 988.
[41] http://www cfd-online.com
ROCKY MOUNTAIN MATHEMATICA Join us this summer for a Mathematica workshop in the scenic and cool mountain environment of
Frisco, Colorado, 9100 feet above sea level. From July 9-14,2006, Ed Packel (Lake Forest College)
and Stan Wagon (Macalester College), both accredited Wolfram Research instructors with much
experience, teach an introductory course (for those with little or no Mathematica background) and an
intermediate course on the use of Mathemalica. Anyone who uses mathematics in teaching, research,
or industry can benefit from these courses.
The introductory course will cover topics that include getting comfortable with the front end, 2- and
3-dimensional graphics, symbolic and numerical computing, and the basics of Mathematica programming. Pedagogical issues relating to calculus and other undergraduate courses are discussed.
The intermediate course covers programming, graphics, symbolics, the front end, and the solution
of numerical problems related to integration, equation-solving, differential equations, and
linear algebra.
Comments from participant Paul Grant, Univ. of California at Davis:
The instructors had boundless enthusiasm for and an encyclopedic knowledge of Mathematica. I was able to complete an animation project that I will be showing to my students. I highly recommend these courses
to anyone wanting to learn Mathematica from the ground up or enhance their knowledge of the program.
For more info, contact packel©lakcforcst.edu or [email protected], or visit rmm.lfc.edu.
16 THE MATHEMATICAL INTELLIGENCER
M a them a tic a l l y Bent
The proof is in the pudding.
Opening a copy of The Mathematical
Intelligencer you may ask yourself
uneasily, "What is this anyway-a
mathematical journal, or what?" Or
you may ask, "Where am !?" Or even
"Who am !?" This sense of disorienta
tion is at its most acute when you
open to Colin Adams's column.
Relax. Breathe regularly. It's
mathematical, it's a humor column,
and it may even be harmless.
Column editor's address: Colin Adams,
Department of Mathematics, Bronfman
Science Center, Williams College,
Williamstown, MA 01 267 USA
e-mail: [email protected]
C o l i n Ada m s , Editor
The Theorem B laster Colin Adams
I s your theorem overweight? Need to trim the pages of your paper? Want
to fit into that volume on semi-simple Lie Algebras? Then you're in luck! Because we are offering to the public, for the first time ever, the amazing Theorem Blaster®. You've heard about it in the Math Lounge. You've seen it on Math TV. But now, due to a special purchase agreement with the American Mathematical Society, we can offer it to you at drastically reduced prices.
Overweight theorems can be dangerous in the long term. Sluggish and impenetrable, they cause colleagues to lose interest in your work, leading to publication rate slowdown and severe career trajectory tailspin.
But now you can have the theorem you always dreamed of. Yes, if you want hot-looking theorems, you want the Theorem Blaster.
Here are just a few of the many tes-timonials from our satisfied customers:
"My proofs were ponderous and unwieldy. Many mathematicians refused to read them. But now I can't keep my preprints on the shelf. Colleagues can't keep their hands off them."-John Conrad, Princeton
"Little children teased me. 'You're hiding your lack of mathematical insight inside layers of impenetrable notation,' they chanted in their barely audible sing-song voices as they played. I hated it, and I was intimidated, avoiding all contact with children. But now I walk past the playground with pride, my threepage preprint held high for all the toddlers to see."-Robert Lazarus, Yale
"For many years, I could only submit to journals with no upper page limit, like the Transactions. Now I'm publishing two-page notes in the Proceedings of the AMS. Thank you, Theorem Blaster."-Karen Hammond, Duke University
How does it work? By isolating the logical components of the core theorem and honing the fundamental connective tissue, the patented triple-action technology of the Theorem Blaster focuses its energies exactly where you need help the most. It burns off undesirable symbols and equations. It removes unsightly constants. It squeezes out the excess verbiage.
As a special bonus, the Theorem Blaster comes with our special video "Five Days to Beautiful Proofs and Theorems." Watch as our Ph.D.-trained personnel guide you in the use of your Theorem Blaster. In five days, you can say with pride, "Hey, there, random person walking down the street! Take a look at this proof!"
Always lacked the confidence to show your proofs off at the Mathematical Sciences Research Institute in Berkeley? Afraid to take that long bus ride up the hill? In the pit of your stomach was the fear that seminar participants might point and laugh at the circuitous route you took to prove your theorem. Or even worse, they might get up and leave ·in the middle of your· talk.
Well, live in fear no longer. Go ahead, get on the bus! Because they will be begging you for more theorems by the time you are done!
Well-known math trainer Steven Krantz has spent years teaching mathematicians to write clearly and concisely.
"I was amazed by the Theorem Blaster," he said. "It's like liposuction for math. My book A Primer of Mathematical Writing is superfluous in the face of this amazing invention."
Do you have a closet filled with cumbersome machines, each bought on the promise of sudden size reduc-
© 2006 Springer Science+ Business Media, Inc . . Volume 28. Number 1 . 2006 1 7
tion? The Corollary Cutter, the Lemma Loser, the Conjecture Cruncher? Well, throw them all away. You won't need them anymore!
And what about Mathersize@? Or n-ladies®? All those reduction fads are out the window! That's right. You don't need them anymore!
A top university released the results of a study demonstrating that the Theorem Blaster® is the fastest and safest method to reduce papers to a reasonable size. They compared it to other reduction methods and found that the theorem survival rate with the Theorem Blaster® approaches 95%, way
outperforming the competition. And, believe it or not, as a special
deal for those ordering today, in addition to the Theorem Blaster, and the video "Five Days to Beautiful Proofs and Theorems," you will also receive instructions for the world-famous Analysis Diet®. That's right. The diet where you just drop analysis from your mathematical diet and watch the bloat come off.
Epsilons and deltas, gone! cr-alge
bms, gone! Radon-Nikodym deriv
atives, gone!
Your proofs will shrink at a dra-
matic rate. Watch as lemmas and corollaries are exposed to view for the first time. It will bring out definition, in fact, lots of definitions.
This opportunity is not available in any math department. Don't delay! Call our toll-free number today. Why not order one for a friend as well? He will appreciate the help. We all will!
There is no guarantee that the bloat will
stay off. Persons with heart conditions
or difficulty digesting de Rham coho
molgy should consult their Ph.D. advi
sor before using the Theorem Blaster.
ScientificWorkP iace· Animate, Rof4te, Zoom, and Ry New in Version 5.5
Mathematical Word Processing • LAlEX Typesetting • Computer Algebra
Animated Tube Plot • Totnlkt ananlmattdt�o�btplot 1 Typt en expreuion in one Ot two varitbles 2 With tht lnnrtion point in the t)ll),..ssion, 3 Open 1M Plot ProperUe$ dialog end, on
tht umt varleblts
The n.lll animlllion shows a knot being dr�WR. � Plot 3D Animated + Tube r -IO<OU7-2C�l!t} + ljl' Ur) l
-1.5ro.{Ur} .. IOama -l*(.5U)
1 8 THE MATHEMATICAL INTELLIGENCER
'U7H n•.,. ' ' '
• Compute and plot using the MuPAJY' 3 computer algebra engine
• Animate 2D and 3D plots using MuPAD s VCAM
• Rotate, move, zoom in and out, and fly through 3D plots with new OpenGL e 3D graphics
• Label 2D and 3D plots so that the label moves when you rotate or zoom a plot
• Create 3D implicit plots
• Import !HEX files produced by other programs
• Use many new l.i\TE)C packages
• Get help from the new, extensive troubleshooting section of 'ljlpesetting Documents with Scientific WorkPlace and Scientific Word, Third Edition
The Gold Standard for Mathematical Publishing Scientific WorkPlace 5.5 makes writing,
sharing, and doing mathematics easier. A click of a button allows you to typeset your documents in f.!\ EX- Scientific WorkPlace enables both professionals and support staff to produce stunning books and articles.
MacKichan tOinrAU,I,.I:.
www.mackichan.com/mi • Email: info@mad<icl1an.com Toll-free: 877·724·9673 • Fax: 360-394·6039
David E . Row e , Editor l
Ancient Egypt ian Mathemat ics : New Perspectives on O ld Sources Annette lmhausen
Send submissions to David E. Rowe,
Fachbereich 1 7 -Mathematik,
Johannes Gutenberg University,
055099 Mainz, Germany.
Pro captu lectoris
habent suafata libelli
(Terentianus Maurus)
I f books, in general, have their own special fates-which depend on their
readers-the same is true for the mathematical "books" from ancient Egypt. Indeed, modem editors and subsequent readers have strongly influenced the way we view them today. And even now, readers of the third millennium can alter the fate of these early texts by their careful (or careless) reading. 1
Sources and Early Historiography
For the past fifty years, the reputation of Egyptian mathematics has been rather poor. This has been due in part to the very limited number of available primary sources, particularly when compared with the vast collections of cuneiform mathematical texts produced in Mesopotamia. In ancient Egypt the production of mathematics (as well as literature) took place in cities. Then, as today, Egyptian cities were located along the Nile, and hence close to water. This circumstance has had significant consequences for contemporary Egyptological research. On the one hand, papyrus, the main writing material in this culture, was dependent on absolute dryness for its preservation, a condition found in the Egyptian desert where most papyrus finds were made. However, in ancient Egyptian cities, where writings concerned with the mundane affairs of daily life were discarded after use, this condition was usually not fulfilled. Therefore, most of the written evidence documenting the role of mathematics in Egyptian social, economic, and cultural life must be assumed lost forever. On the other hand, to the extent that such sources may still be retrievable some day, practical problems stand in the way. The locations of ancient Egyptian cities often coincide with those of modem urban centers.
This makes it next to impossible to excavate at a number of locations where extant remains might still be found.
Among the few known (excavated) cities, the Middle Kingdom town of Lahun (also known as lllahun or Kahun) is exceptional, having yielded the richest findings of Middle Kingdom papyri so far, among which incidentally are a number of mathematical fragments. 2
The two most significant sources, however, the famous Rhind and Moscow mathematical papyri, were bought on the antiquities market, making their provenance uncertain. These and most of the other known mathematical sources were already published by 1930.
The achievements of the earliest researchers who studied these texts, especially those who worked during the first half of the twentieth century, were enormous. As editors, they managed to penetrate a foreign vocabulary of technical terms, which placed them in position to make a first attempt at understanding Egyptian mathematical methods. 3 As was common at that time, ancient sources and achievements were viewed and evaluated by means of direct comparison with modem conventions and results. In many respects, it was found that Egyptian mathematics had little in common with the methods found in modem mathematical textbooks. Nevertheless, with some effort the mathematical content of the ancient texts could be "decoded" and "translated" into modem mathematics.
Unfortunately, this type of reading often entailed a loss of the most striking characteristics of the original sources, a drawback that was little appreciated at the time. Not surprisingly, the "achievements" of Egyptian mathematics, judged in terms of a different mathematical culture (from more than 3000 years later), looked rather crude and simple. One of the early leading authorities on ancient mathematics was Otto Neugebauer, who wrote his dissertation on Egyptian methods of cal-
© 2006 Springer Science+ Business Med1a. Inc., Volume 28, Number 1 , 2006 1 9
culating with fractions.4 Afterward, Neugebauer turned his attention away from Egyptian mathematics to study Mesopotamian mathematics and astronomy, which he believed was a higher level of scientific achievement. As he once expressed this:
Egypt provides us with the exceptional case of a highly sophisticated civilization which flourished for many centuries without making a single contribution to the development of the exact sciences. [ . . . ] It is at this single center (Mesopotamia) that abstract mathematical thought first appeared, affecting, centuries later, neighbouring civilizations, and finally spreading like a contagious disease.5
It was surely in part due to the outstanding quality of the early scholarly contributions that readers accepted so readily this kind of negative assessment of Egyptian mathematics. As indicated already, this situation was compounded by the lack of new source material which-had it been therewould have required those capable of reading Egyptian texts to reflect upon the assessments of their predecessors. Thus, in the case of Mesopotamian mathematics, where new source material is still being uncovered on an almost regular basis, readers' opinions have changed significantly over time. H
Lacking this wealth of textual material, readers of the Egyptian texts seemed to have no basis for questioning the standard views of earlier experts like Neugebauer. Indeed, once the major Egyptian mathematical papyri became available in English or German translation, various historians of mathematics began contributing new ideas based on their own readings of these first translations. Often these involved modem mathematical symbolism, leading to results that had almost nothing in common with the original source text. 7
This once common approach has now been recognized as both anachronistic and misleading. Indeed, for the last 20 years historians of mathematics have started to take up and to rework the subject of ancient mathematics.8 It is now generally accepted that histori-
20 THE MATHEMATICAL INTELUGENCER
ans of mathematics cannot work on a source text without knowing the language in which it is written or the cultural background it comes from. At the same time, it has become obvious that mathematical knowledge is not universal. It is neither independent of the cultures in which it is produced and used, nor has it developed universally from basic beginnings to more and more advanced stages of knowledge. This dependence on cultural background begins already with number systems and number concepts, as has been demonstrated by various scholars working on ethnomathematicsY More advanced mathematical techniques and concepts have also been shown to be dependent on the culture that created them. 10
Current Research
From this description of past research, it follows that the editions of Egyptian mathematical sources are by now outdated. It is to be hoped that new editions can be published before the current ones reach their centenary. Likewise, older studies of Egyptian mathematics, those written more than 30 years ago, must be read with caution, bearing in mind the kind of approach past researches typically took. For an up-to-date introduction to the subject, the reader should consult the articles by Jim Ritter. 11 In the following sketch, I will attempt to give an overview of the state of current research, illustrated with selected examples from the source material.
Although there have been no spectacular new finds of mathematical papyri, extant sources, including the much-studied Rhind and Moscow papyri, still offer many clues about the role of mathematics in Egyptian life. Alongside these, the Lahun mathematical fragments have just been re-edited, including several previously unpublished fragments. 1 2
Other texts are still awaiting proper publication, such as the mathematical fragments of Papyrus Berlin 6619. The earlier publications from 1900 and 1902 only contain facsimiles of the two largest fragments. Moreover, the interpretations of them then given are not without problems. n The Cairo wooden
boards are currently available in two very small and hardly legible photos with a discussion of some of their content. While a number of demotic mathematical texts have been published, no detailed study of Egyptian mathematics in the Graeco-Roman period is available yet. 14
Evidence from the Predynastic Period
Apart from the extant mathematical texts, however, there are further sources available throughout Egyptian history which inform us about aspects and uses of mathematics as it evolved in ancient Egypt in periods from which no mathematical texts are extant. Written evidence exists from as early as around 3000 B.C., the oldest dating from shortly before the unification of Egypt. It comes from the tomb Uj at Abydos15 and consists of writing on pottery as well as on little tags of bone and ivory. These tags all reveal holes, suggesting they were probably once attached to some perishable goods from this grave, thus indicating their provenance and quantity. u; The quantities were rendered using elements and style familiar from the Egyptian number system in later times, i.e., a decimal system without positional notation (see Figure 1). In this system, each power of 10 up to 1 million was represented by a different sign. In order to write any number, the respective signs, written as often as needed, were juxtaposed in a symmetric way. Note that the hieroglyphic writing, which is what most people associated with ancient Egypt, was used mostly on stone monuments. For daily life purposes, Egyptian scribes wrote with a reed (dipped in ink) on papyrus or so-called ostraca (limestone or pottery shards). The
Figure 1 . Number representations on the
tags from tomb Uj.
script used in this writing is more cur
sive and abbreviated than hieroglyphic script. Several signs can be combined to form ligatures, whereas the writing
itself can vary a great deal, depending
on the individual scribe Gust like mod
em handwriting).
Mathematics in the Old Kingdom
After the unification of Egypt under a
single king (around 3000 B.C.), the Old
Kingdom (OK; 2686-2160 B.C.) brought
forth the first period of cultural bloom
in Egyptian history. Extant architec
tural remains, like the pyramids, as
well as such artifacts as the scribal
statues, demonstrate a high level of
cultural attainment by this time. There
can be little doubt that mathematical
techniques lay at the heart of this de
velopment as a significant tool for handling organizational and administrative
problems. To achieve something on the
scale of the pyramids, mathematics
was necessary not only for architec
tural planning but also for the organi
zation of labor. The scribal statues,
which depict high officials from this
period, demonstrate the importance of
the administrative system. Despite this,
there is practically no written evidence
for mathematical practices extant from
this time. Many of the monumental hi
eroglyphic inscriptions are still ex
tant-but these, of course, focus on
eternity and tell us little about Egypt
ian daily life and the affairs in which mathematics played an important part. Only very few papyri from this period
have survived, some in a very frag
mentary state.
Nevertheless, there is other direct
evidence of Egyptian mathematical
techniques, for example from the plan
ning and execution of building projects
such as a mastaba from Meidum (see
Figure 2). Around the comers of this
mastaba, beneath the ground level,
four 1-shaped mud-brick walls had
been built. On these walls a series of
diagrams can be found, which indicate
the slope of the sides of the mastaba.
This method of handling sloped surfaces points to the development of a
concept which is well documented in
the mathematical texts. 17 To express
sloped surfaces, such as the sides of a pyramid, the Egyptians used the so-
7 p a I
m s
'seqed'
Figure 2. Indication of a sloped surface at Meidum.
called sqd. This Egyptian term is de
rived from the verb qd, meaning "to
build." The sqd was used to measure
the horizontal displacement of the
sloped face for each vertical drop of
one cubit, that is the length by which
the sloped side had "moved" from the
vertical at the height of one cubit. The
sqd was always indicated in palms, and
if necessary, digits. Although we have
textual evidence for this concept only
from the Middle Kingdom onward,
sketches from the Old Kingdom indi
cate that it was in use during this ear
lier period. Note that the parallel lines
drawn on the mud bricks are spaced at
a distance of one cubit or seven palms. Furthermore there is early evidence
for several metrological systems. While
these units can also be found in later
mathematical texts, their appearance
in administrative papyri as well as in the inscriptions and depictions from
tombs indicates that these systems go
back at least to the Old Kingdom. Some
of these systems changed over time, but the sources from the Old Kingdom
suffice to trace these changes.
Calculations with Unit Fractions
One of the most intriguing aspects of
Egyptian mathematics concerns spe
cial methods for calculating fractions,
which were understood in ancient
Egypt as inverses of integers. 18 Hence,
the Egyptian notation for fractions did
not consist of a numerator and de
nominator, but rather a special symbol was used alongside an integer to des
ignate the corresponding fraction. An exception was the fraction %• which had a special sign. The fractions t. i• and ± were also written by using spe-
cial signs (indicating that these may be
older) rather than by using the general
Egyptian notation. 19 In modem stud
ies, Egyptian fractions are usually
described as unit fractions, and it is
often suggested that the Egyptians
"restricted" themselves to calculations with fractions having a numerator of
one. 20 As explained in the paragraph
above, however, this is a rather anachro
nistic view. Moreover, seen from a mod
em perspective, the Egyptian system
inevitably appears awkward and un
necessarily restrictive.
One of the first to study Egyptian
computations with fractions was Otto
Neugebauer, who devised a notational
system that parallels the Egyptian no
tation. Fractions, as inverses of inte
gers, are rendered by the value of
the integer with an overbar: thus, +. would be written as 5, i as 6, etc. Th� exceptional fraction % was rendered by Neugebauer as 3, whereas _1_, �. and _I_ - 2 ,J 4 appeared as 2, etc. This notational sys-
tem, which closely mirrors the Egyptian
concept of fraction, has become the
standard way of writing Egyptian fractions in modem textbooks.
Following this concept of fractions
as inverses of integers, the next step
consequently-was to express those
parts that correspond to a multitude of
inverses. This was done by (additive)
juxtaposition of different inverses.
Thus, � was written in the Egyptian sys-4 - -
tern as 2 4, whereas a general fraction
was given as a sum of different in
verses written in descending order ac
cording to their size. (Note that this no
tation enables one to be as accurate as necessary by considering only ele
ments up to a certain size.)
© 2006 Springer Science+Business Media, Inc., Volume 28, Number 1, 2006 21
Egyptian techniques of multiplication and division (see below for a more detailed description) frequently involved the doubling of a number. This could be done very easily if the number to be doubled was an integer or the inverse of an even integer. However, to double an odd Egyptian fraction (when the result is supposed to be a series of different inverses only) can be quite difficult to accomplish. Consequently, it proved useful to prepare tables giving the results for doubling the inverses of odd numbers. These can be found in the so-called 2 7 N tables still extant in two sources: at the beginning of the Rhind Mathematical Papyrus (for odd N = 1 - 101) and in the Lahun fragment UC 32159 (for odd N = 1 - 21).
Figure 3 shows the fragment UC 32159 in which the numbers are arranged in two columns. The first column shows (what we call) the divisor N, except for the first entry which shows both the dividend 2 and the divisor 3. This is followed by a second column that altematingly shows fractions of the divisor and their value (as a series of inverses). Thus, the second line starts with the divisor 5 in the first column: it is 2 7 5 that shall be expressed as unit fractions. This � foJ lowed in the second column by 3, 1 3, - - -15, a.!ld 3. This has to be read � 3 of 5
- - - - -is 1 3, and 15 of 5 is 3. Since 1 3 and 3 added equal 2, the series of unit fractions needed to represent 2 7 5 is 3 15.
The 2 7 N table in the Rhind papyrus shows the same arrangement of numbers; however, the solutions there are marked by the use of red ink.
Obviously, the representation of 2 7
N as a series of unit fractions is not unique. However, the Egyptian 2 7 N Table uses for each N only one of the theoretically possible representations. Those we find in the Lahun fragment, for example, are identical to the ones found in the table of the Rhind papyrus. And whenever an odd fraction is doubled within the mathematical texts, it is this same representation that we find used.
This circumstance has fascinated a number of experts on additive number theory. In fact, there have been several attempts to crack the puzzle posed by the 2 7 N Table by finding the criteria that led the Egyptians to employ just these particular representations. Yet, while it is possible to describe some of the general tendencies-e.g. , representations with fewer elements are favored as are also representations with larger inverses, etc.-it has not been possible to establish strict mathematical rules that explain the choices the Egyptians mathematicians made. Rather than criticizing them for their lack of insight-or blaming them for not having followed strict rules that would comply with a different mathematical concept of fractions devised by another culture several thousand years
2 3
5
7
9
11
13
15
17
19
21
3 2
3 1 3 15 3 - - - -
4 1 2 4 28 4
6 1 2 18 2 - - - -
6 1 3 6 66 6 - - - - - -
8 1 2 8 52 4 104 8
10 1 2 30 2 - - - - - -
12 1 3 12 51 3 69 4 - - - - - - -12 1 2 12 75 4 114 [6]
14 1 2 42 2
later-it seems more appropriate to recognize that mathematics is, indeed, culturally dependent; our modem point of view may not afford us the best picture of past achievements. Thus, instead of trying to concoct an explanation of the Egyptian solutions by using modem mathematics, it may be more rewarding simply to "accept" the Egyptian table and examine its use and usefulness within the mathematical environment that employed it.
Mathematical Problem Texts
from the Middle Kingdom
Apart from tables, the mathematical texts also include special procedures articulated within problem texts. As these names indicate, such texts set out a problem and then give instructions showing how to solve it. Procedure texts derive from an educational setting. They may have been written by a teacher, who was compiling a handbook, or perhaps by a student engaged in practicing mathematical techniques. An appreciation of this context is important for understanding these texts, which were intended to prepare scribes for the mathematical tasks they would later have to execute as part of their daily work.21 Given that these texts were written for this type of mathematical education, it should not be expected that we can learn how the Egyptians developed their mathematical knowledge from sources of this nature.
The extant hieratic mathematical texts contain roughly one hundred problems. Furthermore, in the largest of these texts, the Rhind Mathematical Papyrus (see Figure 4), we can discern an arrangement of these problems according to their rising level of difficulty. This is not to be judged by purely mathematical aspects alone but also by additional knowledge (often from a practical background) which is necessary to solve the problems. This can be seen, for example, in pRhind, problems 31-34 and those immediately following, problems 35-38. Mathematically, both groups teach a procedure for determining an "unknown" number if its
Figure 3. Fragment UC 32159: 2 .;- N table (Copyright Petrie Museum of Egyptian Archaeol- sum with fractions of itself is given. ogy, University College London). The procedure for solving the prob-
22 THE MATHEMATICAL INTELLIGENCER
lems in both groups is roughly the same. However, in the second group (pRhind, problems 35-38), the "unknown" number is not an abstract number but a quantity of grain. Therefore the result, which is determined in the same way as in the preceding problems, needs to be transformed afterwards into the respective metrological units.22
The style of Egyptian mathematical problem texts can best be appreciated by looking at an actual example, like problem 56 of the Rhind Mathematical Papyrus:
Method of calculating a pyramid, 360 is its base, 250 is its height. You shall let me know its inclination. You calculate half of 360. It results as 180. You divide 180 by 250. 2 5 50 of a cubit results. 1 cubit is 7 palms. 23 You multiply with 7.
7 \ 2 3 2 \ 5 1 3 15 \ 50 10 25 1ts inclination: 5 25
palms
Problem 56, like the other four examples of pyramid problems found in the Rhind Papyrus (nos. 57, 58, 59, and 59b ), teaches the relation between the base, height, and inclination of the sides. This example complements the OK sketch found on the walls around the mastaba with sloping sides, which was discussed above. In fact, the technical term sqd-the number of palms the slope of a slanted plane recedes per vertical difference of one cubit-is explicitly indicated in the problem text. Thus, the base, height, and inclination of a pyramid are linked by the relation:
1 /2 base inclination = 7 palms X
height
The problem above presents a pyramid with base (360) and height (250); its inclination is to be calculated. The procedure calls for calculating half of the base and dividing this by the height. The result is then multiplied by 7 to obtain the inclination in palms. Having grasped "what is going on" in this problem, let us now take a second, closer look at the Egyptian text and its means of structure.
The text begins-as is typical for mathematical problem texts-with a title "Method of calculating a pyramid." Note that the beginning of the title is written in red ink (rendered in my translation in bold). This use of red ink helps the reader recognize at a glance the beginnings of individual problems. The title of mathematical problems is very often given as "Method of . . . " followed by a key word which indicates the type of problem. In our example, the key word is the Egyptian mr, "pyramid."
After this title, the given data are introduced, and they are always specific numerical values. This statement of the data is generally followed by a question or command, outlining the problem that the scribe shall solve. In this example: "You shall let me know its inclination." Next, we see a sequence of instructions, followed by intermediate results. This procedure then leads to the numerical solution of the problem. Each instruction usually consists of one arithmetic operation. The Egyptian mathematical language distinguishes addition, subtraction, multiplication, division, halving, inverting, squaring, and the extraction of square roots. These individual mathematical operations are expressed without any use of mathematical symbols. The instructions themselves are always given as complete sentences.
Furthermore, in this part of the text, a special verb form is used, the socalled sd.m.IJr=f. The name consists
of the Egyptian verb "to hear" (sd.m), which is used in Egyptian grammars to demonstrate different conjugations, its characteristic morphological element (IJr) and the suffix pronoun of the third person singular (f). Its function is to express a "general truth" which results as a necessary sequence from previously stated conditions.24 In the mathematical texts, the sd.m.IJr=f is used for both instructions and announcing intermediate results. As for the latter, the verb form expresses "mathematical facts"-if 2 and 2 are added, the result will necessarily be 4. The use of the sd.m.IJr=f in the instructions underlines the specific procedural character of the text: the sequence of instructions necessarily has to be followed to solve the problem. The last i�s.!_ruction given, the multiplication of (2 5 50) by 7 is followed by a scheme of numbers. This carries out the actual multiplication in the Egyptian manner, which may now be described.
Multiplication (and division) are executed following a scheme that uses two columns of numbers. 25 Each multiplication begins with the initialization which is found in the first line of the scheme: a dot is placed in the first column and the number to be multiplied in the second column. The multiplication is carried out by subsequent operations in both columns using a variety of techniques, depending on the numerical values of the numbers that shall be multiplied. The aim is to find the multiplier as a combination of entries in lines of the first column. The respective lines of the second column will then be the result of the multiplication.
Figure 4. Rhind Mathematical Papyrus, No. 56 Copyright The British Museum.
Problem 56 of the Rhind Papyrus shows the notation used to compute 7 X 2 5 50. The initialization is followed by three more lines, each of which indicates
_ o�e
.!!_f the required fractional
parts (2, 5, 50) of the multiplier. How the individual entires of the second column were found is not obvious. It is possible that there may have been tables for fractional parts of 7, as this was a number that leads to complicated calculations, but which came up frequently due to the metrological conventions.26
© 2006 Springer Science+Business Media, Inc., Volume 28, Number I, 2006 23
Finally, the result of the problem is announced. Next to the text of the problem there is a sketch indicating characteristic measurements for this problem, i.e., the values of base and height (see Figure 5). This step-by-step layout can be found in virtually all Egyptian problem texts. This being the case, one can easily see that the formal aspect of phrasing mathematics in the form of procedures will be completely lost if a problem is "translated" into a modem algebraic equation (in this case: inclination = (� base/height) X 7 palms). While this formula has the advantage of informing a modem reader at a single glance how an ancient measure was defined, it conveys nothing whatsoever about the procedural character of Egyptian mathematics. Moreover, algebraic formulae played no part in Egyptian mathematics so that the above formulation for the sqd is anachronistic, at best, as it is foreign to the methods actually found in Egyptian texts.
Analyzing Egyptian Problem Texts
As it happens, a closer analysis of the problem texts reveals many hitherto unnoticed methodological features of Egyptian mathematics. Indeed, the procedural format can be used as a key to analyze not only individual problems but also various types of problems as found in the mathematical papyri. To get beyond a superficial understanding of Egyptian mathematics, however, a method was needed that enabled a reader to analyze and compare the Egyptian procedures. Such an approach was first proposed by Jim Ritter.27 In my dissertation I have adapted this method to analyze the various procedures used in all hieratic mathematical problems. 28
The analysis of a specific problem text can be carried out by rewriting it in two stages. In the first, one keeps the numerical values indicated in the source text but rewrites the instructions by replacing the rhetoric formulations with modem symbols that indicate the respective arithmetic operations. The data are noted at the beginning of the scheme by their numerical values. Thus, for the example cited above (pRhind, problem 56), the text would be rewritten as follows:
24 THE MATHEMATICAL INTELLIGENCER
Figure 5. Sketch at the end of Rhind Mathe·
matical Papyrus, No. 56.
Method of calculating a pyramid, 360 is its base, 360 250 is its height. 250
You shall let me know its inclination. You calculate half of 360.
(1) 2 X 360 It results as 180. = 180 You divide 180 by 250.
(2) 180 -7- 250 2 5 50 of a cubit results.
1 cubit is 7 palms. You multiply with 7.
= 2 5 50
(3) 2 5 50 X 7
The result allows one to see at a glance whether the arithmetic operations to be carried out were simplified by the choice of data. For example, in problem 43 of the Rhind papyrus, the calculation of the volume of a granary with circular base, the diameter of the granary is given as 9. This greatly facilitates the calculational procedure, the first step of which is to determine � of the diameter. In the values of our 9 problem, the given data were 360 and 250. While the first step, halving 360, is fairly straightforward, the second, the division of the result of the first step by the second datum results in a fraction of three parts, which then has to be multiplied by 7. Thus, by comparison, the data in problems 58 and 59 result in easier calculations.
This first stage of rewriting is especially helpful when dealing with a corrupt text, as the modem reader is forced to follow the source text and identify the procedure in a step-by-step fashion. It then becomes immediately apparent where specific difficulties arise in the source.
To further analyze the text so as to reveal how its procedures are related
to those used in other problems, it is necessary to distinguish between different types of numbers that can appear throughout the procedure. The first numbers a reader encounters are the data of the given problem. From the second instruction on, three types of numbers are possible: data, intermediate results, and constants. To distinguish these, and also to get a clearer view of the structure of the procedure, a second stage of rewriting is required. In this stage the data are indicated by symbols D;, whereas intermediate results are specified by a number in parentheses (x) which specifies the step in the procedure that leads to the given result. The only actual numbers that now appear in the rewritten text are constants. Thus, for our example, the result of this second rewriting is as follows:
Method of calculating a pyramid, 360 is its base, D1
250 is its height. D2 You shall let me know its inclination. You calculate half of 360.
(1) 2 X Dl It results as 180. You divide 180 by 250.
(2) (1) -7- D2 2 5 50 of a cubit results. 1 cubit is 7 palms. You multiply with 7. (3) (2) X 7
In my dissertation I have analyzed the procedures of all hieratic mathematical problems by rewriting the procedure in the form of a symbolic algorithm. This makes it possible to compare the various procedures used and analyze their respective complexity. The analysis of problems by means of their procedures or algorithms thus constitutes a powerful tool for comparing the structure of individual mathematical problem texts. From this, one can learn a great deal about Egyptian mathematical techniques. Within the Rhind Mathematical Papyrus, for example, one fmds groups of problems with similar procedures (pRhind, No. 24-27), as well as a progression within one group from basic procedures to more elaborate ones (pRhind, No. 69-78).
Identifying an unambiguous symbolic algorithm can sometimes be
straightforward, as in the example above. Unfortunately this is not the case with all problems. Individual instructions may be missing-sometimes they are replaced by a written calculation, or several steps are summarized in one instruction only. These types of difficulties can sometimes be overcome by taking into account all of the available source material. If-as in the Rhind Papyrus-several problems of the same kind are available and their procedures are identical insofar as they are explicitly stated, then those problems which lack certain instructions can occasionally be reconstructed by means of the more detailed problems.
I would like to stress in this context that both types of rewriting are merely tools for analyzing specific aspects of the procedures found in the problem texts, whereas the source texts themselves remain central and should never be neglected in any analysis. Taking the three versions of the procedure together, however, enables one to form a more complete analysis that includes not only the various procedures but also technical mathematical vocabulary, as well as the relation of drawings and calculations carried out in writing connected with the procedure, and others.
Mathematics within the Context
of Egyptian Culture
Another integral part of the reassessment of Egyptian mathematics concerns its role within Egyptian culture. Mathematics was one of the key elements of scribal training in pharaonic Egypt. It provided the scribes with a crucial tool they needed to fulfil their administrative tasks as well as to plan and carry out construction projects. Consequently, many of the mathematical problems they dealt with were related to practical matters, e.g., the distribution of rations, the volume of granaries, or the amount of produce to be delivered by a worker. Our understanding of mathematical problems of this kind is at least partially dependent on our appreciation for these larger contexts.
This can be demonstrated with the so-called bread and beer problems, which appear against the background of economic activity, baking and brewing, under the control of a local au-
thority (state or temple). A quantity of grain is taken from a granary and then given to workers who produce bread and/or beer from it. Obviously it was necessary to ascertain the quantity-in loaves of bread or vessels of beer-of a given quality (in this case measured by grain content) that was equivalent to the amount of grain initially given to the workers. The mathematical side of this control is represented by the bread and beer problems. 29 The terminology used in these problems is taken from the respective technological language. Thus the bread and beer problems evolve around the psw, a unit which measures how many loaves of bread have been made from one 1}1;,3.t of grain. Apart from the psw, there are two additional standard phrases indicating the use of specific kinds of grain products and their quantities. Obviously, this has further consequences for the respective calculations. Similar observations can be made for other groups of practical problems as well. These generally involve not only the "basic" mathematical terminology but also further knowledge related to the technological or administrative background. This usually makes them not only more difficult to understand but also less likely to be "mirrored" by a familiar problem in modem mathematics. Thus, early historical research often neglected this area of Egyptian mathematics.
However, as is obvious from the ordering of the problems found in the Rhind Papyrus, it was precisely these practical problems that were considered more advanced. After all, the aim of the mathematical handbooks was to prepare scribes for their daily administrative work. So if we want to obtain insights into Egyptian mathematics, we must consider these problems and try to understand them. The setting of the individual problem may help to point to further sources (not only textual) which may be useful to understand the additional terminology and practice. Furthermore, it is this type of problem that indicates other possibilities of gaining knowledge about Egyptian mathematics apart from the restricted corpus of mathematical texts. The actual output of the scribes in doing their daily work provides us with numerous
documents that prove the use of mathematical techniques. Thus Michel Guillemot has used a ration text from Kahun to analyze mathematical practices. 30 These can be linked to techniques taught in mathematical papyri.31 It is to be hoped that this example can be followed for other texts as well.
The most promising sources still to be explored in this respect are the Reisner Papyri. This set of four papyrus rolls contains calculations for the building of a sanctuary, including ration tables, actual building calculations, as well as the administration of workshops. 32 They not only enlarge the meagre set of seven problems related to architecture which are known from the Moscow (problem 14) and Rhind (problems 56-60) papyri, but they also demonstrate that the amount of work done was linked to a specific number of workers (and rations) per day.
Evidence of Mathematics
in the New Kingdom
While the mathematical texts date almost exclusively from the Middle Kingdom, other sources are available from all periods of Egyptian history. The Wilbour Papyrus, a text from the New Kingdom, is an official record of measurements and assessments of fields over a distance of 90 miles along the Nile. The fields are given by localization and acreage, their assessments referring to taxes specified in amounts of grain.
Another major opportunity to find relevant sources of mathematics for the New Kingdom is provided by the excavation of Deir el Medina. Deir el Medina is the modem name of an ancient Egyptian village on the West Bank of the Nile opposite Luxor. The village was inhabited by workmen who were responsible for the construction and decoration of the tombs in the Valley of the Kings. Deir el Medina has
yielded a huge quantity of artifacts and texts relating to daily life in the New Kingdom-similar to the findings at Lahun for the Middle Kingdom. Among the sources are ration texts, building plans, as well as texts for the education of scribes. The ostracon in Figure 6 shows a fragment of an exercise in the multiplicative writing of large numbers. It shows in the first column
© 2006 Springer Science+Bus1ness Media, Inc., Volume 28, Number I , 2006 25
Figure 6. Deir el Medina: Remains and Ostracon with Number Exercise.
(on the right) the numbers 600,000, 700,000, and 800,000 and in the second (middle) column the numbers 5,000,000, 6,000,000, and 7,000,000 written by the sign for the number 100,000 (or 1 ,000,000) with the respective multiplicative factors (6, 7, and 8 and 5, 6, and 7) below. The third column (left) shows again the sign for 1,000,000 and two illegible signs below.
Conclusions
Although Egyptian mathematics will probably never have the vast number of sources that still can be found in other cultures like India or Mesopotamia, there is more available than has been used so far. 33 The analysis of all the available mathematical texts, taken along with the additional material from administrative economic and literary contexts related to Egyptian mathematics, is certain to provide a better foundation for understanding its role within Egyptian culture. This integrated approach represents an important advance beyond the early studies that relied exclusively on an internal analysis of a small corpus of mathematical texts, which served for several decades as the sole basis for assessing nearly three millennia of mathematical life in ancient Egypt. By carefully rereading these classical mathematical texts while according the new sources a serious first reading, we may anticipate that the fate of Egyptian mathematics faces an exciting future.
NOTES
1 . I thank David Rowe for his comments on
previous versions of this article and for his
corrections of my English. I also thank
Richard Parkinson of the British Museum
and Stephen Quirke of the Petrie Museum
26 THE MATHEMATICAL INTELLIGENCER
for permission to include photographs of
sources.
2. See Annette lmhausen and Jim Ritter,
"Mathematical Papyri , " in : Mark Collier and
Stephen Quirke (eds.), The UCL Lahun Pa
pyri: Religious, Literary, Legal, Mathematical
and Medical, Oxford: Arcaheopress 2004.
3. Among the early editions, the most note
worthy are still Thomas E. Peel, The Rhind
Mathematical Papyrus. British Museum
10057 and 10058, London: Hodder and
Stoughton 1 923, and Wasili W. Struve,
Mathematischer Papyrus des Staatlichen
Museums der Sch6nen Kunste in Moskau
(Quellen und Studien zur Geschichte der
Mathematik, Abteilung A: Quellen, Vol. 1 ) ,
Heidelberg: Springer 1 930.
4. Otto Neugebauer, Die Grundlagen der
agyptischen Bruchrechnung, Berl in: Julius
Springer 1 926.
5 . Otto Neugebauer, A History of Ancient
Mathematical Astronomy (Part Two). Berlin,
Heidelberg, New York: Springer 1 975: 559.
6. See for example the interpretations of
Plimpton 322, e.g. , compare Joran Friberg ,
"Methods and traditions of Babylonian
mathematics: Plimpton 322 , Pythagorean
triples and the Babylonian triangle param
eter equations," Historia Mathematica 8
(1 98 1 ): 277-318 and the recent reassess
ment by Eleanor Robson (Eleanor Robson,
"Neither Sherlock Holmes nor Babylon : a
reassessment of Plimpton 322 , " Historia
Mathematica 28 (200 1 ) ; 1 67-206 and
Eleanor Robson, "Words and pictures: new
light on Plimpton 322," American Mathe
matical Monthly 1 09 (2002): 1 05-1 20).
7. An extreme example of this is Richard
Gill ings, "The Volume of a Truncated Pyra
mid in Ancient Egyptian Papyri , " The Math
ematics Teacher 57 (1964): 552-555.
8. For Egyptian mathematics, see for exam
ple James Ritter, "Chacun sa verite: les
mathematiques en Egypte et en Me
sopotamie , " in : Michel Serres (ed.) , Ele-
ments d'histoire des sciences: 39-61 ,
Paris: Bordas 1 989: James Ritter, "Egyp
tian Mathematics," in: Helaine Selin (ed.),
Mathematics Across Cultures: The History
of Non-Western Mathematics : 1 1 5- 1 36,
Dordrecht, Boston, London: Kluwer 2000,
as well as Annette lmhausen, Agyptische
Algorithmen: Eine Untersuchung zu den
mittelagyptischen mathematischen Auf
gabentexten, Wiesbaden: Otto Harras
sowitz 2003. For Greek Mathematics, cf.
Serafina Cuomo, Ancient Mathematics,
London, New York: Routledge 2001 ,
Michael N. Fried and Sabetai Unguru,
Apollonius of Perga 's Conica. Text, Con
text, Subtext, Leiden: Brill 2001 , as well as
David Fowler, The Mathematics of Plato's
Academy: A New Reconstruction (Second
Edition), Oxford: Clarendon Press 1 999,
and Reviel Netz, The Shaping of Deduc
tion in Greek Mathematics: A Study of
Cognitive History (Ideas in Context 5 1 ) , Cambridge: Cambridge University Press
1999. For Mesopotamian mathematics,
see most recently Jens H0yrup, Lengths,
Widths, Surfaces. A Portrait of Old Baby
lonian Algebra and its Kin, New York:
Springer 2002, and Eleanor Robson,
Mesopotamian Mathematics, 2 1 00-1 600
BC: Technical Constants in Bureaucracy
and Education (Oxford Editions of
Cuneiform Texts XIV), Oxford: Clarendon
Press 1 999.
9 . See Gary Urton, The Social Life of Numbers.
A Quechua Ontology of Numbers and Phi
losophy of Arithmetic, Austin , Texas: Uni
versity of Texas Press 1 997, and Marcia As
cher, Mathematics Elsewhere. An
Exploration of Ideas across Cultures, Prince
ton, N.J . : Princeton University Press 2002.
1 0. See, for example, for Mesopotamia Jens
H0yrup, Lengths, Widths, Surfaces. A Por
trait of Old Babylonian Algebra and its Kin,
New York: Springer 2002.
1 1 See note 8.
1 2 . See Annette lmhausen and Jim Ritter,
"Mathematical Papyri , " in: Mark Collier and
Stephen Quirke (eds ) , The UCL Lahun Pa
pyri: Religious, Literary, Legal, Mathematical
and Medical, Oxford: Arcaheopress 2004.
Another mathematical fragment will be pub
lished in the next volume of that series.
1 3 . See Oleg Berlev, "Review of William Kelly
Simpson: Papyrus Reisner I l l : The Records
of a Building Project in the Early Twelfth
Dynasty, Boston : Museum of Fine Arts
1 969," Bibliotheca Orienta/is 28 (1 97 1 ):
324-326, esp. p. 325.
1 4 . Richard Parker, "A Demotic Mathematical
Papyrus Fragment," Journal of Near East
ern Studies 1 8 (1 959): 275-279; Richard
Parker, Demotic Mathematical Papyri,
Providence, R . I . : Brown University Press
1 972; Richard Parker, "A Mathematical Ex
ercise-P. Dem. Heidelberg 663 , " Journal
of Egyptian Archaeology 61 (1 975):
1 89-1 96. A list of Demotic mathematical
ostraca can be found in Jim R itter, "Egypt
ian Mathematics , " in: Helaine Selin (ed.),
Mathematics across Cultures. The History
of Non-Western Mathematics , Dordrecht:
Kluwer 2000: 1 34, note 27.
1 5 . See Gunter Dreyer, Umm ei-Qaab I . Das
pradynastische Konigsgrab U-j und seine
fruhen Schriftzeugnisse, Mainz: Von Zabern
1 998
1 6 . For a discussion of the inscriptions on these
tags, see Gunter Dreyer, Umm ei-Qaab I.
Das pradynastische Konigsgrab U-j und
seine fruhen Schriftzeugnisse, Mainz: Von
Zabern 1 998, pp. 1 37-145, and John
Baines, "The Earliest Egyptian Writing: De
velopment, Context, Purpose," in: Stephen
D. Houston, The First Writing. Script Inven
tion as History and Process, Cambridge:
Cambridge University Press 2004: 1 50-1 89.
1 7 . See problems 56-60 of the Rhind Mathe
matical Papyrus.
1 8. Jim Ritter, "Mathematics in Egypt, " in :
Helaine Selin (ed.), Encyclopedia of the
History of Science, Technology and Med
icine in Non-Western Cultures, Dordrecht,
Boston, London: Kluwer 1 997, p. 631 .
1 9. For the prehistory of Egyptian fractions and
their development see Jim Ritter, "Metrol
ogy and the Prehistory of Fractions," in:
Paul Benoit, Karine Chernla, Jim Ritter
(eds.) , Histoire de fractions, fractions d'his
toire: 1 9-34, Basel, Boston, Berlin:
Birkhauser 1 992.
20. See, for example, the description of Cou
choud: " . . . i l ne semble avoir connu que
les fractions unitaires, c'est a dire celles
dans lesquelles le numerateur est toujours
equivalent a ! 'unite, . . . " (Sylvia Couchoud,
Mathematiques Egyptiennes. Recherches
sur les connaissances mathematiques de
I 'Egypte pharaonique, Paris: Le Leopard
d'Or 1 993, p. 2 1 ) or that of Gill ings: "When
the Egyptian scribe needed to compute
with fractions he was confronted with
many difficulties arising from the restriction
of his notation. His method of writing num
bers did not allow him to write such sim
ple fractions as % or % because all fractions
had to have unity for their numerators (with
one exception)." (Richard J. Gill ings, Math
ematics in the Time of the Pharaohs, Cam
bridge, Mass . : MIT Press 1 972, p. 20).
2 1 . See Jim Ritter, "Egyptian Mathematics , " in:
Helaine Sel in (ed.), Mathematics across
Cultures. The History of Non-Western
Mathematics, Dordrecht, Boston, London:
Kluwer 2000, p. 1 20.
22. For a discussion of the use of an abstract
number system and conversions into
metrological systems, see Jim Ritter,
"Egyptian Mathematics," in : Helaine Selin
(ed.), Mathematics across Cultures. The
History of Non-Western Mathematics,
Dordrecht, Boston, London: Kluwer 2000,
pp, 1 2 1 - 1 22.
23. The cubit was the Egyptian standard mea
sure of length. 1 cubit consisted of 7
palms; each palm, of 4 digits.
24. An English example for its use would be
the statement "If I have a stone in my hand,
and let it drop, then the stone falls to the
ground." The last part of this statement
"then the stone falls to the ground" is
where the sqm.l)r==f is used in an Egypt
ian text.
25. Examples can be found in Annette
lrnhausen and Jim Ritter, "Mathematical
Fragments: UC321 1 4, UC321 1 8, UC321 34,
UC321 59-UC32 1 62, " in : Mark Collier and
Stephen Quirke, The UCL Lahun Papyri:
Religious, Literary, Legal, Mathematical
and Medical (British Archaeological Re
ports International Series 1 209): 7 1 -96,
Oxford; Archaeopress 2004, esp. pp. 85-86.
26. The New Kingdom Ostracon Senmut 1 53
may be interpreted as a table of + in this
way, see David Fowler, The Mathematics
of Plato's Academy: A New Reconstruc
tion (Second Edition), Oxford: Clarendon
Press 1 999, p. 269.
27. Jim Ritter, "Chacun sa verite: les rnathe
rnathiques en Egypte et en Mesopotamie,"
in : Michel Serres (ed.), Elements d'histoire
des sciences: 39-61 , Paris: Bordas 1 989
(English edition: Jim Ritter, "Measure for
Measure: Mathematics in Egypt and
Mesopotamia," in: Michel Serres (ed.) , A
History of Scientific Thought. Elements of
a History of Science: 44-72, Oxford:
Blackwell 1 995).
28. Annette lrnhausen, Agyptische Algorith
men. Eine Untersuchung zu den mittel
agyptischen mathematischen Aufgaben
texten (Agyptologische Abhandlungen 65).
Wiesbaden: Otto Harrassowitz 2003.
29. For a detailed discussion of these prob
lems see Annette lmhausen, "Egyptian
Mathematical Texts and their Contexts, "
Science in Context 1 6, 2003: 367-389.
30. Michel Guillemot, "Les notations et les pra
tiques operatoires permettent-elles de par
ler de 'fractions egyptiennes'?", in: Paul
Benoit, Karina Chemla, Jim Ritter (eds.), His
loire de fractions, fractions d'histoire, Basel,
Boston, Berlin: Birkhauser 1 992: 53-69.
31 . Annette lmhausen, "Calculating the Daily
Bread: Rations in Theory and Practice,"
Historia Mathernatica 30 (2003): 3-1 6.
32. A first attempt to analyze the mathemati
cal content of some parts of the Reisner
Papyri has been made by Richard J .
Gil l ings, Mathematics in the Time of the
Pharaohs, Cambridge, Mass. MIT Press
1 972, pp. 2 1 8-231 .
33. A variety of architectural sources (with
mathematical implications) can be found in
Corinna Rossi, Architecture and Mathe
matics in Ancient Egypt, Cambridge: Cam
bridge University Press 2004.
AUTHOR
ANNETTE I MHAUSEN
Trinity Hall
Cambridge University
Trinity Lane, Cambridge CB2 1 T J UK
e-mail: [email protected]
Annette lmhausen studied mathe
matics, history of mathematics, and
Egyptology and received her PhD
from Mainz University (Germany) un
der David E. Rowe. She has held fel
lowships at the Dibner Institute for the
History of Science and Technology
(Cambridge, Mass.) and at Trinity Hall
(Cambridge University, England). She
is currently completing a book that
outlines the historical development of
ancient Egyptian mathematics, situ
ating it within the larger social, eco
nomic, and cultural background. In
her spare time she enjoys running 1 /2
and 2/3 marathons.
© 2006 Springer Science t Business Media. Inc., Volume 28. Number 1, 2006 27
M?tffii•i§u6hl%1i@i§4fii.J .. t§.id M i c hael Kleber and Ravi Vaki l , Ed itors
This column is a place for those bits of
contagious mathematics that travel
from person to person in the
community, because they are so
elegant, suprising, or appealing that
one has an urge to pass them on.
Contributions are most welcome.
Please send all submissions to the
Mathematical Entertainments Editor,
Ravi Vakil, Stanford University,
Department of Mathematics, Bldg. 380,
Stanford, CA 94305-21 25, USA
e-mail: [email protected] .edu
The Locker Puzzle Eugene Curtin and Max Warshauer
Suppose I take the wallets from you and ninety-nine of your closest
friends. We play the following game with them: I randomly place the wallets inside one hundred lockers in a locker room, one wallet in each locker, and then I let you and your friends inside, one at a time. Each of you is allowed to open and look inside of up to fifty of the lockers. You may inspect the wallets you find there, even checking the driver's license to see whose it is, in an attempt to find your wallet. Whether you succeed or not, you leave all hundred wallets exactly where you found them, and leave all hundred lockers closed, just as they were when you entered the room. You exit through a different door, and never communicate in any way with the other people waiting to enter the room. Your team of 100 players wins only if every team
member finds his or her own wallet. If you discuss your strategy beforehand, can you win with a probability that isn't vanishingly small?
We develop a more mathematical formulation to facilitate a precise discussion of the problem. This consists of numbering our players, and replacing wallets by player numbers! Our game is played between a single Player A against a Team B with 100 members, B1, B2, . . . , B100. Player A places the numbers 1, 2, . . . , 100 randomly in lockers 1, 2, . . . , 100 with one number per locker. The members of Team B are admitted to the locker room one at a time. Each team member is allowed to open and examine the contents of exactly 50 lockers. Team B wins if every team member discovers the locker containing his own number. Team B is allowed
28 THE MATHEMATICAL INTELLIGENCER © 2006 Spnnger Sc1ence+Bus1ness Med1a. Inc.
an initial strategy meeting. No communication is allowed after the initial meeting, and each team member must leave the locker room exactly as he found it. It is important to realize that the solution does not involve some trick to pass information from one player to another. We could equally well make 100 copies of the room and make an identical distribution of numbers into lockers for each room, then ask the members of Team B to perform their searches simultaneously, with one person per room.
Each individual will succeed in finding his own number with probability 1/2. If they act independently, they must get lucky 100 times in a row, and the team will win with probability only e/z)100. Team B needs some help! Amazingly there is a strategy which gives significant probability of success for Team B. Even if we give the problem with 2n players on Team B each of whom can examine n out of 2n lockers, Team B can apply the strategy to succeed with probability over 30% regardless of how large a value we take for n. Your problem is to find this strategy.
Searching For Ideas
Let's play with some ideas using a more manageable number of players. To be as concrete as possible, let's switch to the case of 10 players on Team B, each of whom can examine 5 out of 10 lockers. Here random guessing by each player is already somewhat hopeless and succeeds with probability (l/z)10 = 1�24 . A first try to improve the probability of success is to search for a clever way to assign a set of lockers for each person to examine. Certainly we can improve over random guessing in this manner. For example if team members 1-5 examined lockers 1-5, and team members 6-10 examined lockers 6-10, they would succeed provided numbers 1-5 are placed in lockers 1-5. Number 1 is placed somewhere in the first 5 lockers with probability 5/10, then given
that number 1 is so placed, number 2 is also in the first 5 with probability 4/9 and so on. Following this plan, Team B will succeed with probability
5 4 3 2 1 _ 1 10 9 8 7 6 - 242 "
While this is an improvement over random guessing, it still leaves Team B with slim chances. Although the scheme fails, it is worth noticing that if B1 finds his number in this scheme, then B6 will find his number with probability 5/9 (as he will look in 5 lockers not including the one containing the number 1 ), but B2 will find his with probability only 4/9. The success or failure of B1 can influence the probabilities of success of the other members. This is the first clue!
An ideal strategy would be one where if B1 succeeds then everyone else does too. Note that this would allow the whole team to succeed half the time even though each individual member fails half the time. This ideal is not attainable, but perhaps you can find a strategy where if B1 succeeds, then everyone else is more likely to succeed. No method of preassigning lockers will accomplish this, as if B1 finds his number in locker k anyone with locker k in their preassigned set has his chances reduced. This suggests that the locker choices will have to depend on information not available at the initial meeting. The only such information available is the numbers a player finds inside the lockers he opens. With this further hint try one more time to find a good strategy before we proceed to the solution!
Developing the Solution
Once we realize that the locker B1 opens at any stage can depend on what he has found inside the lockers he has already opened, the number of possible strategies to consider is enormous, even in the 10-player case. The strategy must tell B1 which locker to open first (10 choices), which locker to open next if he is not lucky on the first try (9 choices for each of the possible 9 numbers he may see), which to open third if he is not lucky on his second attempt either (8 choices for each of the 9 X 8 possible sequences of 2 numbers he has seen so far), and so
on. So B1 alone has 10 X 99 X 89X8 X 79xsx7 x 69xsx7x6 possible strategies. To compute the number of strategy choices for the whole team, we raise this to the lOth power and get a number 28,537 digits long! How are we to choose one?
In this section we will show that one very simple strategy lets the team win with remarkably high probability. The strategy for any one player is entirely unremarkable; the magic arises from the fact that the chances of the different players winning are highly correlated. Moreover, in the next section, we will show that the strategy is in fact optimal.
Fortunately the good strategy is simple to implement and the choice of the next locker does not depend on the entire sequence of numbers seen but only on the most recent number. The good strategy has player Bi start by opening locker i. Then if he finds number k at any stage and k =I= i, he opens locker k next. Notice that player Bi, never opens a locker (other than locker i) without first finding its number, so each time he opens a new locker he must find either his own number or the number of another unopened locker.
Again let's look at a particular case with 10 players and suppose, for example, that the numbers are distributed in the order 6,8,9,7,2,4, 1,5, 10,3. Player B1 first examines locker 1 and finds the number 6. So he looks in locker 6 fmding the number 4, then locker 4 finding the number 7, then finally in locker 7 finding his number. When he finds his number, B1 will now know that B6, B4, and B7 will look in exactly the same lockers in the same cyclic order, each finding his number on the 4th try! He also knows that none of the other players will waste any tries on these lockers.
We may represent any permutation of numbers into lockers by listing the cycles. The permutation 6,8,9, 7,2,4, 1 ,5, 10,3 gives the cycles (6, 4, 7, 1) (8, 5, 2) (9, 10, 3), and Team B succeeds because there is no long cycle. To find the probability that Team B wins, we count the number of permutations of 10 numbers with a cycle of length 6 or longer. First let's count how many have a 6-cycle. Choose which 6 elements go into the 6-cycle, arrange them in cyclic order, and then pick an arbitrary permu-
tation of the remaining 4 elements. The number of ways to do this is (10) ' ' -
10!5!4! -
10! 6
5.4. - 6!4! - 6
So 116 of the 10! permutations have a 6-cycle, and a random permutation has a 6-cycle with probability 1/6. The same argument can be used to find the probability of a permutation of 1-10 having a cycle of any length longer than 6. (We warn that the argument does not work for counting the number of permutations of 1-10 with a 5-cycle (or shorter) as the permutation could have two 5-cycles.) A permutation of 10 numbers has a 7-cycle with probability 117 and so on, and the probability of a cycle oflength 6 or larger is 1/6 + 117 + 1/8 + 1 /9 + 1 / 1 0 = 1 62 7/2 5 2 0 =
0.645635. This gives the probability that Team B will fail, so of course Team B wins with probability 1 - 1627/ 2520 = 893/2520 = 0.354365. Over 35% of the time, all 10 members of Team B find their own wallets!
Will this idea be good enough for the initial version with 100 players? We can do the analogous computation and see that this pointer-following strategy works with probability 1 - ( 1151 + 1152 + . . . + 1/100) = .31 1828.
Notice that while our strategy has still performed remarkably well for 100 players, the probability of success was still less than in the 10-player version. As we increase the number of players, does the success rate decrease to zero, or does it always stay above a certain positive number? With 2n players and 2n lockers, Team B will win provided that the permutation of numbers in lockers has no cycle of length n + 1 or longer. The probability of such a long cycle is IJ:�1 -1-. By viewing this ex-
n+k pression as an upper Riemann sum for
f2n -1- dx and a lower Riemann sum n x + 1 for f2n l dx we obtain n X ( 1 ) l2n 1 ln 2 - n + 1 = n x + 1
dx
n 1 12n 1 s L -- s - dx = ln 2.
k � l n + k n X So II:� 1 -1- ---+ ln 2 as n ---+ oo; moreover n+k the sum increases monotonically with n. So the expression 1 - II:� 1 -1- givn+k ing the probability of success is
© 2006 Springer Science+Business Media, Inc., Volume 28, Number 1 , 2006 29
monotonically decreasing to 1 - ln 2 = 0.306853. Team B wins with the pointerfollowing strategy with probability exceeding 30%, regardless of the number of players and lockers. Now that we have found a good strategy, we turn our attention to whether it provides the best possible solution.
Is Pointer-Following Optimal?
We establish the optimality of pointerfollowing by comparing the game considered above (Game 1) with a new game (Game 2) between the same adversaries, Player A and Team B. For simplicity we give the argument in terms of the 10-player versions. Recall that in Game 1 we are allowing each player to examine 5 lockers. We first modify this rule and say that each player must continue examining lockers until he has opened the locker containing his number, and then he is not allowed to open any further lockers. Team B wins if no player opens more than 5 lockers. This change makes no difference to who wins in Game 1, but it will clarifY the comparison with Game 2.
In Game 2, Player A again distributes the 10 numbers at random in the 10 lockers. Then all of team B is invited into the locker room together. Team member B1 is required by the rules to start opening lockers and continue until she reveals the number 1 . Once she has opened the locker containing the number 1 , she may not open any further lockers; then, the lowest-numbered member of Team B whose number has not yet been revealed is required to take over opening lockers until he finds his number and so on. Team B continues until all lockers are opened. Again Team B wins if no individual team member opens more than 5 lockers. Before proceeding, we invite you to consider the following questions: With what probability can Team B win Game 2? What strategy should the team members employ? Does their choice of strategy even matter?
Let's sit in the locker room and observe Team B in the process of playing Game 2. We record the progress, listing the numbers in the order in which they are revealed. Our list of numbers is sufficient to determine how many lockers were opened by each player.
30 THE MATHEMATICAL INTELLIGENCER
For example, if we record the list 2,6, 1 , 4,9,7, 10,8,3,5, we know that player B1 revealed the numbers 2, 6, and 1. Then player B3 was required to take over, and he opened the lockers containing the numbers 4, 9, 7, 10, 8, and 3, in that order. Then player BR opened the remaining locker containing the number 5. In this example Team B lost, as player B3 opened 6 lockers. Notice that we will record any given ordering of the numbers 1-10 with probability 1110!. The first number revealed is 2 with probability 1110, no matter which locker is opened, given that the first is 2 the second will be 6 with probability 119, and so on. What strategy is Team B following here? It makes absolutely no difference! Team B can choose lockers at random or follow the most sophisticated plan; we still get probability 1/10! for each of the 10! possible orders in which the numbers could be revealed. In Game 2 Team B's probability of success is completely independent of strategy.
To find the probability that Team B wins, we must count how many of the 10! possible orders of the numbers 1-10 represent wins. We employ a version of the classical records-to-cycles bijection [6, p17] to assign a permutation written in cycle notation to each ordering. The first cycle of our permutation consists of the numbers opened by B1 in order; the second cycle, the numbers opened by the second locker opener; and so on. So, for example, 2,6, 1 ,4,9, 7, 10,8,3,5 ----> (2,6, 1)( 4,9, 7, 10,8,3) (5). Furthermore we see that each permutation arises in this manner from a unique ordering of the numbers 1-10. We first write the permutation in cycle notation, rotate each cycle so that the lowest number in the cycle is written last, and then order the cycles so that their last numbers are in ascending order. For example (9, 7,8)(1,3, 10,5) (2 ,4 ,6) = (3, 10 ,5 , 1 ) (4 ,6,2)(8 ,9 , 7) ----> 3, 10,5, 1,4,6,2,8,9,7. We have established a one-to-one correspondence between lists for which Team B wins and the permutations of 1-10 with no cycles of length greater than 5. Thus the probability that Team B wins Game 2 is the probability that a random permutation of 1-10 has no cycle of length greater than 5, and we have already computed
this as 893/2520 = 0.354365. This is exactly the probability of success for Team B in Game 1 using pointer following!
Our analysis has a significant consequence for Game 1. Team B can take any Game 1 strategy and adapt it to Game 2 as follows: If player Bi is opening lockers in Game 2, he can use his Game 1 strategy for choosing lockers to open, simply observing the contents without wasting a turn if the indicated locker is already open. Thus if a strategy succeeds in Game 1 for a particular distribution of numbers into lockers it will also succeed in Game 2. If there were a better strategy for Game 1 we could apply it in Game 2 and get a better chance to win this game also. But this is impossible, as all strategies for Game 2 lead to the same probability of success.
We have one final small puzzle: Happy with their optimal strategy for Game 1 , Team B began a sequence of matches with Player A, but they soon found themselves down 10 to 0. What do you suspect Player A is doing? (It seems that Player A subscribes to the Intelligencer and has devised a plan to defeat Team B.) What can Team B do to counter Player A's plan?
History of The Locker Puzzle
Our problem was initially considered by Peter Bro Miltersen, and it appeared in his paper [ 4] with Anna Gil, which won a best paper award at the ICALP conference in 2003. Miltersen says of the problem, "I think it started spreading when I presented it to several people at Complexity 2003, which was held in Aarhus, where I was a local organizer." In their version there is one numbered slip of paper for each player on the team. Player A then colors each slip either red or blue. Each member of Team B may examine up to half the lockers. He is then required to state or guess the color of the slip of paper with his number. Again every team member must state or guess his color correctly for the team to win. Initially Miltersen expected that Team B's probability of success would approach zero rapidly as the number of players increased. However, Sven Skyum, a colleague of Miltersen's at the University of Aarhus,
brought his attention to the beautiful pointer-following strategy. Finding this is left as an exercise in the paper.
Miltersen and G:il originally considered the case where there are n team members and 2n lockers, half of them empty; each team member still gets to open up to half of the lockers. This is a more difficult problem. Clearly simple pointer-following will not work as empty lockers do not point anywhere. It is an open question whether the winning probability must tend to zero for large n.
In [5] Navin Goyal and Michael Saks build on Skyum's pointer-following to devise a strategy for Team B in a more general setting, varying both the proportion of empty lockers and the fraction of lockers each team member may open. As the number of players increases, their probability of success for Team B approaches zero less rapidly than conjectured in [4]. And fixing the number of players and fraction of lockers each may open, their probability of
winning remains nonzero even as more empty lockers are added.
The problem also appeared in Joe Buhler and Elwyn Berlekamp's puzzle column in the Spring, 2004 issue of The Emissary [3] , with lockers replaced by ROM locations and colored numbers replaced by signed numbers. Here it is pointed out that the team benefits from the members carefully planning their guessing strategy as well as their locker searching strategy. For example, if there are 2n lockers and the longest cycle has length n + 1 , the team members caught in the n + 1 cycle can guess in such a manner that they all guess correctly or all guess incorrectly. The trick is the same as that employed in the hat problem of Todd Ebert [2]. Variations of the hat problem are described in Joe Buhler's article in this column [ 1 ] and in Peter Winkler's book [7, p66, p120]. The locker problem will be discussed in a future edition of Winkler's book also.
We thank Joel Spencer for introducing us to the problem, and we thank
Ravi Vakil and Michael Kleber for encouraging us to write this note and providing many useful suggestions.
REFERENCES
[ 1 } Joe Buhler, Hat tricks. Math. lntelligencer
24 (2002), no. 4, 44-49.
[2} Todd Ebert, http://www.cecs.csulb.edu/
�ebert/
[3} The Emissary, http://www.msri .org/publi
cations/emissary/
[4] Anna Gal and Peter Bro Miltersen. The Cell
Probe Complexity of Succinct Data Struc
tures, Proceedings of 30th International
Colloquium on Automata, Languages and
Programming (ICALP) 2003, 332-344.
[5} Navin Goyal and Michael Saks, A Parallel
Search Game, to appear in Random Struc
tures and Algorithms.
[6} Richard Stanley, Enumerative Combina
torics Vol 1 . Cambridge Studies in Ad
vanced Mathematics, 49, Cambridge Uni
versity Press, Cambridge, 1 999
[7} Peter Winkler, Mathematical Puzzles: A
Connoisseur's Collection, A K Peters, Ltd . ,
Natick, MA, 2004.
MAX WARSHAUER Department of Mathematics
Texas State Un iversity-San Marcos
San Marcos, TX 78666-4945 e-mai l : [email protected]
EUGENE CURTIN
Department of Mathematics
Texas State University-8an Marcos
San Marcos, TX 78666-4945 e-mail: ec01 @txstate.edu
Max Warshauer received his Ph.D. at LSU under Pierre Conner
and is now Professor in the Mathematics Department at Texas
State, where he founded and directs Mathworks" a center for Math
ematics and Mathematics Education, one of five programs in Texas
to receive the 2001 Texas Higher Education Star Award for Clos
ing the Gaps. Max was one of 10 individuals in the country to re
ceive the 2001 Presidential Award for Excellence in Science, Math
ematics, and Engineering Mentoring. His hobbies include playing
chess, go, Ping-Pong , and bicycling. His research interests have
included quadratic forms, analysis of algorithms, and mathematics
education.
Eugene Curtin, a native of Ireland , received a Ph .D. from Brown
University in 1 988 under Thomas F. Banchoff. He is current ly a
professor at Texas State University. His research interests have in
cluded differential geometry, r ing theory, combinatorics , and graph
theory. Eugene has been a faculty member with the Honors Sum
mer Math Camp under Max Warshauer's directorship every sum
mer since 1 992. He enjoys mathematical games and puzzles, and
likes to play chess, go, and backgammon. He was chess cham
pion of Ireland in 1 984 and 1 985, and chess champ ion of Texas
in 1 99 1 , 1 992, and 1 998
© 2006 Springer Science+Business Media, Inc., Volume 28, Number 1 , 2006 31
RAJENDRA BHATIA AND JOHN HOLBROOK
Noncom m utat ive
Geometric Means
For, in fact, what is man in nature? A Nothing in comparison with the Infinite, an All in comparison with the Nothing, a mean between nothing and everything.
-Blaise Pascal
A veraging operations entered mathematics rather early. .rt.Fascinated as they were by geometric proportions, the ancient Greeks defined as many as eleven different means. The arithmetic, geometric, and harmonic means are the three best-known ones. If Pascal had one of these in mind when he composed his Pensees [P] , he would soon have realised that mixing zero and infinity is a source of as many problems as mixing mathematics and divinity.
For centuries, mathematicians perfom1ed their operations either on numbers or on geometrical figures. Then in 1855 Arthur Cayley introduced new objects called ma trices, and soon afterwards he gave the laws of their algebra. Seventy years later, W emer Heisenberg found that the noncommutativity of matrix multiplication offers just the right conceptual framework for describing the laws of atomic mechanics. Matrices were found to be useful in the description of classical vibrating systems and electrical networks as well. For mathematicians, analysis of linear operators was a subject of intense study throughout the twentieth century and into the twenty-first century.
Many quantities of basic interest such as states of quantum mechanical systems and impedances of electrical networks are defined in terms of matrices. Mixing of the underlying systems in various ways leads to corresponding operations on the matrices representing the systems. Not surprisingly, some of these are averaging operations or means.
Of the three most familiar means, the geometric mean combines the operations of multiplication and square roots. When we replace positive numbers by positive definite matrices, both of these operations involve new subtleties. In this article we introduce the reader to some of them.
0 0 0
32 THE MATHEMATICAL INTELLIGENCt:R © 2006 Spnnger Sc1ence t- Bus1ness Media, Inc
Let IR+ be the set of all positive real numbers. Given a and b in IR + a mean m(a,b) could be defined in different ways. It is reasonable to expect that the binary operation m on IR+ has the following properties:
(i) m(a,b) = m(b,a).
(ii) min(a,b) :S m(a,b) :S max(a,b).
(iii) m( aa,ab) = am( a,b) for all a > 0. (iv) m(a,b) is an increasing function of a and b. (v) m(a,b) is a continuous function of a and b.
The three familiar means, arithmetic, geometric, and harmonic, satisfy all these requirements. Other examples of means include the binomial means, also called the power means, defined as ( aP + bJ! )lip
mp(a,b) = 2 , - oc :S p :s; x.
Here, it is understood that for the special values p = 0 and ±: oc we define mp(a,b) as the limits
mo(a,b) = limp __, omp(a,b) = v;;:b, mx( a,b) = limp --> xmp( a,b) = max( a,b ) ,
m-x(a,b) = limp __, -xmp(a,b) = min(a,b) .
The arithmetic and the harmonic means correspond to the cases p = 1 and - 1 , respectively. Inequalities between means have been studied for a long time. See the classic [HLP] , and the more recent [BMV]. A sample result here is that for fixed a and b, mp(a,b) is an increasing function of p. This includes, as a special case, the inequality between the three familiar means.
There exists a fairly well-developed theory of means for positive definite matrices. Let MnCO be the set of all n X
n complex matrices, §n the collection of all self-adjoint elements of Mn(IC), and iJ=Dn that of all positive definite matrices. The space §n is a real vector space and iJ=D n is an
open cone within it. This gives rise to a natural order on
§n. We say that A 2: B if A - B is positive definite or pos
itive semidefinite. Two elements of §n are not always com
parable in this order. Every element X of GLn (the group
of invertible matrices) has a natural action on iJ=D n· This is
given by the map r x(A) = X* AX. We say that A and B are
congruent if B = r x(A) for some X E GLn. In the special
case when X is unitary, we say that A and B are unitarily
equivalent. The group of unitary matrices is denoted by QJn· Now we have enough structure to lay down conditions
that a mean M(A,B) of two positive definite matrices A and
B should satisfy. Imitate the properties (i)-(v) for means
of numbers. This suggests the following natural conditions:
(I) M(A,B) = M(B,A).
(II) If A :::::: B, then A :::::: M(A,B) :::::: B.
(III) M(X*AX,X*BX) = X*M(A,B)X, for all X E GLn. (IV) M(A,B) is an increasing function of A and B; i.e., if
A1 2: A2 and B1 2: B2, then M(A1,B1) 2: M(A2,B2).
(V) M(A,B) is a continuous function of A and B.
The monotonicity condition (IV) is a source of many in
triguing problems in constructing matrix means. This is be
cause the order A 2: B is somewhat subtle. For example, if
A = [� �1 and
then A 2: B but A2 ;t B2.
What functions of positive numbers, when lifted to pos
itive definite matrices, preserve order? This is the subject
of an elegant and richly applicable theory developed by
Charles Loewner. Letf be a real-valued function on IR+. If
A is a positive definite matrix and A = 'i.Aiuiui is its spec
tral resolution, thenf(A) is the self-adjoint matrix defined as j{A) = 'i.J{Ai)uiui. We say that j is a matrix monotone junction if for all n = 1 , 2, . . . , the inequality A 2: B in IJ=Dn implies j{A) 2:j{B). One of the theorems of Loewner says
thatfis matrix monotone if and only if it has an analytic con
tinuation to a mapping of the upper half-plane into itself. As a consequence, the functionj{x) = xP is matrix monotone if
and only if 0 :::::: p :::::: 1. The function f(x) = log x is matrix
monotone, but j{x) = exp x is not. We refer the reader to
Chapter V of [B] for an exposition of Loewner's theory.
Returning to means, the arithmetic and the harmonic
means of A and B are defined, in the obvious way, as
teA + B) and [ tCA - 1 + B- 1) ] - 1, respectively. It is easy to
see that they satisfy the conditions (I)-(V) above.
The notion of geometric mean in this context is more
elusive, even treacherous. Every positive definite matrix A
has a unique positive definite square root A 112. However, if
A and B are positive definite, then unless A and B com
mute, the product A 112 B112 is not self-adjoint, let alone pos
itive definite. This rules out using A112Bl12 as our geomet
ric mean of A and B, except in the trivial case when AB =
EA. We should look for other good expressions in A and B
that reduce to A 112B112 when A and B commute. One plau
sible choice is the quantity
(1) ( log A + log B ) _ . ( AP + BP )lip exp 2 - hmp __. o 2 .
The equality of the two sides of (1) was noted by Bhag
wat and Subramanian [BS], who studied in detail the
"power means" occurring on the right-hand side. This
too is not monotone in A and B, as can be seen by choos
ing positive defmite matrices X and Y, for which X 2: Y but exp X ;t exp Y, and then choosing A and B such that
X = t (log A + log B) and Y = t log B.
The condition (III), sometimes called the transformer
equation, is not innocuous either. Our failed candidates fail
on this count too.
The noncommutative analogue of v;;J; with all desirable
properties turns out to be the expression
(2) A#B = A 112 (A - 1/2 BA - 112)v2 Avz,
that was introduced by Pusz and Woronowicz [PW] in 1975. At the outset it does not appear to be symmetric in A and
B; but it is, as we will soon see. The monotonicity in B is
assured by the facts that congruence preserves order (B1 2:
B2 implies X*B1X 2: X*B,y() and the square root function is
matrix monotone.
Symmetry in A and B is apparent more easily from an al
ternative characterisation ofA#B due to T. Ando [A]. We have
(3) A#B = max{x : X = X* and [� �1 2: o}. Among its other characterisations, one describes A#B as
the unique positive definite solution of the Riccati equation
(4) XA- 1X = B.
We call A#B the geometric mean of A and B. It has the de
sired properties (I)-(V) expected of a mean M(A,B) : property (III) may be verified easily from (3) or ( 4). It satisfies the expected inequality
( A - 1 + B- 1 ) - 1 A + B (5) 2
:::::: A#B :::::: -2- ,
and has other pleasing properties. Many of these were de
rived by Ando [A].
Two positive definite matrices A and B can be diago
nalised simultaneously by a unitary conjugation r u if and
only if they commute. In the absence of commutativity, A
and B can be diagonalised simultaneously by a congruence
in two steps:
(A,B) rA-"' (I ,A - 112 BA - 112) � (I,D),
where U is a unitary such that U* (A - 112BA - 112) U is a di
agonal matrix D. This takes some of the mystery out of the formula (2). In fact, any mean m(a,b) of positive numbers
leads to a mean M(A,B) of positive definite matrices by the
procedure M(A,B) = r AJt2(m(I,D)). To ensure that M is an
increasing function of A and B, we have to assume that the
functionf(x) = m(l,x) is matrix monotone. The formula (2) corresponds to the case when m(a,b) = (ab)112.
© 2006 Spnnger Science+ Business Media, Inc., Volume 28, Number 1, 2006 33
The indirect argument we have used to deduce the symmetry of the geometric mean is not necessary. Let m( a,b) be any mean, letf(x) = m(l,x), and
(6) M(A,B) = A112j (A- 112BA- 112) A112.
Though this expression seems to be asymmetric in A and B, in fact M(A,B) = M(B,A). For this we need to prove
f(A - 112BA - 112) = A - 112B112j (B- 112AB- li2)B112A - 112. Using the polar decomposition A - 112B112 = PU, where P is positive definite and U unitary, this statement reduces to
This, in tum, is equivalent to saying that for every eigenvalue A of P, we have
But that is a consequence of properties (i) and (iii) of the mean m. A similar argument verifies (III).
A simple corollary of this construction is the persistence of inequalities like (5) when one passes from positive numbers to positive definite matrices. Kubo and Ando [KA] developed a general theory of matrix means and established a correspondence between such means and matrix monotone functions.
What happens when we have three positive definite matrices instead of two? The arithmetic and the harmonic means present no problems. Plainly, they should be defined as �(A + B + C) and [�(A - 1 + B-1 + c-1)] - 1, respectively. The geometric mean, once again, raises interesting problems.
We would like to have a geometric mean G(A,B,C) that reduces to A 113 B113C113 when A, B, and C commute with each other. In addition it should have the following properties.
(a) G(A,B,C) = G(1r(A,B,C)) for any permutation 1r of the triple (A,B,C).
(/3) G(X*AX,X*BX,X*CX) = X*G(A,B,C)X for all X E GLn. ( y) G(A,B,C) is an increasing function of A, B, and C. (8) G(A,B,C) is a continuous function of A, B, and C.
None of the procedures presented above for two matrices extends readily to three. The expressions (2), (3), and ( 4) have no obvious generalisations that work The idea of simultaneous diagonalisation does not help either: while two positive definite matrices can be diagonalised simultaneously by a congruence, generally three can not be. Defining a suitable geometric mean of three positive definite matrices has been a ticklish problem for many years. Recently some progress has been made in this direction, and we describe it now.
0 0 0
One geometry cannot be more true than another; it can only be more convenient.
-Henri Poincare [Po}
While the geometric mean A#B has been much studied in connection with problems of matrix analysis, mathematical physics, and electrical engineering, a deeper under-
34 THE MATHEMATICAL INTELLIGENCER
standing of it is achieved by linking it with some standard constructions in Riemannian geometry.
The space Mn(C) has a natural inner product (A,B) = tr A*B. The associated norm I IAib = (tr A*A)112 is called the Frobenius, or the Hilbert-Schmidt, norm. If A is a matrix with eigenvalues A 1, . . . , An, we write A (A) for the vector (A 1, . . . , An) or for the diagonal matrix diag(A 1 , . . . , An)·
The set IP n is an open subset of §n and thus is a differentiable manifold. The exponential is a bijection from §n onto IP n· The Riemannian metric on the manifold IP n is constructed as follows. The element of arc length is the differential
(7) ds = IIA - 112 dA A - 112l lz.
This gives the prescription for computing the length of a differentiable curve in IP n· If y : [ a,b] � IP n is such a curve, then its length, obtained by integrating the formula (7), is
(8) L(y) = r � �y- 112(t)y'(t)y- 112(t)lb dt. a
If A and B are two elements of IP n, then among all curves y joining A and B there is a unique one of minimum length. This is called the geodesic joining A and B. We write this curve as [A ,B], and denote its length, as defined by (8), by the symbol 82(A,B). This gives a metric on 1Pn called the Riemannian metric.
From the invariance of trace under similarities, it is easy to see that for every X in GLn the map r X : IP n � IP n is a bijective isometry on the metric space (IP n,82).
An important feature of this metric is the exponential met
ric increasing property (EMI). This says that the map exp from the metric space (§11,l l · l lz) to (IPnh) increases distances. More precisely, if H and K are Hermitian matrices, then
(9)
To prove this, one uses the formula (8) and an infinitesimal version of (9):
(10)
for all H, K E §n· Here Defi (I() is the derivative of the map exp at the point H evaluated at K, i.e.,
efi+tK - elf (11) Defi(K) = limt--.o t
There is a well-known formula due to Daleckii and Krein (see [B], chapter V, for example) giving an expression for this derivative. Choose an orthonormal basis in which H = diag(A b . . . , An). Then [ fl'; - eAi J Defi(K) =
A · _ A · kij . '· J
(The notation here is that [Xij] stands for a matrix with entries Xij·) From this, one sees that the (iJ) entry of e-H12Defi(K)e-H12 is
( 12) sinh(Ai - Ai)/2
(A.; - Aj)/2 kij·
Since sinh X 2: 1, the inequality ( 10) follows from this. X
In the special case when H and K commute, a calculation shows that there is equality in (9). In this case the function exp maps the line segment [H,K] in the Euclidean space §n isometrically onto the geodesic segment [ef/,ef<] in !f1> n· If A = eH and B = eK, this says that the geodesic segment joining A and B is the path
y(t) = eC1-t)H+tK = eC1-t)HetK = A1-t Bt, 0 :::; t :::; 1 .
Further, o2(A,y(t)) = to2(A,B) for each t in [0, 1 ] . The case of noncommuting A and B can b e reduced
to the commuting case using the fact that r A-lt2 is an isometry on the space (ifl>n,82). The geodesic segment [I ,A - 1!2 BA - 112) is parametrised by y0(t) = (A - 112BA - 112)1, by what we said about the commuting case. So, the geodesic (A,B) = !fA'"{J),fAv.{A- 112BA-112)) is parametrised by
(13) y(t) = A112 (A- 112 BA- 112)1 A112, 0 :::; t :::; 1.
This shows that the geometric mean A#B defined by the fonnula (2) is nothing but the midpoint of the geodesic joining A and B in the Riemannian manifold !f1> n· Thus while (2), (3), and (4) might have ap,Reared as over-imaginative noncommutative variants of �. very natural geometric considerations lead to the same notion of mean as is given by (2). Note that for each t, y(t) defmed by ( 13) is a mean of A and B corresponding to the functionf(x) = :xf in the formula (6). Those means are not symmetric, however: (I) fails unless t = 1/2.
This discussion also gives an explicit formula for the metric 82. We have o2(A,B) = 82 (I ,A - 112 BA - 112) =
2 2. 5 2 1 . 5 0. 5
lllog I - log (A - 112BA - 112) 1 12 = lllog(A - 112BA - 112) l lz. The matrices A - 112 BA - l/2 and A - 1 B have the same eigenvalues. So, this can be expressed as
The inequality (9) captures an essential feature of lfl>n : it is a manifold of nonpositive curvature. To understand this, consider a triangle with three vertices 0, H, and K in §n· Under the exponential map, this is mapped to a "triangle" with vertices I, exp H and exp K in !f1> n· The lengths of the two sides [O,H) and [O,.K] measured by the norm l l · l lz are equal to the lengths of their images [I, exp H) and [I, exp K] measured by the metric 82 . By the EMI (9), the length of the third side [ exp H, exp K] of the triangle in !f1> n is larger than (or equal to) IIH - Kllz. The general case of a geodesic triangle with vertices exp A, exp B, exp C in !f1> n may be reduced to the special case by applying the congruence fexp(-A/2) to all points and thus changing one of the vertices to I. This is often described by saying that two geodesics emanating from a point in !f1> n spread out faster than their pre-images (under the exponential map) in §n·
It is instructive here to compare the situation with that of IUn, a compact manifold of non-negative curvature (Figure 1 ). In this case the real vector space i§n consisting of skew-Hermitian matrices is mapped by the exponential onto IUn. The map is not injective; it is a local diffeomorphism.
Using the formula (11) with H and K in i§n, we reduce
exp(iA)
0 0.5 1 .5 2 2.5 Figure 1. Three curvatures, showing a comparison of a Euclidean (curvature zero) triangle in §2 with its images under exp(-) in P2 (nonposi
tive curvature) and exp(i·) in Q.J2 (non-negative curvature). The colours indicate matching vertices. Note that the geodesics emanating from
exp(A) spread out faster than Euclidean ones (compare the straight lines at A), whereas those emanating from exp(iA) spread more slowly.
© 2006 Springer Science+Business Media, Inc., Volume 28. Number 1 , 2006 35
A#C
B c B#C
Figure 2. Geodesic distance from A#B to A#C is no more than half
that from B to C. Joining the midpoints of the sides of a geodesic
triangle in IP'n results in a triangle with sides no more than half as
long. Iterating this procedure leads to the construction of Ando, Li,
and Mathias, described in the text.
H to diag( iA 1 o • • • , iAn) with A1 real. Instead of (12) we have now
sin(Ai - AD/2 (Ai - AJ)/2 kii·
Since [ sin x [ :S 1, the inequality (10) is reversed in this case, X as is its consequence (9), provided elf and eK are close to each other.
expA
Returning to IP n and the geometric mean, it is not difficult to derive from the information at our disposal the fact that given any three points A, B, and C in 1Pn we have
1 (15) o2(A#B,A#C) :S 2 o2(B,C).
This inequality says that in every geodesic triangle in IP n with vertices A, B, and C, the length of the geodesic joining the midpoints of two sides is at most half the length of the third side. (If the geometry were Euclidean, the two sides of (15) would have been equal.) Figure 2 illustrates (15).
We saw that the geometric mean A#B is the midpoint of the geodesic [A,B]. This suggests that we may possibly define the geometric mean of three positive definite matrices A, B, and C as the "centroid" of the geodesic triangle Ll(A,B,C) in 1Pn.
In a Euclidean space �. the centroid x of a triangle with vertices x1, x2, X3 is the point x = �(x1 + Xz + x3). This is the arithmetic mean of the vectors x1, x2, and x3. This point may be characterised by several other properties. Three of them are:
(M1) x is the unique point of intersection of the three medians of the triangle ll(x1,x2,x3), as in Figure 3;
(M2) x is the unique point in � at which the function
attains its minimum; (M3) x is the unique point of intersection of the nested
sequence of triangles [lln} in which ll1 = Ll and ll1+ 1 is the triangle obtained by joining the mid-
�--__,. expC
A � c
Figure 3. In the hyperbolic geometry medians may not meet. While the medians of a Euclidean triangle intersect at the centroid, the corre
sponding median geodesics of a triangle in IP'n may not intersect at all. A 3-D wire model would make it clear that, generically, the medians
do not even intersect in pairs.
36 THE MATHEMATICAL INTELLIGENCER
points of the three sides of t:.J (Figure 2 mimics this
construction in the non-Euclidean setting of IP' n).
To define a geometric mean of A, B, and C in IP' n we may
try to imitate one of these definitions, now modified to suit
the geometry of IP' n· Here fundamental differences between
Euclidean and hyperbolic geometry come to the fore, and
(Ml), (M2), and (M3) lead to three different results.
The first definition using (Ml) fails. The triangle
t:.(A,B,C) may be defined as the "convex set" generated by
A, B, and C. (It is clear what that should mean: replace line
segments in the definition of convexity by geodesic seg
ments.) It turns out that this is not a 2-dimensional object
as in ordinary Euclidean geometry (see Figure 4). So, the medians of a triangle may not intersect at all in some cases
(again, see Figure 3).
With (M2) as our motivation, we may ask whether there
exists a point X0 in IP' n at which the function
J(X) = 8�(A,X) + 8�(B,X) + 8�(C,X)
attains a minimum. It was shown by Elie Cartan (see, for
example, section 6. 1 .5 of [Be]) that given A, B, and C in IP' n•
there is a unique point Xo at which f has a minimum. Let
G2(A,B,C) = X0, and think of it as a geometric mean of A,
B, and C. This mean has been studied in two recent papers
by Bhatia and Holbrook [BH] and Moakher [M].
In another recent paper [ALM], Ando, Li, and Mathias
define a geometric mean G3(A,B,C) by an iterative proce
dure. This iterative procedure has a nice geometric inter
pretation: it amounts to reaching the centroid of the geo
desic triangle 11(A,B,C) in IP'n by a process akin to (M3).
0.25
0.2
0 . 15
0.1
0.05
0.2
0 0.2 0.3 0.4
Starting with /11 as the triangle 11(A,B,C) one defines /12 to
be 11(A#B,A#C,B#C), and then iterates this process. Figure 2 shows the beginning of this process. The inequality (15)
guarantees that the diameters of these nested triangles de
scend to zero as 112n. It can then be seen that there is a
unique point in the intersection of this decreasing sequence
of triangles. This point, represented by G3(A,B,C), is the
geometric mean proposed by Ando, Li, and Mathias.
It turns out that the two objects G2(A,B,C) and G3(A,B,C) are not always equal (Figure 5 illustrates this phenome
non). Thus we have (at least) two competing notions of the
centroid of 11(A,B,C). How do they do as geometric means?
The mean G3(A,B,C) has all of the four desirable properties (a)-(8) that we listed for a mean G(A,B,C). Properties (a), (/3), and (8) are almost obvious from the construction. Prop
erty ( y)-monotonicity-is a consequence of the fact that
the geometric mean A#B is monotone in A and B. So mo
notonicity is preserved at each iteration step. The mean
G2(A,B,C) does have the desirable properties (a), (/3), and
(8). Property (/3) follows from the fact that r X is an isome
try of (IP' n,82) for every X in GLn. However, we have not been
able to prove that G2(A,B,C) is monotone in A, B, and C. We
have an unresolved question: Given positive definite matrices A, B, C, and A' with A 2: A', is G2(A,B,C) 2: G2(A' ,B,C)?
An answer to this question may lead to better understanding of the geometry of IP'n, the best-known example
of a manifold of nonpositive curvature. Certainly this is of
interest in matrix analysis. Computer experiments suggest
an affirmative answer to the question.
Finally, we make a brief mention of two related matters.
The Frobenius norm is one of a large class of norms called
0.5 0.6 0.7 0.8 0.9
Figure 4. Conv (A,B,C) is not two-dimensional. In the hyperbolic (nonpositive curvature) geometry of l?m the convex hull of a triangle (formed
by successively adjoining the geodesics between points that are already in the object) is not a surface but rather a "fatter" object.
© 2006 Springer Science+Business Media, Inc., Volume 28, Number 1 , 2006 37
unitarily invariant norms or Schatten-von Neumann
norms. These norms 1 1 · 1 1"' have the invariance property I IU A VII"' = I IAII"' for all unitary U and V. Each of these norms corresponds to a symmetric norm <P on IR11; that is, a norm <P that is invariant under permutations and sign changes of coordinates. The correspondence is given by I IA I I''' = <P (s,(A), . . . , Sn(A)), where s,(A) 2: · · · 2: sn(A) are the singular values of A. Common examples are the Holder norms <Pp(X) = o:lxj:V)11P and the corresponding Schatten
norms I IAIIv = (� s}(A))11P, 1 ::::: p ::::: oc. The Frobenius norm is the special case p = 2.
For each of these norms we may define a metric o,v on 1P'11 as in the formula (14). The EMI in the form (9) or (10) remains true (see [B2]). The import of this remark is that, with any of these metrics, IP'n is a Finsler manifold of nonpositive curvature; the special Frobenius norm arises from an inner product and gives rise to a Riemannian structure. In recent years metric spaces of non positive curvature have been studied in great detail; see the comprehensive book by Bridson and Haefliger [BrHa] . The spaces IP' n with norms 1 1 · 11''' are interesting and natural examples of such spaces.
0 0 0 But the whole wondrous complications of interference, waves, and all, result from the little fact that :i:p - px is not quite zero.
-Richard Feynman [ FLS j
The generalised version of EMI has a fascinating connection with yet another subject: inequalities for
�he matrix exponential function discovered by physicists and mathematicians. Many such in
equalities compare eigenvalues of the matrices ef!+K and eHeK, and are much used in
1uantum statistical mechanics and lately in quantum information theory. In [S] I.
Segal proved for any two Hermitian matrices H and K the inequality
Figure 5. The "Cartan surface" contains G2(A,8,C) but not G3(A,8,C).
The Cartan surface consists of points minimizing the convex combi
nations all�(A.xJ + bll�(B,x) + cll�(C,x); here the colours of the points
shown are chosen to reflect the relative strengths of the weights a,b,c.
Thus G2(A,8,C) corresponds to 1 /3, 1/3, 1/3 (see yellow dot on sur
face). The small black circle locates G3(A,8,C), which is not on the
surface in general. Thanks to J.-P. Shoch for computing this picture
of a Cartan surface.
38 THE MATHEMATICAL INTELLIGENCER
Here A 1 (X) is the largest eigenvalue of a matrix X with real eigenvalues. In a similar vein, we have the famous GoldenThompson inequality
(17)
The matrices ef!+K and ef112eKeH12 are positive definite. So, the inequalities ( 16) and (17) say
llefl+K]IP ::::: l lefll2eKeH12I IP, for p = 1 ,oc.
The EMI (9) generalised to all unitarily invariant norms is the inequality
By well-known properties of the matrix exponential, this implies
(19)
This inequality, called the generalised Golden-Thompson inequality, includes in it the inequalities (16) and (17) . The origins of these inequalities and their connections with quantum statistical mechanics are explained in Simon [Si] (page 94). Still more general versions have been discovered by Lieb and Thirring, and by Araki, again in connection with problems of quantum physics. See Chapter IX of [B] . Generalisations in a different direction were opened up by Kostant [K], where the matrix exponential is replaced by the exponential map in more abstract Lie groups.
A common thread running between matrix analysis, Riemannian and Finsler geometry, and physics! Pascal would have approved.
REFERENCES
We have included some articles that are related to our theme but not specifically mentioned in the text. [A] T. Ando, Topics on Operator Inequalities, Lecture Notes, Hokkaido
University, Sapporo, 1 978.
[ALM] T. Ando, C . -K. Li, and R. Mathias, Geometric means, Linear Al
gebra Appl. 385(2004), 305-334.
[Be] M. Berger, A Panoramic View of Riemannian Geometry, Springer
Verlag, 2003.
[B] R. Bhatia, Matrix Analysis , Springer-Verlag, 1 997.
[B2] R . Bhatia, On the exponential metric increasing property, Linear
Algebra Appl. 375(2003), 2 1 1 -220.
[BH] R . Bhatia and J . Holbrook, Riemannian geometry and
matrix geometric means, to appear in Linear Algebra Appl.
[BrHa] M . Bridson and A. Haefl iger, Metric Spaces of Non
positive Curvature, Springer-Verlag, 1 999.
[BMV] P S. Bullen, D . S. Mitrinovic, and P. M . Vasic, Means and Their
Inequalities, D. Reidel, Dordrecht, 1 988.
[BS] K. V. Bhagwat and R. Subramanian, Inequalities between means of
positive operators, Math. Proc. Camb. Phil. Soc. 83(1 978), 393-401 .
[CPR] G. Corach, H . Porta, and L. Recht, Geodesics and operator
means in the space of positive operators, Int. J. Math. 4(1 993),
1 93-202.
[FLS] R. Feynman, R. Leighton, and M. Sands, The Feynman Lectures
on Physics, volume 3, page 20-1 7, Addison-Wesley, 1 965.
A U T H OR S
JOHN HOLBROOK
Department of Mathematics and Statistics
University of Guelph
Guelph, Ontario N 1 G 2W1
Canada
e-mail: [email protected]
RAJENDRA BHATIA
Indian Statistical Institute
7, S. J. S. Sansanwal Marg
New Delhi 1 1 0016
India
e-mail: [email protected]
Rajendra Bhatia did his doctoral studies at lSI Delhi with Kalyan Mukherjee. He has been based there most of the quarter-century since,
along with his wife lrpinder and their son Gautam.
John Holbrook is now Professor Emeritus at Guelph. He and his wife Catherine divide their time between Guelph and Fowke Lake
(farther north), generally in the company of children, grandchildren, and cats.
This photograph of the authors (courtesy of Peter Semrl) shows them in yet another continent, Europe: in the beautiful Alps of Slove
nia. The photo may also serve as encouraging evidence that it is possible to collaborate on mathematical projects and remain on good
terms!
[HLP] G. H. Hardy, J. E. Littlewood, and G. P61ya, Inequalities, Cam
bridge University Press, 1 934.
[K] B. Kostant, On convexity, the Weyl group and the lwasawa de
composition, Ann. Sc. E. N. S. 6(1 973), 4 1 3-455.
[KA] F. Kubo and T. Ando, Means of positive linear operators, Math.
Ann. 246(1 980), 205-224.
[LL] J . D. Lawson and Y. Lim, The geometric mean, matrices, metrics,
and more, Amer. Math. Monthly 1 08(200 1 ), 797-81 2.
[M] M . Moakher, A differential geometric approach to the geometric
mean of symmetric positive-definite matrices, SIAM J. Matrix Anal.
App/. 26(2005) , 735-747.
[P] B. Pascal, Pensees, translation by W. F. Trotter, excerpt from item
72, Encyclopaedia Britannica, Great Books 33, 1 952.
[Po] H. Poincare, Science and Hypothesis , from page 50 of the Dover
reprint, Dover Publications, 1 952.
[PW) W. Pusz and S. L. Woronowicz, Functional calculus for sesquil in
ear forms and the purification map, Reports Math. Phys. 8(1 975),
1 59-170
[S] I . Segal. Notes towards the construction of nonlinear relativistic quan
tum fields I l l , Bull. Amer. Math. Soc. 75(1 969), 1 390-1 395.
[Si] B . Simon, Trace Ideals and Their Applications, Cambridge Univer
sity Press, 1 979.
© 2006 Springer Science+ Business Media, I n c . , Volume 2 8 , Number 1 , 2006 39
Numerica l Properties of Rudolph M ichael Sch indler's Houses in the Los Angeles Area Jin-Ho Park
Does your hometown have any
mathematical tourist attractions such
as statues, plaques, graves, the caje
where the famous conjecture was made,
the desk where the famous initials
are scratched, birthplaces, houses, or
memorials? Have you encountered
a mathematical sight on your travels?
Jj so, we invite you to submit to this
column a picture, a description of its
mathematical significance, and either
a map or directions so that others
may follow in your tracks.
Please send all submissions to
Mathematical Tourist Editor,
Dirk Huylebrouck, Aartshertogstraat 42,
8400 Oostende, Belgium
e-mail: [email protected]
The architecture of Los Angeles mir
rors the diversity of cultures rep
resented within the city and includes
the work of many eminent contempo
rary architects, such as Frank 0. Gehry. Any serious study of Los Ange
les architecture invokes the name
of Rudolph Michael Schindler as a ma
jor inspiration. Schindler's numerous
buildings in Los Angeles, built over
thirty years, from the 1920s to the
1950s, are recognized as icons of twen
tieth-century design.
Mathematicians interested in ex
ploring and discovering Los Angeles ar
chitecture will find Schindler's build
ings (Fig. 1) of particular interest. To
make the most of your visit, you should
understand the underlying principle of
Schindler's architecture: his propor
tional system (he called it "reference
frames in space") and its unique nu
merical properties.
Pacific Ocean
Schindler's primary concern in ar
chitecture is the assembly and delin
eation of spaces. He crystallized his
idea of space in his mind, rather than
by visualizing it through a physical
model, or bodily movement combined
with the perceptual process, or other
methods. As a result, once you enter a
Schindler house, you find yourself in a
space-form that provides a full array of
new spatial experiences, and, at times,
surprises that arise from the complex
ity of the spatial flow.
It is hard to predict Schindler's
space-forms by examining his plans,
elevations, and sections. For Schindler,
these were just two-dimensional nota
tional forms, like scales in music. His
approach was not one of figure and
ground. Rather, the interior and the ex
terior spaces are intertwined. In the in
terior, the spaces flow into each other
and, on the exterior, simple and recti-
Figure 1 . Map of Los Angeles, and Schindler's buildings discussed in the present paper.
40 THE MATHEMATICAL INTELLIGENCER © 2006 Springer Sc>ence- Bus>ness Media, Inc
Figure 2. R. M. Schindler: The McAimon House, 1 935, 271 7-2721 Waverly Drive, Los Angeles; The Mackey Duplex, 1 939, 1 1 37 South Cochran
Avenue, Los Angeles.
linear space-forms are highly inter
locked. Their union creates a rhythm
of complex combinations. Thus it is of
ten hard to distinguish where the build
ings start or end; frequently they lack
a clear main fa<;ade. Therefore, to ap
preciate Schindler's space-forms, one
must visit the houses in person, and ex
perience them through the movement
of the body in space.
Architecture, for Schindler, began
with thinking about and reasoning with
the building, shaping and feeling it as
a structure in the mind (Schindler,
1946). Here, the proportional system
plays a significant role in composing a
specific compositional sensibility with
the practicalities of physical construc
tion.
The purposes of his system were,
first, to provide a mental structure for
conceptualization before pencil met pa
per and second, to communicate to the
builders a map locating elements, al
lowing them to easily scale off dimen
sions. By doing so, the location of every
element of the building is identified ac
curately in the convenience of composi
tion and construction. Thus, no obscure
or arbitrarily unrelated measurements
are involved in the system.
Usually, Schindler recommended a
basic unit of 4 feet ( 48 inches), to be
used with simple multiples and with 1l2, 1h and 114 subdivisions. In many cases,
he used 114 and 1l2 subdivisions for
plans and 113 for vertical subdivision.
The 114 and 112 adjustments "humanize"
the coarseness of the space reference
frame itself. The architect must think
in 48-inch units, but after ± 12", ±24" re
fmements, results always lie within a 12-
inch grid. A variety of L-shapes and other
arrangements are easily accommo
dated. Such a unit is useful to measure
dimensional relationship of a build
ing's heights, widths, and lengths. Verti
cally, s11ch rectangles are elevated in 16-
inch steps to create spatial elements.
There are two reasons for his
choice of this unit. First, it had to be
related to the human figure to satisfy
all the necessary requirements for
rooms, doors, and ceiling heights; sec
ond, for practical reasons, the 48-inch
module fit the standard dimensions of
materials and common construction
methods available in California at that
time.
Architectural drawings are the
strategic transfer of the architect's in
tention and vision, accurately and pre
cisely. Schindler utilized his space ref
erence frame in a grid pattern in
three-dimensional space. Numbers and
letters are laid out on the grid on the
floor plans in sequence, and the verti
cal module is identified with an eleva
tion grade. This pattern was original to
Schindler. In his earlier designs, the
grid was present on drawings and in
the house; later, the grids disappeared
from the house and, at times, from the
drawings. This doesn't mean he aban
doned his system; on the contrary, it
remains embedded in the designs as
the underlying principle.
Schindler's choice of 48-inch units
has two significant mathematical as
pects. With the unit system, a space is
understood in the mind as a continu
ous quantity in three dimensions. Aris
totle bisects "quantity" into two parts:
magnitude and multitude. For Aristo
tle, magnitude is that which can be
measured: it has indivisible, continu
ous attributes. Multitude, which can be
counted with number, has divisible or
discrete attributes. Schindler's unit
system can be interpreted as measured
space replaced by counted space.
Thus, for Schindler, a space can be
measured numerically by the unit sys
tem in the mind-with subdivisions
and multiples of the unit. It is a fully
rationalized method of creating a spa
tial structure. He wrote:
[The space architect] must establish
a unit system which he can easily
carry in his mind and which gives
him the size values of his forms di
rectly without having to resort to
mathematical computations. Mea
surements in figures express ab
stract size relations void of any hu
man connotation and offering no
assistance for the imagination.
Thus, the unit modules replace mea-
surements. The designer can play with
them, in his mind, freely and accu
rately, without any mechnical measur
ing devices. Accordingly, space forms
are successively conceived from the
mental play of the unit system.
Divisibility is another essential fea
ture of the unit system. The number 48
is divisible by 10 factors (1, 2, 3, 4, 6,
8, 12, 16, 24, 48), which form a sub
module group and relate organically to
the unit system. This group forms a
© 2006 Springer Science+ Business Media. Inc., Volume 28, Number 1 , 2006 41
-----
-----
----
---..,.. -
Figure 3. Schindler's three-dimensional grid of space cubes.
whole numbers on drawings are com
monly presented. These numbers are in
crements of unit multiples with subdivi
sions. At times, various rooms are not
rectangular in shape but interlocked,
overlapped, and even zigzagged. Despite
their shape, Schindler labeled room di
mensions in terms of a X b on the draw
ings. It is certain that he approximated
measurements of the room in a rational
manner with his space reference frame.
The notation would be the standard
room and component sizing notation:
a X b with a, the width, greater than the
length, b. Here there is no temptation to
reduce the terms by common factors or
ratios, since the repetition of the same
ratio throughout the parts is seen as pro
ducing a confusion of scale. Schindler is
concerned about scale precisely be
cause it would imply a change of the unit
of measurement within the same work
He wrote, " 'Scale' denotes a consistent
dimensional relationship of parts of a
structure to each other and to a basic
unit." Thus, with such a notation, scale
is always retained.
family, so that the measurements of the
whole building are in relationship to
one another. Thus, all the parts of the
dimensions harmonize through unifor
mity of scale and rhythm. In other
words, unrelated dimensions in the
whole scale can be excluded. In
Schindler's words, " . . . only coarse
ness allows him to break that rhythm
by introducing arbitrary unrelated di
mensions into his layout." The 48-inch
unit is characterized as a "highly
composite number" by Lionel March
(1994a), who accounts for various
composite numbers, such as 36, 48, 60,
180, with a lattice diagram, showing a
hierarchical structure.
Through a process of abstraction,
Schindler began to construct and ex
plore a variety of spaces and space
forms in his mind. He defined spaces
as abstract geometric forms whose po
sitions could be uniquely determined
by a given number of coordinates.
Here, Schindler's space reference
frame helps to measure the exact dis
tance between points in space, and to
identify space-forms of a design in
precise locations, allowing for formal
and spatial coherence in his architec
ture. It is easy to see how Schindler
worked with those unit dimensions in
his spatial computation. Because the
space architect can conceive of 4 X 4 x 4 foot "units" of space, it seems
quite reasonable to imagine such a
space element as being slightly less or
slightly more, :±:: 1/4, or simply halfway
42 THE MATHEMATICAL INTELLIGENCER
between, :±:: 1h in the mind. In the de
sign process, Schindler is able to ad
just the dimension bit by bit.
Schindler roughed in his designs us
ing the 4 X 4 X 4 foot space reference
frame and located spatial elements rel
ative to one another, introducing re
finements between elements-taking a
little here, adding a little there. As he
explained, "It is not necessary that the
designer be completely enslaved by the
grid. I have found that occasionally a
space-form may be improved by devi
ating slightly from the unit." Nor
was Schindler obsessed with particu
lar numbers, such as the Fibonacci se
quence or ratios. Design can be imag
ined in the mind without recourse to
pencil and paper. This represents a ma
jor distinction from the architect Le
Corbusier, who used regulating lines in
his early work, transforming them into
Le Modular, and was noted for the per
sistence of the golden section in his
work after World War II.
Schindler also used the space refer
ence frame to measure room sizes. In his
preliminary sketches, room sizes with
Among Schindler's 200 houses in
the Los Angeles area, three are dis
cussed in this article-the Kings Road
House of 1920; the Lovell House
of 1923; and the How House of 1925.
They represent exquisite examples of
Schindler's finest work Most of his
houses are private residences and are
not accessible to the public. However,
the Kings Road house is open; since
1994, the house has been used as the
MAK Center for Art and Architecture
in Los Angeles, a satellite of the Mu
seum for Applied Art in Vienna.
Kings Road House, 1 92 1 -22 835 North Kings Road,
West Hol lywood
The Kings Road house is the best
"representation of Schindler's "or
ganic" type of dwelling.
180
Figure 4. The lattice diagram of highly composite numbers such as 36, 48, 60, and 180 are
shown (after Lionel March).
6 Bath
b B � B I Living R'M §
I I
Kilclk!=n l:>our Living R�l Living RM L.iving RM
0 0
8 ( o Patio r
J 0 L?gr,...._...._•r-- -.,..-
L.iving RM B<dR ·I B<dRM
IA IB K:: ID � IF IG
Patio Front Elevation
As opposed to imposing modem structures, the simplicity of its volume is in tune with its environment, creating a refined refuge from the bustle of Los Angeles. The house lies in a densely wooded grove. Three L-shape units are arranged in a pattern on a 200 X 100-
foot lot, forming a courtyard with enclosed patios and outdoor fireplaces. Each pair of units opens to outdoor living patios. The third one is formed by the kitchen, guest studio, and garage. The impression is of primitive boxes resting in their natural place. Schindler wrote, "The shape of rooms, their relation to the patios and the alternating roof levels, create an entirely new spatial interlocking between the interior and the garden." There is almost no difference in level between the ground floor and the garden, suggesting an infinite extension to the open ground in accordance with the character of the land. Schindler's "organic" building type is fully realized in this house; the house and the outdoors unite in perfect rapport to embrace their
Figure 5. Pueblo Ribera Court. Plan: elevations,
window, and furniture.
The structural components of the house are simple: concrete walls on one side and two wooden posts from the other side support all ceilings. "All partitions and patio walls are non-supporting screens composed of a wooden skeleton filled in with glass or with removable 'insulate' partition. These basic materials are used in a lucid way to form "the cave-
tent shelter of concrete, wood and canvas" which relates the project to the climate, the region, and the surroundings. Schindler appears to have erected a modern hut, expressing the immemorial relationship between humanity and nature. It is a true sustainable aesthetic.
On the drawings, the dimensions and placements of various spatial
extraordinary surroundings. Figure 6. The Kings Road House, 1 921-22. 1/4 scale model constructed with basswood.
© 2006 Springer Science+ Business Media, Inc., Volume 28, Number 1 , 20C>6 43
Figure 7. The Kings Road House, 1 921-22. Courtyard view.
forms and details of the house are con
trolled by Schindler's unit system. The
48-inch unit system is clearly identi
fied in the plan. A 12-inch ( 1/4 unit
module) vertical module is employed.
Thus the heights of all window and
door mullions are based on a 12-inch
module. The rafters of the horizontal
wooden structures are regulated ac
cording to a 24-inch distance-half the
unit module.
Most of the major rooms in the con
struction drawings are approximately
measured in whole numbers, although
rooms are frequently not simple rec
tangles. Unlike the complexity of the
space-forms, these involve surprisingly
few dimensions and corresponding ra
tios. In the house there are six differ
ent ratios of dimensions: 2 : 1 , 3:2, 4:3,
6:5, and 9:8. The noteworthy character
of the ratio is that most of the fractions
vary by adding 1 to both numerator and
denominator. This is equivalent to the
classical sequence of subsuperparticu
lar numbers (March, 1998). Interest
ingly, these room ratios are also found
in musical ratios within an octave: oc
tave (2: 1 ), second (9:8), fourth ( 4:3), a
minor third (3:2), and a major third
(6:5). Schindler himself briefly referred
to musical ratios in his early discussion
of proportion in his unpublished 1917
Emma Church School lecture notes
(Park 2003; March 2003). In addition,
March (1994b) used a musical analogy
to examine the proportional design of
the Schindler house: in his paper "Dr.
How's Magical Musical Box" March re
calls "architecture as frozen music."
44 THE MATHEMATICAL INTELLIGENCER
According to March, any analysis of ra
tios and proportions of Schindler's
houses will inevitably correspond to
musical intervals. It does not mean the
musicality is intentional but it is a nec
essary consequence of the relations of
small numbers.
Lovell House, 1 922-26 1 242 Ocean Avenue,
Newport Beach, California
Built for Dr. Philip Lovell, this house is
the most revealing project in Schind
ler's oeuvre, displaying dramatic spa
tial complexity as well as structural
resolution. Schindler uniquely devised
ingenious structures to control and
9 : 8
raise the upper-level sleeping quarters
above a flat beach site. In the house, he
used five reinforced concrete columns,
opening the ground level for play and
recreation. Liberating the ground floor
recalls one of Le Corbusier's five points,
plan libre. Piloti, or columns, in con
crete and steel, carry the structural load,
lifting the box into space; interior walls
are then freely arranged according to cir
culation or other functional require
ments (Park and March 2003).
The 48-inch unit system is clearly
identified in the plan with numbers and
letters, and the 16-inch vertical module
in elevation with grades. The grid is
marked along the bottom with num-
2 : 1
6 : 5 4 : 3 3 : 2
2 : 1
2 : 1
Figure 8. The Kings Road House, 1 921-22. Ratios of each room.
bers from 1 to 16, and up the right-hand
side with letters A to J. Its vertical mod
ule is based on a 16-inch unit system,
which controls not only the height of
the room but other elements as well,
including built-in furniture, chairs, ta
bles, windows, doors, and clerestory,
providing uniformity of scale and proportional beauty. In Schindler's words,
"Floor plans and elevations were de
signed by a scheme of unit lines to as
sure uniformity of scale. All woodwork
including concrete forms and furni
ture, was built of eight inch boards
with wide joints."
Sarnitz (1986) provides a simple but
interesting proportional study of the
house. His analysis of the house plans
and elevations is based on the belief
that subdivisions of square and double
square determine the overall propor
tional system of the house.
How House 1 925 931 North Gainsborough Drive,
South Pasadena
The How House stands out from
Schindler's other works of this period
in its conspicuous and transparent play
about a diagonal axis that overlays a
48-inch unit system. The use of both
the modular and symmetrical systems
in the How House is subtle. Schindler
increases the significance of the diag
onal axis by setting the orthogonal
lines of the ground plan at a 45° angle
to the boundaries of the lot and the
road frontage (Park 2000). The lower
portion of the building's volume is built
in concrete with Schindler's own "slabcast" construction system, while the structure of the upper portion is red
wood. The horizontal stratification of
the continuous concrete course as well as the battened boards on the wall of
the house were laid to coincide with a
16-inch vertical module. The module
incorporates the heights of all ele
ments of the main structure, as well as
built-in furniture, chairs, windows, and
doors.
The How House is mainly deter
mined by reflective symmetry about a diagonal axis. As Schindler writes "the rooms form a series of right angle
shapes placed above each other and facing alternately north and south."
Figure 9. The Lovell House, 1 922-26. Five-layered vertical structures of the Lovell house in
tegrate all layered stairs and corridors.
bile-the living room, the dining room,
and Dr How's study-where the spatial
and structural setting of the whole
composition is based on the diagonal
axis. However, the floor plan of the piano nobile does not conform to sym
metry along the diagonal axis in a strict
manner: the symmetry is broken by additional spaces such as the kitchen and
the entrance hall, and also in certain
details such as the principal fireplace
and the stairway to the lower, bedroom
floor. This asymmetry does not depend
on an arbitrariness of personal taste or a rejection of the principles of symme
try in the cause of modernism. Instead,
it is generated from a disciplined un
derstanding of symmetry. The final de
sign displays an abundance of symmetries within the parts while negating
the strict symmetry of the whole.
The locations of the rooms are
clearly arranged according to his sub
divisions and multiplications of the
unit module. The floor plan of the house
demonstrates an interesting propor
tional relation with regard to his system.
The house is set within a 48-inch unit
module, numbered from 10-24 (a 14-
unit module) and lettered from H-W
(again a 14-unit module). All architec
tural elements are disposed within this
14-module square along the diagonal
axis. In the house, Schindler employs a
16-inch (1/3 unit) vertical module. In RM Schindler-Composition and
Construction, Lionel March provided an
in-depth analysis of this house. Accord
ing to March, its principal features follow the classical arrangement known as
ad quadratum. First of all, the layout of the house is placed within a 14-module
The analysis focuses on the piano no- Figure 1 0. R. M. Schindler, the Lovell House, 1 922-26. Facade.
© 2006 Springer Science+ Business Media, Inc., Volume 28, Number 1, 2006 45
Figure 1 1 . R. M. Schindler, the Lovell House, 1 922-26. Corner view.
square. Another square covers the prin
cipal volumetric elements of the house:
the living room, dining room, the study,
and the roof terrace. This square with
comers at V-24 and L-14 is a 10 X 10
module. From the external comer of the
living room a 7 X 7 square defines the
edge of the dropped ceilings in the din
ing room and study. This is clearly visi
ble at the gallery level. The major space
divisions generate a series of propor
tions, such as 7:5, 10:7, and 14:10, which
suggests that the layout of the house
may be ordered from a series of ad quadratum arrangements.
It also turns out that three pairs of
concentric squares define conceptually
the principal "blocks" of the house. First
of all, the living room of the piano nobile is located on the center point of the
14 X 14 square. It lies in 5 X 5 modules.
In the plan of the piano nobile, the main
volume of the house is planned within a
91/4 X 91/4 module square. The center of
Figure 1 2. The Lovell House, 1 922-26. 1/4 scale study model.
this square is the center of the 60-inch
by 60-inch open shaft which provides
light and ventilation for the bedroom
floor from the roof terrace above. The
large 51/2 X 51/2 module square is the
terrace, measured to the outer edge of
the planters. The small % X % module
square locates a built-in light fixture in
the cantilevered porch for the terrace.
The three diagrams, taken together and
superimposed, show a sequence of cen
tering points along the diagonal axis.
There are two structural layers at the
gallery level % module apart; the lower
layer, which has the same herringbone
pattern as the ceiling of the porch ex
tending over the terrace as the pattern
on the living room ceiling, and the upper
layer, which splits into two wings over
the dining room and the study, respec
tively. Both L-shaped layers are set along
the diagonal axis facing each other in op
posing south and north orientations. In particular, the lower part demonstrates
the ingenuity of Schindler's method, al
lowing two partially cantilevered beams
to cross above the open shaft area with
out any support in the center.
The exposed ceiling structure of the
living room is particularly worth men
tioning. It fills a square plan with a span
of 20 feet, or five 48-inch modules. Two
chimney stacks on adjacent sides em-
Figure 1 3. The Lovell House, 1 922-26. Analysis of plan and elevation of the Lovell house (after August Sarnitz).
46 THE MATHEMATICAL INTELLIGENCER
Figure 15. R. M. Schindler, The How House, 1 925. Interior and furniture designs.
Figure 16. R. M. Schindler, The How House, 1 925. Exterior view.
Figure 1 4. R. M. Schindler, The How House,
1 925. Living room roof structure.
© 2006 Springer Science+ Business Media, Inc . . Volume 28, Number 1 , 2006 47
H
\Q 11 12 13 14 15 16 11 18 18 20 21 22 2'� /
0 p 0 R
u
Figure 1 7. The How House, 1 925. Plan analysis of the How House showing the ad quadratum
arrangement.
phasize the diagonal axis (Figure 18). The structure of this ceiling is a system
of redwood beams at 1/2 module (24-
inch) intervals: each joist is secured
in a wall at one end while the other
end is nailed to a longer joist in a her-
Figure 1 8. The How House, 1 925. Axonometric.
48 THE MATHEMATICAL INTELLIGENCER
ringbone pattern along the diagonal
axis. With this arrangement, only one
beam is required to span the whole
space. The structure of the living room
ceiling provides an interesting series of
proportional relations. According to
The ceiling st ructm·c
of the Jh·ing room
The upper structure
of tlw gaUery floor
The lower structure
ur the gallery floor
The piano nobile
March (1994b), the ceiling of the How
House living room is set out in the man
ner of the Greek gnomon: the sequence
of squares (1 x 1, 2 x 2, 3 x 3, . . . , 9 x 9) and the sequence of oblongs (2 X 1 ,
3 X 2, 4 X 3 , . . . , 9 X 8) (Figure 19).
REFERENCES
March, Lionel and Judith Sheine, eds. 1 994.
RM Schindler-composition and construction.
London: Academy Editions.
March, Lionel. 1 994a. "Proportion is an alive
and expressive tool. . . . " pp. 88-1 01 in RM
Schindler: Composition and Construction, L
March and J Sheine, eds. London, Academy
Editions.
March, Lionel. 1 994b. "Dr How's Magical Mu
sic Box." Pp. 1 24-1 45 in RM Schindler:
Composition and Construction, L March and
J Sheine, eds. London, Academy Editions.
March, Lionel. 1 998. Architectonics of Human
ism: Essays on Number in Architecture. Lon
don: Academy Editions, John Wiley & Sons.
lliJ I= ill i= I==
n JL I oo I a
hl b
Il l c
Figure 1 9. The How House, 1925. Ceiling de
sign.
March, Lionel. 2003. "Rudolph M. Schindler:
Space Reference Frame, Modular Coordina
tion, and the 'Row' ," Nexus Network Jour
nal 5, 2: 48-59.
Park, Jin-Ho. 2003. "Rudolph M. Schindler:
Proportion, Scale and the 'Row'," Nexus
Network Journal 5, 2 : 60-72.
Park, Jin-Ho. 2000. "Subsymmetry Analysis of
Architectural Designs: Some Examples," En
vironment and Planning 8- Planning & De
sign 27, 1 : 1 2 1 -1 36.
Park, Jin-Ho and Lionel March. 2003. "Space
Architecture: Schindler's 1 930 Braxton
Shore project," Architectural Research Quar
terly 7 , 1 : 51 -62.
Park, Jin-Ho. 2004. "Symmetry and Subsym
metry of Form-Making: The Schindler Shel
ters of 1 933-42," Journal of Architectural
and Planning Research 2 1 : 24-37.
Sarnitz, August. 1 986. R M Schindler: Architekt
1887-1953. Vienna: Edition Christian Brand
statler.
Schindler, R. M. 1 946. "Reference Frames in
Space." Architect and Engineer 1 65: 1 0 , 40,
44-45.
AUT H OR
JIN-HO PARK
Department of Architecture
lnha University
Nam-gu, lncheon
Korea
e-mail: j [email protected]
Jin-Ho Park earned his BS in architecture from lnha University, Korea, and his MA and
PhD Degrees. also in architecture, from the University of California at Los Angeles. He
has taught at the University of Hawaii (Manoa), where he received the University of Hawaii
Board of Regent's Medal for Excellence in Teaching in 2002 and the ACSA/AIAS New
Faculty Teaching Award in 2003. He is presently associate professor at lnha University.
His articles have appeared in numerous journals, and he currently serves as corre
sponding editor of the online Nexus Network Journal: Architecture and Mathematics.
N E W I N P A P E R B A C K
With an introduction by Thomas Banchoff
F LAT LA N D A Romance of Many Dimensions
Edwin Abbott Abbott Flatland has fascinated generations of readers, becoming a peren nial science-fiction favorile. A first-rate ficlional guide to the concept of multiple dimensions of space, the book wil l also appeal to those who are interested in computer graphics. In his inlroduction, Thomas Banchoff points out that there is no beHer way to begin exploring the problem of understand ing higher-di mensional slicing phenomena than readi ng lhis classic novel of the Victorian era.
Praise for Princeton 's previous edition: "One of the most imaginative, delightful and, yes, touching works of mathematics, this slender 1 884 book purports to be the memoir of A. Square, a citizen of an entirely twodimensional world."- Washington Post Book World Princeton Science librory Paper $9.95 0-69 1 - 1 2366-7
Celebrating 100 Years of Excellence PRINCETON (0800) 243407 U.K.
' ' 800-777-4726 u.s. Umverstty Press math.pupress.princeton.edu
© 2006 Springer Science+ Business Media, Inc . . Volume 28, Number 1 , 2006 49
The Wedding Reuben Hersh
Philosophy: The bride and groom are here, and have
agreed to be wed today, in the presence of several wit
ness s. Let the wedding begin. Logic, speak to Geometry.
Logic: I come to you, Geometry, my beautiful bride-to
be, as your guide and your elder, your counselor and
corrector, your only lord and master, to have and to hold,
for good or for ill, in siclmess and in health, till death
do us part.
Geometry: You come to me, Logic, my groom and my
husband, as my guide and my junior, my counselor and
intem1pter, my self-named lord and master, to have and
to hold, for good or for ill, in siclmess and in health, till the unlmown and unknowable, whatever is to come at
the end of our joint dominion.
L: Do you, Geometry, take me as your guide and your
elder, your counselor and corrector, your only lord and
master, to have and to hold, for good or for ill, in sick
ness and in health, till death do us prut?
G: I do, with a rese1vation.
Philosophy: We will hear your explanation.
G: I am the elder, you are the younger, Lord Logic.
P: Let it be so recorded, Geometry is the elder. Is that
your only reservation?
G: It could be. nless you wish to hear more.
L: You may go on.
G: You bring me strength and control. You bring me your
darling and precious offspring, 1, 2, 3, and many other
beautiful nun1bers. I an1 happy to be yow· bride.
L: This is well explained.
G: I bring you shape, fom1, and continuity. The possi
bility of fruitful intercourse, and offspring uncountabl .
L: You lmow I love you, Geometry.
G: I know you love me, Logic. I do not know if I can love
you. You ru·e hru·d. You are unforgiving. You think you
can be my lord and master.
L: Look, is this a wedding, or what'?
G: It was announced and scheduled as a wedding. TI1e
guests ar here, with gifts. L: Yes, beloved friends and frunily have come. Mechan
ics and Statistics, of course. Even the Stock Market, the
Census Bmeau, and the Atomic Energy Commission are
here.
G: Don't worry about them. They don't understand any
thing, anyhow.
L: Easy for you to say. Who will pay the bills?
G: Oh, Logic, you are hopeless. The bills! Is this love or
business?
L: It' the business of love and the love of business.
G: Another paradox! I hate your silly little paradoxes! I hate them! Why did I ever agree to do this?
L: Because you need me.
G: Yes. It's true. I ne d you. But why and for what?
L: Without me to look after you, what would become of
you?
G: I don't know. What would become of me?
L: Look at your wretched, lo t cousins. umerology.
Weather and Financial Prediction. Old Pythagoras's
great-grand-children. \\lhere ar they now?
G: In the Gutter.
L: Yes. In the Gutter.
G: Why are they all in the Gutter?
L: Because they tried to live without Logic.
G: Will you be good to me? Will you be kind? Will you
ever be kind? L: I will be good for you. I will be as kind as I am able
to be.
P: Let it be ordered and written that on this day, in this
place, in the presence of witnesses both honorable and
dishonorable, Logic and Geometry were lawfully wed
ded, to have and to hold, for good or for ill, in sickness
and in health, till death do them part. And may God some
day forgive us for what we do here today.
� believe � was Hermann Weyl who once said that the angel Algebra and the
devil Topology were struggling for the soul of Mathematics.)
Department of Mathematics and Statistics
University of New Mexico
Albuquerque, NM 871 3 1 USA
e-mail : [email protected]
© 2006 Springer Science+Business Media, Inc., Volume 28, Number 1 , 2006 51
lil§iit§'MJ Osmo Pekonen , Ed itor I
Feel like writing a review for The
Mathematical Intelligencer? You are
welcome to submit an unsolicited
review of a book of your choice; or, if
you would welcome being assigned
a book to review, please write us,
telling us your expertise and your
predilections.
Column Editor: Osmo Pekonen, Agora
Center, P.O. Box 35, University of Jyvaskyla,
Fl-4001 4 Finland
e-mail: [email protected]
Erich Kahler. Mathematische Werke. Mathematica l Works edited by Rolf Berndt and Oswald
Riemenschneider
BERLIN, WALTER DE GRUYTER, 2003. 971 PP. €228.00
ISBN 3- 1 1 -0171 1 8-X
REVIEWED BY ERNST KUNZ
This volume of almost one thousand
pages aims at describing the life,
activities, and convictions of Erich
Kahler (1906-2000), one of the great
and many-sided mathematicians of the
20th century, and a strong and fasci
nating personality. It also deals with
the developments that have grown out
of his work
Nearly 700 pages contain most of
Kahler's original publications in math
ematics and mathematical physics and
three of his philosophical essays. The
remaining 300 pages describe his life
and contain surveys by experts in the
fields in which Kahler was active, de
scribing the impact of his work on
present-day mathematics, physics, eco
nomics, and other areas of intellectual
life. That this part of the book is so
large shows the great influence that
Kahler's work still has. The editors de
serve high appreciation for having col
lected the pieces to compose a rich pic
ture of modem science.
Of Kahler's 399-page book Geometria aritmetica, volume 45 of Annali di Matematica, which was considered
by many of his students, by many math
ematicians, and maybe by Kahler him
self as his main work in mathematics,
only the introduction is reproduced.
But its content is described in the long
article "Kahler Differentials and Some
Applications in Arithmetic Geometry"
by R. Berndt, who has also translated
much of the text from Kahler's unus
ual philosophically motivated language
into modem mathematical terminal-
52 THE MATHEMATICAL INTELLIGENCER © 2006 Springer Scrence+Business Media, Inc
ogy. Kahler had anticipated the publi
cation of the book by surveys given at
conferences in various languages. The
book shows Kahler as a forerunner of
the theory of schemes, as was recog
nized by Grothendieck in his Elements
de geometrie algebrique ("parmi les
travaux qui se rapprochent de notre
point de vue en Geometrie algebrique,
signalons l 'important travail de E.
Kahler"). At a memorial colloquium in
honor of Kahler at the University of
Leipzig, Kahler's former collaborator
H. Schumann reported that Kahler con
sidered the Geometria aritmetica a
first step towards a more comprehen
sive theory but abandoned the project
when the first volumes of Groth
endieck's work appeared. A. Weil ex
pressed concern in Mathematical Reviews that the unconventional style
and the unusual language would pre
vent the book from receiving the at
tention it deserved ("the author seems
to have done everything in his power
to discourage prospective readers and
is only too likely to have succeeded"),
and so it turned out. History has fol
lowed the French school.
Kahler's main objective in Geome
tria aritmetica was to unite number
theory and geometry. He emphasized
that for this purpose it was necessary
to study arbitrary local rings, not only
those containing a field, as was done
in classical algebraic geometry. Con
trary to Grothendieck's more general
point of view, Kahler's schemes (called
''varieties") are contained in "algebraic"
fields, that is, fields which are finitely
generated over their prime field. He also
tried to embed both disciplines into a
philosophical system which was con
ceived by him but not well understood
by others, and in fact hardly under
standable for the reader without further
instruction. An explanation of his
system, a mathematical model of the
world, the universe of all existing things,
is given by him in the essay "Wesen
und Erscheinung als mathematische
Prinzipien der Philosophie" (Essence
and Appearance as Mathematical Prin
ciples of Philosophy) and in other pub
lications which are not contained in
the collected works. In his paper "In
fmitesimal-Arithmetik" which appeared
four years after Geometria aritmetica,
maybe as a reaction to previous critics,
Kahler gave an outline of its contents
in ordinary mathematical language (ex
cept that the local rings having a given
quotient field are called the faces (Seiten) of the field and valuation rings
the perfect faces). The mathematical
discipline to which the book belongs
and in which some of the most spec
tacular successes were achieved in re
cent years has adopted the name of the
book: Arithmetic Geometry. R. Berndt,
and in another essay J. B. Bost ("A Ne
glected Aspect of Kahler's Work in
Arithmetic Geometry: Birational Invari
ants of Algebraic Varieties over Number
Fields") describe some developments in
connection with the mathematical con
cepts which were introduced by Kahler
in what was called by some his opus magnum. In another article by Berndt
there is a discussion of "Kahler's Zeta
Function," and A Deitmar ("A Panorama
of Zeta-Functions") gives an overview of
results and conjectures about zeta func
tions in general.
Much better known and much more
influential than Geometria aritmetica was Kahler's note "Ober eine be
merkenswerte Hermitesche Metrik"
(On a remarkable Hermitian metric)
from 1932, which developed into a
large and important discipline-Kah
lerian geometry, with "Kahler mani
fold" as its main concept-which
had strong influence on many areas of
mathematics (differential geometry,
complex analysis, algebraic geometry)
and mathematical physics (relativity,
field theory). Here many notions have
their origin in this paper of 14 pages,
written by him at the age of 27, and
bear his name as in Kahler metric, Kah
ler-Einstein metric, Kahler potential,
Kahler class, etc. In his report "The Un
abated Vitality of Kahlerian Geometry"
J. P. Bourguignon describes Kahler's
paper and gives a detailed survey of all
developments since the paper was
written. He confesses that he and many
colleagues for a long time considered
Kahler as a figure of the first third of
the 20th century and that they were not
aware that he was still alive and active
in the 1980s. In a "Tribute to Berm
Erich Kahler" S. S. Chern appreciates
the influence that Kahler had on his
own work
In recent decades Kahler manifolds
have entered theoretical physics again
in the construction of models to unify
the fundamental laws of physics. In his
article "Supersymmetry, Kahler Geom
etry and Beyond" H. Nicolai reports on
these developments.
Since early in his career Kahler's
work made extensive use of Cartan's
theory of exterior differential forms,
which he refined and recreated and
generalized in purely algebraic terms.
Many of his publications on the so
called Kahler differential forms are de
voted to this goal. His book Ein
fiihrung in die Theorie der Systeme
von Differentialgleichungen (Introduc
tion to the theory of systems of differ
ential equations) of 1934, which is not
reproduced in the collected works,
uses differential forms. It became the
origin of what is now called Cartan
Kahler theory, with the theorem of Car
tan-Kahler as its fundamental result.
This part of Kahler's work is thor
oughly discussed in a report by R.
Berndt and 0. Riemenschneider, which
contains a much more detailed survey
of all of Kahler's mathematical results
than can be given here. In another es
say by I. Ekeland a problem of eco
nomics is explained which leads to a
system of partial differential equations
of first order and which can at present
be solved only by means of the Cartan
Kahler theorem.
Kahler wrote influential papers on
the singularities of complex functions
in two variables. W. D. Neumann re
views these publications and puts them
in the perspective of the topological
and algebraic classification first of
plane curve singularities and then of
isolated hypersurface singularities.
Kahler's work in mathematical
physics started with three papers on
the 3-body and n-body problem and
one on fluid dynamics. Later work is
devoted to expressing fundamental
equations of physics such as the
Maxwell equations and the Dirac equa
tion and some of their solutions in
terms of differential forms.
As a rule, the sciences select or de
velop the tools from mathematics they
need to describe and to solve their
problems. Kahler's thinking went also
in the other direction. He was con
vinced that the powers of highly de
veloped disciplines such as arithmetic
and algebra were not sufficiently used
in physics ("Number theory is deter
mined to become the leading power of
natural sciences"; "What is the mean
ing for natural sciences of the re
sources collected in algebra and num
ber theory?"). Without being very
specific, he supported his views some
times with bold conjectures and
speculations. It seems that theoretical
physicists have adopted few of these,
an exception being Kahler's (quater
nionic) "new Poincare group" which
according to him should replace the
Lorentz group of classical relativity
theory. It is mentioned in a footnote of
Nicolai's contribution that the new
Poincare group is identical with the de
Sitter group from theoretical physics.
The mathematical properties of the
new Poincare group are discussed in
an article by A Krieg.
Kahler, who was influenced by
Plato's concept of "ideas," Leibniz's
monadology, Hegel, and Nietzsche, gave
university courses in philiosophy. His
friends report that he considered his
texts combining mathematics with phi
losophy and theology as more impor
tant than his mathematical achieve
ments, and that he suffered from the
fact that they did not receive the ap
preciation he had hoped for. For him, at
least in his later years, mathematics was
a language designed to formulate and to
solve not only problems of the sciences
but much more ("mathematics as a
thread of Ariadne which guides us out
of the labyrinth of modem sciences"). I
quote also from the article "The Life of
Erich Kahler" by R. Berndt and A Bohm: "The ingenious mathematician
became a mathematical dreamer who
thought he could solve all problems of
this world by mathematical methods."
I do not feel competent to comment
© 2006 Spnnger Science+Bus1ness Media, Inc., Volume 28, Number 1 , 2006 53
on the philosophy of Kahler. He calls mathematics "an infinite refmement of language." In the collected works there are several attempts to guide the reader to his philosophical thinking. First there are three essays by Kahler himself, two in Italian, and the one which was mentioned above. References to his other philosophical texts are given. Moreover two articles "Kahler's Vision of Mathematics as a Universal Language" by R. Berndt and "An Approach to the Philosophy of Erich Kahler" by K. Maurin serve the same purpose.
Kahler, who had a strong sense of mission, started his independent scientific production at the age of 17 and remained active until shortly before his death at 94. The principal stages of his academic career were:
1928 Doctor's degree at the University of Leipzig under Leon Lichtenstein
1930 Habilitation in Hamburg 1931- As a Rockefeller fellow in 1932 Rome, he met Enriques, Cas-
1936
1948
1958
1964
telnuovo, Levi-Civita, Severi. B. Segre, and A. W eil Ordinary professor in Konigsberg Successor to Koebe in Leipzig After serious political ten-sions with the East German administration Kahler accepted a position at the Technical University of West Berlin Successor to Emil Artin in Hamburg
1974- Professor Emeritus, work-2000 ing mainly on his Mathe
matical Philosophy and occasionally lecturing at physics and mathematics conferences.
The book under review does not contain a list of Kahler's former PhD-students. Here is one, maybe incomplete: Walter Thimm (Konigsberg 1939), Gtinter Hauslein (Leipzig 1955), Gerhard Lustig (Leipzig 1955), Armin Uhlmann (Leipzig 1957), Rolf Berndt (Hamburg 1969). Kahler also initiated the theses of Gtinther Eisenreich (Leipzig 1963) and Horst Schumann (Leipzig 1968), but
54 THE MATHEMATICAL INTELLIGENCER
since he had left East Germany he could not act formally as advisor.
It seems that Kahler's philosophical texts have been widely ignored. As mathematicians, we admire his numerous contributions to our science which have come to bear so much fruit inside and outside mathematics. With his collected works, the editors have given him a worthy monument.
Fakultat fUr Mathematik
U niversitat Regensburg
D-93040 Regensburg
Germany
e-mail:
History and Science of Knots edited by J. C. Turner and
P. van de Griend
SINGAPORE, WORLD SCIENTIFIC, 1 996. 464 PP. US$78,
ISBN 981 -02-2469-9
REVIEWED BY KISHORE B. MARATHE
The book History and Science
of Knots consists of 18 chapters grouped into 5 parts: I. Prehistory and antiquity, II. Non-European traditions, III. Working knots, IV. Towards a science of knots?, V. Decorative knots and other aspects. The editors, Turner and van de Griend, have chosen a wide variety of experts as authors for the individual chapters; more information about them may be found in "About the Authors" on page 419. The construction and use of knots, links, and braids from string-like objects predates known human history. They occur in diverse areas of human activity, from magic tricks and decorative arts to shipping, fishing, and religious and medical practice. Such structures also occur in nature, for example, in polymer chemistry and biology. In spite of the vast range of topics and the time frame, the book gives a representative sampling of knots and knot applications. The historical aspects of each topic are also discussed. Knots are dis-
cussed from this broad perspective in all but one chapter. Only chapter 1 1 is devoted to knots as defined in mathematics. We will discuss the contents and highlights of each part and give a more detailed look at the part dealing with the mathematical theory of knots.
Part I deals with evidence for the use of knots from prehistoric times to the Egyptian civilization. The chronology of the knot technology over this vast period is summarized in Chapter 1 by studying various aspects of human and animal life which may have required the use of knots. Chapter 2 is devoted to some speculations regarding the earliest forms of knots and their origin. Chapters 3 and 4 are written by archaeologists who examine the knotted structures found at many excavation sites in Europe and Egypt. The earliest such structure is a Mesolithic fish-net fragment found in 1913 on the Karelian isthmus (formerly in Finland) which uses a knot type used in Estonia and in the Finnish settlement. It is now called the "Antrea Russian Knot." The evidence of rock art and artifacts using knots in all major ancient civilizations, extends over thousands of years. This and other finds lead us to conclude that our prehistoric ancestors had both the materials and the skills for making complicated knots.
Part II has three chapters which give a sample of knot history in non-European civilizations. The use of knotted cords or qui pus by the Incas is detailed in Chapter 5. There is an interesting discussion here of the construction of knotted cords and their use in representing numbers and storing numerical data. Chapter 6 traces the rich history of knots in China covering a period of some eighteen thousand years. The main theme in this chapter is the description of the decorative use of knots which began in ancient China and continues to this day. Beautiful examples of such knots were much in evidence during ICM 2002 held in Beijing. Knots used by Inuit Eskimos are the subject of Chapter 7. I would like to note that knots and braids made by using strings and organic materials have been (and are now) used in religious functions in India since before the Vedic period. It
would be very interesting to study this, as excellent written sources are readily available.
Part III presents a historical account of two fields where knots have been extensively used. Chapter 8 deals with knots and ropes used by seamen and fishermen. The knowledge and use of knots was substantially affected in the transition of man from a land-dweller to a mariner. There is vast literature dealing with the use of knots and ropes at sea. The author has managed to give a very good presentation of this subject in just a few pages. Chapter 9 discusses what the author calls life-support knots. It covers the use of knots in rock climbing, rescue work etc., with particular attention to their properties in life-support tasks.
Part N is the longest, with five chapters. Behaviour, under load, of single stranded knots tied in fiber rope is studied in Chapter 10. Chapter 12 is devoted to classification of knots by methods different from those used in topology. Various encyclopedias of knots are also briefly described. The work of Mandeville on trambling (i.e. producing sequences of knots by altering one tuck at a time) is dealt with in Chapter 13. The last chapter is devoted to the history and techniques of crochet work. I found the discussion of the CADD (Computer Aided Doily Design) system and its applications developed by the author and her coworkers at the University of Waikato, New Zealand, quite interesting.
Chapter 1 1 should be of greatest interest to the readers of this magazine. To a mathematician, a knot is an embedding of a circle in the three-dimensional Euclidean space R3 or its compactification, the 3-sphere S3. This definition is easily modified to obtain knots in any manifold. In particular, embedding of the standard unknotted circle is called the unknot. A systematic study of knots was begun in the second half of the 19th century by Tait and his followers. They were motivated by Kelvin's theory of atoms modeled on knotted vortex tubes of ether. It was expected that physical and chemical properties of various atoms could be expressed in terms of prop-
erties of knots, such as the knot invariants.
Though Kelvin's theory did not work, the theory of knots grew as a subfield of combinatorial topology. Tait classified the knots in terms of the minimal crossing number of a regular projection. Recall that a r·egular pro
jection of a knot on a plane is an orthogonal projection of the knot such that at any crossing in the projection exactly two strands intersect transversely. Tait made a number of observations about some general properties of knots which have come to be known as the "Tait conjectures." In its simplest form the classification problem for knots can be stated as follows: Given a projection of a knot, is it possible to decide in infinitely many steps if it is equivalent to an unknot. This question was answered affirmatively by W. Haken in 1961 , who proposed an algorithm which could decide if a given projection corresponds to an unknot.
However, because of its complexity it has not been implemented on a computer even after 40 years.
The simplest combinatorial invariant of a knot K is the crossing number c(K),
defmed as the minimum number of crossings in any regular projection of the knot K. The classification of knots up to crossing number 16 is now known [2]. The crossing numbers for some special families of knots are known, but the question of fmding the crossing number of an arbitrary knot is still unanswered. Another combinatorial invariant of a knot K that is easy to define is the un
knotting number u(K), the minimum number of crossing changes in any projection of the knot K which makes it into a projection of the unknot. Upper and lower bounds for u( K) are known for any knot K. An explicit formula for u( K) for a family of knots called torus knots, conjectured by Milnor nearly 40 years ago, has been proved recently by a number of different methods.
One of the earliest investigations in combinatorial knot theory is contained in several unpublished notes written by Gauss between 1825 and 1844 and published posthumously as part of his estate. They deal mostly with his attempts to classify Tracifiguren or
plane closed curves with a finite number of transverse self-intersections. Such figures arise as regular plane projections of knots in R3. However, one fragment deals with a pair of linked knots. In this fragment of a note dated January 22, 1833, Gauss gives an analytic formula for the linking number of a pair of knots. This number is a combinatorial topological invariant. As is quite common in Gauss's work, there is no indication of how he obtained this formula. The title of the note "Zur Electrodynamik" ("On Electrodynamics") and his continuing work with Weber on the properties of electric and magnetic fields leads us to guess that it originated in the study of the magnetic field generated by an electric current flowing in a curved wire. Maxwell knew Gauss's formula for the linking number and its topological significance and its origin in electromagnetic theory. In obtaining a topological invariant by using a physical field theory, Gauss had anticipated Topological Field Theory by almost 150 years. Even the term topol
ogy was not used in his era. It was introduced in 184 7 by J. B. Listing, a student and protege of Gauss, in his essay "Vorstudien zur Topologie" ("Preliminary Studies on Topology"). Gauss's linking number formula can also be interpreted as the equality of topological and analytic degree of a suitable function. Thus it can be considered as an example of an index theorem. Starting with this, a far-reaching generalization of the Gauss integral to higher self-linking integrals can be obtained. This forms a small part of the program initiated by Kontsevich [3] to relate topology of low-dimensional manifolds, homotopical algebras, and non-commutative geometry with topological field theories and Feynman diagrams in physics.
The Alexander polynomial provided a new type of knot invariant. There was an interval of nearly 60 years between the discovery of the Alexander polynomial and the Jones polynomial. Since then, a number of polynomial and other invariants of knots and links have been found. A particularly interesting one is the two-variable polynomial generalizing both the Alexander polynomial and
© 2006 Springer Science+ Business Media, Inc., Volume 28, Number � , 2006 55
the Jones polynomial. This polynomial is called the HOMFLY polynomial (a name formed from the initials of authors of the article [ 1 ]). The author of this chapter gives an excellent account of the history of topological knot theory and related theory of braids, bringing the readers up to the mid-1990s.
Some important recent developments are not included in this chapter. Jones's work in the 1980s was a major advance in knot theory, leading to the resolution of several of the longstanding Tait conjectures. However, it did not resolve the chirality conjecture: If
the crossing number of a knot is odd,
then it is chiral (i.e. , not equivalent
to it,s mirror image). A 15-crossing knot which provides a counter-example to the chirality conjecture is given in [2]. Jones did not provide a geometrical or topological interpretation of his polynomial invariants. Such an interpretation was provided by Witten [6], who applied ideas from Quantum Field Theory (QFT) to the ChemSimons Lagrangian. In fact, Witten's model allows us to consider the knot and link invariants in any compact 3-manifold M. Witten's ideas have led to the creation of a new area called Topological Quantum Field Theory (TQFT) which, at least formally, allows us to express topological invariants of manifolds by considering a QFT with a suitable Lagrangian. An account of several aspects of the geometry and physics of knots may be found in [4] and [5] .
Recently, several topological and geometric invariants of knots and links have been used in polymer chemistry and in studying the mathematical structure of DNA. These early results have led molecular biologists to believe that knot theory may play an increasingly significant role in understanding the geometric and topological properties of DNA and that these in turn may help in resolving some of the riddles encoded in these basic building blocks of life. Understanding the structure and dynamics of DNA, RNA, and proteins, in general, may very well require the forging of new mathematical tools.
The last part, V, deals with decorative knots and their use as symbols in heraldry and love. The history of
two widely practiced crafts, namely macrame and lace is given in Chapters 15 and 16, respectively. Chapter 17 describes the various ways knots have been used in European Heraldry. The final chapter deals with the concept of love knot, its occurrence in literary works and the various forms that it has taken over the last five centuries or so. Indeed the most widely used synonym for marriage is "tying the knot."
I enjoyed browsing through all the chapters. They contain material that a mathematician would not normally come across in his work There is a well-known story about Alexander the Great unraveling the Gordian knot with his sword. Today's problems in knot theory, mathematical or otherwise, will require tools far more sophisticated than a sword.
BIBLIOGRAPHY
[ 1 ] R. Freyd, et a/., A new polynomial in
variant of knots and links. Bulletin of Ameri
can Mathematical Society (New Series,
1 2 :239-246, 1 985.
[2] J . Haste, M . Thistlethwaite, and J . Weeks.
The First 1 ,701 ,936 Knots. Mathematical ln
telligencer, 20(4):33-48, 1 998.
[3] M . Kontsevich. Feynman Diagrams and
Low-Dimensional Topology. In First European
Congress of Mathematics, vol. I I Progress
in Mathematics, 1 20, pages 97-1 2 1 , Berlin,
1 994. Birkhauser. [4] K. B. Marathe, G. Martucci, and M . Fran
caviglia. Gauge Theory, Geometry and
Topology. Seminario di Matematica deii'Uni
versita di Bari, 262 : 1 -90.
[5] Kishore B. Marathe. A Chapter in Physical
Mathematics: Theory of Knots in the Sci
ences. Mathematics Un/imited-2001 and
Beyond, pages 873-888. Springer-Verlag,
Berlin, 2001 .
[6] E. Witten. Quantum Field Theory and the
Jones Polynomial. Communications in Math
ematical Physics, 1 21 :359-399, 1 989.
Max Planck Institute for Mathematics in
the Sciences, Leipzig, and
Department of Mathematics
1 3 1 7b Ingersoll Hall
City University of New York
Brooklyn College
2900 Bedford Ave.
Brooklyn, NY 1 1 21 0-2889, USA
e-mail: [email protected]
Codebreakers: Arne Beurling and the Swedish Crypto Program during World War I I by Bengt Beckman
PROVIDENCE, Rl, AMERICAN MATHEMATICAL SOCIETY,
2003. 259 PP , US $39, ISBN 0·821 8-2889-4
REVIEWED BY HAKAN HEDENMALM
The author of this book, Bengt Beckman, is one of early members of
the Swedish cipher bureau FRA of Forsvarsstaben (Defense Staff Headquarters), which was operational in 1941 but officially founded a year later. By the time Beckman came to FRA as a conscript in 1946, there was much discussion of Arne Beurling (1905-1986), the mathematics professor who had made himself famous at FRA by breaking the German code of the Geheim
schreiber (developed by Siemens in the 1930s) at a time pivotal for Sweden during World War II. This feat is of the same order of magnitude as the British effort to break the Enigma code during the same war. In England as well as in Sweden, mathematicians played a vital role for the intelligence deciphering part of the war effort; perhaps the most famous mathematician working for the British at Bletchley Park was Alan Turing. The Swedish effort to keep a low profile with regard to this intelligence gathering was quite successful; indeed, the importance of Beurling's contribution to Sweden's ability to keep out of the war was, until recently, known only to narrow circles inside Sweden. By now, more than sixty years have passed, and the veil of secrecy has been lifted; Beckman, who stayed with FRA until 1991 , is able to tell the story as he remembers it, from what he picked up a long time ago, as well as from recent in-depth interviews with people more closely involved.
This story first aired in a 1993 Swedish Television documentary G sam i hemlig (G as in secret), produced by Beckman and Olle Hager. In the book, Beckman is able to tell a
© 2006 Springer Sc1ence+Business Media, Inc., Volume 28, Number 1, 2006 57
much more detailed story, of course. With the translation to English, commissioned by the American Mathematical Society, the material is now available to a considerably wider audience. The translator, Kjell-Ove Widman, who himself has been involved in cryptology at the Swiss company Krypto AG, has done a very thorough job and produced a very readable text. However, the style differs a little from the original: Beckman tells a story by the campfire, but Widman's translation sets higher literary standards. Having for the moment assumed a slightly critical stance, let me also mention that the map of Stockholm on the page following xviii places Karlaplan a bit off, somewhere in the forest area north of the Royal Institute of Technology (KTH); the mark should be placed a little bit further southeast. It is also unfortunate that the suggested explanation of Beurling's analysis is marred by cipher typos (on pp. 80, 82).
The book takes a historical perspective on ciphers and begins with an exercise to decipher a coded message from the 18th century. This is quite enjoyable; the cipher is of simple substitution type, and it is just a matter of running a frequency analysis to guess the most common letters of the cipher key. Then the perspective changes a little, and automatic ciphering machines enter the picture, along with the Swedish names Damm, Hagelin, and Glyden. Then, a description of radio signal interception and cryptanalysis before 1939 follows.
The Kingdom of Sweden was poorly prepared for the war that broke out on the European continent on September 1, 1939. The situation became particularly dire on April 9, 1940, when Germany invaded neighboring Denmark and Norway. As Sweden could not afford a massive military build-up, it became imperative to be able to secondguess the German intentions regarding Sweden. Then Sweden's foreign policy could be modified to be more palatable for the Germans, and hence avoid actual invasion. This was done quite successfully-German shipments of materials and supplies as well as of troops were permitted in sealed transit trains through Sweden-and it is commonly
58 THE MATHEMATICAL INTELLIGENCER
believed that this is the reason Sweden was able to remain outside the war.
But this is not the whole story. The invasion of Norway offered Sweden the chance not to second-guess but to actually read the potential enemy's cards. The diplomatic traffic between occupied Oslo and Berlin was transmitted along Swedish telegraph lines, and the Swedes were able to tap the messages. The only problem was that the messages were not in plain text; moreover, the encryption was not of simple substitution type, as could be seen from a simple frequency analysis. Given the sheer volume of encrypted traffic, it was suspected that a machine was doing the encryption automatically. One day in 1940 Beurling, who had already been involved with some simpler cryptanalysis tasks for the Swedish Defense Department, collected the tapped telegraph traffic at the Karlaplan office dated May 25 and May 27, 1940, which he believed to be essentially free of transcription errors. After a couple of weeks, he had more or less cracked the code. This was an impressive feat, especially compared with the British Enigma effort, which was based on the physical capture of an encryption machine from the Germans. If we think of the Geheim
schreiber encryption as a kind of substitution cipher, then the cipher key apparently was changing with each new letter of the message. Also, the initial key settings were altered every few days. The way the Geheimschreiber
was made, it would not begin cyclically repeating its cipher on any given message, for the number of possible encodings was much much bigger than the total quantity of information exchanged over the entire war.
Beurling never revealed how he performed his feat; he would say that a magician never reveals his tricks. Nevertheless, Beckman offers a possible explanation, based on a reconstruction attributed to Carl-Gosta Borelius. The encryption may have been perfect in
theory, but in practice telegraph lines were not 100 percent reliable in those days, so the German telegraphers would frequently rerun parts of the message, using the same code. This allowed Beurling to get a foot in the
door, and using some sound hypotheses regarding the nature of possible codes on teleprinters (where each letter corresponds to a sequence of five Os and Is )-essentially combinations of permutations and transpositionshe was able to complete his task It should be noted that Beurling did this with a rather small data sample, and without actually having seen a Geheimschreiber. Today a Geheimschreiber is on display in the Beurling library of the Mathematics Department at Uppsala University in Sweden, where Beurling worked in the 1940s.
At first, the Swedes carried out the deciphering manually, in accordance with Beurling's instructions, but later, and certainly by 1942, machinescalled Apps-were doing the job. The value of having cracked the Geheimschreiber depreciated toward the end of the war. The Germans sensed that their transmissions were being read, and reacted to it. By that time, however, the risk that Sweden would get dragged into the war was much reduced.
Beurling was a deep mathematician equipped with a difficult temperament. The stories about his disagreements, rows, or even outright fights with colleagues are widely known in mathematical circles in Scandinavia. Some of these stories are retold in this book Beurling was apparently quite charming to the ladies, and this aspect of his life, based on interviews with AnneMarie Yxkull Gyllenband, takes up a chapter. He married twice. His first wife is not mentioned by name in the book, but it is known that she worked as a physician, and Beurling had two children with her. Later, in 1950, he met his second wife, Karin Lindblad, at the party his student Lennart Carleson hosted to celebrate his thesis defense at Varmlands nation, one of the student clubs in Uppsala. As far as I know, Karin was a friend of Carleson's, and was Forste Kurator at Viirmlands at the time, the highest post a student could assume at a student club in Uppsala. Karin and Arne remained together for the rest of their lives.
Beurling worked in three areas of mathematical analysis: potential theory, harmonic analysis, and complex
analysis. His collaborators were few
but well chosen: Ahlfors, Deny, Helson,
and Malliavin. Probably it was his im
pressive collaboration and deep friend
ship with Lars Ahlfors that landed him
the position of permanent member at
the Institute for Advanced Study in
Princeton, New Jersey, in 1954. In
Princeton, he took over the office pre
viously occupied by Albert Einstein. It
is unfortunate that Beurling was not
able to continue on American soil the
extremely productive period he en
joyed in Uppsala in the 1940s, with
students such as Goran Borg, Lennart
Carleson, Yngve Domar, Carl-Gustav
Esseen, and Bo I\jellberg. Had he been
able to wield more influence in Prince
ton, the period of abstraction in math
ematical analysis, which was such a
dominant theme in the 1950s and
1960s, might have been balanced by a
deep and elegant approach that fo
cused not on form but on content.
Bengt Beckman has produced a fas
cinating book that acquaints us with
some of the stars of 20th-century Scan
dinavian mathematics. In addition, we
gain some insights into basic cryptol
ogy.
Department of Mathematics
The Royal Institute of Technology
S-1 0044 Stockholm, Sweden
e-mail: [email protected]
From Newspeak to Cyberspeak. A History of Soviet Cybernetics by Slava Gerovitch
CAMBRIDGE, MA, THE MIT PRESS, 2002. xiv + 369 PP
US $42 ISBN 0-262-07232-7
REVIEWED BY PAUL JOSEPHSON
Most readers know of the impact
of ideological interference in the
practice of Soviet scientists. In the case
of biology, the peasant agronomist
Trofun Lysenko rose to the top of the
scientific establishment with Stalin's
personal endorsement. He advanced
Lamarckian notions of the influence
of acquired characteristics that sup-
planted genetics. After a 1948 national
conference that proclaimed Lysen
koism the only true Soviet biology,
dozens of geneticists lost their jobs
and were exiled from the laboratories
to sheep-breeding farms at the end of
the empire; and genetics was purged
from university curricula. In physics,
too, several scientists joined ideo
logues in an attack on quantum me
chanics and relativity theory. The at
tack nearly had disastrous results for
physicists, many of whom were forced
to abandon research. In both of these
cases, questions of epistemology, class
struggle, and other issues of impor
tance in the official Soviet philosophy
of science, dialectical materialism,
played a role, as did Cold War pres
sures to promote a new, Soviet science
that was different from, and better than
that in the West, particularly in the
United States.
In a superb contribution to the his
tory and philosophy of science, Slava
Gerovitch considers the place of cy
bernetics in Soviet philosophical dis
putes, and the development of what
he calls "cyberspeak" in the postwar
USSR. Gerovitch, a research associate
at the Dibner Institute, has studied the
history of Soviet computing and cy
bernetics for some time, and beyond
From Newspeak to Cyberspeak, is now
focusing on human-machine interaction
in the Soviet space program. Cyberspeak he defines as a universal language of man-machine metaphors described in
such terms as information and feedback
and control, e.g., the organism as an en
tropy-reducing machine, the computer
as a brain, the brain as a computer.
Gerovitch joins several other scholars in
rejecting the notion that cybernetics was
severely damaged by ideological in
terference in the form of such official
pronouncements that cybernetics was
a "reactionary pseudo-science." That in
terference was relatively short-lived,
and scientists learned how to manage it.
Yet the fact that attacks were short
lived and ignorant should not lessen
our appreciation of the way in which
they reflected the dangers of doing sci
ence in the USSR generally. Those who
carried on the anti-cybernetics campaign
employed accepted ways of discourse
and dispute to rescue their careers and
EMI N ENT MATH EMATICIAN S
THE ART OF CONJ ECTU RING, TOGETH ER WITH "LETTER TO A FRI E N D ON SETS IN COU RT TEN N IS" Jacob Bernoulli translated with an introduction ond notes by Edith Dudley Syllo "Bernoulli's The Art of Conjecturing hos always been recognized as one of the outstanding texts in the history of probability, marking a dramatic development in the theory. With Syllo's translation, it becomes clear what a comprehensive and revolutionary work it was." -James Franklin, University of South Wales $70.00 hardcover
ARTH U R CAYLEY Mathematician Laureate of the Victorian Age Tony Cri l ly Arthur Cayley ( 1 82 1 - 1 895) was one of the most prolific and important mathematicians of the Victorian era. His influence
sti I I pervades modern mathematics, in group theory (Cayley's theorem), matrix algebra (the Cayley-Hamilton theorem), and invariant theory, where he mode his most significant contributions. Tony Cril ly, the world's leading a uthority on Cayley, provides the first definitive account of his life. $69.95 hardcover
S I R WI LLIAM ROWAN HAMILTON Thomas L. Hankins "This is an interesting, well-written biography of the great nineteenthcentury mathematician." -Mathematical Reviews $22.50 paperback
TH E JOH N S HOPKI N S U N IVERS ITY PRESS 1 -800-537-5487 • www.press.jhu.edu
© 2006 Springer Science+ Business Media, Inc., Volume 28, Number 1, 2006 59
grind their axes. These activities surely are seen in the West, but not with the fury and lasting costs to people and fields of research. Perhaps, as Gerovitch implies, because cybernetics was needed for radar and rocketry, the Cold War saved it from further attacks-in the way that research on the atomic and hydrogen bombs saved relativity theory and quantum mechanics from ideological interference.
Gerovitch provides a detailed discussion of what made attacks on cybernetics possible in the first place in a discourse that took place between the language of cybernetics and the language of Soviet ideology. He asserts that the ideological language was much more flexible than many historians have gathered. Soviet scholars learned to play by the rules of the establishment-the government bureaucracy and Party apparatus-in their rhetorical styles, modes of thought, and arguments to defend cybernetics. Like their opponents, who saw in cybernetics idealism, "kow-towing" to the West, among other dangers, they turned to quotation-mongering and label sticking (what others have called the "quote and club" method) to put them on the defensive. Gerovitch shows that the postwar ideological campaigns lacked coordination and coherence; they were rarely orchestrated from the top down.
In fact, Soviet cybernetics developed along many of the same lines along which cybernetics developed in the West. Such scientists as Andrei Kolmogorov and later Alexei Liapunov contributed to the foundations of cybernetics. Gerovitch discusses the work of these scientists against the background of comprehensive analysis of the contributions to the field of Norbert Wiener, Arturo Rosenblueth, and Claude Shannon.
Although cyberneticians were adept at disputation, there were significant pitfalls that awaited them until after Stalin's death. Their contributions were measured against the standards of the West, but they were required to avoid showing contamination with Western ideas. Soviet scholars had to avoid falling behind the West in computing and simultaneously following
60 THE MATHEMATICAL INTELLIGENCER
Western trends too closely. At the same time, a Cold War ideological battle against the West led to the cutting off of international contacts. Intellectuals were forced to toe the Party line. In this environment, the boundaries between academic and political disputes disappeared.
An important facet of the history of computer science and the first Soviet computing machines was the connection between cyberneticians and their powerful military and Communist Party patrons. One such example is Mikhail Lavrent'ev, who, as director of the Institute of Computer Technology, received facilities and protection from the new Moscow City Party Chief, Nikita Khrushchev. Later, as Party leader, Khrushchev enabled Lavrent' ev to found the Siberian city of science, Akademgorodok, with its own new Computer Center in the late 1950s. When it came to military purposes, computers were a technology without ideological deviations.
After the death of Stalin, in the mid-1950s Soviet computers were deified and entered the public realm, no longer to be held under wraps of military secrecy. Cyberneticians touted computers as paragons of objectivity based on quantitative knowledge, precise language, and precise concepts. They also commenced an attack against past ideological interferences of the philosophers and their allies among scientists and Party officials. Gerovitch's discussion of the battle against dogmatism and calcification of philosophical discourse under Stalin at this time is engaging. Ultimately, Gerovitch writes, cyberneticians "overturned earlier ideological criticisms of mathematical methods in various disciplines and put forward the goal of the 'cybernetization' of the entire science enterprise" in search of objectivity in the life sciences and social sciences alike. (p. 199) This conclusion reinforces the sense that an excessive scientism developed in the USSR.
In the 1960s cybernetics became a full-fledged science in the Soviet establishment. It found an institutional home in a council of the Academy of Sciences of the USSR, experienced
rapid institutional growth, and saw a new publication, Problemy kiberni
tiki, which was one of the most influential such publications in the world. Its promoters claimed that cybernetics had become a universal, objective language, and as such would break interdisciplinary barriers and legitimize the use of mathematical methods in other sciences. Its power was evident in the fact that even before Lysenko was deposed, cyberneticians promoted genetic research in physics and chemistry institutes, speaking about genes as units of hereditary information. Supporters thought cybernetics might be a panacea for reforming the Soviet economy through the creation of "optimal" models for planning and management. But scientists ultimately realized that central mainframes in large-scale systems to centralize input and output calculations for such a huge economy were simply unfeasible.
Once institutionalized, Gerovitch concludes, cybernetics became part of the Soviet establishment in service of the nation's management, administration, direction, and government purposes. The goal was to control the entire national economy, technological processes, and so on, to ensure optimal governance. An alliance between cybernetics and dialectical materialism followed in the early 1960s. In the end, cyberspeak became so much a part of establishment thinking, so much the mode of dominant discourse, that its supporters grew disillusioned with efforts to apply it willy-nilly. Fissures in the cybernetics community as in Soviet society itself created new disputes. Some scientists had grown increasingly conservative and anti-Semitic, while others joined the dissident movement to protest increasing violations of human rights under Leonid Brezhnev. This suggests that scientism or not, cybernetics, like other sciences in other countries, could not avoid reflecting the social, political, and cultural norms of the nation in which it developed.
Gerovitch only touches on reasons why the USSR failed to embrace the computer revolution, some of which have roots in the debates over "thinking machines" that occurred from 1950
to 1965. He does not consider the decision to build computers based on reverse engineering after so many decades of success in building indigenous machines. In addition, because his focus is on philosophy and intellectual history, some readers will need to seek other sources to gather the impact of the broader context of Soviet history and politics on cybernetics.
Gerovitch's study is based on a thorough use of archival materials and unpublished memoirs and interviews. This book will be of interest to advanced undergraduate students, graduate students, and teachers, as well as to computer scientists, historians, and philosophers. I recommend it highly.
Program in Science, Technology and Society
Colby College
5320 Mayflower Hil l
Waterville, ME 04901 -8853
USA
e-mail: [email protected]
Traditiona l Japanese Mathematics Problems of the 1 8th and 1 9th Centuries by Hidetoshi Fukagawa and
John F. Rigby
SINGAPORE, SCT PUBLISHING, 2002. 1 91 PP. US$50.00
ISBN 981 -04-2759-X
Japanese Temple Geometry Problems San Gaku by Hidetoshi Fukagawa and
Dan Pedoe
WINNIPEG, THE CHARLES BABBAGE RESEARCH
CENTRE, 1 989. 206 PP. US$40.00 ISBN 0-91961 1 -2 1 -4
REVIEWED BY CLARK KIMBERLING
The best thing about these books is their content, which is based on
problem proposals carved and drawn on Japanese wooden tablets dating from a span of isolation from the West. During that time Japanese mathematicians developed their own "traditional
mathematics," which, in the 1850s, began giving way to Western methods. There were also changes in the script in which mathematics was written, and as a result, few people now living know how to interpret the historic tablets. One of these is Hidetoshi Fukagawa, the Japanese author of the two books. The 1989 book opens with these words:
A selection from the hundreds of problems in Euclidean geometry displayed on devotional mathematical tablets (SANGAKU) which were hung under the roofs of shrines or temples in Japan during two centuries of schism from the west, with solutions and answers. Implicit in this description is the def
inition of sangaku (often written San gaku and Sangaku ) . The phrase "with solutions and answers" applies to the books, not the sangaku. Dan Pedoe, co-author of the 1989 book, explains in the Preface:
There were few colleges or universities in Japan during the period of separation from the west, but there were many private schools, and obviously many skilled geometers who wished to thank the god or gods for the discovery of a particularly lovely theorem, and also, it may be guessed, who were not averse to displaying their discoveries to other geometers . . . with the implicit challenge: "See if you can prove this!" The 2002 book continues the col-
lection with additional problems and solutions. For both books, many of the solutions use modem methods. With admirably little overlap in content, the two books give historical descriptions, photographs, figures, calligraphy, solutions, and references, all well focused on sangaku. For a broader context, one may cite Chapter 22 of Yoshio Mikani's The Development of Mathe
matics in China and Japan, second edition, Chelsea, 1974 (originally published in German, 1913).
Mikani places sangaku in the perspective of Seki Kowa, who has been called the Japanese Newton and father of Japanese mathematics. Although Seki's lifetime (1642-1708) preceded sangaku, his influence in algebraic and analytic methods set the stage for san-
gaku. Mikani writes, "The highest development of the Japanese mathematics must of course be looked upon as the invention of . . . 'circular theory' "and it is precisely the enchantment of circle-problems that pervades san
gaku. Indeed, a majority of the problems in the 1989 and 2002 books involve circles.
One of the foremost mathematicians represented in sangaku was Ajima Chokuen (1732-1798). (The family name is Ajima. The given name Chokuen is used in the 1989 book, but the more formal given name Naonobu is used in the 2002 book) The two books contain a number of spinoffs from Ajima's famous problem about three pairwise tangent circles inscribed in a triangle. A view of Ajima's place in the books provides insights into the organization and mathematical tone of the two books and also gives insights into the work of one of the leading representatives of Japanese "traditional mathematics" (as it is called in the 2002 book).
On pages 28-30 of the 1989 book, Example 2.3 and Problems 2.3. 1 to 2.3. 7 are presented under the heading "Three Circles and Triangles," followed by Examples 2.4(1) and 2.4(2) and Problems 2.4.1 to 2.4. 7 under "Four Circles and Triangles." The presentation is in two-column format with figures in the right column. Each problem proposal is labeled "Example" or "Problem." Here are three items from the 1989 book:
Example 2.3: The three circles, 01Cr1), Oz(rz), and 03(r3) have external contact with each other. The triangle ABC is formed by the common tangents to the circles. Find the radius of the incircle of triangle ABC in terms of r1. r2, and r3.
Example 2.4( 1 ): I(r) is the incircle of triangle ABC, and the circles 01Cr1) , Oz(rz), and 03(r3) respectively touch AB and AC, BA and BC,
and CA and CB, and all touch I(r)
externally. Show that
r = Vr)r; + y:;;; + v;;;:;-. Problem 2.4.1: ABC is a triangle, I(r) its incircle. The circle 01(r1) touches AB and AC produced and
© 2006 Springer Science+Business Media, Inc., Volume 28, Number 1 , 2006 61
also I(r) externally, and 02(r2), and 03(r3) are defined similarly. Show that
1 1 1 1 - = -- + -- + --
r v;v; y,;;; � . A footnote refers to page 106, in Part
I, titled "Solutions to Selected Problems and Answers." There, under the heading "The Malfatti Problem," solutions and historical comments are given. The construction of the circles as in Example 2.3 is often attributed to Malfatti, but the historical comments note that Ajima posed and solved the problem about 30 years before Malfatti did.
In modem times, this Ajima-Malfatti construction has received considerable attention. For example, let A' be the touchpoint of circles 02(r2), and 03(r3), and cyclically, let B' = 03(r3) n 01(r1) and C' = 01Cr1) n 02Cr2). Then M'B'C' is perspective to MEG, and the perspector (John Conway's improvement over "center of perspective") is a point known as the AjimaMalfatti point. For a discussion, visit the Encyclopedia of Triangle Cen
ters-ETC:
http://facu1ty.evansvil1e.edu/ck6/ encyclopedia/ETC.htm1
and scroll down to X(179). Peter Yff found remarkable trilinear coordinates for the Ajima-Malfatti point:
(These are respectively proportional to the distances from the point to the sidelines BC, CA, AB.)
The 2002 book states "Ajima's Theorem" at the beginning of Chapter 4 and "Ajima's second theorem" at the beginning of Chapter 5. The second theorem is elsewhere cited in the book on several pages, as is a third theorem attributed to Ajima, labeled "Boushajutu".
Another traditional mathematician represented in both books was Shoto Kenmotu (1790-1871). His configuration in Problem 3.2. 1 of the 2002 book is the earliest known (1840) construction of a point now known as the Kenmotu point. The configuration, involving three congruent isosceles right triangles, extends easily to the one depicted here.
62 THE MATHEMATICAL INTELLIGENCER
A
The three congruent squares meet in the Kenmotu point, indexed as X(371) in ETC. Trilinears were found by John Rigby:
cos(A - 7T/4) : cos(B - 7T/4) : cos(C - 7T/4).
Both books are partitioned into Part 1 (problems) and Part 2 (essentially, solutions and comments). The 1989 book has chapter headings (1) Circles, (2) Circles and Triangles, (3) Circles and Polygons, (4) Polygons, (5) Ellipses (and One Hyperbola), (6) Ellipses and Circles, (7) Ellipses and Polygons, (8) Ellipses, Circles and Quadrilaterals, and (9) Spheres. The total number of problems is 249.
The 2002 book chapter headings are (1) Number theory, (2) Numerical Analysis, (3) Geometry ofPolygons, (4) Geometry of Circles, (5) Geometry of Circles and Triangles, (6) Geometry of Ellipses, (7) Solid Geometry, and (8) Maxima and Minima. There are 287 problems.
In both books, the organization of material is indeed wonderful, when you stop to think that consecutive closely related problems started out in temples scattered across Japan. For example, in the 1989 book, for the nicely sequenced Problems 2.3. 1 to 2.3.5, we read in Part 2 that these originated in various prefectures at various times: Fukusima in 1891, Iwate undated, Iwate in 1842, Miyagi in 1857, and Fukusima in 1901. Regarding such places and dates, Dr. Hiroshi Kotera offers a very attractive website:
http://www.wasan.jp/english/
(There is also a Japanese version.) There you will find a Clickable SANGAKU Map of Japan, showing 24 labeled prefectures in which sangaku are
found. By clicking any of them, you will be able to examine individual tablets on which problems are posed. In particular, you can click Tokyo, then select English, and see not only a very well preserved tablet, but also markers indicating Ohkunitama Shrine, a date of March 1885, the text of the problem, and so on. Another website is also recommended: Tony Rothman's (with the cooperation of Hidetoshi Fukagawa) Japanese Temple Geometry,
http://www2.gol.com/users/ coynerhm/0598rothman.html
Both books have extensive bibliographies. The 1989 book gives 78 items, and the 2002 book gives 109. The 2002 book has an Index.
Fukagawa's coauthor of the 1989 book, Dan Pedoe, is well known as the author of Circles (Pergamon, 1957), in which he wrote, regarding the ninepoint circle, "This circle is the first really exciting one to appear in any course on elementary geometry." (This famous line is quoted at the beginning of a section in Coxeter's Introduction
to Geometry.) The coauthor of the 2002 book, John
F. Rigby, is well-known in geometric circles. For example, Ross Honsberger's Episodes in Nineteenth and
Twentieth Century Euclidean Geometry (Mathematical Association of America, 1995) reserves a page on which special acknowledgment is given to John Rigby for his contributions to the book. Pages 132-136 introduce a Rigby point which serves as a seed for families of points in ETC beginning at X(2677). Two other Rigby points are indexed in ETC as X(1371) and X(1372). For more on both kinds of Rigby points, visit Eric Weisstein's Math World:
http://mathworld. wolfram. com/ RigbyPoints.html.
Physicist Freeman Dyson's Foreword (or "Forward", as it not inappropriately appears) to the 2002 book is a noteworthy piece, reminiscent of his Imagined Worlds (Harvard University Press, 1997). The Forward opens with these words: "One of the most important scientific enterprises of the twentieth century is the search for extraterrestrial intelligent species." Once
contact is made, there will be the problem of communication. Dyson writes, "Eighteenth-century Japan is stranger to me, in language and in historical tradition, than any other past or present culture on this planet. To my delight, I see in Fukagawa's books a collection of mathematical messages that are profoundly strange but none-the-less intelligible. Fukagawa has collected and arranged these messages so that their strange beauty is now accessible to everyone, eastern and western alike."
The 1989 book can be ordered directly from The Charles Babbage Research Centre, P. 0. Box 272, St. Norbert Postal Station, Winnipeg, Canada R3V 1L6. To order the 2002 book, send a check payable to Mathematics
and Informatics to Susan Wildstrom, 10300 Parkwood Drive, Kensington, MD 20895-4040, USA. Include a letter telling whether you wish to receive the book (from Singapore) by surface mail (total $50.00) or by air mail (total $60.00).
Department of Mathematics
University of Evansville
1 899 Lincoln Ave
Evansville, IN 47722-0001 USA
e-mail: [email protected]
Mathematics: A Very Short Introduction by Timothy Gowers
NEW YORK, OXFORD UNIVERSITY PRESS, 1 56 PP.
US $9.95, ISBN 0·1 9·285361 ·9
REVIEWED BY JEAN-MICHEL KANTOR
This is a very short review of Mathematics: A Very Short Introduc
tion, by Timothy Gowers, a very smart mathematician and an excellent communicator of mathematics (the two concepts are distinct). Gowers's 1998 Fields Medal might be considered a proof of the first statement; this book, together with his already famous Clay Lecture on the importance of mathematics [ 1 ] , an argument for the second. He has done a wonderlul job in producing this rich little book on the essentials of mathematics.
The main part consists of three chapters:
• Models (turning practical problems into mathematical ones).
• The abstract method (or axiomatic method), for which he gives good arguments showing its power and suggesting its pedagogical usefulness (I will not open here the Pandora's box of discussing didactics).
• Proofs, which some people consider to be at the heart of mathematics. Gowers gives examples of proofs, and of seemingly obvious statements that need proofs.
The rest of this charming book is made up of stories, illustrations, examples, and well-illustrated concepts such as limits, estimates, dimension, infinity . . . . Gowers has chosen a list of simple concepts of great mathematical importance, which he presents to the curious reader. Finally, the chapter "Some frequently asked questions" responds to what people generally ask about mathematicians (or so mathematicians imagine). Among the questions, "Why do so many people positively dislike mathematics?"
When people asked Henri Poincare why they never understood mathematics at school, he answered them, "What I don't understand is that people don't understand mathematics!". Gowers suggests using the abstract approach. We wish him all the best!
I would strongly recommend this book to first-year undergraduates, although professional mathematicians will also find it a useful introduction to many beautiful examples on Gowers's Web site, such as the astonishing results that won him the Fields Medal [2]. The general audience too will fmd much of interest on the Web site, including an article about what is definable in mathematics, and the text version [3] of the Clay Lecture, in which Gowers shows how mathematics is a subject where importance and beauty are connected.
REFERENCES
[1 ] The Importance of Mathematics. A Lecture
by Timothy Gowers, Springer VideoMATH,
2002. VHS/NTSC video tape. ISBN 3-540-
92652·6.
[2] http ://www . dpmms.cam .ac. uk/�wtg1 0
/vsipage.html
[3] http://www. dpmms.cam . ac . uk/�wtg 1 0/
importance.pdf
ADDED IN PROOF
See also The Princeton Companion to Mathe
matics, under the supervision of T. Gowers,
to appear in 2006, Princeton University Press.
Universite de Paris VII
75251 Paris Cedex 75005
France
e-mail: [email protected]
Basebal l's Al l-Time Best H itters by Michael Schell
PRINCETON, NJ: PRINCETON UNIVERSITY PRESS, 1 999.
xxi + 295 PP. US $1 7.95 ISBN 0691- 1 2343·8 (PAPER)
REVIEWED BY JIM ALBERT
Baseball is one of the most popular tean1 games in the United States.
Professional baseball started near the end of the 19th century. Currently in the United States and Canada, there are 30 professional teams in the American and National Leagues, and millions of people watch games in ballparks and on television.
Baseball is a game between two teams of nine players each, played on an enclosed field. A game consists of nine innings. Each inning is divided into two halves; in the top half of the inning, one team plays defense in the field and the second team plays offense, and in the bottom half, the teams reverse roles. The team that is batting during a particular half-inning, the offensive team, is trying to score runs. A player from the offensive team begins by batting at home base. A run is the score made by this player who advances from batter to runner and touches first, second, third, and home bases in that order. A team wins a game by scoring more runs than its opponent at the end of nine innings.
A basic play in baseball consists of a player on the defensive team, called a pitcher, throwing a spherical ball (called a pitch) toward the batter. This
© 2006 Springer Science+ Business Media, Inc., Volume 28, Number 1 , 2006 63
Batting Career At-
Rank Player Average bats Career
Ty Cobb 0.3664 11434 1905-1928
2 Rogers Hornsby 0.3585 8173 1915-1937
3 Joe Jackson 0.3558 4981 1908-1920
4 Lefty O 'Doul 0.3493 3264 1919-1934
5 Ed Delahanty 0.3459 7505 1888-1 903
6 Tris Speaker 0.3447 1 0195 1907-1928
7 Ted Williams 0.3444 7706 1939-1960
8 Billy Hamilton 0.3443 6268 1888-1901
9 Dan Brouthers 0.3421 6711 1879-1904
10 Babe Ruth 0.3421 8399 191 4-1935
confrontation is called the batter's atbat. The batter is attempting to make contact with the pitch using a smooth round stick called a bat. A strike is a pitch that is struck at by the batter and missed, or is not struck by the batter and passes through a region called the strike zone. A ball is a pitch that is not struck at by the batter and does not enter the strike zone in flight.
After a number of thrown pitches, the batter will either be put out or become a runner on one of the bases. The batter may be put out in several ways: (1) he hits a fly ball (a ball in the air) that is caught by one of the fielders, (2) he hits a ball in "fair" territory, and first base is tagged before the batter reaches first base, (3) a third strike is caught by the catcher.
A hitter can advance to become a runner and reach base safely by: (1) Receiving four pitches that are balls (outside of the strike zone). In this case, the batter receives a walk or base-onballs and can advance to first base. (2) Hitting a ball in fair territory that is not caught by a fielder or thrown to first base before the runner reaches first base. There are different types of hits depending on the advancement of the runner on the play. A single is a hit when the runner reaches first base, a double is a hit when the runner reaches second base, a triple is a hit when a runner reaches third base, and a home
run is a "big" hit (usually over the outfield fence) when the runner advances around all bases safely.
Rating Players
Baseball players are rated with respect to their ability to hit and their ability to field. The classical measure of a
64 THE MATHEMATICAL INTELLIGENCER
player's ability to hit is the batting average (AVG), defined as the proportion of "official at-bats" (essentially a player's at-bats that are not walks) when the player gets a base hit. Above is a table listing the players since the beginning of professional baseball (1876) that have the ten highest batting averages. From this table, it appears that Ty Cobb is the best baseball hitter of all time. But was he really the best hitter? This book provides a serious attempt to make the proper adjustments to batting average so that one can make a fair comparison of players who played during different time periods.
Why Consider Batting Average?
It should be clarified what this book is about and what this book is not about. Although batting average (AVG) has been the standard way of measuring a player's batting ability since the beginning of professional baseball in 1876, it is well known that AVG is a relatively poor measure. The objective of a hitter is to help create runs for his team, and there are alternative measures of a player's hitting success that are much more positively associated with runs than AVG. Albert and Bennett (2003) give an overview of the many superior alternatives to AVG.
The book under review has been criticized by many people for comparing hitters with respect to batting average instead of alternative "good" measures of batting performance. I think this criticism is unfair. Schell makes it clear in the introduction that his analysis is not finding the best "batters," but rather the best "hitters"-the ones who have the best chance of getting a hit. Given the current popularity
of the AVG as a measure of hitting per-formance, I believe that this is a rea-sonable aim. The definition of bat-ting average is carefully explained, and Schell can focus his efforts on the proper adjustment of this measure to make comparisons of players. (Schell is currently completing a second book, Premier Batsmen, that makes similar adjustments to other measures of hit-ting performance.)
What Adjustments Should
Be Made?
To illustrate some of the issues discussed in this book, let's compare two great baseball hitters, Ty Cobb and Ted Williams. On the surface, Cobb appears to be the better hitter, for his career batting average was 0.366 compared to Williams's 0.344. But this superficial comparison ignores some relevant concerns. First, we question whether 0.366 and 0.344 are representative measures of the ability of the two players to get hits. Ty Cobb played baseball for 24 seasons. Figure 1 plots his season batting averages against his age.
In it, a smooth-fitting (loess) curve is placed on top of the plot. We see a general pattern in this plot that is typical for many players. Baseball players, like many other athletes, mature and become more proficient in performance until a particular age, after which their performance deteriorates until retirement. A player's career batting average includes the periods of maturation and deterioration during which the player has a low batting average. A better measure of hitting performance might be the batting average for a player during the middle years of his career.
A second general concern in comparing Cobb and Williams is that they played baseball during different eras and their careers did not overlap. The basic rules of baseball have remained the same over the years, but the playing conditions that affect the effectiveness of the pitchers have changed, and these changes have had a significant effect on the batting averages of players. It is difficult to say that Cobb was a better hitter than Williams, because they played against different pitchers and the game was different in the two eras. For these reasons, a reported batting
• •
0.4 •
0.35 (!) �
0.3
0.25 •
20 25 AGE 30 35 40
careers. In the "Second Base" chapter,
Schell makes an adjustment for seasons
during which the batters had general ad
vantages or disadvantages versus the
pitchers-he calls these periods hitting
feasts and famines. A mean adjusted bat
ting average is computed, which is the
ratio of the player's batting average to
the average league batting average for
that particular season. This adjustment
diminishes the achievement of players
that had high batting averages during pe
riods where hitting was relatively easy.
More subtle adjustments to batting
average are made in the "Third Base"
and "Home" chapters. The third ad
justment accounts for differences in
the talent pool during different eras.
Schell claims that the standard devia
tion of players' batting averages is a
useful measure of the talent pool of
players, and small standard deviation
values indicate more talent. A new ad
justed batting average is proposed that
takes into account the difference in tal
ent pools. This method essentially as
sumes a percentile ranking for each
player for the era in which he played,
and then combines the percentile rank
ings into a single list. In the "Home
Base" chapter, Schell makes the final
adjustment. A player's batting average
is adjusted for the ballpark where he
played half of his games. This is a te
dious calculation, for adjustments have
to be made for each of the ballparks
where Major League baseball has been
played over the past 130 years. This
method multiplies a batting average by
a specific "ballpark effect."
Figure 1
average of, say, .320 is not very mean
ingful unless we understand when this
average was achieved. Schell makes
the plausible assumption that the num
ber of great hitters has stayed constant
over the years, and it is not meaning
ful to look at a player's absolute bat
ting average. Rather, he believes, a
player's batting average should be
judged relative to the batting averages
of other players who played during the
same seasons.
A third concern is related to the pool
of talented players during different
eras of baseball. Schell describes how
the recruiting pool of potential ball
players has changed over the years.
Most of the players in the early years
of baseball were whites from the North
east United States. In later years, base
ball included players from the South,
later still, African-Americans, and now
players from Latin America and Asia.
The change in U.S. population and the
source of eligible players has had a sig
nificant impact on the general talent of
the group of professional players. Gen
erally, it can be said that the range of
talent has decreased over the years
(players are currently more homoge
nous in terms of ability), and this
change should affect the comparison
of batting averages from different eras.
A final concern is that all ballplay
ers play half of their games in their
home ballpark and half of their games
in ballparks away from their home
town. There are significant differences
in the dimensions of the ballparks, the
weather, the type of grass, and the con
dition of the pitcher's mound (the hill
from which the pitcher throws), and
these ballpark characteristics can have
a significant effect on hitting statistics.
If one player hits for a 0.300 batting av
erage in a "hitter's ballpark" and a sec
ond player hits for an average of 0.300
in a "pitcher's ballpark," then one
would rate the second hitting perfor
mance as superior because it was more
difficult to get hits in his ballpark
Thus, one needs to adjust a player's
batting average for his ballpark
Making the Adjustments
Schell carefully describes how to make
the four types of adjustments de
scribed above to find the 100 best base
ball hitters of all time. He starts by only
considering "qualifying" players-the
retired players who had at least 4000
at-bats and the current players who
have had at least 8000 at-bats. Note that
one of the players in the above list,
Lefty O'Doul, would be excluded from
the top list because he didn't have the
required 4000 at-bats. With this exclu
sion, Schell displays the "traditional"
list of 100 best hitters, and in the four
chapters that follow (called "First
Base," "Second Base," "Third Base,"
and "Home"), adjusts the traditional
list using the four concerns.
In the "First Base" chapter, Schell
adjusts a player's batting average for
late career declines. Essentially, this
method adjusts a career A VG by con
sidering only the proportion of hits in
the first 8000 at-bats. This adjustment
has the effect of raising the batting av
erages of players whose averages de
teriorated in the closing years of their
And the Best Hitter Is?
After performing all of these adjust
ments to batting average, who are the
best hitters of all time? As expected,
many of the players who had high bat
ting averages during the feast period of
the 1920s and 1930s get devalued, and
many of the current players rise in the
ratings. For example, Roger Hornsby,
who played in the 1920s, drops from
second on the traditional list to fifth on
Schell's list, and Rod Carew, a 1970s
player, rose from 28th (traditional) to
3rd (Schell). Surprisingly, the best hit
ter of all time is Tony Gwynn, who re
tired in the 2001 season. Schell's re
sults received much media attention,
© 2006 Springer Science+Business Media, Inc., Volume 28, Number I , 2006 65
and the author had an opportunity to
meet with Gwynn at the San Diego ball
park to discuss his findings.
General Assessment
This book provides a nice historical
view of baseball from a statistical
perspective. There have been notable
changes in the game over the years,
and one can view these changes by use
of time series plots of the appropri
ate statistics. Through this historical
study, one learns a lot about the great
hitters of all time. Also, although many
baseball fans may disagree about the
final ranking of best hitters, the author
uses scientifically sound methods to
make the necessary adjustments in bat
ting average. Although I might choose
different methods to make my adjust
ments, I suspect that I would reach
similar conclusions. My only concern
in reading this book is the relatively
high technical level of the presentation;
the book can be read only by baseball
fans with a quantitative background,
yet most fans would be interested in
the conclusions. I also believe that the
adjustment procedures described here
would be helpful in constructing simi
lar types of adjustments to measures of
performance in other sports.
REFERENCE
Albert J. and Bennett, J . , Curve Ball, Springer
(2003).
Department of Mathematics and Statistics
Bowling Green State University
Bowling Green, OH 43403
USA
e-mail: [email protected]
Teaching Statistics Using Basebal l by Jim Albert ---.. -- ·-----
WASHINGTON. DC, MATHEMATICAL ASSOCIATION OF
AMERICA, 2003. 304 PP US $46.95. ISBN 0·88385-727-8
(PAPER)
REVIEWED BY MICHAEL J. SCHELL
Teaching Statistics Using Baseball (TSUB hereafter) addresses vari
ous issues in statistics for which base-
66 THE MATHEMATICAL INTELLIGENCER
ball can provide insights. The author,
Jim Albert, suggests that TSUB "can be
used as the framework for a one-se
mester introductory statistics class
that is focused on baseball" or "as a re
source for instructors who wish to in
fuse their present course in probability
or statistics with applications from
baseball." Regarding the first sugges
tion, I would call such a course "sta
tistics appreciation," for TSUB does
not cover the traditional statistics
material contained in introductory
courses, and very little formal infer
ence is done.
The topics of TSUB are closely
linked with those of [2] . The book is or
ganized into 9 chapters. The first chap
ter is an introduction to the book; the
remaining chapters are composed of
an average of 5 case studies and 19 ex
ercises, including "leadoff exercises"
featuring Rickey Henderson, probably
baseball's greatest leadoff hitter.
Chapter 2 introduces stemplots
(also called stem-and-leaf diagrams),
the "five-number summary" of a dis
tribution (minimum, maximum, and
three quartiles ) , histograms, and graph
ing of time series data. Chapter 3 com
pares batches of data, principally using
parallel stemplots, boxplots, or stan
dardization. Chapter 4 examines rela
tionships between variables using scat
terplots, correlation coefficients, and
regression, using the RMSE to evaluate
how well alternative measures (like
batting average and slugging average)
predict run-scoring. In Chapter 5, prob
ability concepts are introduced and
applied to the design of baseball table
top games. Plots of cumulative rates of
some baseball statistics (like the
earned run average of a pitcher over
the course of a season) are also fea
tured. In Chapter 6, the binomial dis
tribution and goodness-of-fit Pearson
residuals are presented in the case
studies, and the Poisson and geomet
ric distributions are introduced in the
exercises. Chapters 7 and 8 focus on
statistical inference. Typically, data are
simulated according to various distri
Obutional assumptions, and inference
among alternatives is conducted using
Bayes's rule, probability intervals, and
visual inspection of graphs. Chapter 9
models a baseball game as a Markov
Chain and uses transition probability
matrices to derive conclusions.
The case studies and exercises pose
interesting baseball questions, and the
book and accompanying Web site pro
vide easy access to much baseball data.
Here are a couple of my favorites:
Case Study 2-5 presents an interest
ing look at managerial strategy, by
studying the use of the sacrifice bunt.
When the dotplot of sacrifice attempts
is displayed in parallel (separately) for
the American and National Leagues,
there are noticeably fewer bunts in the
AL. The designated hitter rule provides
a "simple explanation for this discrep
ancy."
Case Study 9-4 is built on previous
case studies in Chapter 9 and uses
Markov Chain concepts for its devel
opment. Within an inning, there are 24
possible "states," based on the number
of outs (0, 1, or 2) and the runner con
figurations (e.g., bases loaded). From
this model, Albert estimates the average
value of a home run to be 1.40 runs to
the team. This corroborates similar esti
mates obtained by others by regressing
wins on various team batting statistics.
Albert has adroitly kept the statistical
development work to the minimum re
quired to solve the question.
In other case studies, though, the
author simplifies the analyses ( un
doubtedly for pedagogical reasons).
Students should know that these analy
ses lack the rigor needed for more de
finitive answers to the questions posed.
I now provide several such cases as an
aid to the potential instructor.
Case Study 3-5 is close to my heart,
for it involves comparison of four of the
best single-season batting averages in
baseball history. Albert standardizes the
raw averages by using the means and
standard deviations of "regular players"
(those with 2::400 at bats) from the four
seasons. For Rod Carew's 1977 and Tony
Gwynn's 1994 seasons, Albert obtains
batting average z-scores of 4.07 and 3.15,
respectively. Given that Gwynn's season
ranks first in my book, [3] (Carew's sea
son ranks fourth), I was surprised that
the z-score differential in TSUB was so
disparate. Two simplifications of the
problem led to a substantial shift in the
findings. First, batting averages were
not adjusted for the players' home ball-
park in TSUB. Second, the definition of
a "regular player" biased the standard
ization, because a players' strike short
ened the 1994 season. Consequently,
only 63 players (2.2 players per team)
from 1994 were "regular players" com
pared to 168 ( 6. 5 players per team) from
1977. These simplifications gave Carew
13 and 16 net batting points over
Gwynn, respectively, compared to the
more detailed analysis given in my
book, which yielded z-scores of 4.24 for
Gwynn and 4.03 for Carew.
For Case Study 7-3, Albert shows re
sults from 150 simulations but notes
that they "aren't too precise," so he re
simulates the example 1000 times.
Then, in actually solving the problem,
he uses about 333 simulations. If 150
isn't precise, and 1000 is, what about
333? I would have preferred that 10,000
simulations be used.
Sometimes TSUB relies heavily on
the reader's baseball knowledge. Having
discussed leadoff hitter Rickey Hender
son's distribution of plate appearances
in the 24 runner and out situations for
1990 (p. 260), Albert asks the reader
whether Barry Bonds's 1990 profile
should be similar. The question seems to
spin on whether Bonds was a leadoff hit
ter or not. Well . . . early in his career,
Bonds WAS a leadoff hitter. Bonds led
off only 13 games in 1990, but in 1989 he
led off 106 of the 159 games he played.
The casual fan might not know anything
about Bonds's lineup position at all. The
intermediate fan might think of him as a
cleanup hitter. The advanced fan wor
ries about when he made the transition
from being a leadoff hitter.
In Chapter 8, TSUB proposes as
sessing the "streakiness" of players by
visual comparison of player moving
average plots to simulated ones. How
ever, most of the plots based on simu
lated data are of different size, render
ing comparison very difficult.
In summary, Jim Albert is to be ap
plauded for providing a host of baseball
data and examples for use in learning
statistics. Sports examples provide one
of the best avenues for helping stu
dents develop an interest in statistical
concepts. In 1997, Frederick Mosteller
[ 1] sounded the call that "statisticians
should do more about getting statistical
material intended for the public written
in a more digestible fashion" and relate
the interests of sports enthusiasts "to the
more general problems of statistics."
Through Curve Ball and Teaching Statistics Using Baseball, Jim Albert has
responded well to that call.
REFERENCES
1 . Mosteller, Frederick. Lessons from sports
statistics. American Statistician, 51 :305-
3 1 0, 1 997.
2 . Albert, Jim and Bennett, Jay, Curve Ball:
Baseball Statistics and the Role of Chance
in the Game, Copernicus Books. New York,
2001 .
3. Schell, Michael J. Baseball's All-Time Best
Hitters: How Statistics Can Level the Play
ing Field, Princeton University Press, Prince
ton, 1 999.
Department of Biostatistics
University of North Carolina
Chapel Hi l l , NC 27599
USA
e-mail: [email protected]
Machine Learning: Discriminative and Generative by Tony Jebara
BOSTON, KLUWER,THE KLUWER INTERNATIONAL
SERIES IN ENGINEERING AND COMPUTER SCIENCE,
VOL. 755. 2004. 224 PP. HARDCOVER US$82,
ISBN 1 -400-7647-9
REVIEWED BY MARINA MElLA
I f "probability and statistics [ . . . ]
have separated, then divorced" [2],
machine learning is the place where
they coexist happily and occasionally
get up to interesting things together.
This book is the refreshing result of
just such an encounter.
To pursue another metaphor of the
same flavor, machine learning (ML) is
the estranged child of Artificial Intelli
gence, which gave up on high-level,
rule-based representations of knowl
edge for low-level, often stochastic
models estimated from data. It em
braced probability from its inception,
discovered statistics in the 1990s, all
the while maintaining its roots in com
putation. This was fortunate, as some
of machine learning's most famous
successes include an algorithmic in
gredient.
One of the oldest and most studied
problems in ML is classification. Auto
mated classification is often desired in
practical applications of computers.
For example, credit card companies
detect when a card is stolen from the
pattern of activity in the account on the
past few days; banks give credit ratings
"good = will pay = + 1" or "bad = will
default = - 1" to potential customers
based on a set of a dozen or so ob
served characteristics (usually kept se
cret by the banks). Recently, with the
advent of the massive scientific data
bases, automatic classification is in
demand in the sciences as well. In
biological sciences, automatic protein
classifiers are being developed that
predict the functional class of a protein
from its chemical formula. Astronomy
has introduced massive sky surveys,
with a telescope continually photo
graphing the night sky for as long as a
few years. The number of celestial ob
jects, from asteroids to remote galax
ies, recorded this way is in the billions.
Hence, classifying each of them must
be done automatically.
Mathematically speaking, a classi
fier is a function that takes some inputs
(the characteristics of the customer in
the credit rating problem, or the posi
tion and image of the object if one
wants to classify astronomic observa
tions) and outputs a label, e.g. ::!:: 1.
For example, a linear classifier is de
fined by
f(x) = sign(81x1 + 8zxz + . . . 8mxm),
where x1, x2, . . . Xm are the inputs and
el, 8z, . . . em are the parameters of the
classifier. This classifier defines a hy
perplane in m-dimensional space. All
examples (x1, Xz, . . . Xm) falling on one
side of the hyperplane are labeled + 1
(e.g., "good credit rating"), while the
points on its other side are labeled - 1
("bad rating"). The hyperplanef(x) = 0
itself is called the decision surface. The
values of the parameters, i.e., the
choice of the particular hyperplane, is
made based on a set of hand-labeled
examples called the training data.
Hence, training a classifier consists
of picking a function! and a set of val-
© 2006 Springer Science+ Business Media, Inc., Volume 28, Number 1 . 2006 67
ues for its parameters e. More ab
stractly, one chooses a family '!fo of
functions parametrized by 81, . . . Om, a
criterion for selecting the values of the
parameters J(O; training data), and
some algorithm for obtaining the opti
mal O's according to J. The first task,
choosing '!fo, is sometimes left to the
"application expert," but the second
and third are at the core of ML.
One way of practically approaching
classification was pioneered by Sir
Ronald Fisher and, to no surprise, typ
ifies the point of view of statistics: As
sume that one knows that the good
credit applicants have a distribution
p+ (x) (for example a multivariate nor
mal) and the bad credit applicants are
distributed according to p-(x). Then,
to decide whether xnew deserves a
good or a bad credit rating we compare
the probabilities that the two mod
els assign to xnew. For instance, if
p+(xnew)IP-(xnew) > 1, xnew receives
a good rating, otherwise it receives a
bad one. This approach can also be put
in aj(x) form, with
f(x) = sign[log p+(x) - log p-(x)] (1)
Because p+, p- are referred to as the
generative models of the two classes,
this approach is called the generative
approach to classification. By contrast,
directly choosing a decision surface
f(x) without resorting to generative
models is called discriminative classi
fication.
The statistical approach induces
one to think of how the examples of
each class were generated, which is
more intuitive than directly selecting a
class '!fo of decision surfaces, especially
in high dimensions and when one deals
with more than 2 classes. Statistics
also offers principles-like Fisher's
Maximum Likelihood-and algorithms
for estimating the parameters of each
model from data. It was also proved
that, when the true model classes for
p± are known and the number of train
ing examples tends to infinity, the gen
erative classifier is optimal.
The problems appear in real condi
tions, when one knows little about
the true distributions p±. What P to
choose? One can seemingly bypass this
68 THE MATHEMATICAL INTELLIGENCER
question by choosing a family of gen
erative models rich enough to approx
imate well any distribution the data
may have. But this way one may fall
prey to overfitting, which is exaggerat
ing the importance of the inputs in pre
dicting the data, when some of them
may be noisy or irrelevant. For exam
ple, if a student sees a cat on the day
he fails an exam, and concludes that
seeing a cat on future exam days will
predict failure, then he is probably
overfitting the data.
The discriminative way to look at
classification is to pragmatically pick a
classifier f E '!fo that classifies the train
ing set best, doing away with the gen
erative model, i.e., with the assumption
that we know something about the
form of the "true" sources that gener
ated the data. A now classic theory [ 1 ]
allows one to predict the classification
error rate on future data based on just
three numbers: the number of training
examples, the number of errors off on
these examples, and the capacity of '!fo the class of functions we are optimiz
ing over. To understand the im
portance of the last factor, think of over
fitting. If the class '!fo is richer, something
measured by its capacity, then it is more
likely that the chosenfis overfitting the
observed examples but will do poorly
on new examples Gust like not seeing a
cat the next exam day will be a poor
predictor of success).
Discriminative methods have the
advantage of directly optimizing the
quantity of interest (the classification
error) instead of the description of the
classes separately through p+ , p-. In
practice they have been shown to be
more robust to overfitting, especially
when few training examples are avail
able. However, their output cannot be
converted easily into a confidence
measure, and incorporating other
knowledge about the domain, some
thing easily done with generative mod
els, was only achieved on a case-by
case basis.
How to combine the advantages of
generative models-ideal for incorpo
rating knowledge about the domain
and returning a probabilistically mean
ingful answer-with the principles of
discriminative classification that seem
to govern problems with finite data?
"Machine Learning: Discriminative and
Generative" offers a way of doing just
that: it is Maximum Entropy Discrimi
nation (MED for short).
While introducing the core of the
MED method to a machine learning ed
ucated audience can be done in one re
search paper, this monograph has a
more ambitious goal. It aims to situate
MED in the context of its sources, to
motivate it, and to exhibit its many ties
with other old or recent advances in
machine learning.
The book opens with an overview of
machine learning, stressing the fun
damental trade-off between choosing
a model expressive enough to solve
the problem and avoiding overfitting
the observed data. The next chapter
weaves in the precursor ideas for MED.
The generative and discriminative ap
proaches are introduced. Each is illus
trated by a notable success story in
teresting in itself, but which also
supports the understanding of the rest
ofthe book For generative models, the
jewel of the crown are the "graphical
models," featured on the cover of the
book itself. Graphical models have be
come a language for expressing prob
abilistic dependencies between many
variables by way of graphs. Like any
good language, the concise represen
tations are precise in a probabilistic
sense, and translate into very efficient
algorithms both for making inferences
from the model and for estimating the
models themselves.
The discriminative methods' reso
nant success is the Support Vector Ma
chine (SVM), also symbolized on the
book cover by a decision surface. SVM
demonstrates that important inventions
need more than one good idea; in the
present case three ideas converged from
domains as different as convex opti
mization, reproducing kernel Hilbert
spaces, and statistical learning theory.
Chapter 3 introduces the MED core
ideas: Similarly to SVM, a family '!fo of
discriminative functions is chosen and
one explicitly enforces good classifica
tion of the training examples. But, in
stead of choosing just one f E '!fo, one
hedges the bets by averaging over all
the options. It is a weighted average, in
which an f gets a higher weight if it
makes fewer mistakes on the training
data. Beautifully and surprisingly, it
turns out that for many classes 21', solv
ing this apparently more complex prob
lem (an average over an infinite set of
functions) is no harder than choosing
one optimal! Even better, the task can
be cast into a convex optimization
problem with unique solution, that can
be solved by a generic algorithm.
One nice consequence of learning a
distribution instead of a single f is that
now one can naturally achieve most of
the goals listed above: One can construct
thefs as in (1) from generative models,
yet be explicitly optimizing the classifi
cation error. One can interpret the clas
sifier's output both discriminatively and
probabilistically. Prior knowledge of var
ious kinds can be introduced.
Extensions to the main method are
presented in chapters 4 and 5. The for
mer extends the MED approach to a
wider range of problems, like classifi
cation with 3 or more classes (multi
way classification), predicting a real
valued "label" (regression), selecting
which inputs are relevant for classifi
cation and which are not (feature se
lection). Experimental results also
highlight the behavior of MED com
pared with other methods. Chapter 5 discusses the case when some relevant
inputs are not observed (these are
called latent variables). For instance,
two stars that are close together in an
image may be part of a binary system
or may appear close only seen from the
Earth (much less likely!). Which is the
case is something that may perhaps be
inferred given other data, but cannot
be observed directly; it is thus a latent
variable. Finally, the discussion in
chapter 6 summarizes the ideas in the
book and outlines some directions for
further research.
Experiments punctuate most sec
tions and the illustrations are very help
ful (one wishes though that the Matlab
plots reproduced better). As the book
spans domains so varied as statistics,
optimization and functional analysis,
each with its own jargon, it is a challenge
for any author to fuse these into a co
herent and precise language. The text
does a good job of it, but readers must
expect to do their part too. Those who
prefer a fully rigorous mathematical pre
sentation will sometimes have to follow
the references. This is especially so in
the background chapters where not
everything is formalized, partly to keep
the book a manageable size.
For the ML researcher, understand
ing MED and its sources of inspiration
is highly recommended, as the method
is at the center of a rapidly expanding
area of research. Chances are that,
even after reading this monograph, one
will see possible ways of expanding the
MED framework to new situations. For
someone wanting to apply ML solu
tions to engineering or science, the
book offers a very flexible one-a so
lution that can be adapted to a variety
of tasks and of problem characteris
tics. And for the reader interested in
mathematical ideas at work in another
field, the topics in this book rate five
stars. The fundamental issues of ma
chine learning are the mathematical
expression of problems that every sci
entist has to deal with. The ideas MED
is based on (like maximum entropy,
exponential models, VC theory [ 1 ]) are
each on its own very powerful statisti
cal and mathematical tools, lying at the
core of most notable successes of ma
chine learning in the last decade. MED
itself is the result of combining them in
an elegant and surprising way.
Overall, the text is as an enthralling
presentation of ideas that move machine
learning these days. Let us remember
that ML is ultimately a field with practi
cal significance, which currently is ex
periencing substantial growth. It's excit
ing to be there, and this book takes you
to the heart of the matter.
Department of Statistics
University of Washington
Seattle, Washington 981 95-4322
USA
e-mail: [email protected]
REFERENCES
[ 1 ] V. Vapnik. Statistical Learning Theory. Wi
ley, 1 998.
[2] Marc Yor. Review of "Weighting the Odds:
A Course in Probability and Statistics" by
David Williams. The Mathematical lntelli
gencer, 25(2):77-78, 2003.
The Pol itics of Large Numbers. A History of Statistical Reasoning by Alain Desrosieres
Translated by Camille Naish
CAMBRIDGE, MA, HARVARD UNIVERSITY PRESS,
PAPERBACK, 2002. 384 PP. US $20.50
ISBN 0-674-00969-X
REVIEWED BY PATTI W. HUNTER
Today, the term statistics carries a
double meaning. On the one hand,
it refers to collections of numerical
facts about (among other things) polit
ical, medical, or agricultural conditions.
On the other, statistics is a scientific dis
cipline, a method of analyzing data. For
most of the nineteenth century and be
fore, only the first meaning had wide
spread use-a statistician was a col
lector of numerical data. Not until the
twentieth century did any statisticians
think of themselves as mathematical
scientists.
How administrative and mathemat
ical statistics came together is the sub
ject of Alain Desrosieres's book, The Politics of Large Numbers. Drawing on
historical research of the last few
decades, Desrosieres examines the in
teraction of these two domains, ana
lyzing their development from the
point of view of the sociology of sci
ence. He is particularly interested in
how the evolution of statistics ( admin
istrative and scientific) sheds light on
the process by which such phenomena
as unemployment, poverty, and infla
tion acquire objective status.
Covering the seventeenth through the
early twentieth centuries, Desrosieres
alternately considers the two faces of
statistics. Three chapters (1, 5, 6) ex
amine the origins and development of
administrative statistics and their role
in the state, comparing Germany, Eng
land, France, and the United States.
Chapters 2, 3, and 4 deal with subjects
more closely connected to the mathe
matical face of statistics: seventeenth
and eighteenth-century probability,
averages, correlation, and regression.
© 2006 Springer Science+ Business Med1a, Inc., Volume 28, Number I , 2006 69
Chapter 7 examines the social conditions in which sampling techniques originated. Chapter 8 considers problems associated with choosing categories into which to classify people and things. Moving into the twentieth century, chapter 9 traces the history of modem econometrics.
The book has a number of interesting and informative sections. Chapter 1 includes a detailed discussion of how the French Revolution shaped people's understanding of which aspects of social and demographic information were important and of what sets of categories most effectively classified that information. The chapter exploring Adolphe Quetelet's "average man" (chapter 3) provides an engaging glimpse into nineteenth-century debates among French medical scientists over the value of data about public health, particularly in the context of efforts to identify the causes of the cholera epidemic of the 1830s. In the second half of chapter 6, Desrosieres traces the professionalization of what became the United States Census Bureau. Here we see the influence of antebellum debates about the productivity of Northem versus Southern states, of disagreements about immigration policy, and of Franklin D. Roosevelt's New Deal.
Readers not familiar with the historical or mathematical details of the topics treated by Desrosieres may find his sociological analysis more accessible after first reading for themselves the sources from which the author draws his material (for example, [ 1 , 2, 3] discuss the history of probability before the twentieth century). In these works, they will find the writings of the historical figures analyzed carefully; they will learn something of the biographies of people who play important roles in Desrosieres's discussion and of the professional and scientific contexts in which they worked. Mathematicians will need no introduction to Blaise Pascal or Simeon Poisson. But what about Wesley Mitchell? The author introduces him only as a "former census statistician" (p. 198), whose work took on new importance during World War I when Woodrow Wilson centralized national data collection.
70 THE MATHEMATICAL INTELLIGENCER
Some 80 pages further in, we learn that Mitchell founded the National Bureau of Economic Research in 1920, but the first discussion of this agency occurs thirty pages later. About Tjalling Koopmans, whose ideas figure prominently in the history of econometrics covered in chapter 9, some readers may find it helpful to be reminded of something more than the fact that he had a degree in physics. From where? What impact did this and his other training have on his economic thought? Desrosieres notes in his introduction that he is addressing readers from diverse cultural backgrounds. He seems to treat the household names of each culture (and sometimes their ideas) as familiar to all.
This lack of attention to readers' diverse knowledge of history would not alone make The Politics of Large
Numbers difficult for scholars new to Desrosieres's interests. Particularly in discussions of the broader questions about the sociological implications of the history of statistics, the exposition relies on specialized vocabulary and rather complicated sentences. Setting out the purpose of the book, Desrosieres explains that he wants to link the technical history of statistics with its social history. "The thread that binds them," he writes, "is the development-through a costly investment
of technical and social forms. This enables us to make disparate things hold together, thus generating things of another order" (p. 9, italics in the original). Desrosieres thus seeks to understand the development of these "forms," as well as how social phenomena like unemployment, and technical phenomena such as correlation managed to achieve objective status. As he puts it,
The amplitude of the investment 'in forms realized in the past is what conditions the solidity, durability, and space of validity of objects thus constructed: this idea is interesting precisely in that it connects the two dimensions, economic and cognitive, of the construction of a system of equivalences. The stability and permanence of the cognitive forms are in relation to the breadth of investment (in a general sense) that produced them. This relationship is of
prime importance in following the creation of a statistical system (p. 1 1 ). The introductory and concluding
chapters are most densely populated with such statements, but they can be found throughout the text. From a comparison with the original 1993 French edition, this style does not seem to be a consequence of the translation.
Desrosieres's exploration of the relationship between statistics and the public sphere raises some intriguing questions about the mutual impact of mathematical ideas and the functions of the state. Readers with an interest in those questions and the willingness to fill in some historical detail and work their way through the exposition may find some thought-provoking answers.
Department of Mathematics
Westmont College
Santa Barbara, CA 931 08 USA
e-mail: [email protected]
REFERENCES
[ I ] Lorraine Daston, Classical Probability in the
Enlightenment, Princeton University Press,
1 988.
[2] Theodore Porter, The Rise of Statistical
Thinking: 1820- 1900, Princeton University
Press, 1 986.
[3] Stephen M. Stigler, The History of Statis
tics: The Measurement of Uncertainty be
fore 1 900, Harvard University Press, 1 986.
Mathematical Circles, vol. I , II, I l l by Howard Eves
WASHINGTON, DC, THE MATHEMATICAL ASSOCIATION
OF AMERICA, 2003. US $98.00. ISBN 0-88385-542-9, 0-
88385-543-7, and 0-88385-544-5
REVIEWED BY STEVEN G. KRANTZ
G ian-Carlo Rota observed that we mathematicians are more likely to
be remembered for our expository work than for our research. (The exceptions are figures like Gauss, Riemann, and Cauchy.) While only a handful of research mathematicians from the 1950s and 1960s are worth even a
mention, the name of Howard Eves (191 1-2004) stands tall. Everyone has heard of Howard Eves. Why? Eves did little research, but he wrote texts and articles on a vast array of subjects, ranging from combinatorial topology to geometry to complex variables to number theory. He is best remembered for his many wonderful expository books. Notable among these is the sixvolume Mathematical Circles collection.
Using Google, I found no fewer than 2400 hits for Howard Eves. He is spoken of in glowing terms-"eminent mathematician and educator Howard Eves." Sounds perhaps like Saunders MacLane or Steve Smale. But it is Howard Eves, because Eves spoke to all of us with authority, with credibility, and with sincerity, in language that we can all understand. What was his secret?
Eves was a considerable authority on mathematical history. That is a rather recondite subject area, and had he limited his publications to math history journals then he would probably languish in well-deserved obscurity. But he chose to share his erudition by way of anecdotes and aphorisms about our collective heroes. He wrote well, he wrote accurately, and he wrote with conviction. Eves's Mathematical Circles volumes constitute one of our collective treasures. The original Prindle, Weber, and Schmidt editions are now
Calculus: The Elements MICHAEL COMEN ETZ
537 pp $46 softcover (981 -02-4904-7)
$82 hardcover (981 -02-4903-9)
Both editions have sewn bindings
out of print, and we are fortunate that the Mathematical Association of America has seen fit to republish the six books in three elegant volumes.
What is so special about these books? These days, with the great vogue in popular mathematical writing, we are beset with volumes about Andrew Wiles, about the Riemann hypothesis, about the history of zero, about chaos and fractals, and about any other quasi-mathematical topic that the public has a hope in hell of understanding. It should be noted that Eves was one of the pioneers of mathematical popularization and of mathematical story-telling. And his books are serious. He does not tell-at least in his first five volumes-of the digestive quirks of Descartes or the romantic peccadilloes of Galois. He really wants to tell us about the mathematical enterprise, about the people who do mathematics and why they do it. Eves's purpose is serious, and his result-his written record-is substantive and valuable. We are fortunate to have these collections of stories.
Among the more charming anecdotes are • The story of why Thales never mar
ried. • Four versions of the death of Archi
medes. We have all heard the story of how Archimedes told one of Marcellus's troops to get out of his light-the soldier was disturbing his
circles. The irate soldier ran the great scholar through with his lance. Eves offers at least three other possible versions of the story.
• The story of how Napier identified his servant who was stealing.
• The story of how l'Hopital obtained the rule named after him from Johann Bernoulli.
• A determination of who were the second and third most prolific mathematicians in history (after Leonhard Euler).
• The story of how the Indiana legislature passed a law to declare it possible to square the circle (this may be the first and primary source for the story).
• The story of how Walter Koppelman (in 1970) at the University of Pennsylvania was shot and killed by his graduate student Robert H. Cantor.
Thus we see a range of events, from the historical to the current. The stories are told with a compelling accuracy and authority, rendered in concise and lively prose. Reading these stories is like eating Fritos: you cannot stop with just one.
There is always a danger with sequels: You have told all your best stuff in the first volume. When your public or your publisher comes to you with demands for more, then you must cook something up. And it may not be up to the standard of the "stories of a lifetime" that you set forth in Volume I.
A CALCULUS BOOK WORTH READING • Clear narrative style • Thorough explanations and accurate proofs • Physical interpretations and appl ications
"Unlike any other calculus book I have seen . . . Meticulously written for the intelligent person who wants to understand the subject. . . Not only more intuitive in its approach
to calculus, but also more logically rigorous in its discussion of the theoretical side than is usual. . . This style of explanation is well chosen to guide the serious
beginner . . . A course based on it would in my opinion definitely have a much greater chance of producing students who understand the structure, uses, and arguments of calculus, than is usually the case . . . Many recent and popular works on the topic will appear intellectually sterile after exposure to this one." -Roy Smith, Professor of
Mathematics, University of Georgia (complete review at publ isher's website)
"One has the feeling that it is a work by a mathematician still in close touch with physics . . . The author succeeds well in giving an excel lent i ntuitive introduction while
ultimately maintaining a healthy respect for rigor." -Zentralblatt MA TH (online)
A selection of the Scientific American Book Club World Scientific Publishing Company
http://www. worldscientific.com 1 -800-227-7562
© 2006 Spnnger Sc1ence+Business Media, Inc., Volume 28, Number 1, 2006 71
Howard Eves stood up pretty well to
this challenge. But around Volume 5 one can perceive some dissipation.
Whereas Volume 1 contained stories
like the invention of the slide rule and
the mathematical treasures of the
Rhind papyrus, Volume 5 contains trite
stories about the addition of vectors
and the formulation of IOU notes. Vol
ume 6 degenerates to the boringly
anecdotal:
A student once threw a book at a
mathematics instructor.
"I wouldn't have done that," re
marked another student.
"Why not?" asked the culprit.
"Because that's no way to treat a
book," was the reply.
"Must we first take all these prelim
inary courses?" asked a mathemat
ics student.
"There's only one endeavor in which
one can start at the top, and that's dig
ging a hole," replied the instructor.
The trouble with humor is that it can
rapidly become dated. I imagine that
someone thought these last two stories
were amusing at some time or another,
but they seem pretty pointless at the
moment.
Still and all, Howard Eves's magnum opus is a noble one, and an important
part of our literature. It is arguably the
first source for many important and
popular mathematical stories. A great
many of us cut our teeth on these books,
and we re-read them with pleasure.
Eves has a great spirit, and a great sense
of mathematical culture. He makes us
feel good about ourselves and what we
do. He shows us as erudite and human
and charming all at the same time. His
stories are never mean-spirited or criti
cal. They show mathematicians for
what they are, in the strongest and most
positive sense of the word. They make
us feel good about our profession and
our enterprise. They leave us thirsting
for more.
When I read modem inept volumes
about the history of 7T and the proof of
Fermat's last theorem and the lore of
wavelets-mostly written by people
who do not know what they are talk
ing about, just anxious for a quick
buck-I wish that Howard Eves were
still writing. I wish that mathematical
72 THE MATHEMATICAL INTELLIGENCER
exposition and popularization were ex
ecuted by those who are qualified to do
it. Of course the cognoscenti are all too
busy proving theorems and going to
conferences. You will not catch them
writing books like Mathematical Circles. And we are all the poorer for it.
Howard Eves was a virtuoso at his
craft. He knew exactly what he knew,
and he knew his limitations. He wrote
about things that he had mastered and
understood. He set a marvelous exam
ple, to which we could all aspire. And
I hope that some of us will.
Department of Mathematics
Washington University in St. Louis
St. Louis, MO 631 30
e-mail: [email protected]
http://www.math.wustl.edu/�sk
Encyclopedia of Genera l Topology edited by K. P. Hart,
Jun-iti Nagata, and J. E. Vaughan ----- · --------- -- ---- . -----
AMSTERDAM, ELSEVIER, 2004. 526 PP € 1 45, ISBN 0-444-50355-2.
REVIEWED BY HANS-PETER A. KUNZI
As a student you may have been
taught that normality is neither
hereditary nor finitely productive.
Later on in your career you may have
encountered intricate problems in var
ious contexts that could be reduced to
seemingly plain topological questions
which, nevertheless, you could not im
mediately solve. For instance, in my
case, I could not find answers to some
questions about the behaviour of nor
mality in box products, or to the prob
lem of whether each compact topology
is coarser than a compact topology in
which all compact subsets are closed.
It is difficult to be a mathematician
and not use some basic concepts and
methods from General Topology every
now and then. Over the past decades a
few polished techniques and some
fundamental basic terminology of this
field have become so well known and
efficient that they now belong to the
folklore of today's university mathe
matics. Less known to the mathemati-
cal community are many modem-and
often much more sophisticated and
complex-developments that thus far
have not had many applications be
yond their area of origin.
The encyclopedia under review tries
to cover the basic as well as many of
the more specialized and new develop
ments in the field of topology. It ad
dresses both established mathemati
cians and students, independent of their
area of specialization, and it is designed
to lead them quickly to the terminology
and results that might be useful to their
own investigations in other areas.
It explains terms like "Hedgehog",
"Weak P-Point", "Cliquish Function",
"Talagrand Compactness", "Thick Cov
ers", and "Resolutions". Similarly, it
discusses concepts like "the Sous
lin Hypothesis", "Hit-and-Miss Topolo
gies", "the Normal Moore Space Con
jecture", "the Blumberg Property", "the
Class MOBI", or "Bing's Example G". It
also deals with difficult open problems:
Is the class of meta-Lindelof spaces
preserved under perfect maps, or is
there a Dowker space with a u-disjoint
base? Finally, a wealth of known re
sults and methods are treated: What
is the Dugundji Extension Theorem?
When is the completion of a topologi
cal ring a field? Does the Sorgenfrey
line have a connected Hausdorff com
pactification? How can one prove with
the method of elementary submodels
that the cardinality of a first-countable
compact Hausdorff space is at most
that of the continuum?
The encyclopedia includes about 120 articles contributed by a similar number
of topologists from all over the world.
Mostly written by experts in the spe
cialized fields, these articles outline, in
a short sequence of definitions and re
sults, many basic aspects of the treated
topics. The average length of a contri
bution is about four pages. Very few
proofs are given. The articles are col
lected under ten headings that imitate
Section 54 of the 2000 Mathematics Sub
ject Classification as used by Mathe
matical Reviews and Zentralblatt MATH.
The key words listed below will give
the reader an idea about the contents of
the work. They roughly follow the titles
of the articles as they are listed in the
table of contents of the book. Some
readers might want to skip the list below. I decided to include it in spite of a frowning referee, because it gives the reader a thorough and comprehensive first impression of the book In light of the high concentration of the presented results in the volume, the full flavour of the book cannot be grasped by merely glancing through a few examples of theorems and techniques, as they are discussed in the present review.
GENERALITIES: Topological Spaces, Modified Open and Closed Sets, Cardinal Functions, Convergence, Several Topologies on One Set (Minimal and Maximal Topologies).
BASIC CONSTRUCTIONS: Subspaces, Relative Properties, Product (Quotient, Adjunction, and Cleavable) Spaces, Hyperspaces, Inverse and Direct Systems, Covering Properties, Locally (P)- and Rim(P)Spaces, Categorical Topology, and Special Spaces.
MAPS AND GENERAL TYPES OF SPACES DEFINED BY MAPS: Continuous (Open, Closed, Perfect, and Cell-Like) Maps, Extensions of Maps, Topological Embeddings (Universal Spaces), Continuous Selections, Multivalued Functions, The Baire Category Theorem, Absolute Retracts, Extensors, Generalized Continuities, Spaces of Functions in Pointwise Convergence, Radon-Nikod:Ym (Corson, Rosenthal, and Eberlein) Compacta, Topological Entropy, and Function Spaces.
FAIRLY GENERAL PROPERTIES: Separation Axioms, Frechet (Sequential, and Pseudoradial) Spaces, Compactness (Local Compactness, Sigma-Compactness, Countable Compactness, and Pseudocompactness ), The LindelOf Property, Realcompactness, k-Spaces, Dyadic Compacta, Paracompact Spaces (Generalizations and Countable Variants), Extensions of Topological Spaces, Remainders, The Cech-Stone Compactification (in Particular of N and �), Wallman-Shanin Compactification, H-Closed Spaces, Connectedness, Connectifications, and Special Constructions.
SPACES WITH RICHER STRUCTURES: Metric Spaces, Metrization, Special Metrics, Completeness,
Baire Spaces, Uniform and QuasiUniform Spaces, Proximity Spaces, Generalized Metric Spaces, Monotone Normality, Probabilistic Metric Spaces, and Approach Spaces.
SPECIAL PROPERTIES: Continuum Theory, Dimension Theory (General, and of Metrizable Spaces), Infinite Dimension, Dimension Zero, Linearly Ordered and Generalized Ordered Spaces, Unicoherence and Multicoherence, Topological Characterizations of Spaces, and Higher-Dimensional Local Connectedness.
SPECIAL SPACES: Extremally Disconnected (Scattered, and Dowker) Spaces.
CONNECTIONS WITH OTHER STRUCTURES: Topological Groups (Rings, Division Rings, Fields, and Lattices), Free Topological Groups, Homogeneous Spaces, Transformation Groups and Semigroups, Topological Discrete Dynamical Systems, Fixed Point Theorems, and Topological Representations of Algebraic Systems.
INFLUENCES OF OTHER FIELDS: Descriptive Set Theory, Consistency Results in Topology (Quotable Principles, Forcing, and Large Cardinals), Digital Topology, Computer Science, Non Standard Topology, Topological Games, and Fuzzy Topological Spaces.
CONNECTIONS WITH OTHER FIELDS: Banach Spaces, Measure Theory, Polyhedra and Complexes, Homology, Homotopy, Shape Theory, Manifolds, and Infinite-Dimensional Topology. Throughout the book it is assumed
that the reader has some basic knowledge of set theory, algebra, and analysis.
It is always easy to criticize various shortcomings of a book of this kind: Without doubt, each expert in the area will find some subject that in his or her opinion is lacking or is dealt with insufficiently. Similarly, it is easy to feel that some topics treated are of minor importance for the field and could have been less stressed or even completely neglected. Thus some readers of this encyclopedia might miss sections on Constructive Topology, Topological Ordered Spaces, or Frame Theory,
while others might wonder whether the concept of Cleavability or the theory of Approach Spaces are already so well established that they deserve their own sections in this volume.
Also, in light of the large number of authors, the nature of the articles is uneven in many respects, even after some unifying work was done by the editors. In particular the articles about the more specialized topics often and unavoidably have to assume a certain familiarity of the reader with basic concepts. This familiarity cannot be gained by reading a few introductory articles in the encyclopedia. Thus while the encyclopedia will certainly look impressive on the shelves of a library or on the desk of a mathematician, the question remains whether it is of practical use.
As can be guessed from the short description above, the book will be very helpful to those who want to get a first overview of specific areas in the field and are looking for some references. However the work surely cannot replace the usual text books, monographs, or original research papers. In some sense, the value of encyclopedias in mathematics is quite limited: Essentially they can only provide quick orientation for informed readers. But they can hardly be relied on to teach complete novices, and they are generally too superficial for the working specialist.
Those who want to study some of the sketched theories in any depth will necessarily have to learn more from the references at the end of each article, and from a general list of basic references that is used throughout the book Persons who are simply interested in the solution to an isolated problem, for instance, need to know at once the basic facts of "the theory of bomological convergences", might prefer finding the latest literature dealing with their question with the help of the Internet. Indeed, the book will be most useful to those who already know a lot about topology and now still want to deepen their understanding of various areas closely related to their own field of interest.
Certainly some discussions in topology seminars could be based on parts
© 2006 Springer Science+ Business Media, Inc., Volume 28, Number 1 , 2006 73
of the more elementary articles. The students would be asked to fill in the missing arguments and appropriately expand the presentation of the material by consulting the pertinent literature.
Nevertheless the Encyclopedia of General Topology is a remarkable book, and one to which the editors have contributed a huge amount of work Such efforts are important in a time when the value of specialized original research is often overvalued at the cost of a systematization of the natural historic developments of mathematical knowledge.
The book represents a significant attempt to classify and order much of the work done in general topology and related areas in recent decades. It will inform future generations of mathematicians about the research already conducted and may thus avoid unnecessary duplications and subsequent disappointments. Without such books, complex theories cannot develop properly, and even excellent mathematical concepts and results are soon forgotten and do not survive their discoverers or inventors.
Let us finally mention that the historical background of many of the theories presented is discussed in some detail in the three volumes thus far published by C. E. Aull and R. Lowen ( eds.) under the title "Handbook of the History of General Topology", Kluwer Academic Publishers, 1997-2001.
Department of Mathematics and Applied Mathematics
University of Cape Town
Rondebosch 7701
South Africa
e-mail: [email protected]
74 THE MATHEMATICAL INTELLIGENCER
SUBSCRIPTION INFORMATION: The Mathematical Intelligencer is pub
lished by Springer Science+ Business Media, Inc. winter, spring, summer and
fall. Volume 28 ( 4 issues) will be published in 2006. ISSN: 0343-6993 COPYRIGHT: Submission of a manuscript implies: that the work described
has not been published before (except in the form of an abstract or as part
of a published lecture, review or thesis); that it is not under consideration
for publication elsewhere; that its publication has been approved by all coau
thors, if any, as well astacitly or explicitly6by the responsible authorities at
the institution where the work was carried out. Transfer of copyright to
Springer becomes effective if and when the article is accepted for publica
tion. The copyright covers the exclusive right (for U.S. government employ
ees: to the extent transferable) to reproduce and distribute the article, in
cluding reprints, translations, photographic reproductions, microform,
electronic form (offline, online) or other reproductions of similar nature.
All articles published in this journal are protected by copyright, which cov
ers the exclusive rights to reproduce and distribute the article (e.g., as
reprints), as well as all translation rights. No material published in this jour
nal may be reproduced photographically or stored on microfilm, in electronic
data bases, video disks, etc., without first obtaining written permission from
Springer. The use of general descriptive names, trade names, trademarks,
etc., in this publication, even if not specifically identified, does not imply that
these names are not protected by the relevant laws and regulations.
An author may self-archive an author-created version of his/her article on
his/her own website and his/her institution's repository, including his/her fi
nal version; however he/she may not use the publisher's PDF version which
is posted on www.springerlink.com. Furthermore, the author may only post
his/her version provided acknowledgement is given to the original source of
publication and a link is inserted to the published article on Springer's web
site. The link must be accompanied by the following text: The original pub
lication is available at www.springerlink.com. Please use the appropriate
URL and/or DOl for the article (go to Linking Options in the article, then to
OpenURL and use the link with the DOl-). Articles disseminated via
www.springerlink.com are indexed, abstracted and referenced by many ab
stracting and information services, bibliographic networks, subscription
agencies, library networks, and consortia.
While the advice and information in this journal is believed to be true and
accurate at the date of its publication, neither the authors, the editors, nor
the publisher can accept any legal responsibility for any errors or omissions
that may be made. The publisher makes no warranty, express or implied,
with respect to the material contained herein. Springer publishes advertise
ments in this journal in reliance upon the responsibility of the advertiser to
comply with all legal requirements relating to the marketing and sale of prod
ucts or services advertised. Springer and the editors are not responsible for
claims made in the advertisements published in the journal. The appearance
of advertisements in Springer publications does not constitute endorsement,
implied or intended, of the product advertised or the claims made for it by
the advertiser.
Special regulations for photocopies in the USA. Photocopies may be made
for personal or in-house use beyond the limitations stipulated under Section
107 or 108 of U.S. Copyright Law, provided a fee is paid. All fees should be
paid to the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers,
MA 01923, USA ( WEB: http://www.copyright.com; FAX: 978-646-8600; TEL:
978-750-8400), stating the ISSN 0343-6993, the volume, and the first and last
page numbers of each article copied. The copyright owner's consent does
not include copying for general distribution, promotion, new works or re
sale. In these cases, specific written permission must first be obtained from
the publisher.
The Ca nada Institute for Scientific and Technical Inforrnation (CISTI)
provides a comprehensive, world-wide document delivery service for all
Springer journals. For more information, or to place an order for a copyright
cleared Springer document, please contact Client Assistant, Document De
livery, CISTI, Ottawa KIA OS2, Canada. TEL: 613-993-9251; FAX: 613-952-8243; E-MAIL: [email protected]. You may also contact INFOTRIEVE (WEB:
h(tp://www.infotrieve.com/; E-MAIL: orders @infotrieve.com; fax: 310-208-5971; TEL: 800-422-4633), UnCover: http://www.ingenta.com; UMI: http://www.
umi.cornl; lSI: http://www.isinet.com
RESPONSIBLE FOR ADVERTISEMENTS: Printed in the United States on acid-free paper.
© 2006 Springer Science+ Business Media, Inc.
Periodicals postage paid at New York, New York and additional mailing
offices. POSTMASTER: Send address changes to: The Mathema-tical Intelli
gencer, Springer, 233 Spring Street, New York, NY 10013, USA.
kj@ij,j.iq.U.I§i R o b i n W i l s o n I
The Ph i lamath' s Alphabet-L
Lagrange: Joseph-Louis Lagrange (1736-1813) wrote the first 'theory of functions', using the idea of a power series to make the calculus more rigorous, and his mechanics text Mechani
que analytique was highly influential. In number theory he proved that every positive integer can be written as the sum of four perfect squares. Laplace: Pierre-Simon Laplace (1749-1827) wrote a fundamental text on the analytical theory of probability and is also remembered for the
Lagrange
Please send all submissions to
the Stamp Corner Editor,
Robin Wilson, Faculty of Mathematics,
The Open University, Milton Keynes,
MK7 6AA, England
e-mail: r.j .wilson@open .ac.uk
'Laplace transform' of a function. His monumental five-volume work on celestial mechanics, Traite de mechanique celeste, earned him the title of 'the Newton of France'. Leibniz: Although Newton could claim priority for the calculus, Gottfried Wilhelm Leibniz (1646-1716), who developed it independently, was the first to publish it. His notation, including dyldx and the integral sign, was more versatile than Newton's and is still used. Leibniz's calculus was different from Newton's, being based on sums and differences rather than velocity and motion. Liu Hui: An ancient Chinese work, the Jiuzhang suanshu (Nine chapters on the mathematical art), contains the calculation of areas and volumes, the evaluation of roots, and the systematic solution of simultaneous equations. Around 260 AD, while revising the Jiuzhang suanshu, Liu Hui calculated the
Laplace
areas of regular polygons with 96 and 192 sides and deduced that 7T lies between 3.1410 and 3. 1427. Logarithmic spiral: The Fibonacci sequence 1, 1, 2, 3, 5, 8, 13, . . . occurs throughout nature, and the ratios of successive terms 1/1, 2/1, 3/z, 5/3, . . . tend to the 'golden ratio' 1/2(1 + Y5) =
1.618. . . . A 'golden rectangle' with sides in this ratio has the prope1ty that the removal of a square from one end leaves another golden rectangle; this process is shown on a Swiss stamp, which also features the closely related logarithmic spiral, found on snail shells and ammonites. Lyapunov: The Russian mathematician Aleksandr Lyapunov (1857-1918) was much influenced by Chebyshev. He worked on the stability of rotating liquids and the theory of probability. In 1918 his wife died of tuberculosis; Lyapunov shot himself the same day and died shortly after.
100
leibniz
t·<:� . . 'HELVETIA If '· ! l / I
Liu Hui
S;' , �- .:· . // I �\ I !a ' \ . / - ! . ' /><: �l �· _ 80
Logarithmic spiral
Lyapunov
76 THE MATHEMATICAL INTELLIGENCER © 2006 Springer Science+ Business Media. Inc.