The Mathematical Intelligencer volume 26 issue 1

'·'·"·"·'·'

Thoughts on the Riemann Hypothesis G. J. Chaitin

The Opinion column offers

mathematicians the opportunity to

write about any issue of interest to

the international mathematical

community. Disagreement and

controversy are welcome. The views

and opinions expressed here, however,

are exclusively those of the author,

and neither the publisher nor the

editor-in-chief endorses or accepts

responsibility for them. An Opinion

should be submitted to the editor-in

chief, Chandler Davis.

The simultaneous appearance in May 2003 of four books on the Rie

mann hypothesis (RH) provoked these reflections. I briefly discuss whether the RH should be added as a new axiom, and whether a proof of the RH might involve the notion of randomness.

New Pragmatically Justified

Mathematical Axioms that Are

Not at All Self-evident

A pragmatically justified principle is one that is justified by its many important consequences-which is precisely the opposite of normal mathematical practice.1 However, this is standard operating procedure in physics.

Are there mathematical propositions for which there is a considerable amount of computational evidence, evidence that is so persuasive that a physicist would regard them as experimentally verified? And are these propositions fruitful? Do they yield many other significant results?

Yes, I think so. At present, the two best candidates2 for useful new axioms of the kind that GOdel and I propose [ 1) that are justified pragmatically as in physics are:

• the P =I= NP hypothesis in theoretical computer science that conjectures that many problems require an exponential amount of work to resolve, and

• the Riemann hypothesis concerning the location of the complex zeroes

of the Riemann zeta function

?cs) =I_!_= II -1 -. n ns p 1 - _l_

pS

(Here n ranges over positive integers and p ranges over the primes. )3 Knowing the zeroes of the zeta function, i.e., the values of s for which ?(s) = 0,

tells us a lot about the smoothness of the distribution of prime numbers, as is explained in these four books:

• Marcus du Sautoy, The Music of the Primes, Harper Collins, 2003.

• John Derbyshire, Prime Obsession, Joseph Henry Press, 2003.

• Karl Sabbagh, The Riemann Hypothesis, Farrar, Strauss and Giroux, 2003.

• Julian Havil, Gamma, Princeton University Press, 2003.4

The Riemann zeta function is like my n number: it captures a lot of information about the primes in one tidy package. n is a single real number that contains a lot of information about the halting problem. 5 And the RH is useful because it contains a lot of number -theoretic information: many number-theoretic results follow from it.

Of the authors of the above four books on the RH, the one who takes Godel most seriously is du Sautoy, who has an entire chapter on Godel and Turing in his book In that chapter on p. 181, du Sautoy raises the issue of whether the RH might require new ax-

'However, new mathematical concepts such as v'=1 and Turing's definition of computability certainly are

judged by their fruitfu lness-Fran(:oise Chaitin-Chatelin, private communication.

2Yet another class of pragmatically justified axioms are the large cardinal axioms and the axiom of determi

nancy used in set theory, as discussed in Mary Tiles, The Philosophy of Set Theory, Chapters 8 and 9. For

the latest developments, see Hugh Woodin, "The continuum hypothesis," AMS Notices 48 (2001 ), 567-576,

681 -690.

3\fou start with this formula and then you get the full zeta function by analytic continuation.

4Supposedly Havil's book is on Euler's constant y, not the RH, but ignore that. Sections 1 5.6, 1 6.8, and 1 6. 1 3

of his book are particularly relevant t o this paper.

5!1 = 'lp ha�s 2-iol is the halting probability of a suitably chosen universal Turing machine. !1 is "incompress

ible" or "algorithmically random." Given the first N bits of the base-two expansion of !1, one can determine

whether each binary program p of size io1 ,; N halts. This information cannot be packaged more concisely. See

[2], Sections 2.5 through 2.1 1 .

4 THE MATHEMATICAL INTELLIGENCER © 2004 SPRINGER-VERLAG NEW YORK

ioms. On p. 182 he quotes Godel,6 who specifically men

tions that this might be the case for the RH. And on p. 202

of that chapter, du Sautoy points out that if the RH is un

decidable this implies that it's true, because if the RH were

false it would be easy to confirm that a particular zero of

the zeta function is in the wrong place.

Later in his book, on pp. 256-257, du Sautoy again

touches on the issue of whether the RH might require a

new axiom. He relates how Hugh Montgomery sought re

assurance from Godel that a famous number-theoretic con

jecture-it was the twin prime conjecture, which asserts

that there are infinitely many pairs p, p + 2 that are both

prime-does not require new axioms. Godel, however, was

not sure. In du Sautoy's words, sometimes one needs "a

new foundation stone to extend the base of the edifice" of

mathematics, and this might conceivably be the case both

for the twin prime conjecture and for the RH.

On the other hand, on pp. 128-131 du Sautoy tells the

story of the Skewes number, an enormous number

wwlo34

that turned up in a proof that an important conjecture must

fail for extremely large cases. The conjecture in question

was Gauss's conjecture that the logarithmic integral

Li(x) = Jx du 2 ln u

is always greater than the number 1r (x) of primes less than

or equal to x. This was verified by direct computation for

all x up to very large values. It was then refuted by Little

wood without exhibiting a counter-example, and finally by

Skewes with his enormous upper bound on a counter

example. This raised the horrendous possibility that even

though Gauss's conjecture is wrong, we might never ever see a specific counter-example. In other words, we might

never ever know a specific value of x for which Li(x) is less

than 1r(x). This would seem to pull the rug out from under

all mathematical experimentation and computational evi

dence! However, I don't believe that it actually does.

The traditional view held by most mathematicians is that

these two assertions, P =I= NP and the RH, cannot be taken

as new axioms, and cannot require new axioms, we simply

must work much harder to prove them. According to the

received view, we're not clever enough, we haven't come

up with the right approach yet. This is very much the cur

rent consensus. However, this majority view completely ig-

no res 7 the incompleteness phenomenon discovered by

Godel and Turing, and extended by my own work [2] on in

formation-theoretic incompleteness. What if there is no

proof?

In fact, new axioms can never be proved; if they can,

they're theorems, not axioms. So they must either be justi

fied by direct, primordial mathematical intuition, or pragmatically, because of their rich and important consequences,

as is done in physics. And in line with du Sautoy's observa

tion, one cannot demand a proof that the RH is undecid

able before being willing to add it as a new axiom, because

such a proof would in fact yield the immediate corollary

that the RH is true. So proving that the RH is undecidable

is no easier than proving the RH, and the need to add the

RH as a new axiom must remain a matter of faith. The

mathematical community will never be convinced. 8 Someone recently asked me, "What's wrong with calling

the RH a hypothesis? Why does it have to be called an ax

iom? What do you gain by doing that?" Yes, but that's be

side the point; that's not the real issue. The real question

is, Where does new mathematical knowledge come from?

By "new knowledge" I mean something that cannot be

deduced from our previous knowledge-from what we al

ready know.

As I have been insinuating, I believe that the answer to

this fundamental question is that new mathematical knowl

edge comes from these three sources:

a. mathematical intuition and imagination ( \!=]\ b. conjectures based on computational evidence (explains

calculations), and

c. principles with pragmatic justification, i.e., rich in con-

sequences (explains other theorems). 9

And items (b) and (c) are much like physics, if you replace

"computational evidence" by "experimental evidence." In

other words, our computations are our experiments; the

empirical basis of science is in the lab, the empirical basis

of math is in the computer.

Yes, I agree, mathematics and physics are different, but

perhaps they are not as different as most people think, per

haps it's a continuum of possibilities. At one end, rigorous

proofs, at the other end, heuristic plausibility arguments,

with absolute certainty as an unattainable limit point.

I've been publishing papers defending this thesis for

more than a quarter of a century, 10 but few are convinced

by my arguments. So in a recent paper [ 1] I've tried a new

6Unfortunately du Sautoy does not identify the source of his Gbdel quote. I have been unable to find it in Gbdel's Collected Works.

7 As du Sautoy puts it, p. 1 8 1 , "mathematicians consoled themselves with the belief that any1hing that is really important should be provable, that it is only tortuous

statements with no valuable mathematical content that will end up being one of Gbdel's unprovable statements."

8The situation with respect to P * NP may be different. In a paper "Consequences of an exotic definition for P = NP," Applied Mathematics and Computation 145 (2003), pp. 655-665, N. C. A. da Costa and F. A. Doria show that if ZFC (Zermelo-Fraenkel set theory+ the axiom of choice) is consistent, then a version of P = NP is consistent with ZFC, so a version of P * NP cannot be demonstrated within ZFC. See also T. Okamoto, R. Kashima, "Resource bounded unprovability of compu

tational lower bounds," http://eprint. iacr.org/2003/1 87 /.

9A possible fourth source of mathematical knowledge is (d) probabilistic or statistical evidence: A mathematical assertion may be deemed to be true because the prob

ability that it's false is immensely small, say <1 0-99999

Here is a practical example of this: The fast primality testing algorithm currently used in Mathematica does not necessarily give the correct answer, but mistakes

are highly unlikely. Algorithms of this sort are called Monte Carlo algorithms.

10See, for example, the introductory remarks ·In my 1 974 J. ACM paper [3].

VOLUME 26, NUMBER 1 , 2004 5

tactic. I use quotes from Leibniz, Einstein, and Godel to

make my case, like a lawyer citing precedents in court ... .

Even though I am touting the Riemann hypothesis as an

excellent new-axiom candidate-whether Godel agrees or

merely thinks that a new axiom might be needed to prove

the RH, I'm not sure-let me briefly wax enthusiastic over

a possible approach to a proof of the RH. Disclaimer. I'm

not an expert on the RH. What I'm about to relate is defi

nitely an outsider's first impression, not an expert opinion.

A Possible Attack on the Riemann Hypothesis?

Here is a concrete approach to the RH, one that uses no

complex numbers. It's a probabilistic approach, and it in

volves the notion of randomness. It's originally due to Stielt

jes, who erroneously claimed to have proved the RH with

a variant of this approach.

The Mobius JL function is about as likely to be + 1 or - 1

(see Derbyshire, Prime Obsession, pp. 322-323).

{ 0 if k2 divides n, k > 1, f.L(n) = b rct·rr · ct· · r ( _ 1 )num er o 1 erent pnme tvtsors o n if n is square-free. The RH is equivalent to the assertion that as k goes from 1

to n, JL(k) is positive as often as negative. More precisely,

the RH is closely related to the assertion that the difference

between

• the number of k from 1 ton for which JL(k) = -1, and

• the number of k from 1 to n for which JL(k) = + 1

is O(Vn\ of the order of square root of n, i.e., is bounded

by a constant times the square root of n. This is roughly

the kind of behavior that one would expect if the sign of

the JL function were chosen at random using independent

tosses of a fair coin. u

This is usually formulated in terms of the Mertens func

tion M(n): 12

n M(n) = .2: p.,(k).

k�l

According to Derbyshire, pp. 249-251,

M(n) = O(Vn) implies the RH, but is actually stronger than the RH. The

RH is equivalent to the assertion that for any E > 0, I

M(n) = O(n2+E).

Could this formula be the door to the RH?!

This probabilistic approach caught my eye while I was

reading this May's crop of RH books.

I have always had an interest in probabilistic methods

in elementary number theory. This was one of the things

that inspired me to come up with my definition of algorithmic randomness and to find algorithmic randomness

in arithmetic [6] in connection with diophantine equations.

However, I doubt that this work on algorithmic random

ness is directly applicable to the RH.

In particular, these two publications greatly interested

me as a child:

• Mark Kac, Statistical Independence in Probability, Analysis and Number Theory, Carus Mathematical

Monographs, vol. 12, Mathematical Association of Amer

ica, 1959.

• George P6lya, "Heuristic reasoning in the theory of num

bers," 1959, reprinted in Gerald W. Alexanderson, The Random Walks of George P6lya, Mathematical Associa

tion of America, 2000.

I think that anyone contemplating a probabilistic attack on

the RH via the JL function should read these two publica

tions. There is also some interesting work on random

sieves, which are probabilistic versions of the sieve of

Eratosthenes:

• D. Hawkins, "Mathematical sieves," Scientific American, December 1958, pp. 105-112.

As P6lya shows in the above paper-originally American Mathematical Monthly 66, pp. 375-384-probabilistic

heuristic reasoning can do rather well with the distribution

of twin primes. By the way, this involves Euler's y constant.

Can a refmement of P6lya's technique shed new light on JL and on the RH? I don't know, but I think that this is an in

teresting possibility.

By the way, P :1: NP also involves randomness, for as

Charles Bennett and John Gill showed in 1981-SJAM Journal on Computing 10, pp. 96-113-with respect (relative)

to a random oracle A, pA :1: NPA with probability one [7].

Further Reading-Four "Subversive" Books

• On experimental mathematics:

Borwein, Bailey, and Girgensohn, Mathematics by Ex

periment, Experimentation in Mathematics, A. K. Peters, 2003.

(See [8]. There is a chapter on zeta functions in volume

two.)

• On a quasi-empirical view of mathematics:

Tymoczko, New Directions in the Philosophy of Mathematics, Princeton University Press, 1998.

• On pragmatically justified new axioms and information

theoretic incompleteness:

Chaitin, From Philosophy to Program Size, Tallinn Cy

bernetics Institute, 2003.

(There is also an electronic version of this book [2].)

And regarding the adverse reaction of the mathematics

community to the ideas in the above books, I think that it

is interesting to recall Godel's difficulties at the Princeton

Institute for Advanced Study, as recounted in:

11 For a more precise idea of what to expect if the sign of the IL function were chosen at random, see the chapter on the law of the iterated logarithm in Feller, An In

troduction to Probability Theory and Its Applications, vol. 1 , Vlll.5 through VII I .?. 12See [4, 5].

6 THE MATHEMATICAL INTELLIGENCER

• John L. Casti, The One True Platonic Heaven, John

Henry Press, 2003.

According to Casti, one of the reasons that it took so long

for Godel's appointment at the lAS to be converted from

temporary to permanent is that some of Godel's colleagues

dismissed his incompleteness theorem. Now, of course,

Godel has become a cultural icon13 and mathematicians

take incompleteness more seriously-but perhaps not seri

ously enough.

Mathematicians shouldn't be cautious lawyers-! much

prefer the bold Eulerian way of doing mathematics. Instead

of endlessly polishing, how about some adventurous pioneer

spirit? Truth can be reached through successive approxi

mations; insistence on instant absolute rigor is sterile

that's what I've learned from incompleteness.14

WEB REFERENCES ( 1 ] Two philosophical applications of algorithmic information theory.

http://www.cs.auckland.ac.nz/CDMTCS/chaitin/dijon.html

[2] From philosophy to program size.

http://www.cs.auckland.ac.nz/CDMTCS/chaitin/ewscs.html

[3] Information-theoretic limitations of formal systems. http://www.

cs.auckland .ac. nz/CDMTCS/chaitin/acm 7 4. pdf

[4] Mertens function. http://mathworld.wolfram.com/

MertensFunction .html

[5] Mertens conjecture. http://mathworld .wolfram.com/

MertensConjecture.html

[6] Randomness in arithmetic. http://www.cs.auckland.ac.nz/

CDMTCS/chaitin/sciamer2. html

(7] Relative to a random oracle A, pA * NPA * co-NPA with proba

bility 1 . http://www.research.ibm.com/people/b/bennetc/

bennettc1 981 497f3f4a.pdf

[8] Experimental mathematics website. http://www.expmath. info

[9] Apostolos Doxiadis home page. http://www.apostolosdoxiadis.com

IBM Research

Yorktown Heights, NY 1 0598

USA

e-mail: [email protected]

131n this connection, I should mention Incompleteness, a play and a theorem by Apostolos Doxiadis, which is a play about Gbdel. For more information, see [9]. 141n this connection, see da Costa and French, Science and Partial Truth, Oxford University Press, 2003.

Solution Kept Secret


EUGENE GUTKIN

The Toeplitz-Hausdorff Theorem Revisited: Relating Linear Algebra and Geometry

Genesis

In the beautiful paper [24] 0. Toeplitz associated with any

complex n X n matrix a compact set in the complex plane.

As his title suggests, he was inspired by a theorem of L. Fe

jer [6] concerning a relationship between planar curves and

Fourier series. Apart from this, the paper [24] is self

contained. Let en be the standard vector space with the

scalar product <u, v>. I will not distinguish between the

n X n matrices and operators on en. Let C be one such. It is determined by its "bilinear" form <u, Cv>. The compact

set that Toeplitz introduces is the image, W = W(C) C e, of the unit sphere in en, under the quadratic map u �

<u, Cu>. He cof\jectures that W(C) is a convex set, and

proves that the outer boundary of W(C) is a convex curve.

A year later F. Hausdorff proved Toeplitz's col\iecture

[12]. The Toeplitz-Hausdorff theorem was born. For several

reasons, it continues to attract the attention of researchers.

Extensions of Toeplitz's setting came up in robust control;

hence the thriving engineering literature on the subject.

See [20, 21, 5]. My own preoccupation with the Toeplitz

Hausdorff theorem has its genesis in a joint project with

electrical engineers [15, 10]. Despite (or because of) the simplicity of the Toeplitz

Hausdorff framework, basic questions in the subject remain

open [14]. For instance, it is not known what domains are

realizable as W(C) for C on en. The present article aspires

to attract attention of the general mathematical readership

to the fascinating interplay of linear algebra, geometry, and

analysis that the papers [24, 12] initiated.

My plan is as follows. I analyze in some detail the original

papers of Toeplitz and Hausdorff. Then, following the view

point of [24], I associate with an arbitrary C a linear pencil

of hermitian operators H( · ) . This allows me to cast the analy

sis into the language of convex geometry: Support lines and

support functions come in. The crucial observation is that the

support function of W(C) is the highest eigenvalue, A(·), of

H( · ) . This brings in both the algebraic geometry and the con

vex geometry. R. Kippenhahn was the first to exploit this ob

servation. In his Dissertation [16] he introduces and develops

this point of view. To illustrate this approach, I immediately

derive rough bounds on the size of W( C) in terms of the spec

tral attributes of C. I also reproduce without proof the much

more sophisticated estimates of Kippenhahn [16]. Then I bring in the differential geometry by calculating

the curvature of the boundary curve aW(C). To show the

usefulness of this viewpoint, I apply it to obtain new bounds

on the size of W( C) in terms of the standard attributes of

C. These estimates, although still very crude, are sharper

than those I got out of the support function. The differen

tial geometry viewpoint turned out to be especially suitable

to study the multidimensional version of W( C), the joint nu

merical range [10].1 I conclude with a brief survey of the

literature and a personal remark

I thank the referees for helpful comments.

Historical Remarks

Toeplitz coined the name "Wertvorrat" for W(C). A literal

English translation is the value supply or the stock of values. Variations of "Wertvorrat" dominate the German liter

ature on the subject. For instance, A. Wintner, during the

Leipzig period of his prolific career, used the expressions

"Wertevorrat" (values supply) and "Wertbereich" (value domain) [26].2

The modern literature intermittently uses field of values

1There are many generalizations of the numerical range of an operator in the modern literature. It would take several pages just to give the relevant definitions. The con

cept of the joint numerical range and the awareness that it is the natural multi-dimensional extension of the numerical range is already in the founding papers [24, 1 2).

2Wintner emigrated to America shortly after the University of Leipzig refused to award hirn the Habilitation. The book [26) is apparently his Habilitationschrift.


and numerical range. s I don't like either expression. The

former adds one more item to the litany of mathematical "fields"; the latter is plain awkward. The original name is

better in every respect except one: It is German and therefore unacceptable in the English literature. 4 Some proposed alternatives (template, form range, contracted graph) did not fly. I fmd the expression numerical range the lesser of two evils, and I will use it in what follows. 5

Toeplitz proves several propositions relating W( C) and the spectrum of C. For instance, he shows that W(C) contains the spectrum, and if C is a normal operator, then W( C)

is the convex hull of the spectrum. But the centerpiece of [24] is "Satz 8," the convexity of the outer boundary.

The penultimate §5 of [24) offers several informal comments, and points out the difference between convexity of the outer boundary and convexity of the set. Then Toeplitz says: "I will now discuss a generalization of the entire setting, which . . . also shows the difficulties that stem in the general case from the possibility of holes." He goes on to introduce what is now called the joint numerical range of any number q of hermitian operators A1, .. . , Aq. The set in question, W.(A1, . .. , Aq) c !Rq, is the image of the unit sphere in en, under the map u f-i> ( <u, A1u>, . . . <u, Aqu> ). The decomposition C = A1 + iA2 implies W(C) = W.(A1,A2).6 Toeplitz demonstrates that W.(Al, . . . , Aq) is not convex, in general. He concludes: "Whether this can already happen for q = 2 remains possible, athough unlikely."

Toeplitz missed that he actually proved the desideratum! Indeed, to a modem reader, it seems that Toeplitz essentially settled the convexity conjecture. To us, it suffices to prove it for n = 2; for, if <u, Cu> and <v, Cv> belong to W(C), and the numerical range of the restriction of the form C to eu + ev is convex, then the claim holds. And in §5

Toeplitz shows that the numerical range of an operator on e2 is either an elliptic disc, or a segment, or a point-in each case, it is convex! In fact, this is how the ToeplitzHausdorff theorem is proved in modem textbooks [9, 11, 14].1 Amazingly, in the 80-some years since [24), nobody, including Hausdorff, noticed that the Toeplitz-Hausdorff theorem is implicitly proved in [24).

In the 3-page-long, focused, beautiful paper [ 12), Hausdorff proves Toeplitz's conjecture. On the one hand, he proves it from scratch, without using Satz 8 of [24). On the other, he goes just a step further than Toeplitz to show that the intersection of W(C) with any straight line is the image of a connected subset of the unit sphere under a continuous mapping, and hence is connected. 8 In a one-sentence remark Hausdorff points out that his results and the Toeplitz argument combine to prove the convexity of the

outer boundary of the surface W.(Al, A2, As) for any triple

of hermitian operators.

A natural generalization of the Toeplitz-Hausdorff theorem would have been the convexity of W.(A1, . . . , Aq) for all hermitian operators on any en. Although this claim is ''very false" [ 1 1), W.(A1, A2, As) for any triple A1. A2, As on en is convex if n 2:: 3. Remarkably, it was established 60 years after the papers [24, 12]! There are several proofs of this in the literature [10], and some are based on the Hausdorff connectedness idea [5] . The convexity claim for W.(Al, A2, As, A 4) for operators on en fails for any n [5]. Although this is unfortunate from the engineering viewpoint [21], there

are nontrivial interpretations of this "phase transition" [10].

But let us return to the subject. How could it be that neither Toeplitz nor Hausdorff realized that [24] contained a proof of the convexity of the numerical range? It is quite likely that Hausdorff overlooked the relevant part of [24] .

However, the Commentary by S. D. Chatterji in Hausdorffs Collected Works [ 13] reveals a curious fact in this respect. The Hausdorff Archives in Greifswald contain two handwritten notes for [ 12), dated September 19 and October 12

of 1918. In one of them Hausdorff works out the numerical range of any two-by-two matrix. He shows, as Toeplitz had already done, that it is a (possibly degenerate) ellipse.

Bringing in the Geometry

My interpretation of the approach of [24] is as follows. Let C be an n X n matrix, and let W(C) be the numerical range. Toeplitz associates with C a linear pencil of hermitian operators H( · ), parametrized by the circle of directions. The highest eigenvalue, A ( · ), of H( · ) is the support function of W(C). I will now explain this in detail.

Let <u, v> denote the standard scalar product on en, linear (resp. antilinear) in the second (resp. first) argument. As usual, I lull = V <u, u>. Let C be an operator on en with the adjoint C*, and let

C =A+ iB: A*= A, B* = B (1)

be the decomposition into hermitian operators. For 0 :S t < 27T set

1 . . H(t) = 2 [e-'tC + e'tC*] = (cos t)A + (sin t) B. (2)

The space of rays (i.e., oriented lines) in !R2 is parametrized by S1 X IR [22]. Namely, the ray r(t, p) has direction t, and the signed distance p from the origin. The notion of support lines is well known [1, 22]. I will associate with any compact set, XC IR2, the family, u(t), 0 :S t < 27T, of its support rays. For any 0 :S t < 27T the set of p E IR such that

3See [9] for historical comments on this terminology. The name "numerical range" is due to M . H. Stone [23].

4The German-English hybrids "eigenvalue, eigenvector" are the fortunate exceptions .. . . I don't know who coined them or how, but I am happy that I don't have to

use the awkward "proper value, proper vector, characteristic number," etc.

51t could have been worse. F. D. Murnaghan refers to W(C) as " ... the region of the complex plane covered by these values under the hypothesis that .. . " [1 8].

6Thus, the patent on the joint numerical range belongs to Toeplitz and not to Hausdorff [5].

7 A proof of the Toeplitz-Hausdorff theorem based on this idea is due to W. F. Donoghue [4). He explicitly calculates the ellipse in question. An elegant calculation of

aW(C) if n :s 3 is due to Murnaghan [1 8). Although he points out that aW(C) is an ellipse when n = 2, Murnaghan is not concerned with the region W(C) itself.

8Hausdorff's elegant argument is limited to finite dimensions, because he diagonalizes hermitian operators. The extension of the Toeplitz-Hausdorff theorem to infinite

dimensions is due to Stone [23). See [1 1 ) for a proof of N. P. Dekker [3) that combines Hausdorff's idea with the reduction to IC2

VOLUME 26, NUMBER 1, 2004 9

<3:>

Figure 1 . Support rays and the eigenvalues.

r( t, p) intersects X is compact; let p( t) be the maximal such p. Then a{t) = r(t,p(t)) is the support ray of X in ilirection t.

The following proposition is essentially Satz 8 of [24].

Proposition 1. Let C = A + iB be an operator on en and let H(t) = (cos t)A + (sin t)B, 0 :o; t < 27T, be the associated pencil of hermitian operators. Let

(3)

be the eigenvalues of H(t), and let Ei(O) c en be the eigenspace9 corresponding to Ai(O).

Let u(t), 0 ::5 t < 27T, be the support rays of W(C). Then the intersection point of a(t + 7T/2) with r(t,O) is A1(t)(cos t, sin t). Using this point as the origin in u(t + 7T/2), identify a{t + 7T/2) with R Then a{t + 7T/2) n W(C) C IRis the convex hull of the spectrum of the form H(t + 7T/2) restricted to E1(t).

Proof The unit circle acts on operators, C � e-iac, and on e, by rotations. The statement is equivariant with respect to these actions. Therefore, it suffices to verify the claims for the direction t = 0. We have H(O) =A, H(7T/2) = B, the ray r(O, 0) is the x-axis, and u( 7T/2) is the vertical ray supporting W from the right. See Figure 1. The points z = x + iy of the numerical range have the form z = <u, Cu>, llu ll = 1. By (1), x = <u, Au>, y = <u, Bu>. Therefore, the projection of Won the horizontal axis is the interval [An(A), A1(A)]. The right extremity of this interval is the intersection point with the ray lT( 7T/2). This proves one claim.

The intersection of lT( 7T/2) with W is given by

z = {<u, Au> + i <u, Bu>: llull = 1, <u, Au>= A1(A)}.

9Another fortunate hybrid!


y

u(i)

1(A)

In view of the above, our subset of IR is formed by <u, Bu>, where u runs through the unit sphere in E1(A). The numerical range of an hermitian operator is the convex hull of its spectrum. This proves the other claim. •

Proposition 1 has several far-reaching consequences. First of all, it implies that the outer boundary aW(C) is convex [24]. Second, it describes the support rays of W(C) via the eigenvalues of the hermitian pencil H( · ) of (2). These support rays determine the convex hull of aW(C). Since W(C) is convex, as we now know, they determine the set W(C) itself. Thus, Proposition 1 yields a description of the numerical range of C in terms of the spectrum of the associated pencil H(·).

Since the publication of [24], many authors have developed this observation in several directions. One of these directions may be called algebra-geometric. Its starting point is the algebraic curve

det(xA + yB + zi) = 0. (4)

This paper exploits another direction, which may be called "proper geometric." It takes off with an immediate corollary of Proposition 1. To formulate it, I will recall the notions of the support function and the width function of a convex set [1, 22]. Let X c lh£2 be convex and compact, and let lT(t), 0 :o; t :o; 27T, be the support rays of X. The distance between the parallel lines lT(t), lT(t + 1r) is the width of X in direction t. The support function is the signed distance of lT( t) to the origin. Denote the support and the width functions by h(t) and w(t), respectively. Then w(t) = h(t) + h(t + 7T).

Corollary 1. Let C be an operator on en, let H(·) be the associated pencil of hermitian operators, and let An(·) :S

· · · ::::; A 1 ( ·) be the eigenvalues of H(-). Then the support and the width of the numerical range of C are

h(t) = A1(t - 1r/2), w(t) = A1(t - 1r/2) - An(t - 1r/2). (5)

Proof. Proposition 1 yields the first claim. The second follows from the first and the identity H( t + 1r) = -H( t). •

Although the Toeplitz paper [24] is the precursor of both geometric directions, it was the work of R. Kippenhahn [ 16] that explicitly gave birth to them. 1 0 From now on I will concentrate on the proper geometric direction, referring the reader to the literature on the algebra-geometric direction. See, for instance [19].11

I will now use Corollary 1 to estimate the size of the numerical range of C in terms of the standard attributes of the operator C. The size of a planar convex compact set X is expressed via its area, diameter, breadth, and perimeter. Let w( ·) be the width of X. The breadth and the diameter of X are the minimum and the maximum of w, respectively. The perimeter and the area of X are also controlled by the width function [ 1]. If X = W( C), then w( ·) is determined by the spectrum of the hermitian pencil H( ·) which, in tum, is determined by the operator C. Among the standard attributes of C are its spectrum a( C) and the operator norm jcj. The number w(C) = maxAwCC)( IAi - A1} is the diameter of the spectrum.

For any a, b E I[ W(aC +b)= aW(C) + b.

Hence the size of the numerical range does not change under the transformations C � C + tl. Denote by Jtn the linear space of operators on en, and let .M� c Mn be the subspace of traceless operators. The function jCio = mintE<C jc + t� is a norm on the quotient space Jtn/{al}. The projection Co = C - tr(C)In I induces a linear isomorphism of Mn/{al) and Ml Note that jcj0::::; jC0j, and for generic C the inequality is strict. The following very rough estimates are essentially contained in [24].

Corollary 2. Let C be an operator on IC", and let W be 'its numerical range. Then

AreaCW) ::::; 4lcl6, Perimeter(W) ::::; 8jCjo;

w(C) :S Diameter(W) :S 2jC!o,

(6)

(7)

Proof For 0::::; t::::; 21r let R(t) be the rectangle formed by the four supporting rays. See Figure 2. Since w(H(t)) = A1(t) - An(t) and jH(t)j = max{ jA1(t)j, jAn(t)j }, we have

Area(W)::::; 4jH(t)l·jH(t + 1r/2)j, Perimeter(W) ::::; 4IH(t)1 + 4jH(t + 7T/2)j.

Using that jH(-)j ::::; lei, and the invariance of the preceding argument under C � C + al, we obtain (6). The upper bound in (7) follows from w(H(t)) ::::; 2jH(t)i :S 2jCj and the invariance principle. The obvious fact that W contains the spectrum of C implies the lower bound. •

The roughness of the estimates in Corollary 2 occurs for two reasons, one geometric and one analytic. The geometric reason is that W is much smaller than the circumscribed rectangles R(t). The analytic one is that the bounds A1(t) -An(t) :S 2jH(t)l ::::; 2jCj are very crude. Using convex geometry and subtle but elementary analysis, Kippenhahn obtained much better estimates [16]. Although the restriction tr C =

0 that [16] imposes, can be removed and the inequalities further improved, I will only state the relevant results of [16]. THEOREM 1. Let c be an operator on en such that tr c = 0, and let W = W(C) be its numerical range. Then

_!_ Vtr2(CC*) - itr C2j2::::; Area(W) ::::; n

(8) 2(n - 1)

Vtr2(CC*) - itr C2j2; n

4 Vn Vtr(CC*)

::::; Perimeter(W)::::; 21r � Vtr(CC*). (9)

The following corollary demonstrates the strength of Theorem 1. Corollary 3. The numerical range of a matrix C has empty interior iff C = a + bH, where H is hermitian.

I leave the proof to the reader, as an exercise. (Hint: Use (8) and the Cauchy-Schwarz inequality.) If C = a + bH, and H is hermitian, then C is normal. Hence, the numerical ranges of non-normal matrices have positive area.

Lest the reader think that the spectral properties of C matter only for the size of W(C), I hasten to add a few comments. Let X c IC be convex and compact. A point z E ax is a corner point12 if X has more than one support line at z. An eigenvalue A of C is normal if there exists an eigenvector v with the eigenvalue A such that Cl'v = Av. The following theorem [16, 4] gives an example of a completely different relation between the spectral characteristics of a matrix and the geometry of its numerical range. 1 3

THEOREM 2. Let c be an operator on en, and let w = W(C) be its numerical range. Then the corner points of W are among the normal eigenvalues of C.

Bringing in the Differential Geometry

An arbitrary convex compact W c I[ is determined by its support function. If the boundary a W is (piecewise) twice differentiable, then W is also determined by the curvature

1ilThe note [1 8] contains a few beautiful remarks about the algebraic geometry of iJW(C), but it does not pursue the matter.

11Complains about the scarcity of citations of Kippenhahn's work.

12Sharp point in the engineering literature. 1 3See [ 15] for a differential-geometric proof.

VOLUME 26, NUMBER 1 , 2004 1 1

y

u(t + 1r

Figure 2. Numerical range enclosed within a rectangle.

u(t

function, x(") 2: 0. As opposed to the support function, the

curvature is intrinsically defined by aw. The radius of curvature p(·) = x-1(·) is sometimes handier to use. Now let

W be the numerical range of a finite-dimensional operator,

C. Let H(·) be the corresponding pencil of hermitian oper

ators. By Corollary 1, the support function of W is the high

est eigenvalue A(·) of H(·). I will now express the radius of

curvature of aW in terms of A(·). A matrix is often called regular if its eigenvalues are

simple.

Definition 1. Let C be an operator on en, and let H(t), 0 :s t :s 27T, be the corresponding pencil of hermitian operators. Then C is Toeplitz regular if for all 0 :s t :s 27T the maximal eigenvalue of H(t) is simple.

THEOREM 3. Let W c 1Ri2 be the numerical range of an operator C on en. Let H(·) be the associated pencil of hermitian operators and let A(·) be the maximal eigenvalue of H( · ).

Suppose that C is Toeplitz regular. Then the junction A(·) is infinitely differentiable, and A + A" > 0. The set W is strictly convex, the boundary aw is twice differentiable, and its radius of curvature satisfies

p(t + 7T/2) = A(t) + A"(t). (10)

Proof Denote by E(t) c en the eigenspace of H(t) corre

sponding to the maximal eigenvalue. Let e E E(O) be a unit

vector. Then there is a unique vector function v(t), 0 :s t :s

27T, such that E(t) = Cv(t), llvCOII = 1, v(O) = e, and <v(t), v'(t)> = 0.14

141n general, v(21T) = {3v(O). The factor {3 has to do with Berry's phase.

1 2 THE MATHEMATICAL INTELLIGENCER

u(t + 37r/2)

By construction

H(t)v(t) = A(t)v(t). (11)

Differentiating this equation twice yields

and

(H' - A')v + (H - A)v' = 0 (12)

(H'- A")v + 2(H' - A')v' + (H - A)v" = 0. (13)

But H satisfies H' = -H. Substituting this into (13),

(A + A")v = 2(H' - A')v' + (H - A)v". (14)

Take the scalar product of (14) with v(t). Equation (12) im

plies

A + A"= 2 <v', (A - H)v'> .

But A is the top eigenvalue of H and v' is perpendicular to

its eigenspace E(A). Hence

A + A"> 0. (15)

Denote by ;£ the ray family (O"(t), 0 :s t < 27T}, where O"(t) has direction t + 7T/2 and intersects r(t, 0) at the point

A(t)(cos t, sin t). The positivity condition (15) implies that

the envelope, A(;£) C C, is a strictly convex curve, with the

parametric equations

x(t) = A(t) cost- A'(t) sin t, y(t) = A(t) sin t + A'(t) cost. (16)

Moreover, A(:£) is twice differentiable, and its radius of cur

vature is given by (10) [22, 1]. Since, by Proposition 1, :£ is

the family of support rays of W, we have A(:£) = aw. • Not every operator C on en is Toeplitz regular. If C is

normal, then W(C) is a polygon, hence it is not strictly con

vex. By Theorem 3, normal matrices are not Toeplitz reg

ular. In fact, by Theorem 2, the non-regularity of W(C) al

ways has to do with a partial normality of C. Fortunately,

there are plenty of Toeplitz regular operators.

Proposition 2. The complement to the set of Toeplitz regular operators in _Mn is contained in a closed hypersurface. Proof. Let �n denote the space of n X n hermitian opera

tors. By (1), _Mn =�nEB i�n. Replacing cost, sin t in (2) by

independent variables, we obtain an algebraic mapping, <p, from _Mn into the algebraic variety Gz(�n) of subspaces in

�n of dimension at most 2. The set of hermitian operators with multiple eigenval

ues is an algebraic variety, Xn C �n, of codimension 3. Therefore, the set of L E G2(�n), such that L n Xn i= 0 is a codimension one subvariety, Yn C G2(�n). Since

<p : _Mn >---7 G2C�n) is surjective, the preimage <p-1(Yn) C _Mn

is a hypersurface. But the complement of the set ofToeplitz

regular operators belongs to <p -1(Yn). • The following is immediate from Proposition 2.

Corollary 4. The set of Toeplitz regular operators on en is open and dense.

I will now use Theorem 3 to sharpen the bounds on the

size of the numerical range. Moreover, I will do it for

bounded operators on any Hilbert space �. Recall that if

dim � = oo and C is a bounded operator on it, the numeri

cal range W(C) c Cis bounded and convex [23], but not

necessarily closed. The operator norm C] and the reduced

operator norm ICio = mintEdC + til have the same basic

properties as in the case dim � < oo.

THEOREM 4. Let C be a bounded operator on any Hilbert space, and let W be its numerical range. Then

Perimeter(W) ::5 2'7TIC]o, Area(W) ::5 '7TIC]5. (17) Proof Let � be the Hilbert space where C acts. Assume

first that dim � < oo. Then � = en, and I will use the pre

ceding material. Let H( t), 0 < t < 2'7T, be the corresponding

pencil of hermitian operators and let A(·) be the highest

eigenvalue of H(·). Suppose first that C is Toeplitz regular.

From Theorem 3 and standard differential geometry [1, 22] we have

127T Perimeter(W) = (A + A")(t)dt

and

0

= f7T A(t)dt::::; f7T IA(t)ldt (18) 0 0

Area(W) = _!_ J27T (A2- A'2)(t)dt::::; _!_ J2A2(t)dt. (19) 2 0 2 0

Bounding above lAO I as in the proof of Corollary 2, we ob

tain the estimates

(20)

Both sides of these inequalities depend continuously on the

operator. Because the set of Toeplitz regular operators is

dense, (20) holds for arbitrary C on en. Using the invari

ance under C >---7 C + tl, as in the proof of Corollary 2, we

replace the norm in (20) by the reduced norm. This proves

our claim in the finite-dimensional case.

Now let dim� = oo, and let C be a bounded operator on

�. Let ie C � be a finite-dimensional subspace, let C be

the restriction of the form C to ie, and let W be the nu

merical range of C. Then W and C satisfy the bounds of

(20). Using that IC:I ::::; lei and that

Perimeter(W) = sup Perimeter(W), Area(W) = sup Area(W), �c� �c�

we conclude that (20) holds for C. Now use again the in

variance under C >---7 C + ti. •

Concluding Remarks

Although the bounds of Theorem 4 improve those of Corol

lary 2 by the factor of 411T, they are still very rough. The

same or better bounds on the size of the numerical range

W( C) can be obtained using elementary geometry. Let X C C be compact. Denote by r(X) the numerical radius of X, i.e., the radius of the smallest disc D(X), centered at (0,0) and containing X. Toeplitz proved in [24] that

19_::::; r(W(C)) ::::; lei. 2

(21)

Since W(C) c D(W(C)), (21) implies (20) and the inequal

ity Diameter(W) ::::; 2ICI. Invoking the invariance principle,

we obtain (17) and the upper bound of (7).

Set W1(C) = {zl- z2 : Z1, Zz E W(C)}. The set W1(C) C (: is symmetric about the origin and convex and satisfies [25]

W1(C) = { <u, Cv> + <v, Cu>: llull = llvll = 1,

This implies

Diameter(W(C)) =

<u, v> = 0}. (22)

max I <u, Cv> + <v, Cu> 1. (23) llull=llvJI= l,<u,v>=O

This in tum yields the bounds

Diameter(W(C))::::;

max 21 <u, Cv> I ::::; 2ICI. (24) llull=llvll= l,<u,v>=O

Invoking the same invariance principle, we obtain from (24) the upper bound of (7). There are other approaches to es

timating the size of W(C). For instance, [2] employs the

Gershgorin disc theorem to obtain quadratic bounds on the

area of W(C) for certain nilpotent matrices.

In view of these results and those of [16), of course, the

main justification of Theorem 3 is not in the bounds on the

size of the numerical range that it yields. The justification

is the elegant formula (10) for the curvature of the bound-

VOLUME 26, NUMBER 1, 2004 1 3

AUTHOR

&UOENE OUTKIN

ary of the numerical range. The estimates ( 17) follow from it by very crude estimates. The formula (10) seems to be novel. My only "precursor" M. Fiedler identified in spectral terms the boundary curvature of numerical range in special cases [7, 8]. There is no immediate relationship between his formulas and (10). I hope that (10) will find other applications to the remarkable subject that grew out of the Toeplitz-Hausdorff theorem.

It goes without saying that geometric considerations pervade the literature on numerical range. Several researchers have used the ideas above for purposes other than estimating the size of W(C). For instance, in [17] (16) helps to uncover new examples of domains satisfying the famous "porism of Poncelet."15

Before stopping, I will give unsolicited advice to the reader. There is a pervasive custom of concentrating on the latest literature while doing research. I am no exception to this rule. However, my experience with the study of numerical range brought me to the conclusion:

It is useful to read the work of "founding fathers"!

15A related way of using the numerical range to construct such examples is pre

sented in [27].


REFERENCES [ 1 ] T. Bonnesen and W. Fenchel, Theorie der konvexen K6rper,

Springer-Verlag, Berlin, 1 97 4.

[2] M.-T. Chien, Y.-H. Lin, On the area of numerical range, Soochow

J. Math. 26 (2000), 255-269.

[3] N. P. Dekker, Joint numerical range and joint spectrum of Hilbert

space operators, Dissertation, Free University of Amsterdam , 1 969.

[4] W. F. Donoghue, Jr. , On the numerical range of a bounded oper

ator, Mich. Math. J. 4 (1 957), 261 -263.

[5] A. Feintuch and A. Markus, The Toeplitz-Hausdorff theorem and

robust stability theory, Math. lntelligencer 21 ( 1 999), 33-36.

[6] L. Fejer, Ober gewisse durch die Fouriersche und Laplacesche

Reihe definierten Mittelkurven and Mittelflachen, Rend. Circ.

Matern. Palermo 38 (1 9 1 4) , 79-97.

[7] M . Fiedler, Geometry of the numerical range of matrices, Lin. Alg.

Appl. 37 (1 98 1 ), 8 1 -96.

[8] M. Fiedler, Numerical range of matrices and Levinger's theorem,

Lin. Alg. Appl. 220 (1 995), 1 71-180.

[9] K. E. Gustafson and D. K. M . Rao, Numerical Range, Springer

Verlag, Berlin, 1 997.

[1 OJ E. Gutkin, E. Jonckheere, and M. Karow, Convexity of the joint nu

merical range: Topological and differential geometric viewpoints,

preprint, 2002.

[1 1 ] P. Halmos, A Hilbert space problem book, Springer-Verlag, New

York, 1 982.

[1 2] F. Hausdorff, Der Wertvorrat einer 81/inearform, Math. Zeitschrift 3 (1 9 1 9), 31 4-3 1 6.

[1 3] F. Hausdorff, Gesammelte Werke, Band IV: Analysis, Algebra and

Zahlentheorie, Springer-Verlag, Berlin, 2001 .

[ 14] R. Horn and C. Johnson, Topics in Matrix Analysis, Cambridge

University Press, Cambridge, 1 991 .

[1 5] E. Jonckheere, F. Ahmad, and E. Gutkin, Differential topology of

numerical range, Lin. Alg. Appl. 279 (1998), 227-254.

[1 6] R. Kippenhahn, Ober den Wertevorrat einer Matrix, Math. Nachr.

6 (1 951 ) , 1 93-228.

[ 1 7] B. Mirman, V. Borovikov, L. Ladyzhensky, and R. Vinograd, Nu

merical ranges, Poncelet curves, invariant measures, Lin. Alg. Appl.

329 (2001 ) , 61 -75.

[1 8] F. D. Murnaghan, On the field of values of a square matrix, Proc.

Natl. Acad. Sci. USA 18 (1 932), 246-248.

( 1 9] H. Nakazato and P. Psarrakos, On the shape of numerical range

of matrix polynomials, Lin. Alg . Appl. 338 (2001 ), 1 05-1 23.

[20] D. H. Owens, The numerical range: a tool for robust stability stud

ies?, Sys. Control Lett. 5 ( 1 984), 1 53-1 58.

[21 ] M . G. Safonov, Stability robustness of multivariable feedback sys

tems, MIT Press, Cambridge, MA, 1 980.

[22] L. A. Santal6, Integral geometry and geometric probability, Addison

Wesley, London, 1 976.

[23] M . H . Stone, Linear transformations in Hilbert space and their ap

plications to analysis, A.M.S. , New York, 1 932.

[24] 0. Toeplitz, Das algebraische Analogon zu einem Satze von Fejer,

Math. Zeitschrift 2 (1 9 1 8) , 1 87-1 97.

[25] N . -K. Tsing, Diameter and minimal width of the numerical range,

Lin. Mult. Alg. 14 (1 983), 1 79-1 85.

[26] A. Wintner, Spektraltheorie der unendlichen Matrizen, Verlag S .

Hirzel, Leipzig, 1 931 .

[27] P. Y. Wu, Polygons and numerical ranges, Amer. Math. Monthly

107 (2000), 528-540.

Theories of Vision Enul · •I holz

I .

I I .

From Grosholz, Emily; Shorts and Headlands. Copyright © 1 988 by Princeton

University Press. Reprinted by permission of Princeton University Press.

1 6 THE MATHEMATICAL INTELLIGENCER © 2004 SPRINGER-VERLAG NEW YORK

I l l .

IV.

v.

hildren

b · '< nd

aking

Department of Philosophy

Pennsylvania State University

University Park, PA 1 6802

USA


1&ffij.i§j:@hl£113'fil§4fiijlj:i§:id M ichael Kleber and Ravi Vaki l , Ed itors

Cutting a Polygon into Triangles of Equal Areas Sherman Stein

This column is a place for those bits of

contagious mathematics that travel

from person to person in the

community, because they are so

elegant, suprising, or appealing that

one has an urge to pass them on.

Contributions are most welcome.

Please send all submissions to the

Mathematical Entertainments Editor,

Ravi Vakil, Stanford University,

Department of Mathematics. Bldg. 380,

Stanford, CA 94305-21 25 , USA


This tale, like so many in mathematics, begins with a simple ques

tion, answers it, and ends with questions that have yet to be resolved.

Fred Richman in 1965 wondered whether it is possible to cut a square into an odd number of triangles of equal areas. The key word here is "odd," for a moment's reflection shows that a square can be cut into any even number of triangles of equal areas.

Before I go on to describe the research that grew out of that question over the last third of a century, I will stop to introduce a few terms for the sake of clarity.

A dissection of a polygon into triangles of equal areas I will call an equidissection. An equidissection into m triangles I call an m-equidissection. An m-equidissection with m odd will be called an odd equidissection, and with m even, an even equidissection. Richman was asking whether every

equidissection of a square is even.

Richman's colleague, John Thomas, became interested in the problem and proved that there is no odd equidissection of a unit square in standard position in the xy-plane if the coordinates of the vertices of the triangles are rational with odd denominators. When he submitted his work to Mathematics Magazine, "The referee thought the problem might be fairly easy (although he could not prove it) and possibly well-known (although he could find no reference to it)." The referee suggested that Thomas submit it as a Monthly problem and if no one solved it, the paper should be published. It appeared in 1968 [12].

Paul Monsky in 1970 [4], building on Thomas's proof, showed that the answer to Richman's question is, "No, there is no odd equidissection of a square." His argument uses two tools, Spemer's Lemma from combinatorial topology, and 2-adic valuations from algebra. I will describe both.

In 1928 Emanuel Spemer published a theorem which he used to prove sev-

eral topological theorems, including the fact that a ball of dimension n is not homeomorphic to a subset of a lower dimension space. It was soon applied by others to give a short proof of Brouwer's fixed-point theorem. He stated it for simplices in all dimensions, but I will present it just for polygons in the xy-plane.

Consider a polygon cut into triangles. For simplicity, assume that two triangles that touch each other intersect either in a complete edge of both or in a vertex of both. All the vertices are labeled A, B, or C. Figure 1 is an example.

l@ldii;IIM

An edge of a triangle whose ends are labeled A and B will be called complete. A triangle whose vertices are labeled A, B, and C will also be called complete. Spemer's reasoning shows that the number of complete edges on the boundary of the polygon has the same parity as the number of complete triangles. In Figure 1 the respective numbers are 3 and 9. In particular, if there are an odd number of complete edges on the boundary there must be at least one complete triangle. That implication is what Spemer used, and so will we.

The other tool is a 2-adic valuation, <P, which is a function defined on the real numbers. First, for a non-zero integer n, <P(n) is the number of 2's in the prime factorization of n. If a and b are non-zero integers, <P( alb) is defined as <P(a) - <P(b). At 0, <P is set equal to oo.

It turns out that <P can be extended to

© 2004 SPRINGER-VERLAG NEW YORK, VOLUME 26, NUMBER 1 , 2004 17

all the reals in uncountably many ways, each having the following properties:

<P(xy) = <P(x) + <P(y) <P(x + y) ::::: minimum of <P(x)

and <P(y) <P(x) = oo only when x is 0.

It follows from these properties that for <P(x) < <P(y), <P(x + y) = <P(x). As examples we have <P(1) = 0, <P( -1) = 0, <P( -x) = <P(x), <P(112) = -1, <P(V3/2) =

-1, and <P(\12) = 112. For each prime p there are p-adic

valuations defined in a similar manner. A more detailed treatment of Spemer's Lemma and p-adic valuations is to be found in Chapter 5 of [9].

With the aid of 2-adic valuations, we can divide the xy-plane into three sets, which I will call A, B, and C. The set A consists of the points (x,y) for which both <P(x) and <P(y) are positive. B consists of the points (x,y) for which <P(x) ::::; 0 and <P(x) ::::; <P(y). C consists of the remaining points, namely, those (x,y) for which <P(y)::::; 0 and <P(y) < <P(x). We label each point A, B, or C, depending on which of the three sets contains it. For instance (0,0) is labeled A, (1,0) is B, (1,1) is B, and (0,1) is C. The point (\13/2, \13/2) is labeled B. It is easy to check that translating a point by a point labeled A (viewing them for a moment as vectors) does not change the label: if P is a point, then P and P - A have the same labels.

One of the key tools, going back essentially to Thomas, is that if the three vertices of a triangle have all three labels, A, B, and C, then the valuation of the area of the triangle is less than or equal to -1. To show this, first translate the complete triangle by the vertex labeled A. Let us say that the three vertices are now (0,0), (a,b), and (e,d), with (a,b) labeled B and (e,d) labeled C. The area of the triangle is the absolute value of (ad - be )/2. Since <P(a) ::::; <P(b) and <P(d) < <P(e), <I>( ad) = <P(a) + <P(d) < <P(b) + <P(e) = <P(be), Thus <P(ad - be) = <P(ad)::::; 0, and it follows that <P((ad - be)/2) = ::::; -1.

Note that as a consequence a line in the xy-plane cannot meet all three sets A, B, and C.

Now consider an m-equidissection of a polygon of area A. Assume that at


least one of its m triangles is complete. Since the area of the triangle is Aim, it follows that <P(Aim) ::::; -1.

Thus <P(A) - <P(m)::::; -1, from which we conclude that <P( m) ::::: <P(A) + 1.

So, if we knew that <P(A) is larger than -1, m must be even. In particular, if A is an integer, m is even.

To apply the information just obtained, we have to be sure that there is at least one complete triangle in the dissection. This is where Spemer's Lemma enters the picture.

After just one more observation, we will be ready to prove the RichmanThomas-Monsky theorem.

Consider a finite set of points in a complete line segment. Each of these points is labeled either A or B. The points divide the segment into shorter sections. The number of these sections that are complete must be odd. One way to show this is to drop pebbles in each section next to an end labeled A,

and then add them up in two ways, by points and by sections.

When a line segment that is not complete is divided into sections, the number of complete sections is always even. For instance, there are no complete sections when a segment with ends B and C (or A and C) is cut into sections, because no line meets all three sets, A, B, and C. A segment with ends A and A (or B and B) has an even number of complete sections.

With all the machinery in place, we are ready to prove that a square has only even equidissections.

Consider an equidissection of a square. It is no loss of generality to assume that the area of the square is 1 and that its vertices are (0,0), (1,0), (0,1), and (1,1). This is shown in Figure 2, with the labels of its four vertices.

lplijiJ;IfM

C (O, l )b

A (0, 0) B ( l , 0)

All the vertices, not just the comers of the square, are labeled A, B, or C. No matter how those vertices are labeled, there will be an odd number of complete sections along the bottom edge of the square, which is complete. The other three edges have no complete sections. Thus the total number of complete sections on the boundary of the square is odd. Hence there is at least one complete triangle in the dissection. It follows that the equidissection is even.

That is where the subject of equidissections remained until 1979, when David Mead [3] obtained a generalization from a square to a cube in any dimension. He proved that when an n

dimensional cube is divided into

simplices all of which have the

same volume, the number of the

simplices must be a multiple of n! .

In addition to Spemer's Lemma in higher dimensions, he used p-adic valuations for all primes p that divide n.

In 1985, when Elaine Kasimatis was presenting the result for a square in G.

Donald Chakerian's geometry seminar, I wondered, "What about the regular pentagon?" She found the answer and went on to prove that in any m-equidissection of a regular n-gon with at least

five sides, m must be a multiple of

n. In the proof she had to extend p-adic valuations to the complex numbers for the prime divisors of n. Her work appeared in 1989 [1]. In a sense it was another generalization of the theorem about equidissections of a square.

A year later she and I published [2] the results of an investigation of equidissections of trapezoids and other quadrilaterals. Among these was yet another generalization of a square, namely quadrilaterals whose four vertices are (O,Q), (1,0), (a, a), and (0,1), where a is any positive number, illustrated in Figure 3.

Mptdll;l¥4 (a, a)

(0, 1)

(0, 0) (1, 0)

The area of such a quadrilateral is a. If <P(a) is greater than 0, then the boundary of the quadrilateral has two complete edges, and the hypothesis of Spemer's Lemma doesn't hold. Incidentally, had it held, we would have been able to conclude that in any mequidissection of the quadrilateral m has to be a multiple of 4, since <P( m) would be greater than 1. Because m can be as low as 2, we could have predicted that there are an even number of complete edges on the boundary.

In this case we apply the linear mapping that takes (x,y) to (xla, y). The image of the original polygon has area 1 and vertices (0,0), (1/a,O), (1,a), and (0,1), as shown in Figure 4.

( 1 , a)

(0, 1)

(0, 0) ( l !a, 0)

Now there is only one complete edge on the boundary, and Spemer's Lemma applies. Hence m is even, as in the case of the square.

If -1 < <P(a) ::; 0, the labeling of the initial quadrilateral has one complete edge, and there is no need to introduce a linear mapping. Again, m must be even.

When <P(a) = -1, there may be odd equidissections as the case a = 3/2 shows. Figure 5 illustrates this case, where the quadrilateral is cut into three right triangles, each of area 1/2.

( 1 , 0)

More generally, if a = b/(2c), where b and c are odd integers, the corresponding quadrilateral has an odd equidissection. However, if <P(a) = -1 and a is irrational, I don't know what can be said. Even the case a = v3/2 is not settled. Does the quadrilateral with vertices (0,0), (1,0), ((v3/2, v'3!2), and (0, 1) have an odd equidissection?

In any case, this attempt to generalize the result for squares failed. The question remained: What is there about a square that forces all its equidissections to be even? In other words, What is the most general class ofpolygons that have no odd eq uidissection?

One simple generalization is that any parallelogram has no odd equidissection. This follows immediately from the result for a square, for any parallelogram is the image of a square by a linear mapping. Since a linear mapping magnifies all areas by a constant, it takes an equidissection into an equidissection.

A parallelogram being centrally symmetric suggests that perhaps any centrally symmetric polygon has no odd equidissection. Kasimatis's theorem about regular n-gons, when n is even, gave me enough extra evidence that I investigated centrally symmetric polygons, trying to produce a counter-example. Instead I proved in 1989 [8] that every centrally symmetric 6-gon or 8-gon has no odd equidissection. Monsky in 1990 [5] proved the theorem in general.

Even so, I did not feel that that was the last word. There was another class of polygons that I suspected would generalize the square. To construct this type of polygon, I start with the unit square in Figure 2 and then distort its boundary, changing opposite edges in the same way. The resulting polygon still tiles the plane by translates using all integer vee-

Mpt§ll;iiM

(0, 1 ) •�• ( 1 , 1 )

(0, 0) •�• (1 , 0)

tors. Figure 6 shows such a distorted square.

It seemed to me that complicating the boundary would lessen the chance that the resulting polygon would have an odd equidissection. I proved for a few simple families made this way, such as polygons formed by adding one dent, as in Figure 7, that my suspicion was valid.

·------· ( 1 , 0)

Some years later a surprising breakthrough occurred, which I described in a paper published in 1999 [10]. It concerns a unit square in the xy-plane whose comers have integer coordinates, such as the one in Figure 8.

+§lijii;i+:W (5, 7) • B

B A (5, 6) •'------• (6, 6)

Note that the square in Figure 8 has one complete edge. A moment's thought shows that exactly one vertex of any such square has both coordinates even, hence labeled A. Its two neighboring vertices are then labeled B and C. That implies that the square has exactly one complete edge.

It follows immediately that any

polygon in the xy-plane made up of

an odd number of such unit squares

has no odd equidissection. To see this, place a pebble inside each square in the polygon next to its complete edge. Because there is an odd number of pebbles, there must be an odd number of complete edges on the boundary. Moreover, since the area is an integer, it follows that the number of triangles must be even.


It struck me as odd that by assum

ing that the polygon has an odd num

ber of squares I was able to deduce that

the number of triangles was even. I

checked a few cases where the poly

gon had an even number of squares,

enough to convince me that it was true

in general, but left it to someone else

to treat that case. Iwan Praton in 2002

[6] disposed of the even case. His proof

showed that if the number of squares

is of the form 2rb, where b is odd, then

there is a translate of the image of the

polygon by a linear mapping that takes

(x,y) to (x/2u, y/2v), where u + v ::5 r, to which a stronger version of Sperner's

lemma applies.

Consequently any polygon com

posed of the unit squares described has

no odd equidissection. There is an

other, more suggestive way to state this

result: Any polygon in the xy-plane

whose edges are parallel to the

axes and have rational lengths has

no odd equidissection. To show this,

first translate the polygon so that one

of its vertices is at the origin. Then

magnify this image by a mapping that

takes (x,y) to (qx, qy), where q is an in

teger divisible by all the denominators

of the lengths of the edges. The image

consists of congruent squares and has

no odd equidissection. Hence the orig

inal polygon has no odd equidissection.

The next conjecture is inevitable.

What if the assumption that the edges

have rational lengths is removed? I conjectured that any polygon whose edges are parallel to the axes has no

odd equidissection. I then faced three classes of poly

gons that I either knew or suspected

have no odd equidissections: centrally

symmetric, distorted square, edges

parallel to the axes. The first case was

already settled, and there was ample

evidence for the remaining two cases.

Figure 9 illustrates the three types. As

I stared at polygons like those, I no

ticed a property that they all shared. To

describe this property I orient the

boundary, turning each edge into a vec

tor whose direction is compatible with

the orientation. Then I call two vectors

on the boundary equivalent if they are

parallel. All three types have the prop

erty that the sum of the vectors in each equivalence class is the zero vector. I


ii'riil;l¥+

O D Centrally symmetric Distorted square

Edges parallel to axes

called such a polygon special and con

jectured that each special polygon has

no odd equidissection.

I showed that the conjecture is true

when the special polygon has only a

few edges. The smallest possible num

ber of edges is four, and the polygon is

then a parallelogram, for which the

conjecture is true. There are no special

polygons with five sides, as may easily

be checked. There are three types of

special polygons with six sides, con

structed as follows.

The first step is to determine the

number of edges in an equivalence

class. There must be at least two in a

class and at most three, for if there

were four, two would be forced to be

adjacent. The partitions of six meeting

these conditions are 6 = 3 + 3 and 6 =

2 + 2 + 2. The second step is to see

how the equivalence classes could be arranged on the boundary. Take the

ljMiiijiitl

q

q � "' "' ... .. .. .. ... ... ..

P,,'' ....

\p

. . . . � : . . . .

q '" ...... _ ,' q - . · ... .. .. .. .. ..

p

p

(a)

p

q

(b)

p

3 + 3 case first. Denoting parallel vec

tors by the same letter, the only possi

bility is to alternate the vectors of the

two classes, as shown in Figure lOa.

That schema can be realized by a spe

cial polygon, as shown in Figure lOb.

Without loss of generality, we can as

sume its edges are parallel to the axes.

The 2 + 2 + 2 case leads to two es

sentially different schemas, as shown

in Figure l l and later in Figure 13.

I#Mil;li!i p

, .,. ... ... .. .. .. .. .... ..

q,/' "'\ r

. . . . ' I . . . .

r \.. ,/ q ... .. , ... .. .. .. .. .. ..

p

This schema can be realized by any

centrally symmetric polygon with six

sides, as shown in Figure 12.

l!'dli;Jifl p

p

The other possible schema is shown in Figure 13.

. .

q " ...... ..

p

p

It, too, can be realized by a special

polygon, shown in Figure 14.

Each case can be treated with the

aid of Sperner's Lemma, 2-adic valua

tions, and a variety of affine mappings,

that is, mappings that take (x,y) to

(ax + by + e, ex + dy + f), where a, b,

c, d, e, andf are constants and ad - be

is not 0.

p

To determine the special polygons

with seven sides, I first list the parti

tions of seven in which the summands

are at least two and at most three.

There is only one such partition,

namely 7 = 3 + 2 + 2. It can be real

ized in two different ways by schemas,

and each schema has a geometric re

alization, as shown in Figure 15.

+pMil;ii .. ii f.· · - < ' ' q/ \ p ' ' '

;- •• _ _ _ _ _ .·'q p

f!. · - -< ' ' r :

' \ P ' ' . : q\_ .. ·'q .. ... .. ...

p

p

q��r r[__fP

p

Again I managed to show that both of

these types of special polygons have no

odd equidissection [11) . Because the

proofs break into a couple of dozen

cases, I have hesitated to go on to the

eight-sided special polygons. In any

event, these 7-gons provide substantial

evidence for the general conjecture,

which I had wanted to call the "mother

of all conjectures," but was restrained

by the referee to name it simply a "gen

eralized conjecture."

As is customary in science, we are

left with more questions than we had

when we started. Perhaps we have

i

found the fundamental property of the

square that is the basis of the Richman

Thomas-Monsky theorem. Perhaps

not. That raises the first of several

questions:

Does a special polygon ever have an

odd equidissection?

The next four questions are suggested

by the special polygons.

How many partitions are there of a

positive integer n if the summands

are at least 2 and at most n/2?

Is each such partition representable

by a combinatorial schema?

If so, by how many?

Is each combinatorial schema rep

resentable by a special polygon?

Even if all these questions are an

swered, many questions about equidis

sections would remain. For instance,

does a trapezoid whose parallel edges

have lengths in the ratio of v'2 to 1 have any equidissections? I think that

the answer is no and make the follow

ing conjecture:

Consider a trapezoid whose parallel

edges have lengths in the ratio of r to 1, where r is algebraic. I conjec

ture that if r has at least one nega

tive conjugate, then the trapezoid

has no equidissection.

Little has been done about equidis

sections into simplices in higher dimen

sions aside from [3]. It was shown in [2)

that in any dissection of a regular octa

hedron into m simplices of equal vol

umes, m must be a multiple of 4. Is it

true that in any dissection of a centrally

symmetric polyhedron into m simplices

of equal volumes, m must be even?

That is where Richman's question

has led. The path that he discovered

seems to have no end.

i n t

- l i ha I I I .

Acknowledgment

I wish to thank Anthony Barcellos for

providing the illustrations, using Co

hort's software, Coplot.

REFERENCES 1 . E. A Kasimatis, Dissections of regular

polygons into triangles of equal areas, Dis

crete and Camp. Geometry 4 (1 989),

375-381 .

2 . -- and S . Stein , Equidissections of

polygons, Discrete Math. 85 (1 990),

281 -294

3. D. G. Mead, Dissection of hypercubes into

simplices, Proc. Amer. Math. Soc. 76

(1 979), 302-304.

4. P. Monsky, On dividing a square into tri

angles, Amer. Math. Monthly 77 (1 970),

1 61 -1 64.

5. -- A conjecture of Stein on plane dis

sections, Math. Zeit. 205 (1 990), 583-592.

6. I. Praton, Cutting Polyominos into Equal

Area Triangles, Amer. Math. Monthly 1 09

(2002) 81 8-826.

7. F. Richman and J. Thomas, Problem 5471 ,

Amer. Math. Monthly 74 ( 1 967), 329.

8. S. Stein, Equidissections of centrally sym

metric octagons, Aequationes Math. 37

(1 989), 3 1 3-31 8.

9. -- and S. Szabo, Algebra and Tiling,

Mathematical Association of America,

Washington, D . C. 1 994.

1 0 . -- Cutting a polyomino into triangles of

equal areas, Amer. Math. Monthly 1 06

(1 999), 255-257.

1 1 . -- A generalized conjecture abourt cut

ting a polygon into triangles of equal areas,

Discrete and Camp. Geometry 24 (2000),

1 41 -1 45

1 2. J. Thomas , A dissection problem, Math.

Mag. 41 (1 968), 1 87-190.

Department of Mathematics

University of California Davis

1 Shields Avenue

Davis, CA 9561 6-8633

email: [email protected]

u 1

l lha r

VOLUME 26. NUMBER 1. 2004 21

M a t h e m a t i c a l l y B e n t

The proof is in the pudding.

Opening a copy of The Mathematical

Intelligencer you may ask yourself

uneasily, "What is this anyway-a

mathematical journal, or what?" Or you may ask, "Where am /?" Or even

"Who am !?" This sense of disorienta

tion is at its most acute when you

open to Colin Adams's column.

Relax. Breathe regularly. It's

mathematical, it's a humor column,

and it may even be harmless.

Column editor's address: Colin Adams,

Department of Mathematics, Bronfman

Science Center, Williams College,

Williamstown, MA 01 267 USA


Col in Adam s , Editor

Rumpled Stiltskin Colin Adams

Once upon a time there was a topol

ogist who lived with his daughter

in a tiny office in the Math Building at

the University of Chicago. One day the

Chairman of the department happened

to stop to talk to a colleague just out

side the door of the topologist's office.

"The hiring season looks tough," said

the Chairman, a bit discouraged. "I hope

we can fmd someone extraordinary."

The topologist, who was barely

known to the Chair, stepped out of his

office. "Pardon me," he said timidly. "I

hate to interrupt. But I know of an

extraordinary mathematician. She can

turn coffee into theorems."

"Really?" said the Chair. "And who

is this mathematician?"

"She is my daughter," said the topol

ogist.

"Then send her to my office this af

ternoon," said the Chair.

That afternoon, the topologist's

daughter went to the Chair's office. She

was quite apprehensive, as she had no

idea how to turn coffee into theorems.

"Follow me," said the Chair as he led

her to the department lounge.

"Here you see a coffee maker, and

three pots of coffee. I want you to turn

the three pots of coffee into theorems

by morning. If you do not, then I will

see to it that the only job you ever get

is at a regional university with high

research expectations and a teaching

load of four courses per semester."

With that he left the lounge, locking

the door behind him.

The poor girl was disconsolate. She

fell sobbing on the couch. "Oh, what

ever will I do?" she cried. "My career

is over before it has even started."

Suddenly, as if by magic, the door to


the lounge swung open, and in walked

a squat, disheveled creature, with long

matted beard and hair. He was a squat

man dressed in a dirty t-shirt, jeans, and

an even dirtier red sports coat. He

seemed surprised to see her.

"What are you doing here?" he

asked.

"Oh, the Chair has said that I must

turn this coffee into theorems, or he

will destroy my career."

"And you don't know how to tum

coffee into theorems?" asked the little

man, a smile playing across his lips.

"Oh, no. I have no idea how. My fa

ther just said I could to impress the

Chair."

"And what will you give me if I turn

this coffee into theorems?" said the un

sanitary fellow.

"Ummm, how about this mint con

dition copy of Stewart's Calculus

book?" she suggested, pulling the book

from her briefcase.

"Let's see," he replied. "Is it the new

edition? I could get 50 bucks for that.

You got a deal."

And with that, the strange man

gulped down all three pots of coffee.

His bloodshot eyes began to glow. His

eyebrows started to twitch. Then he sat

down before a pad of paper and wrote

furiously for three hours. When he was

finished, the pages of three pads of pa

per were covered with the most beau

tiful lemmas the girl had ever seen.

"That ought to do the trick," said the

little man, and with that, he scooped

up the calculus book and was gone.

The next morning the Chair un

locked the door, expecting to find the

girl crying or sleeping, with nothing to

show for her night. But his jaw dropped

when he saw the scribblings on the

pad. "This is really quite good," he said.

"Thank you," said the girl timidly.

"Can I go now?"

"What, are you kidding? This is the

beginning of some really good mathe

matics. But you need to fill in the de

tails. Flesh out the theory. Come back

this evening."

When the girl arrived that night, the

Chair pointed to six pots of coffee sit

ting on the table. "If you don't turn this

coffee into theorems, I will make sure

the only work you get is as a recitation

instructor, teaching fifteen problem

sessions a week for large calculus lec

tures." And again, he locked her in the

lounge.

The girl fell sobbing on the couch.

But she said to herself, if the little man

can do it, why can't I? With that she

went over and poured herself a cup of

the steaming brown liquid. She took

a sip and spat it out immediately.

"Ahhrgh," she said, "This tastes like it

has been sitting in the pot for the last

12 hours"-which in fact it had.

But then the door swung open again,

and in walked the little man. His jeans

were tom at the knee and his teeth ap

peared never to have experienced the

friction of a toothbrush.

"Back again, are we?" he said.

"Yes, the Chair said I must tum

these six pots of coffee into theorems

by morning, or he will tum me into a

recitation instructor."

"And what, pray tell, will you give

me, if I do it for you?"

The girl thought for a moment and

then pointed to her computer brief

case.

"How about my laptop?" she asked

hopefully.

She handed it to the man. "Hmmm,

looks like a Mac Titanium Power PC

G4, 800 MegaHertz, with one megabyte

L3 and 256K L2 cache. You got a deal."

So again, he gulped down the cof

fee, and set to work Six hours later, he

had filled six pads of paper with the

orems and proofs.

"This should do it," he said. And

grabbing up the laptop, he disappeared

out the door.

When the Chair arrived the next

morning, he was flabbergasted by the

beauty of the mathematics on the pads.

"This is really good stuff," he said

enthusiastically. "These are the germs

for a whole new theory. I am really im

pressed."

"Good," said the girl nervously.

"Now can I go?"

"Yes, but you must come back

tonight," said the Chair. "You have

more work to do."

That evening, the Chair sat her

down before twelve pots of coffee.

"If you don't tum this coffee into

theorems," he threatened, "I will

make you into a permanent grader for

our remedial algebra course. But, if

you do succeed, I will give you a

tenure-track position on the faculty

here at Chicago." Then he turned and

left the lounge, locking the door be

hind him.

The girl fell on the couch sobbing.

It was too much to hope the smelly

man would be back to help her once

more. And besides, she had nothing left

to give him.

Suddenly the doorknob turned and

in he walked.

"Still trying to turn coffee into the

orems, are we? Haven't learned how to

do it yet?"

"Oh, no," she said. "I can't do it. And

the Chair is going to make me into a

permanent grader. Oh, woe is me."

"And what will you give me if I do

it for you?" asked the hair-encumbered

individual.

"I don't know," said the girl. "I don't

have anything left to give."

The little man grinned mischie

vously. "Oh, I think you do," he said. "I

want you to give me your first-born

theorem."

"What do you mean?'' asked the girl.

"The first theorem that you prove

yourself, I want you to give it to me, to

claim as my own."

Now the topologist's daughter knew

that if she said no, she wouldn't ever

have the opportunity to create her own

theorems, anyway. So there wouldn't

be anything to lose. On the other hand,

if she did survive all this nonsense, and

had a career as a mathematician, what

was one theorem more or less? So she

agreed. The little man laughed delight

edly.

"Oh, yes we have a bargain," he

laughed as he danced about the room.

Then he gulped down all twelve pots

of coffee, and worked through the en

tire night, finishing just before day

break.

"Remember our deal," he said as he

slipped out the door, leaving twelve

pads of paper filled with mathematics

on the table.

When the Chair arrived, he was

stunned by the level of work that he

saw.

"You have a job, a tenure-track job,"

said the chair, shaking her hand en

thusiastically.

So the young woman began her

career at Chicago. She was an able

teacher, and enjoyed that aspect of her

job. But at first she found it difficult to

work on her research, as her other du

ties were so numerous.

But one day, she attended a number

theory seminar. The speaker gave a dis

cussion of Catalan's Conjecture, which

says that the only two consecutive

powers of whole numbers are the in

tegers 8 and 9. She found the question

quite fascinating. Soon, she was spend

ing all her time working on the prob

lem. She would have worked even

more but sometimes exhaustion over

came her. Finally, one evening, want

ing to continue her work but unable to

keep her eyes open any longer, she

stumbled into the department lounge

and quickly swallowed a cup of coffee,

before she had a chance to gag.

Suddenly, she felt awake. Within

minutes, the caffeine was coursing

through her system, and her neurons

seemed to be firing every which way.

She worked all that night and by morn

ing, she had proved Catalan's Conjec

ture.

Although she was tired and in great

need of sleep, she decided to wait un

til the Chair arrived at 8:00 to tell him

the good news. At 7:30, just as her eyes

were closing with exhaustion, the door

to her office swung open, and the little

man, whom she had not seen for the

last two years, bounded in.

"I am here to collect my debt," he

said.

"Oh no," pleaded the assistant pro

fessor. "It's too good. You can have my

next one."

"I don't want your next one," said

the diminutive hairball. "I want this

one."

"Please, please, don't take it. It has

taken me all this time to learn how to

tum coffee into theorems. I can't give

it up."

'Til tell you what," said the little

man, an evil grin on his face. "If you

can guess my name, I will not take your

theorem. And I will give you three days


to guess it." He laughed then and

scooted out of the office.

The young woman thought to her

self that this couldn't be so hard. After

all, he had made no rules about the

guessing. She could guess as many

names as she wanted. Eventually she

would get it right.

The next morning, the door to her

office opened and in popped the minor

mutant.

"And what do you guess is my

name?" he asked.

"Is it Pythagoras?" she queried. "Is

it Zeno? Is it Euclid?"

"No, no, and no." He hopped de

lightedly from one foot to the other.

"Is it Nicomachus? Is it Diophantus?

Is it Pappus?"

"Not even close."

"Is it Fibonacci? Is it Newton? Is it

Liebniz?"

"Ha."

"Is it Bernoulli, or Euler, or Lagrange?

"No, no, and no again. You will have

to do better than that." And with that

he was gone.

That day, the young woman

searched for every name she could

think of. She asked others around the

department for any other names they

might know.

When the little man arrived the next

morning, she asked, "Is it Gauss? Is it

Cauchy? Is it Mobius?"

"No, no, and no," he laughed, hardly

able to contain himself.

"Is it Lobachevsky? Dirichlet?

Liouville?"

"Be serious."

"How about Weierstrass? Cayley?

Hermite? Cantor? Dedekind? Bel

trami?"

"No, no, no, no, no, no."

"What about Lie, or Poincare or

Peano or Hurwitz or Hilbert or Cartan?"

"Give me a break."

"Maybe Zermelo, Dickson, or

Lebesgue?"

"No, no, and no. Tomorrow is your

last chance." And with that, he disap

peared out the door.


The young professor was crushed.

She didn't know what to do.

"Oh, woe is me," she cried. All that

day, she wrung her hands. That evening,

as she went to get a tissue from the bath

room to dab her tears, she heard a voice

singing from within the Men's Room.

"I am so happy. I could sing, as I

shower in the sink.

For she doesn't realize who I am,

And how with Chicago I link.

She doesn't know that I live in the

Lounge,

She doesn't know my game.

And she doesn't know the most

important part,

Rumpled Stiltskin is my name."

She immediately went to her fa

ther's office.

"Pop," she said, "Have you ever heard

of someone named Rumpled Stiltskin?"

"Oh sure, everybody knows about

Rumpled Stiltskin. One of the most

brilliant minds ever to grace this

campus."

"Who is he?"

"Who was he is the more appropri

ate question. Bob Stiltskin was a gradu

ate student here thirty years ago. A real

star. But he got hooked on Catalan's

Conjecture. Spent all his time trying to

prove it. Couldn't bring himself to solve

an easier problem and get a Ph.D."

"So what happened?"

"After eight years, they cut his sup

port, and threw him out of the program.

But he still hung around. Used to sleep

in the Math Lounge. Somehow he had

gotten hold of a key. About ten years

ago, he disappeared entirely. Nobody

knows where he went. But there are ru

mors of a sighting every now and then."

"And why is he called Rumpled

Stiltskin?"

"Well, he always wore the same red

sports coat, and calling it rumpled is

being generous."

The next morning, the pungent per

son sprang into her office, and said,

"Last chance. What's my name?"

The girl smiled and said, "Is it Ve

blen, or Noether, or Sierpinski?"

"No, no, and no again."

"Is it Birkhoff, or Lefschetz, Little

wood, or P6lya?"

"No, no, nope, and no."

"Maybe Ramanujan or Banach, Cech, or Bloch?"

"No, oh no, oh no, and no."

"Klein, Wiener, Nevanlinna, or

Urysohn?"

"Not even close."

"Artin or Zariski?"

"Double no."

"Church or Whitehead?"

"No and again a big no. Looks like

you are plumb out of luck." He was

grinning from ear to ear.

"I guess I don't know," she said

pausing for a second. "Unless of

course, perhaps, it is Rumpled Stilt

skin."

The odoriferous oddball froze,

stunned for an instant. "How could

you? ... how did you?" he spluttered.

"I guess I keep the proof of Catala

nis conjecture after all," she said.

"Arghhh," screamed the minute mis

creant, his face turning as red as his

jacket. He stomped his feet and

gnashed his teeth, and pulled forcefully

on his matted hair. His eyes rolled up

in their sockets, and then he stormed

out of the office, never to be seen at

the University of Chicago again.

Since then, every once in a while

reports filter down from the Univer

sity of Illinois at Chicago of coffee

pots found empty just minutes after

they had been full. And at Northwest

em, department copies of Stewart's

Calculus disappear at an alarming

rate.

The young professor went on to a

very successful career at Chicago. She

and her father wrote some joint papers,

on the basis of which her father was

promoted to an office of reasonable

size. And although she did drink coffee

for the next five years, she switched to

herbal tea after receiving tenure. And

even then, the theorems kept coming.

1$Ffiili§i•£1h¥1MQ.Jj.i .. iii,ihi¥J Marjorie Senechal , Editor I

Connections, Context, and Community: Abraham Wald and the Sequential Probability Ratio Test Patti Wilger Hunter

This column is a forum for discussion

of mathematical communities

throughout the world, and through all

time. Our definition of "mathematical

community" is the broadest. We include

"schools" of mathematics, circles of

correspondence, mathematical societies,

student organizations, and informal

communities of cardinality greater

than one. What we say about the

communities is }ust as unrestricted.

We welcome contributions from

mathematicians of all kinds and in

all places, and also from scientists,

historians, anthropologists, and others.

Please send all submissions to the

Mathematical Communities Editor,

Marjorie Senechal, Department

of Mathematics, Smith College,

Northampton, MA 0 1 063 USA

e-mail: senechal@minkowski . smith. edu

In the early 1940s, Abraham Wald, a

Hungarian-born mathematician who

had emigrated to the United States from

Vienna in 1938, joined a group of scien

tists doing defense-related research for

the U.S. government. Wald's work with

the Statistical Research Group (SRG)

resulted in the creation of a theory

which had immediate as well as long

term implications for research in math

ematical statistics-the theory of se

quential analysis. W ald had developed

the idea that launched sequential analy

sis, the sequential probability ratio test

(SPRT), in 1943, in response to some in

spection sampling problems put to him

by two other members of the SRG-W.

Allen Wallis and Milton Friedman.

The story of the development of the

SPRT is inextricably bound to the im

portance of community in the develop

ment of scientific ideas. The events sur

rounding Wald's discovery of the SPRT

are well documented and clearly reflect

the influence of his immediate col

leagues on the problems he was work

ing on in 1943 when he formulated the

test [64, 9]. What has not been explored

in detail is the broader historical context

in which W ald worked and the influence

of the wider intellectual community on

his contributions to sequential analysis

and to statistics more generally.

The mathematical statistics com

munity in the United States, the com

plex web of professional relationships

in which Wald was involved during his

years in Vienna, and World War II, with

its impact on the American scientific

and social science communities,

played important roles in Wald's work

in sequential analysis. Furthermore,

his ideas had an impact on the broader

historical context. This article exam

ines these reciprocal connections.

American Statisticians:

Creating a Community

When Wald arrived in the United States

in 1938, the American community of

mathematical statisticians had been in

official existence for less than a decade.

Indeed, the principal professional sup

port for mathematical statisticians in

the United States, the Institute of Math

ematical Statistics (IMS), had been

formed only three years earlier, and the

first American periodical devoted to

the discipline, the Annals of Mathematical Statistics, had been in print

only since 1929. However, the origins

of the community go back nearly a cen

tury, to 1839, when a small group of

physicians, lawyers, and ministers met

in Boston to organize the American

Statistical Association (ASA). 1 The

founders of the ASA, and indeed most

people who called themselves statisti

cians in the nineteenth century, looked

upon statistics as "the collection and

comparison of facts which illustrate

the condition of mankind, and tend to

develop the principles by which the

progress of society is determined" [3,

p. v]. They emphasized the collection

and presentation of numerical data

about social and economic problems,

treating statistics as a tool that could

help explain social phenomena. 2 By the turn of the twentieth century,

that tool had come to be seen as a

means of making the solutions to so

cial problems more scientific. In his

1908 presidential address to the ASA,

Simon N. D. North, director of the

United States Census, explicitly tied

the advancement of the disciplines of

sociology and economics to the use of

statistics. He charged that "if the claim

of these sciences to be exact sciences

is to be made good, it follows that the

economist or the sociologist must also

be a statistician" [31, p. 439]. The use

of statistics by social researchers to

certify their status as scientists came

as these researchers in the United

States were establishing the organiza

tions and publication outlets of pro

fessional academic disciplines. The

American Historical Association (AHA)

1 For some of the history of the founding of the ASA, see [67] and [1 5].

2See [1 5] for a brief discussion of this perspective.

© 2004 SPRINGER-VERLAG NEW YORK, VOLUME 26, NUMBER 1, 2004 25

and the American Economic Association (AEA) held their charter meetings in the mid-1880s. The American Political Science Association followed in 1903.3

The establishment of the professional social science disciplines, as well as the view among statisticians in the ASA that statistics could serve those disciplines, led to what became a regular practice in the ASA of holding its annual meetings in cooperation with such groups as the AEA and the AHA. Walter Willcox, a professor of economics and statistics at Cornell who served as president of both the ASA and the AEA, felt that statisticians benefited from association with other organizations. He particularly thought that the ASA's "best connections l[ay] with societies devoted to economics, political science, and law" [66, p. 288] .

These close ties between the ASA and the social science disciplines would eventually lead to tension between the main constituency of the ASA and the small but growing group of members who wanted to develop the mathematical aspects of statistical methods. Some of this mathematics began making its way into the work of statisticians in the ASA as early as the 1890s, but the relationship between collections of numerical data about society and the mathematical theory of probability had just begun to emerge, and several decades passed before a group of researchers would focus their inquiries on it specifically.4

In the 1920s, these researchers began to see the usefulness for research in economics, biology, and agriculture of the growing set of tools of statistical inference, but they found themselves struggling to find places to publish their results. Many of them were trained and employed as mathematicians and their interests overlapped

with a number of disciplines, but it seems that no single periodical or organization provided a comfortable professional home. 5 The mathematical statisticians felt no particularly warm welcome from either the ASA or the American Mathematical Society (AMS)-perhaps the most likely supporters of mathematical statistics. On the one hand, the members of the ASA were interested in data collection and the use of numerical information, but as Carver, founder of the Annals of Mathematical Statistics, put it, "most of their membership were economists, bankers and census people whose knowledge of mathematics was very limited."6 Most American mathematicians, on the other hand, focused their research on pure mathematics, and while papers in mathematical statistics included theorems and proofs, they often had a particular use of those theorems as a starting point. In a discussion leading up to the founding of the IMS, Henry Rietz, a mathematician at the University of Iowa, who would become the organization's first president, commented about the mathematical research community that "when it comes to practice accepting papers for publication it seems not much material is acceptable that is a bit tainted with possible applications to statistical data."7

Carver put out the first issue of the Annals in 1929 (initially with some financial backing from the ASA), and mathematical statisticians formally organized the IMS as an independent professional organization in 1935. In 1938, it assumed complete financial responsibility for the Annals.8

The IMS and the Annals provided a means for mathematical statisticians to establish formal citizenship in their emerging disciplinary community. They created a sort of boundary around that discipline, setting it off from its border-

3For discussions of the professionalization of American social science, see [41 , 1 1 ].

Figure 1. Abraham Wald. Illustration courtesy

of the Columbia University Archives-Colum

biana Library.

ing fields of inquiry. 9 But the boundary was not impenetrable. Although they tended to draw attention to the importance of theoretical statistics, mathematical statisticians still welcomed opportunities to support and collaborate with the users of statistics. As the mathematical statisticians were establishing their community, several groups of applied statisticians had organized themselves into societies supporting their particular interests. The Econometric Society was formed in 1930; the Psychometric Society, in 1935. By the late 1930s, biologists and medical scientists in the ASA had organized its first disciplinary section, the Biometric Section. The group began publishing a journal, Biometrics, in 1945. The work of these three organizations focused on applying statistics to studies based in their parent disciplines-economics, psychology, and biology.

The emerging community of mathematical statisticians in the United States had ties to these organizations.

4During the nineteenth century, the probabilistic tools of the normal distribution and the method of least squares had found most of their applications in the physical

sciences of astronomy and geodesy, occasionally appearing in actuarial work on mortality tables. For a discussion of these developments, see [35, 46].

5The variety of professional societies and journals in which the mathematical statisticians participated is discussed in [1 6].

6Harry C. Carver to Jerzy Neyman in [30, p. 1 72].

7Henry Rietz to E. B. Wilson, 27 July 1 935, in (1 3, p. 289].

8Until then, except for the ASA money, Carver had funded the journal out of his own pocket. See [30, p. 1 72].

9"fhe founding of the IMS marks perhaps only a middle chapter in the story of this process. The story continues with the impact of WWII on American science and

subsequent developments within American universities. While some consequences of the war will be explored below, the rest of the story is beyond the scope of this

discussion.

26 THE MATHEMATICAL INTELUGENCER

Of particular relevance to the place that Abraham W ald would find in the community when he emigrated from Europe were its connections to the Econometric Society. To understand the importance of those links for Wald's inclusion in the American mathematical statistics community, we must trace his path there back to his days in Vienna.

Mathematics and Economics in

Inter-war Vienna: A Network of

Communities

Wald arrived in Vienna in 1927. His hometown in Transylvania, known today by its Romanian name, Cluj, was part of Hungary when Wald was born on October 31, 1902. In 1920, Transylvania became part of Romania, but had a large Hungarian minority. W ald, a Jew, spoke Hungarian and "never developed any affinity for Romania," nor a knowledge of its language [29, p. 361] . Educated at home because the local school required attendance on the Sabbath, Wald had passed the gymnasium examination recognized by the University of Cluj, and after fmishing there enrolled in the University of Vienna at age 25 to study mathematics. He eventually took classes from and wrote a dissertation under Karl Menger, working on metric spaces and differential geometry. 10

Menger's father Carl Menger had made his mark on Austrian economics with his work on marginal utility. The younger Menger was well versed in his father's work and well connected to various "circles" of the Viennese intelligentsia, including the philosophical Vienna Circle and a number of overlapping groups of economists. 11 He led his own circle, the Mathematical Colloquium, which met and published its proceedings from 1928 to 1937. The Colloquium hosted an impressive array of local and international luminaries, including John von Neumann, Alfred Tarski, Karl Popper, and Kurt Godel

(whose incompleteness theorem was first presented to the Colloquium).l2

Wald joined the Colloquium in 1930. In some sense, this gathering formed for Wald the center of a network of communities that fundamentally shaped his research. He contributed twenty-one papers to the Colloquium's proceedings, Ergebnisse eines mathematischen Kolloquiums, between 1931 and 1937, co-editing the last two volumes with Menger, Gbdel, and Franz Alt. Many of these papers communicated his research in pure mathematics, but a few, discussed below, point to the connections among disciplines and communities that the Colloquium made for Wald.

Wald's connection to the world of Viennese economics would be among the most important for his passage to the United States. At Menger's invitation, the banker and economist Karl Schlesinger presented his work on equations of economic production to the Mathematical Colloquium, and W ald quickly became interested in the field, publishing [51, 52, 53] in 1935 and 1936.13 In addition to these "first publications in his long list of contributions to mathematical economics" [27, p. 18], Wald's contact with Schlesinger led to some work as the latter's private tutor in mathematics. Menger had encouraged this connection, knowing that as a Jew, W ald had no chance of employment at the university. For the same reason, he introduced Wald to Oskar Morgenstern, who hired him to work at the Austrian Institute for Business Cycle Research, which Morgenstern directed. According to Menger's recollection, Wald had not been disturbed by his lack of opportunity in the academic world: "W ald, with his characteristic modesty, told me that he would be perfectly satisfied with any small private position which would enable him to continue his work in our Mathematical Colloquium" [27, p. 18]. Not only did Wald's work in the Collo-

10Details of Wald's education and research can be found in [27, 29].

quium continue, but his private positions opened up new intellectual opportunities.

His relationships with Schlesinger and Morgenstern formed important threads in Wald's network of communities. These men were active in influential circles of Vienna's economists, both in and outside of the university. At the university, Schlesinger and Morgenstern participated in the seminar of Hans Mayer, appointed to a chair in economics in 1923.14 Apparently, Menger and Wald attended the seminar occasionally as well [5, p. 12] .

Morgenstern and Schlesinger also attended the private biweekly seminar of Ludwig von Mises. Passed over for a position at the university, Mises was nevertheless considered "the central figure in the Viennese economic community" at the time [5, p. 14]. He held his seminar in the 1920s and 1930s at the Vienna Chamber of Commerce where he was employed as the Secretary. Mises and his seminar participants formed the core of the National Economic Association, which Mises revived in the 1920s, becoming its vice-president, with Mayer as president. 15 The group met in a conference room of the National Banker's Association, thanks to its president, Karl Schlesinger. Papers they presented often appeared in the Zeitschrift fiir NationalOkonomie, a periodical edited by Mayer with Morgenstem's assistance [5, p. 18] . 16

Menger presented a paper at an Economic Association meeting on the Petersburg paradox, but he later recalled that Mayer discouraged its publication because of its strongly mathematical character [28, p. 259]. Morgenstern, on the other hand, encouraged the inquiry of mathematically minded researchers into economic questions. As the director of the Institute for Business Cycle Research, another Mises-promoted organization, Morgenstern employed not only Wald, but another student of

1 1 For discussions of Menger's many connections, particularly to the intellectual groups meeting outside the walls of the university, see [5, 1 0, 23, 44, 45]. 1 2The proceedings of the Colloquium have been reissued along with commentaries in [6].

1 3Schlesinger earned his Ph.D. in 1 91 4 under E. von Bohm-Bawerk at Vienna [2, p. 23].

14Mayer succeeded Friedrich von Wieser, famous for his work in opportunity cost theory. See [5].

15Mises seems to have wanted to insure that Mayer and his students were included in the professional society over against Mayer's rival at the university, Othmar

Spann. This motivation may explain why Mayer received the presidency [5, p. 1 7].

16The Zeitschrift was not, however, an official publication of the National Economic Association.


Menger, Franz Alt, and the economist,

Gerhard Tintner, who later noted that

"at the Institute there was a much more

scientific attitude to economics than

elsewhere in Wien at this time. "17

As Wald's contact with economists

in Vienna fostered his interest in math

ematical economics, his work gradu

ally became known abroad. At a 1936

meeting of the Econometric Society in

Chicago, Tintner reported on some of

Wald's results [21, p. 188]. Schlesinger

and Wald himself attended a 1937

meeting of the Econometric Society in

France [36]. Wald's contacts in the

Econometric Society would form im

portant links for him to the statistics

community in the United States.

His economics research in Vienna

had touched upon issues related to sta

tistics, and he had published a paper in

the Ergebnisse on Richard von Mises's

notion of a collective, a concept that

played a role in the axiomatization of

probability.l8 But his immersion in the

ideas of mathematical statistics would

come in the United States. There, for

Wald, the web of communities woven

together for him in Vienna would con

verge with the statistics community

that had been forming in the United

States since the 1920s. The boundary

that American mathematical statisti

cians had drawn around their disci

pline as well as the connections cross

ing that boundary, particularly to the

econometrists, would further shape Wald's research, as well as his place in the scientific community.

Mathematical Statistics and

World War II: Communities

Converge

The Cowles Commission for Research

in Economics formed the thread link-

1 7From an interview reported in [5, p. 20].

ing Wald to the United States. The

Commission had been organized in

1932 by Alfred Cowles, the president of

an investment counseling firm in Col

orado Springs. He had come into con

tact with several members of the

Econometric Society in 1931, when he

began researching methods of fore

casting the stock market. At their sug

gestion, Cowles opened his research

institute and soon after began financ

ing the Society's new periodical Econometrica [ 4].

Both the Econometric Society and

the Cowles Commission had ties to the

Viennese economics and mathematics

communities, as well as to the Ameri

can statistics communities. Karl Menger

participated in the Society's organiza

tional meeting, held in Cleveland, Ohio,

in December 1930 [40]. Gerhard Tint

ner attended its meetings in the early

1930s and joined the Cowles Commis

sion staff in 1936. Other active mem

bers, also present at the organizational

meeting, included Harold Hotelling and

Walter A. Shewhart, both founding

members of the IMS. Hotelling became

one of the American mathematical sta

tistics community's most respected

spokesmen for the discipline [ 15, 16].

It may have been Tintner's connec

tions that resulted in Wald's invitation

to join the staff of the Cowles Com

mission in 1937, which he accepted,

though not without some delay and

hesitation about leaving Vienna. 19

Menger had departed in 1937 for the University of Notre Dame as the polit

ical climate in Austria was becoming

increasingly unbearable. When Hitler's

troops marched into Vienna in March

1938, Wald had not yet left. Morgen

stem was in the United States on a lec

ture tour and stayed, taking a position

in the economics department at Prince

ton when he heard that he had been

blacklisted by the Nazis [5, p. 29] . Wald,

having been dismissed by Morgen

stem's Nazi successor, fmally made his

way to Colorado Springs.20

His stay there was brief-within a

few months he left for Columbia Uni

versity to work with Harold Hotelling.

Hotelling had been teaching eco

nomics and building up a program in

mathematical statistics in Columbia's

economics department since 193!.21

Through the next decade, several early

members of the American mathemati

cal statistics community received

some training at Columbia under

Hotelling, including Samuel S. Wilks

and Joseph L. Doob [ 16].

Funded from 1938 to 1942 by a grant

Hotelling had obtained from the

Carnegie Corporation, Wald worked at

Columbia, first as a research assistant

and then teaching courses in mathe

matical statistics and economics.22 It

was during these first years at Colum

bia that Wald became immersed in the

ideas of mathematical statistics. Char

acteristically, according to his col

leagues, he "worked with prodigious

energy and endurance" [ 14, p. 18], with

"most of his waking moments during

this and the next several years . . .

given to work" [68, p. 2]. Wald became

an assistant professor of economics in

1942, and made his way through the

ranks to professor of mathematical statistics in 1945, finally becoming the chair of an independent department of

mathematical statistics at Columbia in

1946.23 Wald was a popular lecturer

from his first years of teaching at Co

lumbia. Students flocked to his lec

tures, which were "noted for their lu

cidity and mathematical rigor" [68, p.

18Wald 's interest in collectives followed a presentation on the subject at the Collocuium by Karl Popper. Richard von Mises was the brother of Ludwig. His collectives

briefly vied with Kolmorgorov's measure-theoretic formulation for a role in the foundations of statistics. See, for example, [47, 1 7] .

1 9Roy Weintraub makes this speculation about Tintner's role in [65].

20Except for one brother who eventually joined Wald in the U.S. , all of his immediate family perished in the Holocaust. Wald was just one of many scholars making their

way out of Europe in the wake of the Nazi takeover. For an account of the experiences of emigre mathematicians in the U.S., see [38]; on scholars more generally,

see [7]. 21As a graduate student, Harold Hotelling had applied unsuccessfully for an economics fellowship at Columbia. Hoping he could pursue his interests in probability and

economics elsewhere, he went to Princeton in 1 921 on a mathematics fellowship, but he found no one working in his areas of interest. Instead, he did his research in

topology and differential geometry with Oswald Veblen and Luther P. Eisenhart. This was only a temporary shift in Hotelling's focus, however. He later applied some

topological theory to his statistical research, but with the exception of the published version of his dissertation and one other research paper, the rest of his publica

tions dealt with statistical topics. See [1 6]. 22 See Series Ill .A., box 1 1 4, folder 5, Carnegie Corporation of New York Records, Columbia University. 23Hotelling had just left to chair a new department of mathematical statistics at the University of North Carolina. See [33].


Figure 2. Harold Hotelling. Illustration cour

tesy of the Columbia University Archives

Columbiana library. Photo by Alman & Co.

3] . His colleagues described him as "a

gentle and kindly friend" [ 14, p. 19], re

porting that the students, who came

from all over the world, "loved and re

spected him" [29, p. 366].24

The recently organized American

mathematical statistics community

quickly became Wald's professional

home. By 1943, he was a fellow of the

IMS and was elected its president in

1948 while simultaneously serving as

vice-president of the ASA. But Wald

had also been a fellow of the Econo

metric Society since 1939, and his net

work still included economists, many

of them European emigres. One of

them, Jacob Marschak of the New

School for Social Research, had come

to New York in 1940 by way of the Uni

versity of Oxford's Institute of Statis

tics, which he had directed after being

dismissed from the University of Hei-

delberg in the wake of the Nazis' Jew

ish boycott [1 ] .25 He started a seminar

on econometric methods with others in

the New York area. Hotelling, Wald,

and several others with connections to

the mathematical statistics community

attended and contributed to the semi

nar. One of these, Henry Mann, had re

ceived his Ph.D. from Vienna in 1935

for a dissertation on algebraic number

theory and emigrated in 1938. He tu

tored in New York until obtaining fund

ing from the Carnegie Corporation to

study statistics at Columbia. He and

W ald collaborated on several papers,

including one that grew out of their

work in Marschak's seminar [24].

The local communities supporting

Wald's research in New York, like

those in Vienna, transcended univer

sity and disciplinary boundaries. In ad

dition to his colleagues in Marschak's

seminar and at Columbia, Wald worked

for more than two years with statisti

cians and economists on the staff of

the Statistical Research Group (SRG),

a branch of the National Defense Re

search Committee (NDRC). That orga

nization, the brainchild of Vannevar

Bush, president of the Carnegie Insti

tution of Washington, served to "cor

relate and support scientific research

on the mechanisms and devices of war

fare" [34]. Bush had organized the

NDRC in 1940 under an order from

President Roosevelt, and although it

initially had no division for research in

mathematics, Bush added the Applied

Mathematics Panel (AMP) in 1942,

which included the SRG. W. Allen

Wallis, Milton Friedman (both econo

mists), Hotelling, and Wald were

among the principal staff members of

the statistical group. Their work fo

cused on studies of damage to aircraft

from anti-aircraft guns, on methods of

most effectively bombing targets, and

on statistical methods of inspection in

production [37, 64].

As part of this team, Wald devel-

oped the sequential probability ratio

test, an idea that would later play im

portant roles in the theory and appli

cation of statistics. The details of

Wald's discovery have been recorded

by his colleague Allen Wallis [ 64]. Early

in 1943, Wallis had begun to work for

a Navy captain on some inference

problems involving ordnance testing.

Discussing the problems involved with

performing large numbers of tests, the

captain suggested that a "mechanical

rule which could be specified in ad

vance stating the conditions under

which the [testing] might be termi

nated earlier than planned" could serve

to eliminate waste in the testing

process [64, p. 325]. Wallis mentioned

the problems to Friedman, and the two

began discussing it informally, outside

of their regular work for the SRG. De

termining that the problem required

more statistical knowledge than they

possessed, Wallis and Friedman ex

plained the problem to Wald. Initially

unenthusiastic about the prospects of

solving it, Wald called two days later

with an outline of the basic ideas of the

SPRT, a test that uses data as they are

gathered to determine when to stop an

experiment or an inspection. Rather

than basing the experiment or test on

a fixed sample size, a sequential sam

pling plan provides a rule for deciding,

after each trial, whether to take a cer

tain action or to make another obser

vation.

Soon after formulating his ideas,

Wald began work on a monograph

treating the theoretical properties of

his test [57] , while Harold Freeman, a

professor of statistics in the depart

ment of economics and social science

at the Massachusetts Institute of Tech

nology, began a manual describing its

applications [8]. Wald also spoke about

some of the theory underlying his se

quential methods at the 1944 summer

meeting of the IMS held with the Amer

ican Mathematical Society [58] and

24 An historical analysis of the program begun by Hotelling and Wald at Columbia would provide 1nteresting information about the development of the American math

ematical statistics community in the second half of the twentieth century. Such an inquiry, which would go beyond an exploration of the connections between Wald's

communities and his research on sequential analysis, is outside the scope of this discussion. 25The New School for Social Research had opened in 1 91 9, founded by a group of progressive scholars that included John Dewey, Charles Beard, Thorstein Veblen,

and Franz Boas. In addition to providing opportunities for research for social scientists, the school offered an adult education program modeled on the German Volks

hochschulen. Reorganized in 1 922 under Alvin Johnson, an economist and editor of the New Republic, the New School focused more narrowly on adult education un

til 1 933 when Johnson saw an opportunity to rebuild the school 's research program. Over the next year, he brought a dozen social scientists dismissed from their po

sitions in Germany to New York and established what became the Graduate Faculty of Political and Social Science. See [20, 42).

VOLUME 26. NUMBER 1. 2004 29

published a long paper in the Annals of Mathematical Statistics in 1945 discussing the theory and applications of the SPRT [61]. That same year he contributed a non-technical, expository paper on the fundamental ideas and applications of the test to Journal of the ASA [60]. As the editor of the Journal explained in a footnote to the paper, W ald had specifically written it "to be accessible to statisticians with little mathematical background" [60, p. 277, note]. The appearance of Wald's ideas in these two periodicals-at different levels of mathematical sophistication-highlights the distinction still present in the mid-1940s between the communities of applied and mathematical statisticians.

Wald's ideas started a flurry of efforts on the part of other researchers to explore questions raised by his discoveries.26 Much of this research emerging from Wald's ideas found its way onto the pages of the Annals of Mathematical Statistics. That discussion about Wald's new ideas in sequential sampling occurred in what had become the official publication of the American mathematical statistics community suggests that the Annals had come to play a crucial role in advancing the community's discipline. No longer did its existence simply add to the distinctiveness of mathematical statistics by providing the discipline with an important professional accoutrement; mathematical statisticians like Wald seemed to regard it as having the credibility to record their continuing conversations about their theoretical work.

In addition to providing a subject for fruitful theoretical research of the sort that appeared in the Annals, sequential sampling offered a practical means of reducing the number of observations needed for testing and quality control. In his introduction to the Summary Technical Report of the Applied Mathematics Panel, Warren Weaver commented on the usefulness of the SPRT,

saying that the "Quartermaster Corps reported in October 1945 that at least 6,000 separate installations of sequential sampling plans had been made" [37, p. 614].

Sequential analysis provides an important example of an area of research that combined mathematical theory with statistical applications. Its problems and their solutions addressed the practical needs of manufacturers and scientists, and at the same time attracted the technical and theoretical interests of the mathematical statisticians. Abraham W ald himself seemed to be an ideal member of the community practicing this discipline situated between theory and application. He brought to his investigations, as his student and collaborator Jacob Wolfowitz wrote, "a high level of mathematical talent of the most abstract sort, and a true feeling for, and insight into, practical problems" [68, p. 4]. In this case, work on practical problems of defense promoted theoretical advances.

These theoretical advances extended beyond the field of inspection sampling in which the SPRT originated. In fact, the theory of sequential analysis became an important aspect of Wald's theory of decision functions. Decision theory generalized the questions addressed by statistical inference by determining a rule based on randomly selected observations for choosing the best course of action from a set of possibilities.

W ald had begun developing decision theory early in his time of formal study of modem statistics with Hotelling, several years before his World War II work on the SPRT. In 1939 he published a paper in the Annals of Mathematical Statistics introducing its central ideas. Here he articulated the idea of generalizing the problems of hypothesis testing and constructing confidence intervals, seeking to build a theory that would include them as special cases. Wald conceived of an approach that would pro-

vide a means of choosing among any number of hypotheses (in contrast to the Neyman-Pearson theory, which admitted only two) by specifying a system of acceptance regions according to criteria that would "depend on the relative importance of the different possible errors" [55, p. 301].

Wald lectured briefly on these ideas in a 1941 series of addresses organized by Menger at the University of Notre Dame, but only resumed research on them a few years later, after beginning his work in sequential analysis. 27 By then John von Neumann and Oskar Morgenstern had published their 1944 landmark work, Theory of Games and Economic Behavior [50]. Perhaps surprisingly, these two had not met before settling at Princeton. They had had a number of contacts in Vienna in common, including Wald and Menger. Both had also begun thinking about ideas related to game theory before meeting, and their earlier work had some connections to the economics and mathematics communities in Vienna. In particular, Menger had written a book taking a mathematical approach to social ethics that influenced some of Morgenstern's research in the 1930s.28

Since his days in Vienna Wald had been familiar with some of von Neumann's work in economics, having edited the latter's paper on equilibrium in a dynamic economy for the final volume of the Ergebnisse [49]. This paper had some connection to von Neumann's first work on game theory, published in 1928 [48], and while working with the SRG, W ald mentioned to a colleague that some of his ideas in decision theory were based on that 1928 paper.29 In a discussion of the work by von Neumann and Morgenstern in Mathematical Reviews, Wald pointed out in 1945 that "the theory of games has applications to statistics . . . , since the general problem of statistical inference may be treated as a zero-sum two-person game" [59].

His next paper on decision func-

26More than 1 8 papers related to sequential sampling were published between 1 945 and 1 950 [9, pp. 8-9]. A bibliography published in 1 960 lists 374 references deal

ing with sequential analysis that appeared through 1 959 [1 9] .

27Menger organized a Mathematical Colloquium at Notre Dame fashioned after the one i n Vienna. Wald's lectures at that Colloquium were published as [56].

28Menger's book and his motivation for writing it are discussed in [23]. The influence of Menger on Morgenstern's ideas is treated in [22].

29For an account of that conversation, see [64, p. 334]. The connection between von Neumann's two papers is described in [65].

30 THE MATHEMATICAL INTELUGENCER

tions, appearing the same year in the

Annals of Mathematics [62], elabo

rated on the connections between sta

tistical inference and the zero-sum

two-person game. The Annals of Mathematics was one of the key publication

venues for the (pure) mathematical re

search community in the United States,

and this paper was not Wald's only con

nection to that community. He was a

member of the American Mathematical

Society and had already published two

papers in the Society's Transactions. W ald would publish several more in the

Annals of Mathematics as well as in

the Bulletin of the AMS over the next

five years. These papers treated math

ematical issues raised by sequential

analysis, decision theory, and game

theory.

The results linking game theory

with decision function theory highlight

the interaction among the communi

ties in which Wald participated, both in

Vienna and in the United States. Wald's

success in raising and answering ques

tions of interest to a variety of re

searchers was perhaps due in part to

what one colleague described as "his

open-mindedness" to others' pursuits.

"He was ever ready to listen to the prob

lems other scholars encountered and he

was eager to speak about the work he

had in progress himself' [29, p. 366].

In the late 1940s, the publications of

Wald, von Neumann, and Morgenstern

influenced the research of economists

at the Cowles Commission. This work,

which was an "attempt to discover

what kind of behavior on the part of an

individual or group in specified cir

cumstances would most completely

achieve the goals pursued," drew on

the ideas of Wald and von Neumann

and Morgenstern, and led to research

in decision making under uncertainty

by Jacob Marschak and Leonid Hur

wicz [4, p. 48].30 So the convergence of

Wald's communities in the 1930s and

1940s, across national and disciplinary

lines, had fundamental connections to

his research in statistics, particularly to

his work on sequential analysis and de

cision function theory.

Figure 3. Abraham Wald in 1950. Illustration

courtesy of the Columbia University Archives

Columbiana Library.

Conclusion

Wald's early training with Menger in

geometry was far removed from his

work in decision theory-research that

colleagues at the time called his most

significant contribution to statistics. 31

Wald shifted his interests from pure

mathematics to statistics in less than a

decade, and from the perspective of

the disciplines themselves, this shift

has the appearance of a clean break, a

jump discontinuity. A wider historical

focus, however, that considers the con

text of the scientific communities to

which Wald belonged, brings some

continuity to light and helps explain

the connections between Wald's many

professional relationships and discipli

nary interests.

From his earliest years in Vienna,

although studying pure mathematics,

Wald found himself working with re

searchers engaged in a wide range of

intellectual pursuits. Karl Menger, in

particular, introduced him to the cir

cles of Viennese economists. Although

the use of mathematical methods in

economics research was not the dom

inant fashion in the discipline, Wald

met some economists whose mathe-

30Some of Marschak's work in this field appeared in [25]; some of Hurwicz's contributions can be found in [1 8] .

3 1See, for example, [68, p. 9 ] and [14 , p. 1 9] .

matical inclinations drew him into the

world of econometrics, a professional

community on the border of several

fields.

Because these economists were not

tied exclusively to the university, these

connections provided him immediate

employment in Vienna at a time when

the political and social climate barred

him from traditional academic em

ployment. The increasingly interna

tional character of this econometric

community had an even more far

reaching impact on Wald's opportuni

ties. Some of his colleagues, including

Menger, were active in an international

network of researchers with ties to the

Econometric Society, which had been

holding meetings in the United States

and Europe since its inception in 1930.

Wald's research and abilities caught

the attention of this network, resulting

first in a job at the Cowles Commis

sion-his ticket out of Nazi Europe

and then eventually in significant

influence on the direction of the econo

metric community's research.

In the meantime, however, after his

few months at the Cowles Commis

sion, Wald moved to Columbia to work

with Harold Hotelling. Here, his formal

contributions to the discipline of math

ematical statistics began. His connec

tions to the American statistics com

munity began to grow stronger as well.

Like the world of Viennese economics

that W ald had left behind, the statistics

community in the United States had

somewhat fluid disciplinary and insti

tutional boundaries, as well as impor

tant international connections. Its

members did research in the theory of

statistics as well as in statistical appli

cations to economics and biology,

among other fields. Universities, gov

ernment, and private organizations

supported the community's work.

Wald benefited from and con

tributed to the efforts of these patrons

of statistics-at the privately financed

Cowles Commission, as a researcher

and professor at Columbia with

Carnegie funding, and through his de

fense-related research in the SRG. His


A U T H OR

PATTl WlLQIR HUNTER

international connections had helped

him move to the United States, and he

continued to interact with his fellow emigres as well as with researchers

abroad. In late 1950, Wald was on a lec

ture tour through London, Paris, and

Rome. En route to speak at the Indian

Statistical Institute, he died in a plane

crash on December 13.

As a reviewer of Wald's text on de

cision theory wrote in 1951, "Wald's

death [gave] Statistical Decision Functions [63] an altogether new sig

nificance. When it appeared, it was

threatened with rapid obsolescence by

the activity of Wald himself." All but

the fmal comments of this review had

been written before Wald's death,

when the reviewer "believed Wald's

great work on statistical decision to be

but a fraction of what he was about to

achieve" [43, p. 67] . How his later

achievements might have affected the


research of the network of profes

sional communities in which he par

ticipated of course cannot be known.

The importance of the connections be

tween that network and what Wald did

accomplish is clear.

REFERENCES [ 1 ] Kenneth J . Arrow. "Marschak, Jacob." In

International Encyclopedia of the Social

Sciences, ed. David L. Sills. The Free

Press, New York, 1 979.

[2] Kenneth J. Arrow, "Von Neumann and the

Existence Theorem for General Equilib

rium. " In John von Neumann and Modern

Economics, ed. M . Dare et al. University

Press, Oxford, 1 989, 1 5-28.

[3] Bela B. Edwards. Preface. Collections of

the American Statistical Association 1 , part

1 (1 843). v-x.

[4] Carl F. Christ. "History of the Cowles Com

mission, 1 932-1 952." In Economic Theory

and Measurement. Cowles Commission

for Research in Economics, Chicago,

1 952, 3-65.

[5] Earlene Craver. "The Emigration of the

Austrian Economists." History of Political

Economy 1 8 (1 986), 1 -32.

[6] Egbert Dierker and K. Sigmund, eds.

Ergebnisse eines Mathematischen Kof/o

quiums. Springer-Verlag, Vienna, 1 998.

[7] Laura Fermi. Illustrious Immigrants, 2d ed. University of Chicago Press, Chicago,

1 97 1 .

[8] Harold Freeman. Sequential Analysis of

Statistical Data: Applications. Statistical

Research Group, Columbia University,

New York, 1 944.

[9] Bashkar K. Ghosh. "A Brief History of Se

quential Analysis." In Handbook of Se

quential Analysis, ed. Bashkar K. Ghosh

and Pranab K. Sen. Marcel Dekker, Inc. ,

New York, 1 991 , 1 -20.

[1 0] Louise Galland and Karl Sigmund. "Exact

Thought in a Demented Time: Karl Menger

and his Viennese Mathematical Collo

quium." The Mathematical lntef/igencer 22

(2000), 34-45.

[1 1 ] Thomas L. Haskell. The Emergence of

Professional Social Science. University of

Il l inois Press, Urbana, 1 977.

[1 2] Frederick L. Hoffman. "Problems of Social

Statistics and Social Research." Publica

tions of the American Statistical Associa

tion 1 1 (1 908-1 909), 1 05-32.

[1 3] Robert V. Hogg. "On the Origins of the In

stitute of Mathematical Statistics." Statis

tical Science 1 (1 986), 285-291 .

[ 1 4] Harold Hotelling. "Abraham Wald." Amer

ican Statistician 5 (1 951 ), 1 8-1 9.

(1 5] Patti W. Hunter. "Drawing the Boundaries:

Mathematical Statistics in 20th-Century

America." Historia Mathematica 23 (1 996).

7-30.

[1 6] Patti W. Hunter. "An Unofficial Community:

American Mathematical Statisticians be

fore 1 935." Annals of Science 56 (1 999),

47-68.

[ 1 7] Patti W. Hunter. "Foundations of Statistics

in American Text books: Probability and

Pedagogy in Historical Context , " in Using

Recent History of Mathematics in Teach

ing Mathemathics. Mathematical Associa

tion of America, forthcoming.

[1 8] Leonid Hurwicz. "Theory of the Firm and

of Investment." Econometrica 1 4 (1 946),

1 09-1 36.

(1 9] J. Edward Jackson. "Bibliography on Se

quential Analysis." Journal of the American

Statistical Association 55 (1 960), 561-580.

[20] Claus-Dieter Krohn. lntef/ectuals in Exile:

Refugee Scholars and the New School for

Social Research. Trans. Rita Kimber and

Robert Kimber. University of Massachu

setts Press, Amherst, 1 993.

[21 ] Dickson H. Leavens. "Report of the

Chicago Meeting, December 28-30,

1 936." Econometrica 5 (1 937), 1 84-1 97.

[22] Robert J. Leonard. "From Parlor Games

to Social Science: von Neumann, Mor

genstern, and the Creation of Game The

ory: 1 928-1 944." Journal of Economic Lit

erature 32 (1 995), 730-61 .

[23] Robert J. Leonard. "Ethics and the Ex

cluded Middle: Karl Menger and Social

Science in Interwar Vienna." Isis 89 (1 998),

1 -26.

[24] Henry B. Mann and Abraham Wald. "On

the Statistical Treatment of Linear Sto

chastic Difference Equations." Economet

rica 1 1 (1 943), 1 73-220.

[25] Jacob Marschak. "The Role of Liquidity

under Complete and Incomplete Informa

tion." The American Economic Review 39

(3) (1 949), Papers and Proceedings of the

Sixty-first Annual Meeting of the American

Economic Association, 1 82-1 95.

[26] Richmond Mayo-Smith. "Statistical Data

for the Study of the Assimilation of Races

and Nationalities in the United States."

Publications of the American Statistical

Association 3 (1 892-1 893), 429-449.

[27] Karl Menger. "The Formative Years of

Abraham Wald and His Work in Geome

try." Annals of Mathematical Statistics 23

(1 952), 1 4-20.

[28] Karl Menger. Selected Papers in Logic,

Foundations, Didactics, and Economics.

D. Reidel Publishing Company, Dordrecht,

1 979.

[29] Oskar Morgenstern. "Abraham Wald,

1 902-1 950." Econometrica 1 9 (1 951 ) ,

361 -367.

[30] Jerzy Neyman. 'The Emergence of Math

ematical Statistics. " In On the History of

Statistics and Probability, ed. D. B. Owen.

Marcel Dekker, New York, 1 976, 1 47-193.

[31 ] Simon N. D. North. "The Relation of Sta

tistics to Economics and Sociology." Pub

lications of the American Statistical Asso

ciation 1 1 (1 908-09), 431 -446.

[32] Notes. Bulletin of the American Mathe

matical Society 27 (1 920-1 921 ) , 336-337 .

[33] E. Shepley Nourse and Bernard G. Green

berg. "Statistical Training and Research:

The University of North Carolina System. "

International Statistical Review 4 6 (1 978),

1 71 -207.

[34] Order Establishing the National Defense Re

search Committee. In Scientists Against

Time, ed. James Phinney Baxter I l l . Little,

Brown and Company, Boston, 1 946, 451 .

[35] Theodore Porter. The Rise of Statistical

Thinking: 1820-1900. University Press,

Princeton, 1 986.

[36] Program of the Annecy Meeting, Septem

ber 1 1 -15 , 1 937. Econometrica 6 (1 938),

9 1 .

[37] Mina Rees. "The Mathematical Sciences

and World War I I . " The American Mathe

matical Monthly 87 (1 980), 607-621 .

[38] Nathan Reingold. " Refugee Mathemati

cians in the United States of America,

1 933-1 941 : Reception and Reaction . "

Annals o f Science 38 (1 981 ) , 31 3-338, or

A Century of Mathematics in America

Part I, ed. Peter Duren et al . Providence:

American Mathematical Society, Provi

dence, 1 988, 1 75-200.

[39] Henry L. Rietz et al. Handbook of Mathe

matical Statistics. Houghton Mifflin Com

pany, Boston, 1 924.

[40] Charles F. Roos. "The Organization of the

Econometric Society in Cleveland, Ohio,

December 1 930." Econometrica 1 (1 933),

7 1 -72.

[41 ] Dorothy Ross. The Origins of American

Social Science. University Press, Cam

bridge, 1 991 .

[42] Peter M. Rutkoff and William B. Scott.

New School: A History of the New School

for Social Research. The Free Press, New

York, 1 986.

[43] L. J. Savage. "The Theory of Statistical

Decision." Journal of the American Statis

tical Association 46 (1 951 ) , 55-67.

[44] Karl Sigmund. "Menger's Ergebnisse -A

Biographical Introduction." In Ergebnisse

eines Mathematischen Kolloquiums, ed.

E. Dierker and K. Sigmund. Springer-Ver

lag, Vienna, 1 998, 5-31 .

[45] Karl Sigmund. "Karl Menger and Vienna's

Golden Autumn." In Karl Menger: Selecta

Mathematica, vol. 1 , ed. Bert Schweizer

et al. Springer-Verlag, Vienna, 2002,

7-21 .

[46] Stephen Stigler. The History of Statistics:

The Measurement of Uncertainty before

1900. Harvard University Press, Cam

bridge, 1 986.

[47] R . von Mises and J. L. Doob. "Discussion

of Papers on Probability Theory." Annals

of Mathematical Statistics 1 2 ( 1 941 ) ,

2 1 5-2 1 7 .

[48] John von Neumann. "Zur Theorie der

Gesellschaftsspiele." Mathematische An

nalen 1 00 (1 928), 295-320.

[49] John von Neumann. "Uber ein okonomi

sches Gleichungssystem und eine Verall

gemeinerung des Brouwerschen Fix

punksatzes. " Ergebnisse eines mathema

tischen Kolloquiums 8 (1 935-36), 73-83.

[50] John von Neumann and Oskar Morgen

stern. Theory of Games and Economic Be

havior. University Press, Princeton, 1 944.

[51 ] Abraham Wald. "Uber die eindeutige pos

itive Losbarkeit der neuen Produktions

gleichungen (Mitteilung 1) . " Ergebnisse

eines mathematischen Kol/oquiums 6

(1 935), 1 2-1 8.

[52] Abraham Wald. "Uber die Produktions

gleichungen der okonomischen Wertlehre

(Mitteilung I I ) . " Ergebnisse eines mathe

matischen Kol/oquiums 7 (1 936), 1 -6.

[53] Abraham Wald. "Uber einige Gleichungs

systeme der mathematischen Okonomie."

Zeitschrift fur Nationalokonomie 7 (1 936),

637-670.

[54] Abraham Wald. "Die Widerspruchsfreiheit

des Kollektivbegriffes der Wahrschein

lichkeitsrechnung." Ergebnisse eines math

ematischen Kolloquiums 8 (1 937), 38-72.

[55] Abraham Wald. "Contributions to the The

ory of Statistical Estimation and Testing

Hypotheses. " Annals of Mathematical Sta

tistics 1 0 (1 939), 299-326.

[56] Abraham Wald. On the Pnnc1ples of Sta

tistical Inference. Notre Dame Mathemat

ical Lectures, No. 1 . University of Notre

Dame, 1 942.

[57] Abraham Wald. Sequential Analysis of

Statistical Data: Theory. Statistical Re

search Group, Columbia University, New

York, 1 943.

[58] Abraham Wald. "On Cumulative Sums of

Random Variables ." Annals of Mathemat

ICal Statist1cs 1 5 (1 944), 283-296.

[59] Abraham Wald. Review of Theory of

Games and Economic Behavior, by John

von Neumann and Oskar Morgenstern.

Mathematical Reviews 6 (1 945), 237.

[60] Abraham Wald. "Sequential Method of

Sampling for Deciding between Two

Courses of Action." Journal of the Ameri

can Statistical Association 40 (1 945),

277-306.

[61 ] Abraham Wald. "Sequential Tests of Sta

tistical Hypotheses." Annals of Mathemat

ical Statistics 1 6 (1 945), 1 1 7-1 86.

[62] Abraham Wald. "Statistical Decision Func

tions which Minimize the Maximum Risk."

Annals of Mathematics 46 (1 945), 265-

280.

[63] Abraham Wald. Statistical Decision Func

tions. John Wiley and Sons, Inc. , New

York, 1 950.

[64] W. Allen Wallis. "The Statistical Research

Group, 1 942-1 945." Journal of the Amer

ican Statistical Association 75 (1 980),

320-335.

[65] E. Roy Weintraub. "On the Existence of

a Competitive Equilibrium: 1 93Q-1 954."

Journal of Economic Literature 21 (1 983),

1 -39.

[66] Walter F. Willcox. "Cooperation between

Academic and Official Statisticians." Pub

lications of the American Statistical Asso

ciation 1 4 (1 9 1 4-1 9 1 5) , 281 -293.

[67] Walter F. Willcox. "Lemuel Shattuck, Sta

tist, Founder of the American Statistical

Association . " Journal of the American Sta

tistical Association 35 (1 940), 224-235.

[68] Jacob Wolfowitz. "Abraham Wald, 1 902-

1 950." Annals of Mathematical Statistics

23 (1 952), 1 -1 3 .


Mathematics in the Library of El Escorial Pieter Maritz

Does your hometown have any

mathematical tourist attractions such

as statues, plaques, graves, the cafe

where the famous conjecture was made,

the desk where the famous initials

are scratched, birthplaces, houses, or

memorials? Have you encountered

a mathematical sight on your travels?

If so, we invite you to submit to this

column a picture, a description of its

mathematical significance, and either

a map or directions so that others

may follow in your tracks.

Please send all submissions to

Mathematical Tourist Editor,

Dirk Huylebrouck, Aartshertogstraat 42,

8400 Oostende, Belgium


The Real Monasterio de San Lorenzo

at El Escorial (henceforth referred

to just as El Escorial) near Madrid is

the monument which best represents

the ideological and cultural aspirations

of the Spanish Golden Age, that period

of nearly two hundred years that in

cludes the reigns of Charles I, King of

Spain (better known as Emperor

Charles V), and his son Philip II.

El Escorial was originally designed

for a variety of purposes: (1) it is a

monastery for the monks of the order

of St. Jerome, whose church was the

pantheon of Emperor Charles V and his

wife, his son, Philip II, his relatives and

heirs; (ii) it is a palace to house the

King, patron of the foundation, and his

entourage; (iii) the college and semi

nary complete the religious function of

the Monastery; (iv) and the Library

complements these three.

The victory of Philip II's army over

Henry II of France at Saint Quentin (a

town near Paris) coincided with the

feast of San Lorenzo (Saint Lawrence)

on August 10, 1557. The victory at Saint

Quentin led, in part, to the naming of

the Monastery at El Escorial. Philip II

had decided to build the Monastery for

two reasons: to give his father a digni

fied burial after the latter had spent

his last years among the Jeronymite

monks of San Yuste, and to show his

gratitude for the victory over Henry II

of France. Therefore, Philip II decided

to build a temple in honor of San

Lorenzo, who was supposed to have

suffered martyrdom on August 10 of

the year 248. Philip II began his search

for the ideal site in 1558, which he fi

nally located in 1562. The name given

to the place was El Sitio de San

Lorenzo el Real, which happened to be

a plateau on the foothills of the

Guadarrama. However, the Monastery

was better known from the nearby

hamlet-El Escurial, or El Escorial.

The Spaniard Juan Bautista de Toledo

was appointed architect to Philip II by

an ordinance dated 15 July 1559. In the

spring of 1562, Toledo measured the


ground, plotted the foundations, and

organized the working teams for El Es

corial [3]. The Monastery is situated on

the south side of the chain of moun

tains dividing the central plain of

Castile. It took 21 years to build: April

23, 1563 to September 13, 1584.

El Escorial can by no means be con

sidered the work of a single architect,

but rather the product of close collab

oration between two men: Juan

Bautista de Toledo and Juan de Her

rera, a young Asturian [1 , p. 10]. Juan

de Herrera first appeared at El Escor

ial in 1563 when Philip II appointed him

Toledo's assistant. The main structure

of El Escorial was made of wood

beams and stone. The walls were hung

with tapestries from the Flemish

looms, and the Milanese, renowned at

that period for superb workmanship in

steel, gold, and precious stones, con

tributed many exquisite specimens of

art [2, p. 164]. The roofs are of slate and

lead pieces. The architectural design is

Toscan and Doric order.

When Juan Bautista de Toledo died

in 1567, the entire south fac;ade had

been built, as well as the two-storied

grandiose Courtyard of the Evange

lists. After Toledo's death, not Herrera

but Giovanni Battista Castello of Berg

amo and Genoa was in charge at El

Escorial. It was not until 1572 that

Juan de Herrera was fully entrusted

with the immense labor. He was re

sponsible for the completion of the

complex, including several parts of

the building which had not been

designed by Toledo. By 1571, the

Monastery area was almost complete;

work commenced on the House of the

King in 1572 and on the Basilica in

1574, which was consecrated in 1595,

the year in which most agree the

Monastery was completed. Nonethe

less the last stone was placed in 1584,

and a few more years were spent on

its decoration.

The great monastic church, the

Basilica, is the "raison d'etre" of El Es

corial. It consists of two churches,

Figure 1 . General view of the Library.

namely the people's church, or the So

tocoro, and the Royal Chapel and

monastic church, which make up the

main body. On leaving the Basilica, one

crosses the Kings' Courtyard, ascend

ing a staircase at the right of the

vestibule to reach the Library.

The Library, which lies at the very

center of the main side of the

Monastery, is one of the Monastery's

great rooms. That it is in such a promi

nent position shows the importance

Philip II attributed to it in its total cre

ation. The Library has over 40,000 edi

tions, including an impressive number

of Latin, Greek, Arabic, and Hebrew

manuscripts.

Because of the number of windows,

the 55-by-10-meter Library is bright,

full of majesty and light [ 1 , p. 28]. The

frescoes on allegorical themes, painted

between 1586 and 1593 by the Italian

painters Pellegrino Tibaldi and Nicolas

Granello in the mannerist style on the

walls and ceilings, are clearly influ

enced by Michelangelo. The complex

and extensive iconography, which

mostly represents or depicts the great

wise men of Antiquity, was the brain

series of frescoes by Tibaldi begins at

the entrance to the Library with a rep

resentation of Philosophy, and at the

child of Father Jose de Sigiienza. The Figure 2. Arithmetica.

far side, the south side or Convent end,

with a representation of Theology. Be

tween the two extremes, the seven lib

eral arts (Artes Liberales) are organized

under the medieval dictum of the Triv

ium (Gramatica, Ret6rica, Dialectica)

and that of the Quadrivium (Aritmetica,

Musica, Geometria, Astrologia). There

is an allegorical representation of each

one of these arts situated in separate

sections of the ceiling with two learned

disciples of each science depicted in

the semicircular "windows" on either

side. Below the ceiling, the friezes are

adorned with yet more references to

the particular science depicted directly

above. Tibaldi's allegories are almost

fully Baroque in their realistic illusion

of space, their stress on volume, and

their contrasting illumination.

For the history of these arts, see [ 4],

and for a recent article on the seven

liberal arts in Munster's Hall of Peace,

see [5].

The floor of the Library is made of

gray marble and the walls are covered

with fmely elaborated Doric bookcases

(see figure 1) that were built by Jose

Flecha, Juan Senen, and Martin de

Gamboa according to designs by Juan

de Herrera.



On the five brown marble tables

distributed throughout the hall, a large

collection of terraqueous and celestial

globes, maps, astrolabes are on dis

play, which suggests the scientific

cabinet status which the Library un

doubtedly had. Also in the Library is

an armillary sphere, made by Antonio

Santucci around 1582, in accordance

with the Ptolemaic system, the earthly

and celestial spheres of Jean Blaeu

from around 1660, and the stone-mag

net that was apparently found during

the Monastery foundational excava

tions.

In 1573, Philip II began to assemble

the bodies of his dead relatives, and to

place them in temporary vaults pre

pared to receive them. He commenced

with his father and mother, for he made

no effort to disturb the bodies of Fer

dinand V and Isabella I at Granada; his

first and third wives were reburied at

El Escorial, but the remains of Mary

Tudor are in Westminster Abbey to this

day. The Mausoleum now holds all the

Spanish monarchs from Emperor

Charles V (that is, Charles I, King of

Spain) to Alfonso XIII (except the re

mains of Philip V, Philip VI, and their

wives), and also of Don Juan de Bour

bon and his wife, Dona Maria de las

Mercedes.

On the morning of September 13,

1598, in a little room off the Basilica in

El Escorial, just as the sun was rising

above the stony peaks of the Guadarrarnas, Philip II died of (probably) dia

betic gangrene at the age of seventy

one [2, pp. 309, 310], [6, p. 726].

Figure 3. Detail of figure 2 (top left). Figure

4. Measuring (middle left). Figure 5. Detail of

figure 4 (bottom).

Figure 6. Calculating.

Figure 7. Detail of figure 6.

Acknowledgments

The author gratefully acknowledges

the assistance rendered by Joan Roux

and Soretha Swanepoel with some of

the photographs.

REFERENCES ( 1 ] Carmen Garcia-Frfas and Jose Luis Sancho

Gaspar. Real Monasterio de San Lorenzo

de el Escorial. Patrimonio Nacional, 1 999.

(2] Charles Petrie. Philip II of Spain. Eyre and

Spottiswoode, London, 1 963.

[3] George Kubler and Martin Soria. Art and Ar

chitecture in Spain and Portugal and their

American dominions 1500 to 1 800. Pen

guin books, Middlesex, 1 959.

(4] B. Artmann. The Liberal Arts. The

Mathematical ln telligencer 20(3), 40-4 1 ,

1 998.

[5] N. Schmitz. Mathematics in the Hall of

Peace. The Mathematical lntelligencer 24(4),

34-36, 2002.

[6] William Thomas Walsh. Philip II. Sheed and

Ward, London, 1 938.

Mathematics Department

University of Stellenbosch

Stellenbosch, South Africa


VOLUME 26. NUMBER 1 . 2004 37

JEAN-MICHEL KANTOR

Mathematics East and West , Theory and Practice : The Example of Distributions

Science with the grievous glance It [mathematics} casts a grievous glance on mankind, and forces it to confront the solid reality, the real fact only, the fact which destroys alike the most magnificent and the most caustic fantasies.

-Robert Musil Notebooks. Excerpt from book #16, "The Spy"

(1923-1924), W I 1979-80

hat lessons can be drawn from the upheavals that characterized twentieth-

century scientific development? Did mathematics undergo the same up-

heaval? Was its status modified, or did it retain at the time of Hiroshima

the moral and aesthetic value which Plato praised? These

questions are too general, but they suggest a debate.

We will study here only a very precise situation and con

text-that of mathematical work conducted in Russia and

France from the 1930s, inspired among others by Jacques

Hadamard's seminal work, and which led to the worldwide

development of mathematical analysis and to the theory of

partial-differential equations. The relevant documents ex

ist, and after more than fifty years, a historical inquiry is

possible.

The death of Laurent Schwartz, a prominent French

mathematician, member of the Bourbaki group, and one of

the driving forces of the mathematical community for more

than twenty years, can be an occasion to think back about

the birth of the theory of distributions. The recent publi

cation of Soviet archives makes it possible to complement

the work of historians, in particular Adolph P. Yushkevich's

comments on the book [Lu] of Jesper Liitzen (whose rec

ognized competence in the history of mathematics and

whose conscientiousness are beyond question). In the Ap

pendix we provide a translation of Yushkevich's article,

where he examines very meticulously among others, the ar-

tides published in Russian (the references here comple

ment those given in his article). Indeed, while times have

changed, language barriers persist, slowing down the ex

change of ideas between the West and Russia, and hinder

ing a wider diffusion of Yushkevich's text, although it was

published as early as 1991 in the historical journal which

he founded. Naturally mathematics is not exempt from

chauvinistic behavior in the international competition (the

"Popov effect," both in the East and in the West), but

Yushkevich is aware of this and does not indulge in it. He

seems to have at heart to show that there was an intense

mathematical life in the East, in the USSR, isolated as it

was by the cold war and the "construction of socialism in

a single country." Let us take a look at the various sensi

bilities and styles revealed by this episode.

This is also an opportunity to take another look at the in

ternational scientific cooperation of the period, which has

hardly ever been studied. The Fields medal was awarded to

Laurent Schwartz at Harvard in 1950 during the Korean

War-and some called it "the Fields medal of the cold war,"

referring to the difficulties experienced by Hadamard and

his nephew Schwartz in getting a visa to the United States.

© 2004 SPRINGER· VERLAG NEW YORK. VOLUME 26, NUMBER 1 , 2004 39

In any case this is a little-known episode in the relationship

between science and politics, as we shall see.

During the 1930s, the idea of generalized function or dis

tribution was "in the air": it was used by the great physi

cist Paul Adrien M. Dirac (1902-1984), and by Salomon

Bochner, whose work anticipated later work on distribu

tions, in particular with respect to the role of Fourier se

ries [Boc]: physicists were using distributions just as

Moliere's Monsieur Jourdain was producing prose-un

awares. The very birth of the theory of generalized func

tions/distributions can thus be rife with lessons for a time

when the relationships between mathematics and physics

are evolving (cf. [JQ]).

Above all, this study is an opportunity to bring into view

two different conceptions of the role of mathematics, in the

East and in the West (to simplify)-one, led by Schwartz

and Bourbaki, focusing on structures, and the other, cen

tered on Sobolev and the Saint Petersburg school, closely

linked to physical sciences. All these questions are of in

terest for the present, and we think that we owe it to the

memory of Laurent Schwartz and his keen sense of the

scholar's civic role to approach them-with honesty and

rigor-at last.

The Actors: Hadamard (1 865-1 963), Sobolev

(1 908-1 989), and Schwartz (1 91 5-2002)Two Different Worlds

Laurent Schwartz is a mathematician admired all over the

world, known indeed beyond specialists' circles for his role

as a "mathematicien dans le siecle" [S2]-a "secular" math

ematician, involved in the social and political world. One

of the active members of the Bourbaki group after World

War II, he was also a militant partisan of all the humani

tarian causes of the twentieth century, from militant Trot

skyism between 1936 and the Resistance to the "Comite

Au din" during the Algerian War, and the cause of the math

ematicians standing for human rights in Eastern Europe.

Schwartz's personality brings together the qualities of the

French intellectual, growing out of a family with a long tra

dition of social ascent, which supplied France with emi

nent intellectuals.

Nothing except mathematics is shared between a Lau

rent Schwartz and a Sergei Sobolev. The latter is also a fine

scholar, but gained much less fame in the West. Sergei

L'vovich Sobolev was born in Saint Petersburg in 1908, in

a family connected to the nobility; his father was a noted

lawyer from Saint Petersburg (later Leningrad). In the en

during rivalry between Moscow and Saint Petersburg, a city

created by Peter the Great in 1703, the mathematical

schools had a particular role: Saint Petersburg was the city

of Euler, who lived there for a large part of his life, as well

as Chebyshev (1821-1894), Markov (1856-1922), Lyapunov

(1857-1918). This already shows that mathematical life in

the city was very open to the sciences and technology. It

was also in Saint Petersburg where the managerial talents

of Steklov (1863-1926), an applied mathematician, led to

the creation of research institutes of the Academy which

later bore his name. A detailed account of political strug-


gles in Moscow and Leningrad within mathematical soci

eties, and their dramatic consequences ("the Luzin affair"),

can be found in several recent publications like [De, Mar,

M-Sh, Viu, Y] , as well as in various issues of the history pe

riodical launched by A. P. Yushkevich.

Sobolev was a brilliant student at a particularly young

age, like a number of other 20th-century Russians. At the

University, which he entered in 1925, he followed the

courses of Grigori'i Mikhailovich Fikhtengolts (1888-1959)

and Nikolai Maksimovich Gunther (1871-1941) (the latter

in potential theory). He met Vladimir Ivanovich Smirnov

(1887-1974), who would be a professor then a co-worker

of Sobolev, a professor from 1925, and later dean of the

"Mat-Mekh" faculty for 25 years. (This did not spare him

the displeasure of criticism in 1957 on the occasion of a

tribute to Euler: after Smirnov praised the positive influ

ence of Frechet, who was attending the ceremony, on So

viet mathematics, Kolmogorov chided him publicly for his

"love of the foreign" ([Y] , page 31).

Sobolev's first publication was a counter-example to a

result announced by Saltykov and reused by Gunther in his

analysis course. In 1929, after obtaining his doctorate, he

joined the Institute of Seismology, where he collaborated

with Smirnov, before joining the Steklov Institute, becom

ing at 24 a corresponding member, then a full member

the youngest-of the USSR Academy of Sciences. Along

side his mathematical career, in which he was always open

to other sciences and other countries despite a difficult

context (he was fluent in French, which he had learned as

a child from his Belgian nanny), he conducted various pro

jects including the creation of the Siberian Centre of the

Academy of Sciences. He always displayed Russian pride

and a strong loyalty to the Soviet power, as a member of

the Party from the 1930s, yet this loyalty did not prevent

him from taking sometimes difficult and courageous posi

tions (for example in the Lysenko affair); but he sometimes

had a more orthodox stance, e.g., as one of the critics blam

ing Luzin in 1936 for his openness and his foreign publica

tions [De].

Somewhere between these two personalities is Jacques

Hadamard-"le petit pere Hadamard" (Daddy Hadamard), as he was familiarly called by his admirers, or "the living

legend of mathematics," an expression used by Hardy to

introduce him to the London Mathematical Society in 1944

[Ka]. After Poincare, Hadamard is without doubt the

Frenchman who most influenced twentieth-century math

ematics. He is also an illustration of the humanist and uni

versalist traditions in French culture at their best. For a

better understanding of the rest of this study, it must be

noted that Hadamard was Laurent Schwartz's great-uncle

by marriage, and followed his studies in secondary school,

then at the Ecole Normale Superieure (ENS). From his sem

inar grew the Bourbaki group (via the Julia Seminar). The

Hadamard seminar was formative for several generations

of students at ENS. Laurent Schwartz recognized (lac. cit.) the crucial part played by Hadamard in his education. We

know well the life of Hadamard [M-Sh]-the immensity of

his mathematical work, and also his left-wing radical com-

Figure 1 . S. L. Sobolev with his children, Moscow, 1940.

mitment, initially motivated by the Dreyfus affair, then by

the rise of Nazism, and his closeness to the French Com

munist Party along with Frederic Joliot-Curie. The archives

of the Academie des Sciences have copies of articles pub

lished during his stays in USSR, in which he praised the po

litical system and the merits of Soviet science [H1] .

The Facts

The 1 930s: Sobolev Functionals

Within the framework of his militant activities for friend

ship between peoples, Hadamard, an indefatigable traveler,

made numerous journeys in the East, in particular to China

and the USSR.

Visits to the USSR:

• 1930: he attends the Congress of Soviet mathematicians

in Kharkov, in July, then travels to Kiev. He meets

Sobolev in Kharkov and later they have discussions in

French in Leningrad. Hadamard asks Sobolev to keep

him informed of his work [M-Sh p. 217];

• May, 1934: Hadamard is a member of a delegation of nine

French academics travelling for the "week of French Sci

ence" in the USSR. In Leningrad he meets Sobolev but

does not participate in the second Congress of Soviet

Mathematicians (24-30 June, 1934), where Serge Sobolev

makes three presentations:

1. A new method for solving the Cauchy problem for hy

perbolic partial differential equations;

2. Generalized solutions of the wave equation;

3. On the diffraction problem for Riemann surfaces.

The contents of these talks were certainly discussed a fort

night earlier with Hadamard, who followed with interest

the works of his colleague: Sobolev himself acknowledged

the influence of the notion of finite part, discovered by

Hadamard in 1903 (!), in his discoveries of 1934-35 (see Ap

pendix).

As underlined in the obituary of Sobolev by Jean Leray

[L3] and the review of the [Lu] book by Yushkevich, the

discovery of generalized functions must be ascribed to

Sobolev in his articles of 1935 and 1936:

• The Cauchy problem in the space of functions, Proceedings (Doklady) of the USSR Academy of Sciences, 1935,

volume III (VIII), N 7 (67) (in French).

• New methods to solve the Cauchy problem for normal

hyperbolic linear equations, Mat. Sbornik, 1936, vol. 1

(43), 36-71 (in Russian).

In these two articles, Sobolev explicitly defines generalized

functionals as continuous forms on the space of differen

tiable functions of a given order m with support in a com

pact set K, for fixed m and K. He establishes the funda

mental properties of generalized functionals.

Why in French?

The year 1934, with the murder of Kirov, a popular Com

munist leader in Leningrad, was a turning point for the

USSR, which began to shut itself off, and where "ideologi

cal" struggles broke out, as illustrated by the campaign al

ready mentioned against Luzin. In this campaign the issue

of whether to publish in Russian or in a more widely ac

cessible language (as was done for most mathematical pub

lications until the war), played an important role. The pub

lication of the seminal article by Sobolev in Russian and in

French in the same volume of Doklady was purposeful.

Sobolev, who had criticized Luzin, was patriotically pub

lishing in Russian; the publication in French, though com

mon at that time, might be risky as a reminder of the so

cial background of the author. It is quite likely that this

double publication was perceived positively by Hadamard

at least, maybe even suggested by him.

In 1936 Hadamard was again in Moscow, returning from

China. In 1945 he made another journey to Moscow and

Leningrad as a member of the French delegation to the cel

ebrations of the 220th anniversary of the Russian Academy

of Sciences. He did not meet Sobolev (we shall see why).

However, as early as 1935, the reports he makes, back in

France, show Hadamard's awareness of the problems. He

evokes the tragic disappearance of a rising star, a clear al

lusion to the suicide of the young and brilliant mathemati-

Figure 2. S. L. Sobolev not reading mathematics, Novosibirsk, 1962.


cian Schnirelman, a number theorist and topologist, in

1938. Hadamard praised the close relationships between

pure and applied science in the USSR, even in mathemat

ics [H2).

Sobolev's Discovery

Sobolev, inspired among other things by Hadamard's work,

first defmed generalized solutions of a wave equation, then,

in 1934-35, "generalized functions," without any mention

of a reference equation (contrary to the description in [Lu],

page 65), first under the name of "ideal" functions (as in

dicated by Mikhlin), probably in reference to the introduc

tion of ideal numbers by Kummer, then as "generalized

functions" in the seminal article of 1935. The older term

dangerously evoked idealist philosophy [M-Sh] at a time

when the Czech-born Marxist philosopher Kolman and the

other followers of "proletarian science" were stirring things

up in Leningrad. This hesitation over the naming, and the

double publication in Russian and in French, confirm that

Sobolev had a clear idea of the importance of his work and

its general character, contrary to the assertions made in

[Lu]. The reader can read in the Appendix a detailed analy

sis of the various articles written by Sobolev and his in

spirers and colleagues. There are hardly any clues to the

ongm of his discovery,

troduction) with a surprisingly partial and anti-chronolog

ical presentation of Sobolev's articles:

Soboleff Proceedings of the Soviet Academy of Sciences, 1, 1936, p. 279-282, Math. Sbornik, 4, 1938, 471---496, Friedrichs: (1939), . . . Kryloff (1947). . . . Some articles mentioned in previous notes were published later than the introduction of distributions, but the authors did not know of distributions, due to the slowness of the publishing process, the slowness of international communications, or delays in my publication. See also Soboleff's functionals (''New Methods" . . . ) .

The first two references have no critical interest. The last

one, "New Methods," is the article already quoted. On the

other hand, he "forgets" to mention the Doklady article of

1935 (received on 7/17/1935). Moreover, this Note remained

unchanged in later editions [S' 1 ] .

The Key to the Mystery

In his autobiography, Schwartz, after a minimal description

of the discovery made by Sobolev in 1935 as found in the

article not mentioned in Note 4 above, wonders ([S2), p.

236) why, after the war,

apart from Hadamard's

work. Sobolev had a clear idea of Sobolev did not continue

his work on generalized

Hadamard's curious and

enthusiastic mind could

not remain indifferent to

this ongoing work; he read

the 1936 article as soon as

the importance of h is work and its general character.

functions.

The answer is instruc

tive. Sobolev disappeared

from mathematical re

search circles and re

it got to the Ecole Normale Superieure. Moreover,

Hadamard always remained a subscriber to the main So

viet mathematical reviews [ManS] . Jean Leray was becom

ing a specialist in partial differential equations and also a

participant in the "prehistory of distributions" with his no

tion of weak solution [L1], the subject of his "Cours Pee

cot" in 1935 at the College de France. He told Sergei Sobolev

in the 1980s that he had discussed his 1936 article with Lau

rent Schwartz before the war (personal communication of

V. Chechkin, holder of the Chair of Partial Differential

Equations at Moscow University and grandson of S.

Sobolev).

It took Schwartz more than ten years, including several

years not dedicated to mathematics and some years of slow

maturation, to bring forth his work of 1945, which reuses

Sobolev's defmition. But meanwhile Sobolev had surrepti

tiously left the stage! Sobolev did not pursue his work in

this direction, though some work with Smirnov was not far

from it. He was awarded the Stalin Prize in 1941, and be

came a deputy of the Parliament of the Soviet Union and

Director (beginning in 1941) of the Steklov Institute. This

left Schwartz free to develop the theory. The missing parts

were mainly Fourier transforms and the topological struc

ture of the space of distributions (see below).

Moreover, the first publication in which Schwartz quotes

his sources [S1] contains a note (Note #4, page 5 of the In-


frained from any foreign contacts from 1943 until 1953 be

cause he was busy with other activities in applied

mathematics-very applied indeed; he became the main

assistant of the director I. V. Kurchatov in "Laboratory 2,"

which was initially located within Moscow University, and

which became LIPAN, where the first Soviet atom bomb

was developed [Viz] .

It is not surprising that both in the West and in the East

great mathematicians played a critical role in the nuclear

projects [Go, p. 383). The complex physics of shock waves

involved in those projects entails the solution of nonlinear

equations, and Bethe (who told von Neumann about it) had

noticed the instability of the numerical approximation in the

solutions; the skills of top mathematicians were needed!

This work, essential to Soviet defense, led Sobolev to the

numerical solution of the equations for a spherical nuclear

reactor. He also studied the so-called gun effect and its vari

ation under neutron bombardment. This work is essential

in applications to assess water loss in reactors (Three Mile

Island and Chernobyl). In 1951 Sobolev received the most

prestigious civilian award, the Hero of Socialist Labour

medal. Naturally any foreign contact was totally forbidden

to him-even his wife did not know Sergei Sobolev's where

abouts when he left for periods of several months after

briefly visiting home. His publication list is much shorter

during this period-apart from his 1950 manual written in

Figure 3. Novosibirsk 1978: In the foreground 0. A. Oleinik, E. 5. Soboleva, 5. L. Sobolev, and French visitor Jean Leray.

a hospital where he was recovering with a broken leg-and

the main part of the work just mentioned is still unpub

lished.

The continuation is known in more detail; Sobolev re

sumed classic scientific activities in the 1960s. Mean

while, with Schwartz's book Distribution Theory and the

line of research pursued by him (tempered distributions

and Fourier transforms, applications of the theory of

topological vector spaces), he came to be considered the

father of the theory. The much-delayed acknowledgement

of Sobolev's paternity came only fifteen years later [L3,

L4] . Schwartz's main contribution, in the heritage of the

Bourbaki project (in a nutshell, the "algebraization of

analysis"), was to bring together Sobolev's definition and

the work begun by Dieudonne on topological vector

spaces in 1940 [Du] following Banach's famous Theorie des operations lineaires and Kothe's work. During the

period 1945-1950, Schwartz understood the importance

of applying TVS theory to the case of generalized func

tions.

This process of discovery by bringing together sepa

rate theories could be called "appropriation by bourbak

ization." It was frequently used-see, e.g., [Gr, Mi, S4]: a

beautiful idea by Minlos, which Gross had also had inde

pendently, was embodied ten years later in the theory of

"radonifying applications" without any acknowledgement

of Minlos. In the case of Sobolev, the author himself had

fostered the process! The term "bourbakization" of course

refers to the "Bourbaki project," which consisted in sin

gling out the deep structures of mathematics to reach the

degree of generality able to give a theory its extensive

power. This clear explanation of the topological vector

space structure paved the way to Schwartz's theorem of

kernels and to the theorem of Malgrange--a student of

Schwartz-on the existence of solutions to partial differ

ential equations in any open set in Euclidean space. Note

also that Schwartz's other student at that time, Jacques

Louis Lions (1928-2001), had been focusing since his dis

sertation on the use of Sobolev-style methods (Sobolev

spaces discovered in the 1930s ) , less elegant but more ef

fective than functional analysis, for example, for cubature

formulas. Lions later became the leader of French applied

mathematics.

"Percolation"

The "percolation" (or "illumination," as he also refers to it)

process discussed by Schwartz in his autobiography prob

ably consisted in the final linkage, made on the occasion

of a problem posed by Gustave Choquet, between Sobolev's

theory of functionals (defined as continuous linear forms)

and the work of Dieudonne and later Dieudonne and

Schwartz. Actually, contrary to the assertions in [Lu], by

January 1946 Schwartz had a good knowledge of Sobolev's

work: a participant recalls that during his "Cours Peccot"

at the College de France "he constantly had Sobolev's name

in his mouth."

Conclusions and Issues

Theory and Practice- East and West

In the era of triumphant socialism in Russia, science was

expected to be at the service of the people for the progress

of mankind. This notion was in fact the new face of an an

cient cultural tradition in Russia, still vital in Saint Pe

tersburg, even in the field of mathematics. One thinks for

example of Pafnutii L'vovich Chebyshev, whose concern

for linkages, ways to cut up fabric, and laws of chance,

were closely related to highly abstract concerns. Cheby

shev has very explicitly described [C] the mutual benefits

of mathematics and practical applications. In the case of

functionals, Smimov, in a profound analysis, shows how

central the experimental sciences remained among the

VOLUME 26. NUMBER 1 , 2004 43

concerns of Russian mathematicians (see Appendix). For the Russian school, in the period discussed here but also later, the value of mathematics is measured by its effectiveness. Even general topology, through Tychonov to Pontryagin, has applications to the study of control systems. More recently, this is also the case in the work of Arnold and his school. We can imagine the difficulties experienced by Lusitania, the famous school created by Luzin in Moscow around function theory, and largely inspired by (German) set theory or (French) function theory. In contrast, France, the country of Descartes, Galois, and Bourbaki, favored an interest in mathematical investigation "pour l'honneur de l 'esprit humain" (a phrase coined by Jacobi): the value of a theory is assessed by its degree of generality-a purifying quality of generality, synonym of efficiency, and evidenced in the connections between apparently remote domains for the production of new theories, and by the elegance of the concepts [B2]. (Similarly in Germany.) For Schwartz, for example, distribution theory develops as he associates the Sobolev definition to the theory of topological vector spaces, thus arriving at the properties of the topology of distribution spaces. This will make possible the work of his students Lions and Malgrange, after the presentation of the kernel theorem at the 1950 Congress in Cambridge, USA; this theorem was the cherry on the cake, and earned him the Fields Medal and the later paternity-in the West at least-of distributions. These two views of mathematics and their role were present simultaneously in both countries, and sometimes in the productions of the same mathematician, as in the cases of Gel'fand in the USSR or, earlier, Fourier in France. In the period we are interested in, the emphases were as we have mentioned above. Though this question goes beyond the scope of this article, we note that recent developments in physical and mathematical sciences show continuing ten-

Figure 4. The young Laurent Schwartz.


sion between effectiveness and rigor (Feynmann integral, string theory; [JQ] for example gives an account of the debate). Should one rejoice that the political upheavals of the last decades threaten to standardize worldwide the practice of mathematical science and the answers to this "essential tension" [Ku]?

Theory and Practice- Probability and Measure

Measure theory and its relevance to probability deserve particular scrutiny: it was the first serious stumbling block in the development of the Bourbaki project [B2]. From the point of view that interests us, distribution theory obviously served as a weighty "ideological" argument at the time. As

an illustration, here is an excerpt from the introduction of [Bl] concerning measure theory: " . . . Thus, integration theory is connected, on the one hand to the general theory of duality in topological vector spaces, and on the other hand to distribution theory, which generalizes certain aspects of the notion of measure, and which we shall present in a later book" It can be seen how much this "structuralist" stancealso present in Schwartz's approach to distributions-concealed the real nature of the phenomena in question, for example the subtleties of random processes. (Another illustration of errors of judgment is found in Andre Weil [Wl], [W2]: " . . . The time has come to try, through closer analysis, to split up Lebesgue's discoveries into various elements in order to identify what is essential in the manipulation of an integral, and what is relevant to the specific operations over sets on which we are most frequently working.")

Rather than disregard for potential applications, it was the desire to give priority to the structure over the phenomenon, and to the architecture over the portrait, that caused a delay of fifteen years in French research on probability. Quite ironic, in the country of Laplace, Lebesgue, Borel, and most particularly Paul Levy, Fortet, Loeve, Ville,

Figure 5. Laurent Schwartz presented with a Vietnamese peasant hat.

and Doeblin, who, in the 1930s, were at the forefront of the

revival of probability theory by developing new trajectorial

aspects of processes, which were to have a wealth of appli

cations in the second half of the 20th century, including in

the solution of the great problems of classical analysis and

its renewal (PDE, Dirichlet's problem, potential theory, etc.).

We hope in subsequent work to revisit this question, on

which Schwartz himself was self-critical ([S2]).

Problems of Communication

From the Russian Revolution to the 1970s, interchanges be

tween mathematicians suffered from many difficulties be

cause of the lack of intellectual freedom in USSR, the cold

war, and internal conflict within the Soviet cultural and uni

versity system from the 1960s. Thus the Soviet delegation

as a group declined the invitation to the 1950 Congress at

Harvard, at the height of the Korean War. This was the Con

gress during which Laurent Schwartz was awarded the

Fields Medal. We suppose that Kolmogorov, even though

he was a member of the medal committee, did not even

mention the name of Sobolev, then assistant director of LI

PAN. In the 1960s, more problems appeared: we witnessed

difficulties of exchange and of publication of mathemati

cal articles in the USSR, which led for example to the creation of the review Funktional'niiAnaliz by Israel M. Gel'

fand in the 1970s.

Mathematics and Politics

At the end of the interview used as a working document by

Liitzen, Laurent Schwartz makes a surprising linkage be

tween distribution theory and political democracy, quoting

the eminent British Marxist historian Moses Finley, for

whom democracy was discovered by the Greeks: "It was

the Greeks, after all, who discovered not only democracy,

but also politics. I am not concerned to deny the possibil

ity that there were earlier examples of democracy . . . . What

ever the facts may be . . . their impact on history, on later

societies, was null. The Greeks, and only the Greeks, dis

covered democracy, precisely as Christopher Columbus,

not some Viking seaman-discovered America." [Fi]

In other words, Sobolev is cast as the Viking, Schwartz

as Columbus. Beyond the general debate on philosophic re

alism (was democracy discovered or invented? and distri

butions?), it is clear that neither mathematics nor political

concepts emerge ex nihilo, and that scientific work is a

process: Schwartz comes after Sobolev, Dirac, and even

Euler! ( cf. Appendix) In retrospect and based on the ex

amination above, this comparison appears to be not merely

excessive but unjustified. The same field of mathematical

analysis saw the emergence of the point of view of alge

braic analysis, whose importance seems much more

promising, if only-in Bourbaki's view-by the "bridges"

which it builds. Going farther, and taking into account the

frequent cases where Schwartz left some things unsaid (see

above), we can wonder whether there may be an allusion

to the ideological power gained by Bourbaki, sometimes

against the will of some members. For example, Claude

Chevalley remained a libertarian all his life. In a beautiful,

nostalgic interview [Che], he confesses that he thought he

was "enlightening the world of mathematics," in a common

desire for renewal. It is in fact in Chevalley's writings that

we find the most interesting remarks on the relationships

between Bourbaki and political thinking: he says that read

ing the political theorist Castoriadis made him understand

the wrongness of his view of mathematical logic!

Schwartz's aura personified that of Bourbaki: modem

mathematics and educational reform, the role of the

scholar in pronouncing what is right, and indirect power in

the life of society: the aura of the mathematician, which

Schwartz knew how to apply "for the good cause," is quite

evocative of Greece. Frequently, top mathematicians seem

to confound mathematical action, political struggle, and

moral principles.

The disappearance of such a strong personality evokes

the end (announced by some) of the era of "great narra

tives"-the disappearance of myth-creating romantic ac

tors (as Bourbaki, the dream of distributions). Is this hap

pening? Time will tell and History will judge.

Acknowledgments

The author thanks the Sobolev family for permission to use

the photographs of S. L. Sobolev; the photographs of

L. Schwartz are by kind permission of the editors of Pour la Science.

REFERENCES [Be] Beaulieu, Liliane. Bourbaki. Une histoire du groupe de mathemati

ciens franc;ais et de ses travaux (1934- 1944). Ph.D. Thesis, Univer

site de Montreal, 1 990, Paris, 1 992.

[Boc] Bochner, Salomon. Review of L. Schwartz's Theorie des distri

butions, Bull. Amer. Math. Soc. 58, ( 1 952), 78-85.

[B1 ] Bourbaki, N. Elements de mathematique, livre VI: Integration, Her

mann, 1 952.

[B2] Bourbaki, N . L'Architecture des Mathematiques, p. 35-47, in Les

grands courants de Ia pensee mathematique, ed. F. Le Lionnais, Edi

tions. Albert Blanchard, Paris, 1 962.

[Boul] Bouleau, Nicolas. Dialogues autour de Ia creation mathematique.

Association Laplace-Gauss, 1 997.

[C] Chebysheff, PafnutiT. Rapport du professeur extraordinaire de


I 'Universite de Saint-Petersbourg sur son voyage a I' stranger. In Oeu

vres de P. L. Tchebychef, A Markoff and N . Sonin, editors, Chelsea,

New York, 1 961 .

[Che] Chevalley, Claude. Nicolas Bourbaki, Collective Mathematician.

The Mathematical lntelligencer, 7, no. 2, (1 985), 1 8-22.

[De] Demidov, Sergei S. The Moscow school of the theory of functions

in the 1 930s in: Golden years of Moscow Mathematics, ed. S.

Zravkovska, P. Duren, vol. 6, AMS, LMS, 1 993.

[Du] Dugac, Pierre. Jean Dieudonne mathematicien complet, Paris,

Jacques Gabay, 1 995.

[Fi] Finley, Moses I . Democracy Ancient and Modern, Rutgers Univer

sity Press, 1 985.

[Ge1 ] Gel'fand Israel M. Some aspects of functional analysis and alge

bra. Proc. Int. Cong. Math., 1 954, Amsterdam (1 957), 253-276.

[Ge2] Gel'fand, Israel M. and Shilov, GeorgiT E. Generalized Functions,

5 vols. , Academic Press (1 977) New York & London.

[Go] Godement, Roger. Analyse Mathematique, tome 2, Springer-Ver

lag , 2000.

[Gr] Gross, Leonard. Harmonic Analysis on Hilbert Space, American

Mathematical Society, 1 963.

[H 1 ] Hadamard, Jacques. Le Mouvement Scientifique en URSS, Rap

port presents en 1 935 a Paris aux journees d 'etude et d 'amitie franco

sovietiques.

[H2] Hadamard, Jacques. Rapport paru en 1 945 apres le 2208 an

niversaire de I 'Academie des Sciences de Russie.

[Ka] Kahane, Jean-Pierre. Jacques Hadamard, The Mathematical lntel

ligencer, 13, no. 1 (1 991 ), 23-29.

[JQ] Jaffe, Arthur; Quinn, Frank. "Theoretical mathematics": toward a

cultural synthesis of mathematics and theoretical physics. Bull. Arner.

Math. Soc. (2) 29 (1 993), no. 1 , 1 -1 3.

[Ku] Kuhn, T, The essential tension, University of Chicago Press, 1 979.

[L 1] Leray, Jean. Sur le mouvement d'un liquide visqueux emplissant

l 'espace, Acta Math. 63 (1 934), 1 93-248.

[L2] Leray, Jean. Travaux de M. Laurent Schwartz, rapport annexe a Ia

candidature de Laurent Schwartz, 1 964, Academie des Sciences, Paris.

[L3] Leray, Jean. Rapport sur ! 'attribution du prix Cognac-Jay (Samar

itaine) 1 972 a Laurent Schwartz, Jacques-Louis Lions et Bernard Mal

grange.

[L4] Leray Jean. La vie et I'CEuvre de Serge Sobolev, La Vie des Sci

ences, serie generale, vol. 7 , 1 990, No. 6, p. 467-471 .

[Lo] Lorentz, G. G. Mathematics and Politics in the Soviet Union, J. Ap

proximation Theory 116 (2002), 1 69-223.

[Lu] Ultzen, Jesper, The Prehistory ofthe Theory of Distributions, Berlin,

Springer-Verlag, 1 982.

[ManS] Mandelbrojt, Szolem. Souvenirs a batons rompus, recueillis en

1 970 et prepares par Benoit Mandelbrot, Cahiers du Serninaire d'his

toire des mathematiques, No 6, 1 985, p. 1 -46.

[ManB] Mandelbrot, Benoit: Chaos, Bourbaki and Poincare. The Math

ematical lntelligencer 11, no. 3 (1 989), 1 0-1 2.

[Mar] Maritz, P. Around the graves of PetrovskiT and Pontryagin. The

Mathernatical lntelligencer 25, no. 2 (2003), 55-73.

(M-Sh] Maz'ya V.-Shaposhinikova T. Jacques Hadamard, AMS-LMS,

1 998.

[Mi] Minlos R. Continuation of a generalized random process to a com

pletely additive measure, Dokl. Akad. Nauk SSSR (N. S.) 119 (1 958),

439-442.

1 From the French translation by Jean-Michel Kantor


[S1 ] Schwartz Laurent. Theorie des Distributions, vol. 1 , Hermann, Paris,

1 950.

(S' 1 ] Schwartz, Laurent. Theorie des Distributions, Nouvelle edition en

tierement corregee a fondue et augmentee. Hermann, Paris, 1 966.

[S2] Schwartz, Laurent. Un Mathematicien aux Prises avec le Siecle,

Editions Odile Jacob, 1 997.

[S3] Schwartz, Laurent, in "Les Mathematiciens," Pour Ia Science, Paris,

1 996.

[S4] Schwartz, Laurent. Seminaire "Applications radonifiantes, "

1 969-70, Ecole Polytechnique.

[Viu] Viucinich, A Soviet mathematics and dialectics in the Stalin Era.

Historia Mathematica 27 (2000), 54-76.

[Viz] Vizguin, V. lstoria sov. atornnogo proekta. lzdat. rousk. kirstian. gu

manitarnogo instituta St. Petersburg.

M Yushkevich, A P. Encounters with Mathematicians, Golden Years of

Moscow Mathematics ed. S. Zdravkovska, Peter L. Duren, History of

mathematics, vol. 6, American Mathematical Society, LMS, 1 991 .

(W1 ] Weil, Andre. Calcul des probabilites, methode axiomatique, inte

gration, in Revue Rose, vol. 1 of CEuvres completes, pp. 260-272.

Springer-Verlag, Berlin, New York, 1 979.

[W2] Weil, Andre. L 'integration dans les groupes topologiques et ses

applications, Hermann, Paris, 1 940; CEuvres, vol. 1 . See also the

commentaries on pages 551 -555.

Added in proof

Kutateladze, S. S. Sergei Sobolev and Laurent Schwartz: two fates and

two fames (in Russian), Novosibirsk, Sobolev Institute, preprint 1 2 1 ,

Oct. 2003.

Appendix

I.

Adolf P. Yushkevich: Some remarks on the history of the

theory of generalized solutions for partial differential

equations and generalized functions. Istoriko-matematicheskie issledovanie, 1991, 256-266 (Russian).1

Since 1968, I have been publishing appreciations of famous

French mathematicians on their Russian colleagues, on the

occasion of their applications as foreign member of the

Academie des Sciences de Paris (the election process has

remained unchanged since the middle of the 19th century).

Often these appreciations are interesting from the point of

view of the history of relationships between the scientists

of our two countries. Naturally these appreciations reflect

the personal point of view of the speakers, and frequently

the judgment on the candidates also depends on the inter

national situation. The appreciations published so far are

those of Chebyshev, Lyapunov, Bernstein, Vinogradov,

Lavrent'ev, and Kolmogorov. It was a great pleasure for me

to receive Paul Germain's authorization to publish Jean

Leray's appreciation on his colleague Sergei Sobolev.

II.

The best appreciation of the work of Sobolev is in the ref

erence [4], published for his 80th birthday: Sergei L'vovich

Sobolev (6. 10. 1908-3.01. 1989) completed his studies at

Leningrad University in 1928. His doctoral dissertation ad

visers were N. M. Gunther (1871-1941) and V. I. Smimov

(1887-1974), both students of V. A. Steklov (1863-1926), himself a student of A. M. Lyapunov (1857-1918). For most of their lives, these four professors worked on the theory of differential equations, the theory of partial differential equations, and their applications in mathematical physics and mechanics. They were eminent members of the mathematical school of Saint Petersburg, later Leningrad, headed by P. Chebychev (1821-1894), one of the professors of Lyapunov. As a student, Sobolev also followed lectures by Fikhtengolts (1888-1959), who was the first to develop in Leningrad the study of functions of a real variable, which prompted the extensive work of the school of Moscow with D. F. Egorov (1869-1931), N. N. Luzin (1883-1950), and their students.

Sobolev belongs to the fourth generation of Chebycheffs school, which systematically exploited the relationships between mathematics and the concrete problems of sciences and technology, without precluding a concern for the introduction of abstract questions-often over and above practical issues (even in number theory). It is necessary also to stress that Sobolev's professors themselves were already using the most recent developments in mathematics-topology, the theory of functions of a real variable, new areas of the theory of functions of a complex variable, integral equations, and the new area of functional analysis. Sobolev's research work began immediately after the end of his studies, in the department of seismology of the Academy of Sciences headed by V. I. Smirnov. While still a university student, he presented a Master's thesis on a topic suggested by Gunther. At the Institute of Seismology, Sobolev again conducted work closely related to this topic previously suggested by Gunther, viz. the analytical theory of partial differential equations and in particular the propagation of elastic waves. Some of his first publications were cosigned with Smirnov. On June 29, 1930, Sobolev presented a paper at the first Congress of Russian Mathematicians: "The Wave equation in a heterogeneous isotropic environment," an abstract of which appeared in the Notes aux Comptes-Rendus de l'Academie des Sciences de Paris. This work interested Jacques Hadamard (1865-1963), who attended the Congress and himself made a presentation on a topic close to Sobolev's: "Partial differential equations and the theory offunctions of a real variable" ([5], in French and in Russian). Sobolev's early work (summarized, after those of Gunther and Smimov, in section 8 of [6]) was already getting considerable attention from Soviet mathematicians, and Sobolev, not yet 25, was elected on 01 .02. 1933 a corresponding member of the Academy of Sciences. He was later elected a member on 29.01.1939.

Ill. In 1932 Sobolev enters the Physico-Mathematical Institute created by Steklov in 1921 . It is in this period that he develops his most important work, which establishes the beginning of the theory of generalized functions. He is the first to define them mathematically and to set about studying their fundamental properties. A summary of his ideas was written by Smirnov ([7] , p. 187-191). Sobolev started

articulating his ideas on distributions, which he calls functionals and were later called "generalized functions," from the late 1920s and the early 1930s-or possibly earlier. He presented them in his lecture "Generalized solutions of the wave equation" on June 29, 1934 at the second Congress of Soviet Mathematicians in Leningrad. Here is the laconic summary by the author: "The class of functions which we can consider as solutions to the wave equation from the classical point of view consists of twice-differentiable functions. But in various practical applications it seems convenient to consider functions with singularities of a welldefined type. We introduce a space of integrable functions in the sense of Lebesgue, in which it is possible to defme the generalized solutions of the wave equations as the limits of twice-differentiable solutions. Using a simple integrability criterion, we give a necessary and sufficient condition for a function to be a generalized solution, and we establish the link between the usual solutions and generalized solutions. Finally, this theory is applied to some concrete examples" ([8] , p. 259). Leray sees considerable importance in Sobolev's work in the theory of generalized functions, called distributions in western mathematical literature, but he dates them back to 1935 and 1936, not earlier. Smimov ([7], p. 187) refers to the article [8] of 1935 and to two other articles quoted in [9) and [ 10). In the bibliographical list [9] , the lecture of 1934 is not even mentioned.

In his two classic volumes on the history of mathematics in the last two centuries, Jean Dieudonne writes that Sobolev began the study of generalized functions in 1937 ( [11 ] , p. 2, [7], p. 171). In the 1982 article "Fonctions generalisees," Vladimirov quotes [9) along with the "generalized solutions" article ([12] , vol. 3, p. 1 102-1 1 10 and 1 116-11 17). It is only in the article written on the occasion of Sobolev's jubilee in 1989, that one of the authors, also named Vladimirov, indicates the article of 1934 "in which the theory of generalized functions appears for the first time." Notoriously, the establishing of chronological priority between several authors of a scientific discovery is not always a harmonious process, but nowadays it does not lead to such negative effects or violent quarrels as in the case, for example, of Newton and Leibniz, the creators of infinitesimal analysis.

IV.

The prehistory of the theory of Sobolev's generalized functions has not been investigated much. Gunther's work should probably be ascribed a role in laying down the core notions; in particular, his smoothing method for insufficiently differentiable functions, which is often quoted by Smimov ([7] p. 184). A path toward the theory of generalized functions is found earlier still in Hadamard's work, starting with his remark "on functional operations" and his "Le<;ons sur la propagation des ondes et les equations de l'hydrodynamique" (Lectures on wave propagation and the equations of hydrodynamics) published in 1903. The academician Steklov drew attention to these sources during the presentation of Hadamard's work when he was elected a corresponding member of our Academy on December 2, 1922.


The article by Steklov is deep and definitely important.

He insists in particular on the significance of the first arti

cle, where Hadamard uses for the first time the term "func

tional," and he discusses in detail the results of the second

article. He insists in particular on the existence of "shock

waves" in compressible liquids and elastic bodies. One re

mark by Steklov is particularly interesting: issues of hydro

dynamics, translated into the language of mathematical

analysis, coincide with the theory of the characteristics of

partial differential equations, "which emerged completely

independently from any physical origin." This remark shows

that Steklov understood perfectly the significance for later

applications of abstract basic research pursued in complete

independence from their use. Moreover he uses the classic

terminology he is familiar with (and uses the term "func

tional" only occasionally), and he could not foresee that a

few years later, it is essentially in his own institute that the

groundwork for the theory of generalized functions was go

ing to be laid. This speech by Steklov was not published un

til 1968 ([1] p. 1 10-115). As regards Hadamard's advances

toward the theory of generalized solutions for partial dif

ferential equations and generalized functions, let us quote

a statement made by G. Shilov (professor at Moscow Uni

versity from 1917 to 1975, and a recognized specialist on

this question), on February 10, 1964, during a memorial ses

sion of the Moscow Mathematical Society: "In solving hy

perbolic equations, Hadamard essentially introduces the de

vice of the theory of generalized functions of one or several

variables. This discovery remained dormant at the time

(Hadamard was many years ahead of the thinking of math

ematicians of his generation), and it was only in the mid

fifties that generalized functions spread worldwide in ques

tions of analysis" ( [ 13], p. 185). Shilov concludes by quoting

Szolem Mandelbrojt's words of 1922 about the famous

"Readings on Cauchy's problem" (translation to French in

1932, and to Russian in 1978): "The notions developed in

this work lead to general topology and to functional analy

sis, and the introduction of the notion of elementary solu

tion has a high degree of generality with respect to distrib

utions (generalized functions)" ([ 14], p. 4-5). Furthermore,

we owe to Hadamard the terms "functional" and "functional

analysis." Jean Leray also mentions this precursor work

What we say here does not by any means detract from

Sobolev's achievement; he is the first to give a rigorous def

inition-and in several ways-of the modem notion of gen

eralized function, and to lay the bases of later developments

in various domains of the theory of generalized solutions to

partial differential equations and generalized functions, as

an autonomous domain of analysis.

v. Almost all of Sobolev's work on the theory of solutions and

generalized functions was published in Russian, except the

article in French of 1936 ([9], 22). So it is not surprising that

in other countries [than the USSR] this work did not attract

immediately the interest it deserved. This remark also ap

plies to the book Some Applications of Functional Analysis in Mathematical Physics (Leningrad, 1950), which cor-


responds to the course that Sobolev taught at the time at the

University. This book was not translated into English until

1963 (into German in 1964). It is quoted several times by

Leray, and, as noted by V. I. Smirnov, "this book played an

important part in the use of the modem ideas and methods

of function theory and functional analysis for the solution of

problems of the theory of partial differential equations" ([6]

p. 191). In Russia, Sobolev's new ideas, following those of

his masters Gunther and Smirnov, diffused fairly quickly, and

were extended and developed from the 1950s.

In the diffusion abroad of these new directions of math

ematical analysis, a major role must be ascribed to the book

Theorie des Distributions (in two volumes) by Laurent

Schwartz, a corresponding member (1973), then a full

member (1975), of the Academy of Sciences, and a profes

sor at the Ecole Polytechnique. Several articles published

by Schwartz between 1945 and 1948 already used the ex

pression "distributions." After the publication in 1950-51 of

Schwartz's book, distribution theory developed consider

ably and received numerous new applications.

The first historical study on research on distributions,

published by Jesper Liitzen in 1980, contains an accurate,

flawless mathematical analysis of the works of Sobolev,

Schwartz, and many earlier or contemporary mathemati

cians. In spite of all these achievements, Jesper Liitzen's

book has some gaps and, from my point of view, uncon

vincing evaluations, which can be explained by an imper

fect knowledge of work in Russian generally and of Sobolev

in particular (although his bibliography contains 1 1 refer

ences which were translated into English, as well as the

1950 book already mentioned and the thick course book in

its third version of 1954). Leray's note on Sobolev's work

is a substantial complement to Liitzen's study.

Without trying to write the history of the question, I shall

make here some remarks on Liitzen's book First, I cannot

agree with his evaluation of the results obtained by

Sobolev, then Schwartz, and their place in the development

of distribution theory. The essence of the differences be

tween their theories, according to Liitzen (p. 64), is that for

Sobolev distributions are a technique to resolve a specific

problem, while Schwartz developed distribution theory un

der multiple angles, and applied it to formulate and resolve

rigorously numerous problems. It is true that in 1934

Sobolev began with Cauchy's problem for the wave equa

tion (which is hyperbolic), but then he did not limit him

self to one of the applications which he had introduced,

and he considerably enriched them, as shown by Jean Leray

("work whose scope, variety, and power are admirable").

It is also true that these various contributions published in

successive articles were not collected into a monograph,

which would doubtless have had the seminal role of

Schwartz's book-which became the basic book for nu

merous researchers abroad and here. Liitzen briefly sum

marizes the fundamental difference between the works of

Sobolev and Schwartz: "So Sobolev invented distributions,

but distribution theory was created by Schwartz" (page 64).

Variants of this reflection occur in the book On page 67,

after quoting Lyusternik and Vishik's words in a speech pro-

nounced on the occasion of Sobolev's fiftieth birthday (1959), Liitzen supports what they say but immediately adds that "further development of the theory was not Sobolev's but Schwartz's achievement." Without intending in the least to detract from the essential importance of Laurent Schwartz's book of 1950, I find more balance in S. Vladimirov's judgment ([12], vol. 4, p. 1104): "The foundations of the mathematical theory of generalized functions were laid by Sobolev in 1936 with the aim of resolving Cauchy's problem for hyperbolic equations, but in the 1950s L. Schwartz gave a systematic statement of the theory and mentioned numerous applications." He could have added that the systematic account in modem terminology in Schwartz's work overshadowed Sobolev's. As regards Schwartz's possible knowledge of Sobolev's previous discoveries, according to the statements made by L. Schwartz in 1950-51 and in 1974, the latter did not know of them before 1945 (p. 67 of [16]). Elsewhere Liitzen writes that Schwartz's attention was called to Sobolev's work by Leray in 1946. Certainly Sobolev and Schwartz arrived at their discoveries of "generalized functions" and "distributions" by different paths-but certainly, too, there is no reason for assigning Sobolev's work to the "prehistory" of distribution theory, as Liitzen does three times (pages 64, 67, and 156).

More generally Liitzen devotes more attention in his book to Laurent Schwartz than to Serge! Sobolev. The statement of the results according to the bibliography is correct; but he could have gone into more detail. In this respect, Leray's note contains valuable complements, but even this note does not contain enough bibliographic data on Sobolev. These indications could have and should have been enriched by the inclusion of Lyustemik and Vishik's text (which Liitzen quotes and uses). There is no reference to Sobolev's teachers, in the text or in the reference index. L. Schwartz's biography is given a very contrasting treatment. In chapter 6, the reader is informed of all the stages in Schwartz's life, the names of the professors at the Ecole Normale (Leray, Leyy, Hadamard), Schwartz's membership in the Bourbaki group, his discovery in six months of distributions, his conversations with de Rham (also mentioned by Leray), etc. All this information is valuable, and it is regrettable that Sobolev's mature work is treated by Liitzen in merely half a page (p. 60).

To be sure, distinctions between the "prehistory" of a theory and its development are a matter of convention. The notion of "distribution" appeared in various authors of the beginning of the 20th century, and one could even go back to Euler (see below), whom Liitzen also mentions. However, we distinguish ideas belonging to prehistory-already born but not introduced yet into a well-defined framefrom ideas belonging to the history of a theory-where they have a precise definition and we focus on the study of their specific properties. Thus one reasoned with functions of one type or another in ancient Greece, in the Middle Ages, and at the beginning of the modem period, but functions as objects of mathematical investigation, in all their generality, appear only at the end of the 17th century. However, the title of Liitzen's book is "Prehistory of etc.", which

situates Schwartz-to whom the largest part of the book is devoted-as part of the prehistory of the theory.

If Liitzen had restricted his study of the prehistory of distributions to Western Europe, it would have been natural to insist on Schwartz's work But for a study of the development of mathematics as a worldwide process (which it has always been), the book's structure seems inadequate. This is shown in any case by the historical study of the facts in our country. Sobolev, following his teachers, played an important role by laying the bases of the numerous studies which began even before 1970, the publication date of Smimov's already mentioned article, which presents a summary of twenty years of work in the theory of partial differential equations-elliptic, hyperbolic, parabolic, or mixedas well as contributions to the general theory, and work by 0. A. Ladyzhenskaya, S. G. Mikhlin, N. N. Ural'tseva, and others. All this work was not isolated from foreign research. Collaborations took place between all the countries involved, albeit it was sometimes made difficult by problems of communication and the lack of personal contacts (which developed a lot in recent years, their earlier scarcity having been supplemented by numerous reference periodicals). The objective of my remarks on the history of the theory of generalized solutions and generalized functions is not only to clarify the conclusions of Liitzen's book, but also to introduce the presentation of Sobolev's candidacy by Leray, which is an essential complement to the Danish historian's account.

I must make a few additional remarks on the protohistory of the theme, which led to the solution of the equation of the vibrating string and to the dispute between d'Alembert and Euler, which went on for almost thirty years from 1750, and which somehow involved all the mathematicians of the 18th century. Briefly stated, d'Alembert completely excluded the case of discontinuity of a derivative, and even more stringently of the function itself. In my book on the history of Russian mathematics before 1917 (Nauka 1968), I showed that Euler, from physical considerations, deemed it necessary to admit, as solutions of problems of mathematical physics, what he called "broken" functions and curves; we would say that the initial position of the string and its initial velocity are functions of position which are continuous by segments, i.e., where discontinuities (in the modem sense) of the first two derivatives are allowed.

Not having the necessary mathematical means at his disposal, Euler gave a simplified geometrical description of the distribution of waves and their reflection for a string fixed at a single point. I take the liberty to quote my own book: "Threads are woven here between Euler's ideas and the new methods of the 20th century up to Sobolev and Schwartz's generalized functions" (p. 166, 169). During the more recent history of the notions of solutions of partial differential equations, the historian S. S Demidov relied as I did on a quote from d'Alembert (Opuscules, volume IX). "Euler essentially constructed a solution of the equation as a generalized solution-for which a correct defmition and, even more, construction, were beyond the capacities of the mathematicians of that time" (p. 179). I added in my book that because of its


A U T HOR

IIQU8S bclis· cont Of

practical utility, Euler's construction had been the object of

the attention of numerous mathematicians over time. I used

Truesdell's well-known study of Euler's work in hydrody

namics and elasticity. References in Russian on this subject

are not known by Ltitzen (for example in volume IX of

d'Alembert's Opuscules, he only mentions Demidov's lecture

at the International Congress on the History of Mathematics

in 1977, which he also uses extensively on page 15). Ltitzen

also traces back to Euler the notion of generalized solution,

and he draws a parallel between Euler and Sobolev. Using

the definition of generalized solutions as limits of series of

classical functions, Ltitzen notes that this idea can be found

in Euler in 1765 and Laplace in 1772, and that the rigorous

definition was introduced in 1935 by Sobolev and later by

the other authors, in particular Schwartz in 1944.

In conclusion, we would say that Euler introduced func

tions which could seem strange to his contemporaries, for

example, (- 1 )x, x being an arbitrary real number . . . but

not the delta function!

REFERENCES [1 ] French-Russian Scientific Relationships, A Grigorian, A Yushke-

vich, Nauka, 1 968.

[2] lstori. Math. /ss/edovanie, 31 (1 989), Nauka.

[3] Ibidem, 90, vol. 32-33.

[4] Bakhalov-VIadimirov-Gonchar, Sergei L'vovich Sobolev, Uspekhi

Mat. Nauk, vol. 43, 5 (1 989), 3-1 3.

[5] Proceedings of the First Congress of Mathematicians of the USSR,

Kharkov 1 930, Gonti, 1 935.

[6] Sobolev SergeT L. Smirnov Seminar, Soviet Academy of Sciences,

1 949.


[7] Smirnov S. (ed .) , Partial Differential Equations, Department of Math

ematics of the University of Leningrad, 1 970.

[8] Proceedings of the Second Congress of Mathematicians of the

USSR, Leningrad, June 24-30, 1 935.

[9] Forty Years of Mathematics in USSR, 191 7-57, Fizmatgiz, 1 959,

vol. 2, Lyusternik-Vishik article.

[1 0] Mathematics in USSR, 1958-67, Bibliography, Nauka, 1 970, vol.

2.

[ 1 1 ] Dieudonne, J . (dir.), Abrege d'Histoire des MatMmatiques,

1 700-1900, Hermann, 1 996.

[1 2] Encyclopedie des MatMmatiques, vol. 5.

[1 3] Shilov, J . Hadamard and the Birth of Functional Analysis, Usp. Mat.

Nauk, 19, 3, (1 964), 1 83-186.

[1 4] Mandelbrojt, S. Article "Hadamard Jacques, " Dictionary of Scien

tific Biography, vol. 6, Charles Scribner, 1 972.

[1 5] Schwartz, Laurent. Theorie des Distributions, Hermann, Paris,

1 950-1 951 '

[1 6] Li.ltzen . The Prehistory of Distribution Theory, Springer-Verlag,

1 980, 232 pages.

[1 7] Leray, Jean. La vie et I 'CEuvre de Serge Sobolev, La Vie des Sci

ences, Comptes Rendus, serie generale, vol. 7 , 1 990, No 6,

467-471 '

A Prest ig ious Col lection from the London Mathematica l Society

v

The Proceedmgs. Journal. Bulletin and Joumal of Computation and

athematiCs of the London Mathematical Society are among h ds lead•ng mathematical

research penod•cals. The subJect cover a e ranges across a broad spectrum of ma hemat1cs, covenng he who e of pure ma hema 1cs

together th some more apphed areas of analys•s. ma hematiCal p ys . t eoretiCal computer sc ence. probab1hty and s a 1s 1cs

To subscnbe pl ase cont journ Is cambrldg .org

w.cambridg . rg

cKichan Ufl.�lll UC'

Some Comments on Sobolev and Schwartz s. Ku dz

I :uul hi •

Sobolev Institute of Mathematics Novosibirsk State University Russia e-mail: sskut@math. nsc .ru

Mathematical Word Processing • U � Typesetting • Computer Algebra

Version 5 Sharing Your Work Just Got Easier

y U IO . rully hyperlink d nd


In


The Reception of the Theory of Distributions Peter Lax

Courant Institute of Mathematical Sciences

New York University

New York, NY 1 00 1 2

USA

e-mai l : [email protected]

ld§ijl§',ifj Osmo Pekon e n , Ed itor I

Feel like writing a review for The

Mathematical lntelligencer? You are

welcome to submit an unsolicited

review of a book of your choice; or, if

you would welcome being assigned

a book to review, please write us,

telling us your expertise and your

predilections.

Column Editor: Osmo Pekonen, Agora

Center, University of Jyvaskyla, Jyvaskyla,

40351 Finland


The Science of Conjecture: Evidence and Probabil ity before Pascal by James Franklin

BALTIMORE, THE JOHNS HOPKINS UNIVERSITY PRESS,

2001

600 pp. $22.50 PAPER ISBN 080-1 86569-7

REVIEWED BY NORMAN LEVITT

Franklin is nominally a mathematician and his book shows a mathe

matician's touch when it deals directly with mathematical matters, but this review must begin with a warning: this is not primarily a history of mathematics, nor is it a book that a strong mathe

matical background makes particularly accessible. Rather, The Science of Conjecture is a history of a lengthy philosophical investigation that has spanned a number of eras and civilizations, and which, of course, continues even now, with no discernible end. The central question is one of partial belief, belief that may be quite pronounced, but which stops short of demonstrative or "mathematical" certainty. Very simply, what kind of evidence and how much of it ought to be necessary to persuade a reasonable inquirer that it is more appropriate to accept a proposition than to reject it? How ought we to order degrees of belief that lie somewhere between absolute conviction and utter dismissal? What degree of belief is necessary to justify an action with grave consequences? This is not the sort of question that mathematicians are given to worrying about, at least not when going about their mathematical business. But it is the central concern of "practical reason," and in various forms it confronts us in many societal roles-as jurors, for instance, or as investors.

Franklin's view is that pre-modem and early modem thinking on this ques-

tion had a considerable influence on the origins of probability calculus, in the work, principally, of Fermat, Pascal, and Huygens. His history, which begins in classical times, closes at this juncture, the take-off point, we might think, of Western scientific rationalism. But by no means would he have us believe that formal, quantitative probability theory (or its evolution into statistics) made earlier modes of inquiry

obsolete or brought them to completion. Rather, the probabilist's quantification of likelihood and expectation, even in its most sophisticated development, covers only a narrow range of experience and informs our judgment at best in limited measure. Thus the meditations of ancient Aristotelians and cloistered Scholastics are far from being mere vestigial echoes of benight

edness. They touched on important epistemological points that have not become that much more transparent even a millennium or two down the line.

I am not an historian of ideas. My knowledge of the secondary, let alone the primary, literature relevant to The Science of Conjecture is too scant to be called even minimal. Yet I will risk the

opinion that Franklin's book is deeply researched and intensely learned. It is a throwback to the days when humanist scholarship meant thorough saturation in a vast ocean of sources, rather than picking out two or three texts and weaving elaborate postmodem curlicues around them. Franklin knows his authors, scores of them, thoroughly, and is scrupulously concerned to represent them with full fidelity to their ideas and their originality. He is not loath to judge those ideas, but his judgments do not come cheap; they are the fruit of care

ful reading and careful thought, not of a prefabricated agenda.

The book is roughly chronological in organization, so that the work of the inventors of mathematical probability culminates the inquiry. But the more important structural principle is to

© 2004 SPRINGER-VERLAG NEW YORK, VOLUME 26, NUMBER 1 , 2004 53

study various notions of likelihood and

less-than-absolute certainty as they

carne to bear on other fields of thought.

The most important of these, overall,

is law, an area in which the same ques

tions of credibility and weight of evi

dence recur from era to era and civi

lization to civilization. Franklin surveys

classical (that is, Hellenic and Roman)

ideas, along with Islamic, Judaic, me

dieval, and Renaissance thinking on

the primary question: what degree of

direct and indirect evidence, what lines

of argument, will suffice to convict the

accused or to win redress for a plain

tiff. Obviously, there is no consensus

to be found here. More discouraging,

perhaps, is that it is far from evident

that the principles of judgment now in

stituted in our courts represent the dis

tilled essence of hard-won wisdom.

The best to be said, I think, is that we

have abandoned divination, along with

the notion that evidence of guilt re

quires confession, which in turn ne

cessitates torture. But this is a long

way from our having a coherent and

systematic legal epistemology.

Because medieval Europe is a cen

tral focus, canon law, as well as civil,

is an area where Franklin finds inter

esting thinkers. This naturally con

nects to the prolific literature of moral

philosophy, whose "practical" side in

cluded the vast array of "manuals" for

the priesthood, guides, principally, to

the quasi-judicial ritual of Confession.

Here, "cases of conscience" come to

the fore: how much doubt must arise

in the mind of the faithful believer in

order to proscribe a possible course of

action as potentially sinful (or, put the

other way, how much doubt must be

effaced before the action may be un

dertaken with a clear conscience)? The

lack of agreement among the school

men, here, as well as the ferocity of

their disputes, is one of the most in

triguing topics of the book. We must

adjust our minds to the ancient usage

of "probable." During the period in

question, this does not mean "more

likely than not" but rather, "sustainable

by at least some evidence, argument,

and authority," in other words, merely

plausible. Thus, "probabilism" is the

doctrine that a justification merely

"probable" in this weak sense suffices


to render actions licit, even in the face

of weightier opposing arguments. This

doctrine carne to be identified with Je

suit thought in particular, and con

tributed to the Jesuits' reputation for

sly equivocation.

Franklin also considers epistemo

logical and evidentiary standards in

pre-modem science. There are some

surprises here; some "modem" philos

ophy turns out not to be so modem as

all that. Even Ockharn's famous Razor

is revealed not so much as a revolu

tionary device heralding the end of me

dieval obscurantism as a tool already

quite familiar and well worn with use

before Ockharn took it up. On the ba

sis of Franklin's shrewd observations,

one learns new respect for the long-de

spised Scholastics. He joins a growing

corps of intellectual historians newly

appreciative of the stunning, singular

mathematical genius of the 14th-cen

tury cleric Nicole Oresme, as amazing,

in its way, as that of Rarnanujan.

At the same time, Franklin shows

that many of the developments we look

upon as practical solutions to thor

oughly secular problems grew out of

attempts to create a calculus capable

of evaluating financial and commercial

dealings from the point of view of sanc

tioned morality. Thus, notions of "ex

pected value" that might apply to such

arrangements as annuities, lotteries,

risk insurance, and what we would

now call futures contracts were devel

oped primarily to distinguish fair deal

ing from usury. Of course, this applies

in large part to the formal treatises of

scholars. One assumes that, in parallel,

men of practical affairs developed their

own formularies for these things, but

here the record is scanty. What is most

interesting in this regard is that the ori

gins of true probability calculus in the

mid-17th century seem closely con

nected to the tradition of parsing

doubtful situations to decide what is

truly fair, equitable, or licit. We usually

think of the problems addressed by

Fermat and Pascal as gambling or gam

ing problems, and thus we assume that

they grew out of serious interest in

these practices, with the mathemati

cians seeking to justify, generalize, or

improve rules of thumb known to de

voted gamblers. Franklin demon-

strates that for the most part this is

quite doubtful. The classic problems of

the early probabilists are usually

phrased in terms of finding an equi

table division of stakes in an inter

rupted game, which would be a curi

ous way of looking at things if the

object were to improve the acumen of

dicers and card-players. But it makes

perfect sense when seen as a continu

ation of the Scholastic tradition of de

ciding what course must be followed

to avoid sinful error. (The sin, note, lies

not in gambling, but in a lack of even

handedness in the setup of the game.)

This is not to say that real-life gamblers

were not on the scene, or that their in

sights were never absorbed by the

unworldly mathematicians. But their

practical concerns are not central. And

we know, of course, that Pascal him

self carne to prefer the high-stakes

game of winning souls through con

version. (Franklin does discuss the fa

mous Wager, but his account is fairly

conventional.)

This explains some curious omis

sions in the underlying justification for

probabilistic formulae. The early mas

ters never seem to refer to the idea that

the probability of an outcome (of, say,

a throw of the dice) is essentially iden

tical with the ratio of the number of

times that outcome is achieved to the

total number of trials when the latter

is very large. Similarly, though notions

of expected value of a game occur, and

are correctly computed, this is not re

lated in any direct way to the average

result over a large ensemble of actual

games. By the same token, the actuar

ial approach, deriving parameters em

pirically from observed frequencies

and averages, and then using these as

probabilities or expected values in fur

ther calculation, is not part of the pro

gram either. Basic probabilities-the

one-in-six that a given face will tum up

on a given throw of a die-are simply

assumed ad hoc, without any sense

that real life may depart from these

ideal frequencies. As well, notions that

we think of as quite elementary-the

rules for deriving probabilities of con

junctions and disjunctions, for in

stance-are not clearly formalized.

Thus, the invention of probability cal

culus, though undeniably a brilliant

leap, is placed firmly in its historical

and cultural context; musty tradition of

all sorts clings to it as it emerges. Why

does it emerge at all, and why at this

time? Franklin's answer flatters math

ematicians: The chief reason was the

existence of a large, active community

of mathematicians who inspired, in

structed, and challenged each other,

and provided an admiring audience

when success was achieved by one of

their number. Readers who are famil

iar with John Aubrey's Brief Lives, whose quirky biographical sketches of

noted contemporaries include dozens

of 17th-century British mathemati

cians, will appreciate this point.

In the end, however, Franklin does

not see quantitative probability theory

as the end point of the schools of

thought he reviews. Nor should he. The

word "probable" as now used, even by

scientists, rarely falls within the rubric

of quantitative probability calculus. It

is easy to see this even within the dis

course of mathematics itself. What do

we mean when we aver that conjecture

X is "probably" true or that a given strat

egy is "likely" to succeed in proving it?

Pretty clearly, there is no way to give a

quantitative significance to these asser

tions, nor, indeed, to translate them into

any suitable formalism. Worse, we re

ally have no idea of a systematic epis

temology that might justify them. Yet re

marks like these are really the working

discourse of research mathematicians;

we work on conjectures that seem

probable, using methods that seem

likely to get somewhere, but all this

"seeming" is tied up in unaccountable

subjective intuition, informed by anal

ogy and experience. The philosophical

status of all this is very unclear. The

same applies to science all down the

line; we think of string theory as prob

able (or not) and likewise for anthro

pogenic global warming or prions as the

cause of Alzheimer's. These judgments

are the stock-in-trade of everyday sci

ence. But there is no widely accepted

justificatory theory of judgment that

stands behind them. In that sense, we

are hardly further along than the cen

turies-dead heroes of Franklin's saga.

What Franklin makes of this is im

portant to note. Good ideas and suc

cesses, even partial ones, to hard prob-

lems are hard-won and tend to rest on

centuries of missed leads and blind al

lies. Thus they should be all the more

precious to us. We are obliged, then, to

reject the flighty notions now far too

popular that the "episteme" is tran

sient, arbitrary, and endlessly mutable,

that fundamental ideas are merely cul

tural fashion statements, and that sci

ence is bound, eventually, to recede as

surely as it once advanced. Science, for

Franklin (and, I hope, all of us), is a cu

mulative achievement as much rooted

in obscure toil as in famous triumph,

which should deepen, rather than di

lute, our esteem for it.


Rutgers University

New Brunswick, NJ

Piscataway, NJ 08854

USA

Prime Obsession by John Derbyshire

WASHINGTON, DC, JOSEPH HENRY PRESS, 2003,

$27.95, ISBN 0-309-08549-7

The Music of the Primes by Marcus du Sautoy ��---------------NEW YORK, HARPER COLLINS, 2003, $24.95, ISBN 0-06-

621070-4

The Riemann Hypothesis by Karl Sabbagh NEW YORK, FARRAR, STRAUSS AND GIROUX, 2003,

$25.00, ISBN 0-374-25007-3

REVIEWED BY HAROLD M. EDWARDS

The nearly simultaneous publica

tion of three books for the general

public about the Riemann hypothesis

(hereinafter referred to as RH) can

probably be explained by the million

dollar prize offered by the Clay Mathe

matics Institute for the resolution of

RH (large sums of money evoke inter

est) and by the many books that were

sold to the general public about Fer

mat's last theorem in the wake of

Wiles's proof (selling books is the goal

of publishing). Whatever the reason for

this sudden flood of interest in one of

the frontiers of pure mathematics, it is

a welcome, if surprising, phenomenon.

Mathematicians are probably the

worst people to review such books. An

architect I once met pleased me by

telling me how he had become con

vinced of the power and beauty of

mathematics by reading a certain pop

ular book on mathematics that he

named. I was so gratified by this devi

ation from the usual "I was never any

good at math" that I rushed to the li

brary to see the book My disappoint

ment was great. To me, it was full of

dubious assertions, exaggerations, over

simplified history, and explanations of

mathematical ideas that could impart

no understanding other than false un

derstanding. But, as the architect

plainly demonstrated by his own ex

ample, the book had achieved its goal

brilliantly, at least for one reader.

Moreover, I have had the experi

ence-and most mathematicians I

have asked about it have had the same

experience-of rereading a book for

nonmathematicians that I had read in

my youth and that I remembered as

having inspired me, only to discover

that it had many explanations I now

found to be misleading at best and

statements I now found to be down

right wrong. Would I recommend the

book to a young reader today? My own

experience would say yes, but my

judgment as a mathematician would

say no.

These considerations have been on

my mind as I pondered these books on

RH. All three are quite well written, and

I can easily imagine any one of them

capturing the nonmathematical reader's

fancy. And, overall, I think each pre

sents a reasonably accurate picture of

the history of RH and the present-day

mathematicians who are working on it.

For that, the mathematical fraternity

can thank all three authors. But, after

all, I am a mathematician, and it is only

as a mathematician that I can evaluate

the books.

My lack of success over the years in

explaining the irrationality of Y2 to

reasonably able liberal arts students

has left me without much hope that ex-


planations of RH intended for inter

ested non-mathematicians will suc

ceed. In other words, I am among the

"many people" who, according to the

first sentence of Karl Sabbagh's Pro

logue, "would say that the task I am

embarking on . . . is futile." He defends

his project by comparing it to anthro

pology and to "describing a remote

tribe whose customs and language are

unfamiliar to the reader, but whom I

understand enough to convey some

thing of their inner and outer lives."

Readers of the Mathematical Intelligencer, as members of that remote

tribe, will be interested to know

whether the descriptions he provides

are accurate and whether they illumi

nate our tribal culture. On both counts,

I am unenthusiastic.

The best parts of the Sabbagh book

are indeed the anthropological ones.

He tells who has worked on-or is

working on-the Riemann hypothesis,

how they became interested in mathe

matics and in this particular problem,

how they view their chances for suc

cess, and so forth. But what makes us

a tribe is our peculiar culture, and there

is no way to describe the interactions

of key members of the tribe without go

ing into the substance of our culture.

On this, Sabbagh is an unreliable guide.

For example, on p. 41 (page numbers

refer to the American edition-the orig

inal English edition is more compact,

so for example, this passage is on page 33 of that version) he says: "So, calcu

lating the value of the sum I lln8, which Riemann believed was possible

but couldn't say so for certain, would

result in a totally accurate number for

the number of primes less than n." Well, calculating the value of I 1/n8 is

certainly possible when the real part of

s is greater than 1, but Sabbagh does

not say which particular values of s will

be needed to produce his "totally ac

curate number." (Later in the book,

complex numbers are introduced, and

on the next-to-last page analytic con

tinuation is mentioned in passing, but

at this point I lln8 is far from being the

same thing as ?(s).) This "totally accu

rate number" must refer to Riemann's

explicit formula for 1r(x), which Sab

bagh seems to believe (see also the end

of Chapter 1) depends on RH; but in


fact what is needed for this formula is

not the evaluation of ?(s) for one or

more values of s but a knowledge of

the zeros of ?C s) in the critical strip; the

formula is valid whether or not the ze

ros are on the critical line. This misap

prehension about the meaning of RH

probably underlies his answer, at the

end of his Prologue, to the question,

"Why is it [RH] so important? . . . A

proof . . . would . . . tell mathematicians

a huge amount about an important

class of numbers-the prime numbers,

which dominate the field of pure math

ematics." The notion that such a goal

accounts for the fascination of RH is a

profound misunderstanding of our

tribal culture, like believing moun

taineers want to climb Mount Everest

in order to get somewhere.

For another example, he often spec

ulates about who might or might not

prove the Riemann hypothesis. On p.

219, Martin Huxley is said to have "both

the desire and the ability to prove the

Riemann Hypothesis." On p. 240, we are

told that "many . . . feel that if anyone

is going to prove the Riemann Hypoth

esis, it will be [Alain Connes]." And not

only is there speculation that Louis de

Branges might be the one to prove RH,

the book includes an Appendix by de

Branges with the title "De Branges's

Proof." This isn't the way research in

mathematics goes. Perhaps in other

fields that require expensive equipment

one might, to a limited extent, predict where the next breakthrough might oc

cur, but in mathematics any attempt to

predict whether there will be a proof

any time soon, much less what shape it

might take or who might devise it, is

completely foolish.

In this connection, Sabbagh gives us

an interesting pair of speculations:

Henryk Iwaniec (p. 36) says, ''I'm only

worried that what may happen is that

a proof will be given by somebody and

I will be unable to understand it," while

Alain Connes (p. 263) worries about

something quite different: "It would be

a tragedy if it just needed a trick to

prove it." Different as these concerns

seem, I suspect that most mathemati

cians sympathize with both. Note that

neither has anything to do with "learn

ing a huge amount about an important

class of numbers."

Another of the authors, Marcus du

Sautoy, is a professional mathemati

cian, so we should expect his state

ments to be correct, but I am puzzled

by his statement (p. 11) that "a proof

of the Riemann Hypothesis would

mean that mathematicians could use a

very fast procedure guaranteed to lo

cate a prime number with, say, a hun

dred digits or any other number of dig

its you care to choose." He goes on to

relate this to RSA cryptography, clearly

implying that RH would have some

practical significance for cryptogra

phy, but I doubt that this is the case. I

suspect, rather, that he feels the gen

eral reader must be given some reason

for the significance of this million-dol

lar question in mathematics, but that

the real reason depends on aspects of

our tribal culture that are too difficult

to explain to the general reader. (Per

haps I am wrong; the supposed con

nection is again mentioned on page

243.) Similarly, on page 12 he says,

"The security of RSA depends on our

inability to answer basic questions

about prime numbers," but I thought it

depended on our inability to factor

large numbers. In fact, I thought the

practicality of RSA depended on the

disparity-in practice, primality test

ing is easy; factoring, hard.

His statement on page 5 that "Mas

tering these building blocks [primes]

offers the mathematician the hope of

discovering new ways of charting a

course through the vast complexities

of the mathematical world" puzzles me

in a different way. Whatever could it

mean? To my taste, this statement, and

much else in the du Sautoy book,

sounds too much like empty enthusi

asm, razzle-dazzle meant to impress

non-mathematicians not with sub

stance-because substantial mathe

matics is beyond their ken-but with

fanfares and flourishes.

Du Sautoy touches on a clash of cul

tures within mathematics that is sel

dom revealed to outsiders and that

might hold some interest for Sabbagh

and others interested in the anthropol

ogy of our tribe. In his last chapter he

sketches in very laudatory terms the

career of Alexander Grothendieck;

"Grothendieck's new language of geom

etry and algebra saw the creation of a

whole new dialectic which allowed

mathematicians to articulate ideas

which were previously inexpressible"

(p. 300). Then he goes on to say of the

brave new world of the Grothendieck

ists that "Even [Andre] Weil was rather

disconcerted by Grothendieck's new

abstract world," and, even more baldly,

quotes Carl Ludwig Siegel as saying, "I

was disgusted with the way in which

my own contribution to the subject had

been disfigured and made unintelligi

ble," and Atle Selberg as saying, "My

thought was that such lectures were

never given in earlier times. I said to

someone after the lecture a thought

which had come into my mind: if wishes

were horses, then beggars [could] ride."

Disagreements at the highest levels of

mathematics are extremely interesting,

and I applaud du Sautoy for bringing

them into the open, although he does

not pursue the subject.

I do not applaud, on the other hand,

his description of the relation between

RH and mental illness. In his last chap

ter, he says "Grothendieck is not the

only mathematician who has gone

crazy trying to prove the Riemann Hy

pothesis," as an introduction to a para

graph about John Nash. "Grothendieck

and Nash illustrate the dangers of math

ematical obsession," he concludes, but

mathematical obsession, whether it is

with RH or the continuum hypothesis,

is surely a symptom, not a cause. Our

tribe may have a stronger than average

association with madness that deserves

to be addressed, but, if so, it deserves

to be addressed with more seriousness

than to talk about going crazy trying to

prove the Riemann hypothesis.

As a sometime historian of mathe

matics, I am dismayed by du Sautoy's

failure to cite a single one of his his

torical sources. On page 104 he tells of

"several drafts" of a letter he says Rie

mann was writing to Chebyshev about

"his own progress" in the investigation

of the prime number theorem. In my

1974 book Riemann's Zeta Function I

published a jotting from Riemann's

helter-skelter notes showing that he

was aware of Chebyshev's existence; if

there is more evidence than this of a

Riemann-Chebyshev connection, I do

not know about it. The account of

Siegel's military history (p. 148) differs

substantially from the one given by

Benjamin Yandell in The Honors Class, and, since Yandell names his sources,

I believe his. I hope most readers will

realize that no sources could possibly

support such statements as "[Pythago

ras] filled an urn with water and

banged it with a hammer to produce a

note" and so forth on p. 77, but many

readers will not. Surely I am not the

only reader who wants to know what

lies behind the surprising reference (p.

128) to "Gauss and Einstein's belief

that space was indeed curved and non

Euclidean." That Gauss might have

considered the possibility of non

Euclidean physical space is plausible

enough, but that he believed it? The

propagation of unchecked and un

checkable anecdotes about the history

of mathematics is a form of pollution

to be combatted. An occasional tall

tale, with appropriate caveats, can cer

tainly be used to spice up the exposi

tion from time to time, but when no

sources are ever given for anything,

such tales become an unacceptable

norm.

Another feature of du Sautoy's writ

ing is his habit of introducing a private

phrase to describe something and for

ever calling it by his new name rather

than the one used by everyone else. For

example, he says on page 20 that "One

of Gauss's greatest early contributions

was the invention of the clock calcula

tor." He goes on to explain what he

means-modular arithmetic, of course,

the "clock" being a reference to arith

metic mod 12-but thereafter there is

no modular arithmetic, only "clock cal

culators" as in "That is because the cal

culations will be done not on a con

ventional calculator, but on one of

Gauss's clock calculators" (p. 234, deal

ing with RSA). Similarly, zeros of ((s) are first described as "points at sea level

in the zeta landscape" (p. 89) and are

called that for the remainder of the

book On page 79, rather than saying

that the harmonic series diverges, he

says it will "spiral off to infmity"-an

odd way to describe gradual increase

without bound-and thereafter series

never diverge but "spiral off to infmity."

The line where the real part of s is t is

not the critical line, it is "Riemann's

magic ley line" (p. 98) or "Riemann's ley

line." I have not found a definition of

"ley" in any American dictionary that

fits this use; it is apparently a term used

in British surveying.

But du Sautoy and Sabbagh were

not writing for mathematicians. It may

well be that the general readers they

have in mind will be intrigued and grat

ified by their descriptions of mathe

matics and mathematicians related to

RH. Certainly there is amusement to be

found in these books, and even math

ematicians will find many interesting

things in them if they are not too dis

tracted by questionable formulations

and implausible anecdotes. No harm is

done as long as cranks are not en

couraged and as long as genuinely in

quiring minds are not put off when

some of the purported explanations do

not seem to make sense.

The goal of the third book, the one

by John Derbyshire, appears to be dif

ferent from that of the other two. He

writes in his Prologue of "a general read

ership" (p. xii), but I think he is unduly

optimistic. He mentions, for example,

that he expects his readers to under

stand basic algebra, such as the fact that

S = 1 + xS becomes S = 1/(1 - x) when rearranged. Certainly anyone set

ting out to understand RH must be com

fortable with this rearrangement, but, in

the first place, I suspect that more edu

cated adults than we like to imagine

would not be comfortable with it, and,

in the second place, more mathematical

sophistication and ability is needed than

this example suggests. Perhaps Der

byshire set out to write a book for the

general reader, but as it developed I

think his goal had to change.

No matter! He has written a won

derful book He does not fudge the

mathematics, which will make parts of

it hard going for most non-mathemati

cians, but for the most important audi

ence of non-mathematicians-those

young ones who might consider becoming mathematicians-it will be a

great resource and inspiration. And for

mathematicians and readers with a fair

amount of mathematical sophistica

tion, it is a book that will inspire, in

form, and entertain. If you believe as I

do that RH for the general reader is a

futile project, you will agree that Der

byshire made the right choices. The


copyright page states that the publisher, the Joseph Henry Press, "was created with the goal of making books on science, technology, and health more widely available to professionals and the public," a goal that is admirably served by this book

(Full disclosure: Derbyshire names me in his acknowledgments and mentions me a few times in the book I met him very briefly at the conference on RH held at NYU in 2002, and at that time I gave him a copy of my book, but I don't recall anything else requiring acknowledgment. Of course, I have made every effort to base my opinion of his book on the book alone.)

Even experts on RH will enjoy this book and learn from it, and I would encourage all readers of the Mathematical lntelligencer to try it. It is interestingly and skillfully written, and it approaches many aspects of the subject in imaginative and thought-provoking ways. For a quick probe, you might try reading pages 90-92. There you will find discussions of the contrast between measuring and counting (as he describes it, numbers legato and numbers staccato), Gauss's attitude toward Fermat's last theorem, Mallory's reason for wanting to climb Mount Everest, the rise of the Germans in 19th-century mathematics and how it may have been related to the Napoleonic wars and the Congress of Vienna-as well as a passing mention of Larry, Curly, and Moe.

If that rushed summary suggests that the writing is contrived or precious or pretentious, the fault is mine. To my taste it is always down-to-earth and treats its topics in natural and appropriate, but interesting, ways.

Naturally I have my disagreements and cavils with the book, but it is remarkable to me how few they are when I consider how dense with information and opinions the book is. The peasantpheasant story about Peter the Great on p. 56 should have been tossed out in the revision process. Derbyshire does an admirable job of keeping the calculus in the book to a minimum (he tells us in the Prologue that his original goal was to have no calculus at all, but that this goal "proved a tad overoptimistic"), but my alarm bells go off


when, on p. 112 and again on p. 113, he describes a definite integral as an "area under a function." I gather that, competent as his mathematics is, he has never taught calculus and had to deal with students who persistently confuse functions with their graphs.

I am most disturbed by his statement about the formula em = - 1 that "Gauss is supposed to have said-and I wouldn't put it past him-that if this was not immediately apparent to you on being told it, you would never be a first-class mathematician," not only because I question the attribution of such a statement to Gauss and no source is given, but mainly because it strikes me as a terrible thing to say to a young student. One's reaction to em = -1 must be awe, not "oh, yes, of course!" If you tell me it was immediately apparent to you when you first saw it I will think you are a fool or a liar, or that your memory is faulty. Derbyshire is wrong to discourage his readers-who will need a good portion of ambition to allow them to penetrate his book-in any way, and particularly to do so on false grounds.

And he is indeed asking a lot of his readers. In his 21st chapter he walks the reader through Riemann's explicit formula

J(x) = Li(x) - I Li(xP) - log 2 + p

LX dt x t( t2 - 1) log t

where J(x) denotes 1T(x) + t'1T(Yx) + t1TCVx) + · · · (a terminating series because 1T(y), which is the number of primes less than y, is zero when y < 2) and where the complex numbers p are the zeros of the zeta function in the critical strip. And I don't mean that he simply explains the definitions of all the terms. He also explains how the series

Lr Li(xP) converges conditionally, so the order of the terms is of the essence and the convergence is very slow, and he actually provides numerical estimates of the various terms in the case x = 20. Once he has completed this, having shown in detail and with clarity how the formula yields the known value

7 J(20) = 9J2, he goes on to show the way in which Mobius inversion combines with this formula to give Riemann's ex-

plicit formula for 1T(x), taking for the sake of illustration the case x = 1,000,000 and carrying it through very clearly to show how it really does work out to give '1T(1,000,000) = 78498. (But he does not adequately explain how he evaluated the slowly converging "secondary terms." He gives them to five decimal places, but in his computation of J(20) he already confessed that 86,000 terms of the sum had to be computed to attain four-place accuracy for this term, and he certainly does not expect us to believe that he found the nine-place result he gives in that case by adding terms of the series!)

Can a beginner follow this chapter? Not unless the beginner is very talented. To tell the truth, I had to read it pretty attentively. But it is interesting. The talented beginner will learn from it, as I learned from it. And those who can't follow it are not being sold a bill of goods, not being encouraged to think they understand and appreciate something they don't understand at all, and not being condescended to. They can give it their best shot, and if they fail they can still admire it and still appreciate much of the rest of the book, and may someday come back to it when they are no longer beginners.

A parting thought. In my opinion, all three books grossly overstate the connection of RH to prime numbers. Derbyshire even chooses the title "Prime Obsession." True, an investigation of the distribution of primes and the Euler product formula led Riemann to RH, but Riemann himself quickly switched to another function he called g(t) (it is the value at s = t + it of f(�) ·

2 s(s; I) . '1T�s/2((s), which, as Riemann proves, is an even function of t that is real on the real axis) and his actual hypothesis was that the zeros of g( t) are real! To me, g( t) is a symmetrized version of ((s)-symmetrized to put the functional equation of ((s) in a simple form and to put the interesting part of the function on the real axis-that is an entire function of one complex variable. RH is simply the statement that its zeros are real. The connection with prime numbers may or may not play a role in explaining the amazing extent to which Riemann's hunch has been borne out by massive modern compu-

tations unimaginable in his day. De

spite this modem evidence in its favor,

and despite its connection to a raft

of fascinating and theoretically impor

tant "generalized Riemann hypotheses"

that also stand up to computational

scrutiny, no one seems to have any idea

why it should be true. Who wants to be

a millionaire?

Courant Institute of Mathematical Sciences

New York University

New York 1 001 2, USA


A Mathematician Grappling with His Century by Laurent Schwartz

BIRKHAUSER VERLAG. 2001

vi1i + 490 pp., ISBN 3764360526; US $49.95.

REVIEWED BY NORBERT SCHLOMIUK

Laurent Schwartz [1915-2002] was

one of the great mathematicians of

the twentieth century. His main con

tribution to mathematics is his work on

distribution theory.

In his "History of Functional Analy

sis" Jean Dieudonne wrote:

The role of Schwartz in the theory of distributions is very similar to the one played by Newton and Leibniz in the history of Calculus: Contrary to popular belief, they of course did not invent it, for derivation and integration were practised by men such as Cavalieri, Fermat and Roberval when Newton and Leibniz were mere schoolboys. But they were able to systematize the algorithms and notations of calculus in such a way that it became the versatile and powerful tool which we know.

The great importance of Laurent

Schwartz's contributions to mathe

matics was recognized by his being

awarded the Fields Medal in 1950, the

first French mathematician to receive it.

Laurent Schwartz was above all an

extraordinary human being: warm, gen

erous, wise, modest, deeply involved in

the struggle for the oppressed, for hu

man rights and the rights of people, a

great and noble figure of the twentieth

century. We are lucky that close to the

end of his life he decided to write an

autobiography.

A Mathematician Grappling with his Century is the translation by Leila

Schneps of the original French edition

published in 1997 by Editions Odile Ja

cob under the title Un mathematicien aux prises avec son siecle. For those

of us who had the good fortune to be

close friends of the author, reading the

book is listening to the beautiful voice

of Schwartz, a marvellous raconteur.

In the Foreword, the author pre

sents the content of the book:

I am a mathematician. Mathematics filled my life: a passion for research and teaching as a professor both in the University and at the Ecole Polytechnique. I have thought about the role of mathematics, research and teaching, in my life and in the lives of others. I have pondered on the mental processes of research and for decades I have devoted myself to urgently necessary reforms within the University and at Ecole Polytechnique. Some of my reflexions are contained in this book, as well as a description of the course of my life. However I do not discuss the University reforms since I have written many articles and books on the subject. Inevitably, mathematics appear in this book, one cannot conceive of an autobiography of a mathematician which contains no mathematics. I have written about them in a historical form which should be accessible to large non-specialist sections of the scientific public; readers impervious to their charm may simply skip them. They concern only about fifteen per cent of the volume.

The reader interested in the con

temporary problems facing universi

ties would find much to think about in

Schwartz's "Pour sauver l'universite"

(Seuil, 1983) and in "Pour la qualite de

l'Universite fran<;aise" by Pierre Merlin

and Laurent Schwartz (P.U.F. 1994).

The Introduction to his autobiogra

phy is titled "The Garden of Eden," a

reference to the property at Autouillet

purchased by Schwartz's parents in

1926. It was at Autouillet where the au

thor found the ideal conditions to work

Schwartz grew up in a very warm fam

ily. His father, a distinguished physi

cian, had a strong influence on his chil

dren. Here is a lesson given by the

father to his son:

If in a given circumstance you find that you are alone with your opinion against everybody else, try to listen to them, because maybe they are right and you are wrong. But if, after having thought it out, you still find yourself alone with your opinion, then you should say it and shout it and let everybody hear it.

His explanation, writes Schwartz,

remained engraved inside me and

guided me in all my political activities

in my adult life. Tolerance, inner free

dom, wisdom were qualities which im

pressed everyone who had the chance

to know him.

I remember one of the public lec

tures Schwartz gave in Montreal about

the life of a mathematician. Many in the

large audience were high school and

college students fascinated by the lec

ture and impressed by the sincerity of

his presentation when he spoke about

his self-doubt. I found his words in the

book:

In spite of my success [in school}, I was always deeply uncertain about my own intellectual capacity. I thought I was unintelligent. And it is true that I was and still am rather slow. I need time to seize things because I always need to understand them fully. Towards the end of the eleventh grade, I secretly thought of myself as stupid. Not only did I believe I was stupid, but I couldn't understand the contraindication between this stupidity and my own good grades. At the end of the eleventh grade I took the measure of the situation and came to the conclusion that rapidity doesn't have a precise relation to intelligence. What is important is to deeply understand things and their relations to each other. This is where intelligence lies. Naturally it is help-

VOLUME 26. NUMBER 1 . 2004 59

ful to be quick, like it is to have a good memory. But it's neither necessary nor sufficient for intellectual success.

And here a last quotation from this

marvellous book:

Self-confidence is a condition of success; of course one must be modest, and every intellectual needs to recall this. I am perfectly conscious of the immensity of my ignorance compared with what I know. It's enough to meet other intellectuals to see that my knowledge is just a drop of water in an ocean. Every intellectual needs to be capable of considering himself relatively and measuring the immensity of his ignorance. But he must also have confidence in himself and in his possibilities of succeeding through the constant and tenacious search for truth.

Very interesting, sincere and illumi

nating are the chapters about the po

litical involvement of Schwartz. Very

early during the infamous Moscow tri

als (1937) he realized the nature of

Stalin's dictatorship. He became a Trot

skyite in 1944, and was an unsuccess

ful candidate as a Trotskyite for the

French Legislative election in 1945.

Much too free a thinker to accept a

"party line," he broke with Trotskyism

in 1947.

The center of his life continued to

be in mathematics. After a one-year ap

pointment in Grenoble, Schwartz joined

in 1945 the Faculty of Science in Nancy,

where he spent seven extremely fruit

ful years both for his research and for

attracting brilliant young mathemati

cians: Grothendieck, Malgrange, Lyons,

Treves, Bruhat. He moved to the Sor

bonne in 1952 and to the Ecole Poly

technique in 1958.

The torture and murder of the young

and gifted mathematician Maurice Au

din by repressive French forces in Al

geria led Schwartz to a deep involve

ment in chairing the Audin Committee,

in protesting against torture and mur

der committed by some of the official

French forces, and in signing the fa

mous "Manifesto of the 121 ," which

proclaimed the right of French youth

to rebel against the Algerian war. For


this he was dismissed from the Ecole

Polytechnique by the then Minister of

Defense Pierre Messmer.

Later, during the Vietnam war,

Schwartz again became politically in

volved, and again that involvement was

not one-sided. During trips to Vietnam

after Ho Chi Minh's victory, he contin

ued to intervene with the Vietnamese

authorities on behalf of dissidents.

Together with Henri Cartan and

Marcel Broue, Laurent Schwartz was a

founder of the Committee of Mathe

maticians. The task of the Committee

was to fight against human rights vio

lations of mathematicians, mainly in

the Soviet Union but also in Morocco,

in Czechoslovakia, in Uruguay-and

indeed, all over the world.

Each chapter of Schwartz's autobi

ography is beautiful and interesting.

Mathematicians will be particularly in

terested in chapter VI, "The invention

of distributions." Few mathematicians

have written about the act of creation

in mathematics, Poincare and Hadamard

for instance. The author's pages are, I

think, in the same league.

Laurent Schwartz is no longer among

us; we cannot listen to his beautiful

warm voice carrying messages of wis

dom, humor, and understanding; but

we have this marvellous autobiography

of a splendid man.

Departement de Mathematiques et de

Statistique

Universite de Montreal

C. P. 61 28, Succursale Centre-Vil le

Montreal, H3C 3J7 , Canada.

e-mail: [email protected] .ca

The Mathematician Sophus Lie: It Was the Audacity of My Thinking by Arild Stubhaug, translated from the Norwegian by Richard H. Daly

BERLIN HEIDELBERG, SPRINGER-VERLAG 2002

556 pp. LIS $44.95, ISBN 3-540-421 37 ·8

REVIEWED BY JESPER LUTZEN

The year 2000 saw the publication of

two remarkable books about the

Norwegian mathematician Sophus Lie

and his work: Thomas Hawkins's

Emergence of the Theory of Lie Groups (Hawkins [2]) and Arild Stub

haug's Norwegian biography of Lie

(Stubhaug [3]). The present review

deals with the English translation of

the latter.

The two books complement each

other very well. Hawkins's book, which

represents the crowning result of many

years of research, gives a thorough

mathematical analysis of Lie's inven

tion of Lie groups and the further de

velopment of the subject for the next

half-century. It is aimed at an audience

of mathematicians who know the the

ory and can follow the technical details

of its development. Stubhaug, on the

other hand, draws a vivid picture of the

person Sophus Lie and the time he

lived in. His book is aimed at a general

audience, and does not go into mathe

matical detail. That does not mean that

mathematics is left out of Stubhaug's

biography; in fact, mathematics is con

stantly present in the book as Lie's

great passion, and it is clear that Lie's

greatness was due to his mathematical

creativity. Lie's mathematical cre

ations are described in general terms,

and in a chapter titled Into Mathemat

ical History Stubhaug does a good job

of setting them into historical and

philosophical perspective. A non-math

ematical reader will learn from Stub

haug that mathematics is fascinating

and important and will get a good

glimpse of the creative, dramatic, and

even existential aspects of mathemati

cal research. For this reason alone one

must hope that the book will be widely

read, especially at a time when scien

tific and mathematical research is un

der public scrutiny. The mathemati

cian-reader who finds the description

of Lie's mathematics superficial can

turn to Hawkins, or to Armand Borel's

new book Essays in the History of Lie Groups and Algebraic Groups [ 1 ] .

Both mathematicians and non-mathe

maticians will enjoy other aspects of

Stubhaug's book; like Stubhaug's ear

lier biography of Niels Henrik Abel, it

is both extremely well written and well

researched. Richard H. Daly, who also

translated the Abel biography, has

again done an excellent job. For the

non-Norwegian reader there have been

inserted a few explanations of locali

ties and personalities that are well

known to a Norwegian audience. In a

few cases this has been overdone, as

for example when the often-mentioned

Nordmarka is repeatedly explained to

be a forest north of Christiania. In one

respect the translation of the Lie book

is even more successful than the Abel

book: The mathematical terms have

with a few exceptions been translated

correctly.

The style is literary rather than

scholarly. For example, there are no

references and only a few footnotes in

the main text. And the style is narra

tive rather than analytic. The beginning

of the book may suggest otherwise.

Here Stubhaug explains that everyone

who met Lie would later tell stories and

anecdotes about him. "Nobody seems

to have been able to pass him by in si

lence. What, in detail, did the stories

recount, and what, on the other hand,

do we know with certainty? What was

imaginary and what was real?" This

may sound like an introduction to an

analysis of the sources pertaining to

Lie's life, but in fact the succeeding

chapter does not provide such an

analysis. The questions are rather a

stylistic trick to begin a 20-page over

view of Lie's life, that serves as a sort

of abstract of the book. This introduc

tion is helpful to the reader, who may

otherwise get lost in the wealth of ma

terial presented in the main part of the

book. For readers who do get lost,

however, there is a 5-page schematic

chronology at the end.

With its literary narrative style the

book reads like a novel. However, as is

revealed by the Bibliography, it is

based on a thorough study of a wealth

of sources. Many mathematicians and

historians of mathematics have writ

ten biographies of Lie-for example,

Friedrich Engel, Poul Heegaard, Elling

Holst, Max Noether, and Ludvig Sylow

(a list is provided in Stubhaug's book)

but this is the first book-length biogra

phy of the Norwegian mathematical ti

tan. One reason that Stubhaug's book

is longer than earlier biographies is

that he embeds the story of Lie's life in

a rich cultural, political, and institu

tional context. Another reason is that

new sources uncovered by Stubhaug's

research, in particular many collec

tions of letters to and from Lie, have al

lowed him to paint a very detailed pic

ture of Lie and his life. In this respect

Stubhaug's Lie book is more ground

breaking than his Abel book.

The scholarly aspects of Stubhaug's

work are also revealed by the 50 pages

of endnotes. Within them are refer

ences to the sources as well as addi

tional details that enhance the main

text. In many cases these enhance

ments seem just as relevant and inter

esting as the main text, and they may

have been relegated to endnotes only

because they would have hindered the

flow of the narrative.

The book is richly illustrated with

photographs of people and places of

significance to Lie, and with reproduc

tions of paintings representing the pre

vailing view of the grand Norwegian

Nature that give the reader an idea of

the existing zeitgeist. Unfortunately

the book does not contain a map of

Norway or Europe showing the places

mentioned in the text. That would have

been a great help, in particular for non

Norwegian readers.

As in most biographies, the main

part of Stubhaug's book is written

chronologically. After the summary

presented in the first part, the second

part deals with Lie's family background

and his upbringing. We learn that he

was born on December 17th, 1842, as

the penultimate child out of six of the

vicar Johan Herman Lie in Nordfjordeid

and Mette Maren nee Stabell, who ran

the vicarage as a model farm. When he

was 9 years old he moved with his fam

ily to Moss, where his father remained

a Parish Vicar for the rest of his life.

Sophus's mother died a year after the

move. In 1856 Lie completed the edu

cation at the local science school, and

after a year of private tutoring he fol

lowed his older brother to Nissens

School in Christiania (now Oslo). Such

are the bare essentials, but such a sum

mary does not do justice to the rich

ness of part 2 of this biography. The 27-

page section contains a host of details

imbedded in an extremely well-in

formed cultural-historic context. Stub

haug explains the family tree of the Lie

family; in particular he goes into some

detail about the life of Lie's father. We

learn that after having taken his theo

logical degree, father Lie began to

work as a teacher. Only when his ap

plication for the job as headmaster

of the school where he worked was

turned down did he tum to an ecclesi

astical career. However, he continued

to be very actively interested in the en

lightenment of his parishioners, not

only in religious but also in scientific

matters. In particular he initiated a se

ries of lectures on the natural sciences

for the workers in Moss, hoping that an

awareness of the laws of nature would

reveal the creative hand of God. Here

it is interesting to note that Sophus

Lie's later hero Abel had also been the

son of a vicar with rational scientific

leanings.

Stubhaug also tells about the village

of Nordfjordeid and in particular about

the vicarage and its buildings. We hear

about father Lie's versatility as a vio

linist, his two terms as mayor of the vil

lage, and many other things about the

rather happy everyday life in the vic

arage in Nordfjordeid. All this is seen

on the background of the general

trends in Norwegian society at the

time. After the move to Moss and the

death of Lie's mother, more stringent

and less happy conditions reigned in

the vicarage. But at school Lie did very

well and graduated as number one in

his class. In connection with his stud

ies at Moss Realschool, Stubhaug gives

a long explanation of the ongoing re

forms of the Norwegian school system.

This explanation continues in Part

3, which is devoted to Lie's time at Nis

sen's school in Christiania. Stubhaug

gives a 12-page account of how this

school was founded as a clear alterna

tive to the old Latin and Cathedral

schools, how it became an example for

many subsequent schools that valued

science and modem languages on a par

with the classical languages, and how

its creation was a reflection of and a

great influence on a broad debate in

Norway about the means and goals of

education, a debate in which Lie later

took an active part. We hear that the

founder of the school, Nissen, was a

former student of father Lie; we learn

about the school buildings, about the

teachers, both those that taught Lie


and the ones that preceded them, and

about some of Lie's classmates and

their families.

As I present the content of this chap

ter it may sound far too detailed and

long, but in fact I found all of it inter

esting and relevant for an understand

ing of the context in which Lie grew

up. The only thing I missed was a more

detailed account of what happened

when Lie during his first year at Nis

sen's school had Ludvig Sylow as his

mathematics teacher. We learn only

that Lie was the best in his class in this

subject, and that Sylow later recounted

that he had not seen any special math

ematical genius in the young student.

Do the sources say nothing more about

the matter?

In the 3rd part of the book the char

acter Sophus Lie gradually becomes

visible. As a student at Nissen's school

he did very well in all his subjects, and

in the end he was number two in the

entrance exam to the University in

Christiania. He began to study science

at the University in 1861, and from then

on his whereabouts are rather well

documented. He continued to do bril

liantly in most subjects, in particular in

the mathematical sciences, physics, as

tronomy, and chemistry, and he was

very active in the Scientist Association.

He also began a habit, that lasted al

most to the end of his life, of taking

strenuous hikes of several weeks dur

ing the summer into the mountainous

regions of Norway. As a boy he had ex

celled in physical strength, and his long

and fast hikes became legendary.

What kind of a person was Lie? As

a short answer to this question and as

a good example of the beauty of Stub

haug's language, let me quote the open

ing words of the book:

According to most accounts of Sophus Lie, he was the embodiment of an archetypical character in a theatrical drama-with his forceful beard, his sparkling green eyes magnified by the stout lenses of his spectacles-the blond Nordic prototype, as it was called across Europe-the Germanic gigantic being-a primal force, a titan replete with the lust for life, with audacious goals and an indomitable


will. These descriptions of his physical and mental strength also contained a subtext, an embryonic notion, not only about this brilliant man of science, the prophet, who intuitively conceived new mathematical truths, but also about the colossus who, in his constant zeal for new knowledge, might push others aside, and inadvertently trample them underfoot. He was described as highly committed, richly innovative, someone with unusual physical strength, and the stamina to overcome the majority of obstacles, but also, a man who afterwards had to pay for this with correspondingly great swings of mood and temperament.

One could add that he was a warm

person, often friendly, strong-willed,

direct in his speech, undiplomatic

"the blond Nord ic prototype . a t itan . . "

sometimes even raw in his manners,

and rather self-centered. He wrote

about himself that he "had little talent

for socialising with folk," and "what is

fatal is that I am so diametrically dif

ferent from [Felix] Klein with respect

to the ability to be able to get into the

thinking of others" (p. 257).

The mood swings that Stubhaug

mentions showed themselves for the

first time during Lie's last year as a uni

versity student. He had set himself the

high goal of graduating from University

with the highest grade overall. After

the first three years this still seemed an

attainable goal, but the last year's study

of the biological sciences did not go so

well, so he only got the second-highest

grade. He became depressed, was un

able to sleep, and even planned suicide.

This psychologically unstable state

lasted for the next few years, when he

was plagued by a sense of lacking a

calling for his life.

In fact it is remarkable that, unlike

Abel, who in high school had already

begun to study the masters and make

original contributions to mathematics,

Lie did not seriously begin to pursue a

career in mathematical research until

three years after his graduation from

university. Indeed, with his many phys

ical as well as intellectual talents he

might well have chosen a different pro

fession. In school he contemplated go

ing into philology, but at the entrance

exam to the university he got only the

second-highest mark in Greek. He was

so dissatisfied with his performance

that he opted for science instead. And

even as a science student he toyed with

the idea offollowing the example of his

older brother, who was an officer in the

army. In fact in 1864, when the Danish

borders were threatened by German

troops, Lie followed the general Scan

dinavism among the Norwegian stu

dents, and volunteered to defend the

brother country. However, before his

military training in Christiania was

complete, Denmark had surrendered.

Lie continued to serve for a few years

as a reserve lieutenant, but it turned

out that his eyes suffered from oblique

cornea, and therefore he could not pur

sue a military career.

Even after graduation from univer

sity, Lie did not have a clear idea about

his vocation. He began to work at the

observatory and to give popular lec

tures on astronomy in the Student So

ciety, and he planned to write a book

on the subject. However, probably as a

result of disagreements with the pro

fessor of astronomy, he did not obtain

the vacant job as assistant at the ob

servatory, and in 1867 he gradually

turned to mathematics and composed

a small textbook on trigonometry.

Only during the following year did

he become convinced that "there was

a mathematician in him." The turning

point was the meeting of the Scandi

navian natural scientists in Christiania

that year. Here Lie met Sylow again.

While a student at the university Lie

had followed a course on Galois theory

(the first lecture on that subject any

where in the world after Liouville's pri

vate lectures in the 1840s) that Sylow

had given while he was a substitute for

the mathematics professor Ole Jacob

Broch, who had been elected Member

of Parliament. Lie had in the meantime

lost his notes from those lectures, and

now asked Sylow permission to bor

row his notes. "I believe that group the

ory will become very important," Lie

prophetically told Sylow. Moreover,

Lie developed a friendship with the two

Danish mathematicians Adolph Steen

and Hieronymus Georg Zeuthen, who

attended the meeting. The former gave

a talk about integration of differential

equations, and Zeuthen talked about a

subject from the new geometry. These

three subjects, group theory, differen

tial equations, and geometry, became

the central elements in all of Lie's fu

ture mathematical work.

His depressive moods gone, Lie

threw himself into a study of various

recent works in geometry that Zeuthen

had referred him to, and he began to

do independent research. By Decem

ber he got the idea of his so-called

imaginary geometry. The following

year he lectured on it in the Science So

ciety and privately published a short

pamphlet about it, followed by two pa

pers in the Proceedings of the Christiania Academy of Sciences and a pa

per in Grelle's Journal fur die reine und angewandte Mathematik. This

closely paralleled the way the young

Abel had published his first original re

search. From the outset, Lie reached

out to an international audience. No

one in Norway could fully appreciate

the value of Lie's new contribution to

mathematics, and yet many of the best

Norwegian mathematicians and scien

tists were aware that it was important.

As early as 1869 he was given a travel

stipend that allowed him to stay for a

half-year in Berlin and for a half-year

in Paris, the two cities that Abel had

visited a half-century earlier. Unlike

Abel, however, Lie preferred the

French style in mathematics and was

not particularly fond of the rigor that

characterized the mathematicians in

Berlin. Still, his stay in Berlin became

very fruitful, and he had the triumph of

impressing Eduard Kummer by solving

a geometrical problem he could not

solve himself. This may be considered

Lie's breakthrough in international

mathematics.

But the most important aspect of his

stay in Berlin was his encounter with

the 20-year-old Felix Klein. The two

soon became friends, and they began

to collaborate on research. When Lie

continued on to Paris in the spring of

1870, Klein came along, and they both

enjoyed the regular meetings with

Camille Jordan and Gaston Darboux.

However, when the Prussian-French

war broke out in July, Klein immedi

ately returned to Germany. Lie also

left, for Italy. He had planned to walk

through France over the Alps to Milan,

where he wanted to meet with Luigi

Cremona. However, he only made it as

far as Fontainebleau before he was ar

rested as a German spy. When he was

released a month later, thanks to the

intervention of Darboux, he took the

next train to Switzerland.

In Berlin and Paris and in prison, Lie

pursued his geometrical research and

began to work on contact transforma

tions. His international success made

him hope for either a better stipend or

a permanent position in Norway, but

the applications he sent from abroad

did not bear fruit. When he came home

he composed a doctoral thesis about

the line-sphere transformation that he

had discovered in Paris. No one in Nor

way understood its content, but its im

portance was soon recognized interna

tionally, and when Lie in the fall of 1871

applied for a professorship in Lund,

Sweden, many Norwegian intellectuals

realized that Norway was on the verge

of repeating the mistake they had made

when they did not offer Abel a profes

sorship. The newspapers published let

ters of recommendation from Alfred

Clebsch, whom Lie had met in Gottin

gen, and from Cremona, whom Lie had

not succeeded in meeting in Milan. It

was mentioned that other letters of

support had come from Berlin, Copen

hagen, and Paris.

In 1872 Cabinet minister and math

ematician Broch convinced the Nor

wegian Parliament to appoint Lie as

extraordinary professor. Usually, pro

fessors were appointed by the Cabinet

of the Swedish king, who also ruled

Norway. Thus Lie's appointment was a

small part in a power struggle between

the freely elected Norwegian Parlia

ment and the Cabinet, and more gen

erally a strong statement in the Nor

wegian struggle for independence.

The same year Lie became engaged

to be married to Anna Birch, who was

a granddaughter of Abel's uncle. He

wanted to be married as soon as pos

sible, but she wanted to wait. In the

meantime they conducted an intense

correspondence, which Stubhaug de

tails in the beginning of Part 5. I must

admit that Lie's constant begging for an

early date of marriage became tire

some. It is the only part of the book

that in my opinion would have bene

fited from cutting. In 187 4 Lie finally

persuaded Anna to marry him, and

over the course of the next 10 years

they had three children: two daughters

and a son. Their marriage was happy.

The period 1872-1886, while Lie was

a Parliamentary professor in Christia

nia, was also his most productive pe

riod. He traveled often to Germany,

where he continued his friendship and

exchange of ideas with Felix Klein, and

to Paris, where he presented his new

work on differential equations and his

theory of transformation groups,

which he developed as a means to

solve them. He published a host of new

results, first in the Norwegian journal

Archiv for Mathematik og Naturvidenskab, which he founded together

with two colleagues, and later in inter

national journals, mainly in the Mathematische Annalen, which Klein edited

from 1877 together with Adolf Mayer.

In 1884 Klein arranged for Engel to go

to Christiania to help Lie present his

new ideas in a more polished book form.

Their intensive collaboration during

nine months in Christiania and later in

Leipzig resulted in Lie's main work

Theorie der Transformationsgruppen, published in three volumes in 1888--1893.

From 1873 Lie also worked with Sylow

on a new complete edition of Abel's

works. He also suggested to the

Swedish mathematician Gosta Mittag

Leffler that they found a new Scandi

navian research journal for mathemat

ics. This resulted in the creation of

Acta Mathematica. Lie sometimes lectured on aspects

of his new mathematics, but he did not

have any first-rate Norwegian students.

Therefore he immediately accepted

when, at the instigation of Klein, he

was offered the professorship vacated


at Leipzig University when Klein moved to Gottingen. Lie's stay in Leipzig from 1886 to 1898 was a mixed experience. His lectures drew many good students. In particular Lie was proud that his Parisian colleagues sent some of their best students of the Ecole Normale, e.g., Arthur Tresse and Ernest Vessiot, to Leipzig to study with him. Moreover, he had several good assistants and Private Docents such as Engel (whose collaboration with Lie continued) and Eduard Study, Issai Schur, and Georg Scheffers, who published Lie's lectures in three large volumes: Differential Equations with Known Infinitesimal Transformations, Continuous Groups with Geometric and other Applications, and Geometry of Contact Transformations (all in German). Finally, he had a good relationship with some of the other professors such as Adolph Mayer and Wilhelm Ostwald, but his relationship with his mathematics colleague Carl Neumann was strained.

In 1889 Lie suffered a mental breakdown that confmed him to a psychiatric hospital for seven months. Even after he released himself from hospital and tried to walk the depression out of his system, he was strongly depressive for several years, and his personality seems to have changed for good. Even before the breakdown he had begun to assert his priorities vis-a-vis Klein, and after the breakdown there was a complete break between the two former friends. Lie also began to attack his other collaborators and supporters such as Engel, and he accused Wilhelm Killing of stealing his ideas. On the whole he turned away from his German colleagues and oriented himself more toward Paris, where his works won increasing acceptance.

Part 6 of Stubhaug's Lie biography, which deals with the Leipzig period, is in my opinion the weakest in the book. Stubhaug deviates more from a chronological presentation than in other parts, and that makes the part somewhat disconnected. In particular it is unfortunate that there is not a clear distinction between the periods before and after Lie's mental breakdown. Moreover, when writing about the cultural and institutional setting in Leipzig, Stubhaug


seems to lack the mastery with which he deals with the Norwegian scene.

Yet Norwegian affairs are constantly mentioned also in this part. Indeed, Lie continued to keep abreast with the developments in his native country through regular reading of Norwegian newspapers, through correspondence, and through the young Norwegians who came to study with him in Leipzig. He even continued to participate in the public debate on educational affairs in his homeland, and he was active in the preparations for the centenary of Abel's birth in 1902. In particular, he tried to raise funds for an Abel Prize in Mathematics matching the newly founded Nobel Prizes in other disciplines. As is well known, this idea was not implemented for more than a century.

While his wife and children soon adapted to the new social situation and thrived in Leipzig, Lie had a hard time getting used to the tone at the University, to the much larger teaching load, to German militarism, and to the heat, and he found it difficult to lecture in a language that he had not mastered. He was happy to have left the provincial atmosphere in Christiania, but he missed his friends and in particular Norwegian nature. The first summers in Germany Lie rented a vacation home near Leipzig and went to the Alps for hiking tours, but from 1888 he began to spend parts of each summer in Norway.

His warm feelings for his home country and his hope for its freedom from Sweden became clear to everyone at the university during his inauguration as a Professor in Leipzig. During the ceremony the Rector mentioned that he had heard that the peasant representatives in the Norwegian Parliament had treated the noble King Oscar badly; therefore he could well understand that Lie wanted to leave Christiania. At this point Lie protested loudly and left the room.

In Norway the movement that resulted in independence from Sweden in 1905 was gaining momentum in the 1890s. One of the strategies among the cultural and academic elite was to display the great Norwegian talent. As a part of this strategy, the polar explorer Fridtjof Nansen, the mathematician Elling Holst, and the leading poet Bj0m-

stjeme Bj0mson, who had himself recently returned from a long stay in Paris, conceived a plan to bring Lie back to Norway. After negotiations with Lie in 1893 they succeeded in convincing the Norwegian Parliament to upgrade his professorship (from which he had obtained a leave when he moved to Leipzig) to a Professorship of Transformation Group Theory with a salary that matched his high German salary. Lie accepted, but it took him another 4 years before he finally resigned his position in Leipzig and returned for good to Christiania. During those four years he traveled back and forth from Norway several times, and even had an idea of trying to arrange a joint position at the two universities. When he returned to Christiania in September of 1898 he was not in good health, and soon it became clear that he suffered from pernicious anemia. He died from this disease on February 6 of the following year.

Many obituaries and later biographies have tried to capture this extraordinary mathematical genius, but none of them have been as complete, as well researched, or as well rooted in the cultural context as Stubhaug's wellwritten book. I can recommend it to all mathematicians as well as non-mathematicians, who have an interest in the human aspects of scientific creation.

REFERENCES [ 1 ] Armand Borel, Essays in the history of Lie

groups and algebraic groups. History of

Mathematics 21 , Providence, Rl: American

Mathematical Society; Cambridge: London

Mathematical Society, 2001 .

[2] Thomas Hawkins, Emergence of the The

ory of Lie Groups. An Essay in the History

of Mathematics 1 869-1926, New York:

Springer-Verlag, 2000.

[3] Arild Stubhaug, Oat var mine tankers

djervhet-Matematikeren Sophus Lie. Oslo:

Aschehoug, 2000.

[4] Arild Stubhaug, Niels Henrik Abel and his

Times. Called too Soon by Flames Afar,

Berlin, Heidelberg: Springer-Verlag, 2000.


Copenhagen University

Universitetsparken 5

DK-21 00 Copenhagen 0 Denmark


Fibonacci Numbers by Nicolai N. Vorobiev

BIRKHAUSER. 2002 1 76 pp. €31 ISBN 3-7643-61 35-2

paperback

REVIEWED BY FREDRIC T. HOWARD

The Fib�nacci �umbers ar� defined

by u1 - 1 , Uz - 1, and Un - Un�l + Un�2 for n 2: 3. They were first men

tioned in 1202 in the Liber Abaci, a

book written by Leonardo of Pisa to in

troduce the Hindu-Arabic numeral sys

tem to western Europe. Leonardo, per

haps the greatest mathematician of the

Middle Ages, wrote under the name of

Fibonacci, a contraction of "filius

Bonacci" (son of Bonacci). In Liber Abaci the numbers appeared in the fa

mous rabbit problem, but they were

not called "Fibonacci numbers" until

much later.

The rich and interesting early his

tory of the Fibonacci numbers can be

found in L. E. Dickson's History of the Theory of Numbers, volume 1. During

the last half of the twentieth century,

interest in the numbers and their ap

plications increased dramatically, and

in 1961 N. Vorobiev published his ele

mentary but influential book titled Fibonacci Numbers. Written for high

school students, this book introduced

the basic properties of the Fibonacci

numbers to a new generation of readers.

Vorobiev revised his book in 1989,

and it has now been re-issued by

Birkhauser Verlag. I will briefly sum

marize the chapters and compare them

to the material in the 1961 edition.

Chapter 1 is titled "The Simplest

Properties of the Fibonacci Numbers."

Vorobiev first proves some simple

formulas, like u1 + u2 + · · · + Un = Un+2 - 1, and then works his way up to

more sophisticated formulas like the

one for ur + u� + . . . + u�. Along the

way, he discusses binomial coefficients

and proves the Binet formula. In the old

edition, he ends the chapter with the re

sult that Un is the nearest integer to

an!v'5, where a = (1 + V5)/2. The new

edition goes further; Vorobiev continues

to discuss the last result, and he studies

Sn, the sum of the reciprocals of the first

n Fibonacci numbers, and the limit lim n�x

Sn. He shows how to write a given num

ber as a sum of Fibonacci numbers, and

he makes a comparison of the Fibonacci

system and the decimal system. He gen

eralizes to other bases.

Chapter 2 is titled "Number-Theo

ertic Properties of Fibonacci Num

bers." Vorobiev proves the well-known

fact that if m divides n, then Um divides

Un. He then discusses general properties

of the greatest common divisor, and he

proves that gcd(um, Un) = Ugcd(m, n} In

this chapter he has a statement and

proof of the Euclidean Algorithm. The

new edition extends the divisibility

properties of Un far beyond this. As the

chapter progresses, Vorobiev proves

and uses Fermat's Little Theorem and

The Fundamental Theorem of Arith

metic; he defines and discusses The

Euler phi-function and congruence

modulo m. He even gives an example

of the Quadratic Reciprocity Law. This

chapter, with parts of chapters 1 and 3,

could be used as an introductory

course in number theory (with an em

phasis on Fibonacci numbers).

Chapter 3 is called "Fibonacci Num

bers and Continued Fractions." Voro

biev proves some basic and well

known properties of continued fractions

and their convergents. Naturally, he is

especially interested in representa

tions of numbers like Un+ 11un. He dis

cusses infinite continued fractions, and

he proves some of the standard in

equalities for convergents. In the new

edition, he goes much further and

proves some nonstandard inequalities.

He proves a well-known inequality due

to Legendre, and then he proves lesser

known theorems of Vahlen, Borel, and

Hurwitz. He also discusses "equivalent

numbers" in some detail. Unfortu

nately, the new material does not have

much to do with Fibonacci numbers.

Chapter 4 is "Fibonacci Numbers

and Geometry." Vorobiev defines the

"golden section," and he gives examples

in geometry. The pentagon and the

golden section rectangles are discussed,

and the familiar "proof' that 64 = 65 (by

cutting up a certain square of side Uzn and reassembling it) is given. The new

edition continues with more examples

of patterns that occur in nature, and it

discusses at length a game using di

rected graphs and Fibonacci numbers.

Chapter 5 is a completely new chap

ter titled "Fibonacci Numbers and

Search Theory." Evidently Vorobiev felt

this material should be added because,

in his opinion, number theory lost its

paramount position in mathematical re

search and "the interest in optimization

problems gained a sudden weight." The

chapter is long and detailed, and the

use of the Fibonacci numbers seems

minimal. In the reviewer's opinion, this

material is the least appropriate of the

new additions, but that might be a mat

ter of personal taste.

In general, this book provides an ex

cellent introduction to the Fibonacci

numbers, and it also covers some of the

other basic topics in elementary num

ber theory. On the whole, it is suitable

for a good high school student with a

minimal background in mathematics.

The changes from the old edition, with

the possible exceptions of chapter 5

and the game of chapter 4, strengthen

the book One thing that is missing is

a bibliography; at the very least, there

should be a list of articles and books

for recommended reading.


Wake Forest University

Box 7388 Reynolda Station

Winston-Salem, NC 271 09

USA


New Visual Perspectives on Fibonacci Numbers by Krassimer Atanassov, Vassia

Atanassova, Anthony Shannon,

John Turner

WORLD SCIENTIFIC, SINGAPORE, 2002 332 pp. paperback U.S $36 ISBN 981 -238-734-1

REVIEWED BY FREDRIC T. HOWARD

Back in 1202, Leonardo of Pisa, bet

ter known as "Fibonacci," intro

duced the sequence of numbers 1, 1, 2, 3, 5, 8, 13, . . . as the solution to a prob

lem involving reproducing rabbits.

Leonardo could never have predicted

that eight hundred years later his num

bers would still be of intense interest


to people all over the world, and he

could never have foreseen the amazing

number of books and articles that have

been published about the numbers. As

A. F. Horadam says in the introduction

to the book under review, "It has been

observed that three things in life are

certain: death, taxes and Fibonacci

numbers."

It is refreshing to see a new book

that is totally different from all the

other Fibonacci publications. This

book certainly contains nonstandard

and highly original material that is not

found anywhere else. The book has

weaknesses, but its great strength is its

originality and its suitability for stu

dent research projects.

The book is organized into two

parts: part A (Number Theoretic Per

spectives) and part B (Geometric Per

spectives). Part A, written mainly by

K. Atanassov, V. Atanassova, and

Tony Shannon, constitutes about one

fourth of the book. Part B is primar

ily the work of John Turner. All of

these authors are well known and

highly respected in the Fibonacci

community.

In part A, the authors discuss "2-Fi

bonacci Sequences," which generalize

the usual sequence. Suppose {an} and

(bn} are sequences such that an+z = an+ 1 + bn and bn+2 = bn+ 1 + an. The

authors work out properties and for

mulas for these numbers, and they con

sider several variations, like bn+Z = an+ 1 + bn and an+2 = bn+ 1 + an. They

extend this idea by using three se

quences and by using sequences with

four-term recurrences. They also look

at the recurrence an+2 = an+ 1bn. All of

these new sequences are interesting,

and they would make good research

topics for students. The proofs are

pretty routine and computational, with

a heavy use of induction. In this part of

the book there are no new visual per

spectives, so the title does not really

fit. The second half of part A is con

cerned with the number trees of John

Turner. The basic idea is a method for

constructing a sequence ( Tnl of tree

graphs such that Tn has Fn leaf nodes,

where Fn is the Fibonacci number.

Some generalizations and extensions


of this idea are worked out. Here again,

as the reviewer can attest from his own

experience, the ideas are suitable for

student projects.

In part B, the first section is con

cerned with "Fibonacci vector geome

try." The basic idea is the Fibonacci

vector <Fn- 1, Fn, Fn+ 1) and its gener

alizations. Among other things, the au

thor works out some elementary geo

metric results and proves some vector

identities. The material in this section

is highly original, and it contains many

novel definitions; for example, the au

thor studies the "honeycomb plane"

x + y - z = 0, which contains all of

the Fibonacci vectors. He examines in

some detail the locations and various

configurations of the integer-points in

the plane.

The last half of part B is a presen

tation of "Goldpoint Geometry," which

is defined as the "study of geometric

figures into which golden-mean points

(called goldpoints) have been con

structed or introduced." The author

studies fractals in goldpoint geometry,

triangles and squares marked with

goldpoints, plane tessellations with

goldpoint squares, and games (such as

"goldpoint chess") with goldpoint

tiles. In this section there is a host of

new definitions which are probably

not found anywhere else in the litera

ture.

As mentioned earlier, the strength

of this book is its originality and its

suitability as a basis for student re

search. It is a refreshing and stimulat

ing book, but it could benefit from

tighter editing. Here are some exam

ples, most of which are admittedly

picky. In the first part of the book, the

period is used as the multiplication

symbol, and this is irritating. For ex

ample, the number 3k + 2 is written

3.k + 2 (see page 10). On page 39 the

notationfn is used, when it should ob

viously be Fn. The notation fn is used

later (page 151) in another way. In

chapter 2, for clarity the notation Xnm should always be written Xn,m· The no

tation is inconsistent here. The same

sort of problem occurs on page 72,

where Tijk should always be written

TiJ,k, as it is in some places. On page

63 it is not clear why the notation gi. is

used instead of gi (without the period).

On pages 121-122, it is not clear if

Li(QUV) means the area of QUV or (as

on page 122) just the triangle QUV.

Perhaps some of the tables, like the

one on pages 261-263, are overdone

and not that useful. In a few places, the

exposition could be made clearer and

less idiosyncratic. For example, in the

definition of goldpoint on page 184, the

notation lAP : BPI is used without ex

planation. As another example, the un

defined term "pink tile" is used on page

246.

On the whole, however, this is an in

teresting and unique book. As Profes

sor Horadam says in the introduction,

"Indeed, there is something to be

gleaned from this book by most read

ers." The reviewer agrees.


Wake Forest University

Box 7388 Reynolda Station

Winston-Salem, NC 27709


Kepler's Conjecture: How Some of the Greatest M inds in History Helped Solve One of the Oldest Math Problems in the World by George G. Szpiro

HOBOKEN, NEW JERSEY: JOHN WILEY & SONS, INC. 2003

296 pp. US $24.95 1SBN 0-471 -0860 1 -0

REVIEWED BY KARL SIGMUND

If you pour unit spheres randomly

into a large container, you will fill

only some 55 to 60 percent of the

space. If you shake the box while you

are filling it, you will get a denser pack

ing-something like 64 percent. What

is the densest packing possible? In a lit

tle booklet which he published in 1611 ,

Johannes Kepler claimed that the

hexagonal close packing did the trick:

pack one horizontal layer so densely

that each sphere is surrounded by six

spheres, then add the next layer by

placing spheres into the dimples

formed by the first layer, etc. The den

sity is now 74.05 percent. And so Kep

ler's conjecture was born.

Kepler did not fail to point out that

street vendors stocked their apple

carts in exactly that way. (He must

have relished this kind of "market re

search": his later booklet on volumes

started out with observing how the

content of wine-casks is gauged by in

troducing a measuring rod).

In a remarkably candid piece of in

trospection, the young Kepler once

wrote that his appearance, his ap

petites, his habits, and his character

traits displayed "in every way a dog

like nature. " We may add that Kepler's

conjecture still has a vicious bite. Who

ever approaches it does so at a con

siderable risk

George Szpiro, the author of "Kep

ler's Conjecture," will surely agree.

Right after Thomas Hales announced,

in August 1998, that the problem was

at long last solved, and that the dens

est way of packing unit spheres is the

obvious way, Szpiro sat down to do, for

Kepler's conjecture, what Singh had

done for Fermat's, and he wrote a

splendid book full of insights, anec

dotes, and context to celebrate Hales's

achievement. Now, five years later, the

book has appeared but the proof still

hasn't. It is true that the paper by Hales

and Ferguson will be published even

tually in the Annals of Mathematics, but with an introductory remark by the

editors, a disclaimer as it were, stating

that they had been unable to verify the

correctness of the 250-page manu

script with absolute certainty. The

proof is so huge, and based to such an

extent on massive use of computers,

that the platoon of mathematicians

charged with the task of checking it ran

out of steam. Robert MacPherson, the

Annals editor in charge of the project,

stated that "the referees put a level of

energy into this that is, in my experi

ence, unprecedented." But they ended

up being only "99 percent certain" that

the proof was correct.

Hales, too, was exhausted. Rather

than take up MacPherson's suggestion

to re-write the manuscript, he started

on a new project, the Formal Proof of

Kepler, FPK (felicitously dubbed Pro

ject Flyspeck). After computer-based

theorem-proving, this is the next great

leap forward: computer-based proof

checking. Pushed to the limit, this

would seem to entail a self-referential

loop. Maybe the purists who insist that

a proof is a proof if they can under

stand it are right after all. On the other

hand, computer-based refereeing is

such a promising concept, for review

ers, editors, and authors alike, that it

seems unthinkable that the community

will not succumb to the temptation.

It appears that Flyspeck will require

twenty man-years to verify every single

step of Hales's proof. If all goes well,

we then can be 100 percent certain.

You shouldn't wait twenty years before

reading about the tormented history of

Kepler's conjecture, though. In fact,

Szpiro's book, by appearing so patently

at the wrong moment, gains an extra

quality. It becomes what some of my

colleagues from the humanities call a

"meta-text." The whole book, which is

mostly about the treacherous nature of

the sphere-packing problem, has itself

become yet another victim of that

treachery.

This does not speak at all against

this well-crafted piece of popular sci

ence writing. The topic leaves, of

course, nothing to be desired: many as

pects are based on simple geometric

arguments which are easy to illustrate.

Crystals and wallpapers, Euler's theo

rem and Voronoi's cells, packings and

coverings, nets and knots, all have an

immediate appeal to the brain and the

senses alike.

More importantly, the four hundred

years of persistent intellectual struggle

with the elusive conjecture offer a

wonderful mine of historical detail,

starting with Sir Walter Raleigh and his

concern for storing cannon-balls, and

including Buckminster Fuller with his

re-invention of the geodesic dome. The

"greatest minds in history" include, of

course, the usual suspects: Kepler,

Newton, Lagrange, Gauss, and Hilbert.

But Szpiro avoids the lure of touring

the summits only, and offers lively

vignettes of many figures less well

endowed with biographers. Axel Thue's

muddled attempts at solving the circle-

packing problem, the polite collabora

tion of Schutte and van der Waerden

on the "kissing number" (how many

unit spheres can touch a unit sphere),

and the almost hundred-year-long race

for an upper bound in sphere-packing

densities are carefully researched and

retold, as is the valiant struggle of Las

zlo Fejes-T6th, who predicted in the six

ties that computers would be needed to

crack the conjecture. George Szpiro, a

mathematician who became a reporter

(he currently works as the Israel cor

respondent of the prestigious Neue Zurcher Zeitung), displays his inves

tigative talents for digging up unlikely

stories, and a professional eye for the

telling detail. The review which Gauss

wrote about a book by Seeber (proving

in a few lines a lot more than Seeber

had achieved in two hundred pages) is

described with gusto and wit, as are the

recent debates surrounding Hsiang's

claims to having solved the problem.

A substantial part of Szpiro's book

is devoted to the diffidence with which

many mathematicians approach com

puter-based proofs. Most mathemati

cians agree that they are ugly, should

only be used as a last resort, and ought

to be replaced, as soon as possible, by

some of these short and elegant

"proofs from the book" in which God,

according to Erdos, keeps all true

mathematical insights. At the time

Szpiro wrote his text, the proof by

Hales and Ferguson was seen as such

a necessary evil (and shared the same

social status, among working mathe

maticians) as the classification of finite

groups: no single human brain had ever

understood it in its entirety, but the

community seemed to have resigned it

self to accept it. But now, this appears

to be no longer the case. Hence, the

story of Kepler's conjecture will have

to go on for a few more chapters. It is

to be hoped that it will eventually be

told with the same elan as in Szpiro's

book

Institute for Mathematics

University of Vienna

Strudlhofgasse 4

1 090 Vienna

Austria

e-mail: karl [email protected]


George Green, Mathematician and Physicist 1 793- 1 841 : The Background to H is Life and Work D. M. Cannell

SIAM, 2001 $75.00. ISBN 0-89871 -463-X XXXIX + 3 1 6 pp.

REVIEWED BY STEVEN G. KRANTZ

Every calculus student learns of Green's theorem as perhaps the

most benign multidimensional version of the fundamental theorem of calculus. Coupled with its delightful interpretations in terms of fluid flow and electrostatics, this result can be construed as one of the capstones of a freshman education in mathematics. But most calculus books contain almost no information about the life of George Green (1793-1841) himself, and most mathematicians have little knowledge of the man. More's the pity, for George Green was one of the more fascinating and important characters of early British science.

George Green was born in Nottingham. His father, an uneducated man, was a successful baker who set up his own mill in nearby Sneinton. He built a large family home on a substantial property near the mill. As was the custom of the time, George went to work for his father at an early age. Recognizing his son's talents, the elder Green sent his son to Robert Goodacre's academy at the age of 8. After four terms, young George had outstripped his teachers. He left the academy and returned to work at the mill.

George faithfully worked at his father's mill until 1829, when his father died. Then George was able to divest himself of the mill and devote himself to mathematics. In spite of his commitment to his father's business, George had been able to produce his fundamental (and first) paper, "An Essay on the Application of Mathematical Analysis to the Theories of Electricity and Magnetism." It was published by private


subscription in 1828, and only about 50 people saw it. The response was polite indifference, and poor George returned despondently to milling. By good fortune, Sir Edward Bromhead (a member of the local intelligensia) became aware of Green's work in 1830, and he encouraged him to take up mathematics again.

With Bromhead's support, Green enrolled at Caius College, Cambridge in 1833. He was already 40 years old. He achieved 4th Wrangler on the dreaded Tripos in 1837 and was elected to the status of College Fellow in 1839. Unfortunately, ill health forced Green to leave his position after just four terms. He died in 1841.

George Green is considered by many to have been the father of British mathematical physics. He published just ten papers in the period 1828-1839. Of these, his first (referenced above) is thought to have been the most important, and the most influential. It contains or introduces

• the idea of potential function; • what we now call Green's theorem; • the idea of reciprocity; • the idea of singular value; • the idea of the Green's function.

Later papers include ( i) the first rendition of what we now call the Dirichlet principle, ( ii) an important asymptotic method for solving certain partial differential equations in divergence form, and (iii) a preliminary version of the idea of tensors (indeed a particular tensor is today named after George Green).

Mary Cannell's book is a remarkable and profound effort. Little is known of Green's early years, and (as a senior scientist) he left behind little correspondence, no diary, and no working papers. An especially arduous effort was required to piece together the story of his life. It should be stressed that this book is not a commentary on Green's scientific work (although some of the appendices do treat this aspect of Green's life). In fact, the avowed purpose of the book is to treat the personal aspects.

Cannell tells us that Green had seven children by Jane Smith, the daughter of his father's mill manager; yet they never married. It appears that

Smith remained in the background of Green's life-although it should be noted that all the children ultimately adopted the name Green. Cannell marvels over the fact that Green mastered many of the techniques of French analysis at a time when these ideas were virtually unknown in England. She goes to great lengths to trace his personal and intellectual heritage.

And she certainly mourns Green's lack of personal and scientific recognition during his lifetime. In fact it was Lord Kelvin who rediscovered Green's "Essay" (his first paper) in 1845-four years after the man's death. He ultimately arranged for the paper to be properly published in Grelle's Journal in the 1850s. Finally, on the 200th anniversary of George Green's birth, a plaque in his honor was placed in Westminster Abbey-in front of the statue of Isaac Newton.

One can only speculate, and Mary Cannell does so at length, about what sort of career George Green might have had if he had had benefit of a proper education at the appropriate time in his life, and if he had lived in a more nurturing environment (such as Paris), rather than the stultifying wreckage that was British science in the early nineteenth century. It seems inarguable that George Green had a profound influence on such leaders of nineteenth-century British mathematical physics as Maxwell, Stokes, and Rayleigh. Cannell makes a point of the great effect that Green's ideas had on twentieth-century mathematical physics. For example, the Nobel-Prizewinning work of Julian Schwinger on quantum electrodynamics makes considerable use of Green's function. On the occasion of the 200th anniversary of Green's birth, such luminaries as Freeman Dyson and Schwinger gathered to help pay tribute to him.

We owe a debt to D. Mary Cannell for penning this, the only full-length biography of George Green in existence. It is a shame that this new edition was published posthumously, but a tribute to her scholarship and dedication to an important cause. One of the more interesting and daring points that Cannell makes in her book is that a more formal educa-

Left: The windmill where George Green ground corn for a living. Be

low: Green's grave, St. Stephen's Courtyard, Sneinton, Nottingham.

(Photograph by Jan Crosbia.) Both figures reproduced from The Mathematical tntelligencer, vol. 1 1 (1989), no. 4, pp. 39, 40.


tion�specially in the England of the

early nineteenth century-might have

actually suppressed George Green's

creativity. It does seem that Green was

the product of his environment-but in

a surprising and delightful way.


Washington University in St. Louis

St. Louis, MO 631 30

USA

e-mail: [email protected] .edu

Mathematical Apocrypha by Steven G. Krantz

MATHEMATICAL ASSOCIATION OF AMERICA, 2002, 205 pp $32.95 US, ISBN 0·88385-539-9

REVIEWED BY MARION D. COHEN

In Woody Allen's Midsummer Night's Sex Comedy, a stodgy old philosophy

prof, on the eve of his planned wed

ding to especially-lovely-in-this-film

Mia Farrow, waxes quite enthusiastic

about introducing his new wife to his

faculty associates. "I can't WAIT to in

troduce you to Prof. Eddy and his wife.

They're such an entertaining and amus

ing couple. He specializes in Dr. John

son, she teaches Boswell . . . . " Mia Far

row's demure non-expression betrays

that her enthusiasm does not match

his. As the evening progresses, we see

the prof at the piano, blissfully singing

Schubert Lieder, oblivious to Mia Far

row's boredom as she fans herself and

yawns, contemplating the prospect of

the rest of her life.

The academic life is possibly an ac

quired taste. At any rate, it's rather spe

cialized, an arena which, among other

things, has its particularities, its ru

mors, its "in-jokes," "in-wisdom"-in

other words, apocrypha. And often

what academic people such as Woody

Allen's professor find "entertaining and

amusing" wouldn't be given the time of

day by people who are perhaps not so

hard up.

The anecdotes in Mathematical Apocrypha are not all meant to be

funny per se, but humor seems, to var

ious degrees, to be their underlying

feeling and raison d'etre, if not theme.


Indeed, the very contrast between the

mathematical life and the "ordinary"

life provides a solid basis for humor.

Some of these stories are funny-ha-ha,

but others are, by turns, funny-familiar,

funny-wise, funny-sad, and funny-dark

(Apocryphes noirs?). Working backwards, we have tales

of the "notorious anti-Semite" Ludwig

Bieberbach and of murderous mathe

maticians, such as the recently infa

mous "Unabomber" and the perhaps

less known Walter Petryshyn, who

hammered his wife to death.

As an example of funny-sad (pathos),

we have the aged and senile Heinz Hopf

walking up to someone in the math de

partment hallways and remarking, "I

hear that the great Professor Hopf is

visiting us next week"

I wish there were more wise tales,

and perhaps the reason there aren't is

that smart people are often not wise. At

any rate, we do have Kakutani (Or, asks

the author, was it P6lya?) explaining

Brownian motion in three versus two

dimensions: "A drunken man will usu

ally find his way home. A drunken bird

has no hope." And Hardy and Little

wood's four "axioms" concerning their

collaboration shows considerable hu

morous wisdom. Here are the first two:

"Axiom 1: . . . it was completely indif

ferent whether what he wrote was right

or wrong . . . . Axiom 2: . . . When one

received a letter from the other, he was

under no obligation whatsoever to read

it, let alone to answer it. . . . " Here's my favorite funny-familiar

anecdote (otherwise known as an "in

joke"): "One year at MIT they decided

that calculus would henceforth be

taught non-rigorously. The dreaded ep

silons and deltas would no longer be

part of the course, and that was de

partment policy . . . some professors

took umbrage . . . one professor walked

into his calculus class on the first day

and announced, 'In this class we will

make use of certain constants called

sigma and tau. We defme a limit as fol

lows: For every tau greater than zero

there exists a sigma greater than zero

such that . . . '

As for funny ha-ha, the book offers

some downright slapstick Professor X,

in an effort to explain infinity as "like

a long line that never stops," walked

"through and out the window" and . . .

wound up "two floors below, spread

eagled in the bushes (and unharmed)."

Also deserving mention are the

funny-interesting, such as the descrip

tions of R. L. Moore's unusual way of

teaching. There are even the funny

poignant, like the clash of egos be

tween Atle Selberg and Paul Erdos.

"Paul, " began a third party, Kaplansky,

"you always say that mathematics is

part of the public trust. Nobody owns

the theorems. They are out there for all

to learn and to develop. So why do you

continue this feud with Selberg?" "Ah," replied Erdos, "but this is the prime

number theorem."

And then there are those which, like

the colleagues of the stodgy professor

in the Woody Allen movie, are perhaps

simply NOT funny, such as the mes

sage wired by Dirichlet to his father-in

law when his first child was born: "2 +

1 = 3". (Or perhaps this is funny

subtle, or funny-ya-hadda-be-there.)

Are there apocrypha that specifi

cally concern math itself? Happily, the

answer is yes-for example, the story

about von Neumann. "The lecturer ex

hibited a slide with many pieces of ex

perimental data and, although they

were badly scattered, he argued that

most of them lay on a curve. It is said

that von Neumann murmured, 'At least

they lie on a plane.' " Another example:

"Moore began a lecture by saying, 'Let

a be a point and b be a point.' Lefschetz

shouted, 'But why don't you just say,

"Let a and b be points"?' Moore replied,

'Because a may equal b.' " All told, this book gives a good pic

ture of "the life" and, as the author

says in the Preface, "the people who

live it"-although the book is, un

avoidably, more accessible to those

who themselves "live it." "I have stren

uously," he continues, "avoided the

telling of stories that are mean-spirited

or critical or that depict people in a

bad light. I want these stories to make

people happy, not sour." However, in

the next paragraph he admits that they

"are not always flattering," and much

of his book bears him out. And while

the consensus of opinion might be that

it portrays mathematicians as "hu

man," perhaps a little psychotherapy

would have made some of these same

mathematicians MORE human, and

less INhuman. Indeed, even in the

chapters on "Great Ideas" and "Great

People" we find some "not always flat

tering" examples. To wit: "Wiener

woke up in the middle of a lecture. He

peered slowly at each of the black

boards, evidently saw nothing of in

terest, burst into a fit of coughing,

staggered from the room, and was

seen no more. The coughing ceased as

soon as he left the lecture room."

Krantz divides his book into six

chapters: "Great Foolishness", "Great

Affrontery", "Great Ideas", "Great Fail

ures", "Great Pranks", and "Great Peo

ple". Sometimes it didn't seem clear

whether a particular anecdote had

been placed in the correct chapter. For

example, certainly the one about the

mathematician who hammered his

wife to death has no place in "Great

Ideas".

Other ways of dividing up the sto

ries might come to mind: Great Self

Absorption, Great Psychos, Great Be

fuddlement, Great Egos, Great Repres

sion, Great Manipulations, Great Ex

pectations, Great Power-trips, and Great

Bad-Jokes, perhaps all coming under the

heading Great Personality Defects, or

even Dysfunctional Analysis!

What does this book say about the

psychology of academic life? How

much do the stories betray a longing

for the ordinary (perhaps non-aca

demic) life? Amidst the pages of this

book Bertrand Russell muses, "I've

made an odd discovery. Every time I

talk to a savant I feel quite sure that

happiness is no longer a possibility. Yet

when I talk with my gardener, I'm con

vinced of the opposite."

What do the anecdotes have to say

about mathematicians and their brand

of yearnings, vulnerabilities, strengths?

Might academics have something to

learn about their lives and how to live

them, in particular how to communi

cate? What price do some of these

mathematicians and their psyches pay

for the kind of super-human-ness which

they possess?

Department of Physics, Mathematics,

Statistics, and Computer Science

University of the Sciences in Philadelphia

Philadelphia, PA 1 91 04

USA

e-mail: mathwoman1 [email protected]

STATEMENT OF OWNERSHIP, MANAGEMENT, AND CIRCULATION (Required by 39 U.S. C. 3685). (1) Publication title: Mathematical Intelligencer. (2)

Publication No. 001-656. (3) Filing Date: 10/03. ( 4) Issue Frequency: Quarterly. (5) No. of Issues Published Annually: 4. (6) Annual Subscription Price:

$66.00. (7) Complete Mailing Address of Known Office of Publication: 175 Fifth Avenue, New York, NY 10010-7858. Contact Person: Joe Kozak, Telephone

212-460-1500 EXT 303. (8) Complete Mailing Address of Headquarters or General Business Office of Publisher: 175 Fifth Avenue, New York, NY 10010-

7858. (9) Full Names and Complete Mailing Addresses of Publisher, Editor, and Managing Editor: Publisher: Springer-Verlag New York, Inc., 175 Fifth Av

enue, New York, NY 10010-7858. Editor: Chandler Davis, Department of Mathematics, University of Toronto, Toronto, ON, Canada, M5S 1Al. Managing

Editor: Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 10010-7858. (10) Owner: Springer-Velag Export GmbH, Tiergartenstrasse 17, D-

69121 Heidelberg, Germany and Springer-Verlag GmbH & Co. KG, Heidelberger Platz 3, D-14197 Berlin, Germany. (11) Known Bondholders, Mortgagees,

and Other Security Holders Owning or Holding 1 Percent or More of Total Amount of Bonds, Mortgages, or Other Securities: Springer-Verlag Export

GmbH, Tiergartenstrasse 17, D-69121 Heidelberg, Germany. (12) Tax Status. The purpose, function, and nonprofit status of this organization and the ex

empt status for federal income tax purposes: Has Not Changed During Preceding 12 Months. (13) Publication Name: Mathematical Intelligencer. (14) Is

sue Date for Circulation Data Below: Summer 2003. (15) Extent and Nature of Circulation: (a.) Total Number of Copies (Net press run): Average No.

Copies Each Issue During Preceding 12 Months, 3450; No. Copies of Single Issue Published Nearest to Filing Date, 4100. (b.) Paid and/or Requested Circulation: (1) Paid/Requested Outside-County Mail Subscriptions Stated on Form 3251: Average No. Copies Each Issue During Preceding 12 Months 2865;

No. Copies of Single Issue Published Nearest to Filing Date, 2865. (2) Paid In-County Subscriptions: Average No. Copies Each Issue During Preceding 12

Months, 0; No. Copies of Single Issue Published Nearest to Filing Date, 0. (3) Sales Through Dealers and Carriers, Street Vendors, Counter Sales, and

Other Non-USPS Paid Distribution: Average No. Copies Each Issue During Preceding 12 Months, 0; No. Copies of Single Issue Published Nearest to Fil

ing Date, 0. (4) Other Classes Mailed Through the USPS: Average No. Copies Each Issue During Preceding 12 Months, 0; No. Copies of Single Issue Pub

lished Nearest to Filing Date, 0. (c.) Total Paid and/or Requested Circulation: Average No. Copies Each Issue During Preceding 12 Months, 2865; No.

Copies of Single Issue Published Nearest to Filing Date, 2865. (d.) Free Distribution by Mail (samples, complimentary, and other free): (1) Outside-County

as Stated on Form 3541: Average No. Copies Each Issue During Preceding 12 Months, 156; No. Copies of Single Issue Published Nearest to Filing Date,

156. (2) In-County as Stated on Form 3541: Average No. Copies Each Issue During Preceding 12 months, 0. No. Copies of Single Issue Published Nearest

to Filing Date, 0. (3) Other Classes Mailed Through the USPS: Average No. Copies Each Issue During Preceding 12 Months, 0; No. Copies of Single Issue

Published Nearest to Filing Date, 0. (e.) Free Distribution Outside the Mail: Average No. Copies Each Issue During Preceding 12 Months, 182; No. Copies

of Single Issue Published Nearest to Filing Date, 182. (f.) Total Free Distribution: Average No. Copies Each Issue During Preceding 12 Months, 338; No.

Copies of Single Issue Published Nearest to Filing Date, 338. (g.) Total Distribution: Average No. Copies Each Issue During Preceding 12 Months, 3203;

No. Copies of Single Issue Published Nearest to Filing Date, 3203. (h.) Copies not Distributed: Average No. Copies Each Issue During Preceding 12 Months,

247; No. Copies of Single Issue Published Nearest to Filing Date, 897. (i.) Total (Sum of 15g. and h.): Average No. Copies Each Issue During Preceding 12

Months, 3450; No. Copies of Single Issue Published Nearest to Filing Date, 4100. Q.) Percent Paid and/or Requested Circulation: Average No. Copies Each

Issue During Preceding 12 Months, 89.70%; No. Copies of Single Issue Published Nearest to Filing Date, 89.45%. (16) Publication of Statement of Owner

ship will be printed in the Winter 2004 issue of this publication. I certify that all information furnished on this form is true and complete. (17) Signature

and Title of Editor, Publisher, Business Manager, or Owner:

Dennis M. Looney

Senior Vice President and Chief Financial Officer, 9/27/03


,.-jfi111.19.h.l§i Robin Wilson I

The Phi lamath' s Alphabet-(

Caratheodory: Constantin Cara

theodory ( 1873-1950) is the most

significant Greek mathematician of

recent times; but, encouraged by

Minkowski, Klein, and Hilbert, he spent

most of his life in Germany. He made

significant contributions to the calcu

lus of variations and its applications to

geometrical optics, the theory of func

tions (especially conformal represen

tation), and measure theory. In applied

mathematics, he wrote on thermody

namics and relativity theory.

Caratheodory

Cauchy

Chebyshev

Cauchy: Augustin-Louis Cauchy (1789-

1857) was the most important analyst

of the early nineteenth century. In the

1820s he transformed the whole area

of real analysis, providing a rigorous

treatment of the c:;alculus by formaliz

ing the concepts of limit, continuity,

derivative, and integral. In addition, he

almost single-handedly developed the

subject of complex analysis, and many

results in this area are named after him:

'Cauchy's integral formula' appears on

the stamp.

Chebyshev: Pafnuty Chebyshev (1821-

1894) is remembered mainly for his work on orthogonal functions ('Cheby

shev polynomials') and probability

('Chebyshev's inequality') and for an im

portant contribution to the proof of the

1 2 � 8

nederland

Please send all submissions to

the Stamp Corner Editor,

Computer graphics

Robin Wilson, Faculty of Mathematics,

The Open University, Milton Keynes,

MK7 6AA, England



Condorcet

prime number theorem. He also worked

on quadratic forms and integrals and

studied theoretical mechanics and link

ages. He taught at the University of St

Petersburg, and founded the St Peters

burg School of Mathematics.

Computer graphics: Computer-aided

design has developed rapidly in recent

decades, and in 1970 the Netherlands

produced the first set of computer-gen

erated stamp designs. The stamp below

shows an isometric projection in which

the circles at the centres of the faces

gradually expand and become trans

formed into squares.

Condorcet: The Marquis de Condorcet

(1743-1 794) clarified the foundations

of probability theory, and studied the

solutions of ordinary and partial dif

ferential equations. His most signifi

cant contributions, however, were to

'social mathematics'-in particular, to

the analysis of models of voting pat

terns. After the French Revolution he

was arrested while fleeing for his life,

and died in captivity.

Counting on the fingers: From ear

liest times, people have needed to be

able to count and measure the objects

around them. Early methods of count

ing included forming stones into piles,

cutting notches in sticks, and counting

on the fingers. It is undoubtedly due to such finger counting that our familiar

decimal system emerged.

Counting on the fingers

Documents

The Mathematical Intelligencer volume 26 issue 1