Preface
The first part of these notes are based on a graduate course on modular forms I gave inSpring 2011 at the University of Oklahoma. The second part of these notes, still in progress,consists of additional topics I did not cover in the course, but which I might imagine coveringin a second semester.
Preface to first part
In the first part, I have tried to give an introduction to modular forms with a view towardsclassical applications, such as quadratic forms and functions on Riemann surfaces, as opposedto āmodern applicationsā (in the sense of requiring a more modern perspective) such asFermatās last theorem and the congruent number problem.
At the same time, I have tried to give a suitable introduction to lead into a one-semestercourse on automorphic forms and representations (in Fall 2011), which meant a slightlydifferent balance of material than in a course wholly focusing on classical modular forms.
I also tried to keep the prerequisites as minimal as possible while attempting to meetboth of these goals.
For those considering using these notes: you can get an idea of the contents by lookingat the table of contents and skimming through, so I wonāt elaborate on them here, but justsay the following points, which distinguish the presentation from some other treatments:
(1) My general philosophy is to find a balance between simplicity and completeness,focusing on what I think is important to understanding the ideas and being able to seeapplications, rather than favoring either generalities or minutiaāof course my preferredbalance may be different than yours. (2) I try to give geometric motivation to the definitionof modular forms. (3) I primarily focus on modular forms on Ī0(N) and do not evenintroduce modular forms with character (nebentypus). (4) I try to be fairly explicit witharithmetic (e.g., working out Fourier expansions of Eisenstein series with level and explicitformulas for representation numbers of quadratic forms), though I donāt go overboard (oreven as far along the board as I wanted). (5) I finish the first part with a brief treatment ofL-functions, which I will hopefully expand on later, possibly in the second part.
A warning and an apology are in order.The warning: several of the sections were written in a rush, and may have some (hopefully
not serious) errors. Please email me if you find any mistakes, so I can correct them.The apology: due to weather and travel, we missed many lectures, and as a result there
are many things missing that I would have liked to include, such as Siegel modular forms
1
2
and the structure of modular functions. Further, there are many details that I would like tohave included which I had not the time for. In addition, I realize now I could have done abetter job filling in some prerequisite material. Upon teaching this course again, or possiblyearlier if I am inspired, I plan to revise these notes. (I have made minor revisions to thefirst part, but no serious ones since.) If you have any comments or suggestions, I would behappy to hear them.
I would like to thank my students, for asking many questions and pointing out manymistakes in early versions of these notes, in particular: Kumar Balasubramanian, Jeff Breed-ing, James Broda, Shayna Grove, Catherine Hall, Daniel McLaury and Salam Turki. I amalso grateful to Victor Manuel Aricheta and Roberto Miatello for pointing out additionalerrors.
Preface to second part
Recently (2014ā2015) I started thinking again to write up notes on some additional topics Idid not cover in the one-semester modular forms course, e.g.: newforms, half-integral weightforms (though I didnāt even do odd weight in the first part!), quaternionic modular forms,EichlerāShimura theory, modularity of elliptic curves, Hilbert modular forms, Siegel modularforms. (Though, perhaps surprising to some, I still have no desire to cover modular formswith characterāthereās no accounting for taste, you know). I was contemplating teaching asecond semester course in modular forms in Spring 2016, but there was no modular formscourse in Fall 2015, and I instead decided to teach a course on (Quaternion) Algebras inNumber Theory. The notes for that course should eventually discuss quaternionic modularforms and the JacquetāLanglands correspondence, at least in simple situations.
Still, I made some progress in 2015ā2016 by adding an unpolished chapter on newformsto begin the second part, which is a slow work in progress (currently in progress: a chapteron Hilbert modular forms). While I donāt plan on doing a comprehensive treatment of thetopics in the second part, or necessarily give complete proofs, I hope to give a more-or-lessworking introduction to these topics.
Contents
Preface 1
I The basic course 5
1 Introduction 6
2 Elliptic functions 112.1 Complex analysis review: Holomorphy . . . . . . . . . . . . . . . . . . . . . . 112.2 Complex analysis review II: Zeroes and poles . . . . . . . . . . . . . . . . . . 142.3 Periodic functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162.4 Doubly periodic functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202.5 Elliptic functions to elliptic curves . . . . . . . . . . . . . . . . . . . . . . . . 23
3 The PoincarƩ upper half-plane 263.1 The hyperbolic plane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263.2 Fractional linear transformations . . . . . . . . . . . . . . . . . . . . . . . . . 283.3 The modular group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 313.4 Congruence subgroups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 353.5 Cusps and elliptic points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
4 Modular Forms 444.1 Modular curves and functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 444.2 Eisenstein series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 494.3 Modular forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 614.4 Theta series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 664.5 Ī· and ā . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
5 Dimensions of spaces of modular forms 785.1 Dimensions for full level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 785.2 Finite dimensionality for congruence subgroups . . . . . . . . . . . . . . . . . 87Appendix: Dimension Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
6 Hecke operators 956.1 Hecke operators for Ī0(N) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 966.2 Petersson inner product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
3
CONTENTS 4
7 L-functions 1137.1 Degree 1 L-functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
7.1.1 The Riemann zeta function . . . . . . . . . . . . . . . . . . . . . . . . 1137.1.2 Dirichlet L-functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
7.2 The philosophy of L-functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 1177.3 L-functions for modular forms . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
II Selected topics 124
8 Newforms and oldforms 1268.1 Hecke operators via double cosets . . . . . . . . . . . . . . . . . . . . . . . . . 1278.2 Hecke operators on Eisenstein series . . . . . . . . . . . . . . . . . . . . . . . 1308.3 AtkināLehner operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1348.4 New and old forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
9 Hilbert modular forms 1429.1 Basic definitions and results . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
References 146
Index 149
Chapter 1
Introduction
There are many starting points for the theory of modular forms. They are a fundamentaltopic lying at the intersection of number theory, harmonic analysis and Riemann surfacetheory. Even from a number theory point of view, there are several ways to motivate thetheory of modular forms. One is via the connection with elliptic curves, made famousthrough Wilesā solution to Fermatās Last Theorem. This connection has many amazingimplications in number theory, but we will emphasize the role of modular forms in thetheory of quadratic forms.
Let Q be a quadratic form of rank k over Z. This means Q is a homogeneous polynomialof degree 2 over Z in k variables with a certain nondegeneracy condition. For example, Qmight be a diagonal form
Q(x1, . . . , xk) = a1x21 + a2x
22 + Ā· Ā· Ā·+ akx
2k (1.0.1)
where each ai 6= 0; or it may be something like Q(x, y) = x2 + 2xy + 3y2. Regardless, thefundamental question about Q is
Question 1.1. What numbers does Q represent? In other words, for which n does
Q(x1, . . . , xk) = n
have a solution in Z?
There is a more quantitative version of this question as follows.
Question 1.2. Let rQ(n) denote the number of solutions (in Z) to
Q(x1, . . . , xk) = n.
Determine rQ(n).
Note that an answer to Question 1.2 provides an answer to Question 1.1, since Question1.1 is simply asking, when is rQ(n) > 0? (Sometimes rQ(n) is infinite, and instead onecounts solutions up to some equivalence, but we will not go into this here.)
Let us consider the specific examples of forms
Qk(x1, . . . , xk) = x21 + x2
2 + Ā· Ā· Ā·+ x2k,
6
CHAPTER 1. INTRODUCTION 7
and write rQk(n) simply as rk(n). So Question 1.1 is simply the classical question, whatnumbers are sums of k squares? Lagrange proved that every positive integer can be writtenas a sum of four squares (and therefore ā„ 4 squares by just taking xi = 0 for i > 4). Thecases of sums of two squares and sums of three squares were answered by Fermat and Gauss.So a complete answer to Question 1.1 for the forms Qk has been known since the time ofGauss (who did fundamental work on quadratic forms), but the answer for k > 4 is not sointeresting, as it is trivially encoded in the answer for k = 4.
Hence at least for these forms, we see the question of determining rk(n) is much more in-teresting. Furthermore, Question 1.1 is typically much more difficult for arbitrary quadraticforms Q than it is for Qk, and a general method for answering Question 1.2 will provide usa way to answer Question 1.1.
Now let us briefly explain how one might try to find a formula for rk(n). Jacobi consideredthe theta function
Ļ(z) =āā
n=āāqn
2, q = e2Ļiz. (1.0.2)
This function is well defined for z ā H = x+ iy : x, y ā R, y > 0. Then
Ļ2(z) =
( āā`=āā
q`2
)( āām=āā
qm2
)=ā`,m
q`2+m2
=ānā„0
r2(n)qn.
Similarly,Ļk(z) =
ānā„0
rk(n)qn. (1.0.3)
(More generally, if Q is the diagonal form in (1.0.1), then we formally haveāki=1 Ļ(aiz) =ā
rQ(n)qn.) It is not too difficult to see that Ļk satisfies the identities
Ļk(z + 1) = Ļk(z), Ļk(ā1
4z
)=
(2z
i
) k2
Ļk(z). (1.0.4)
Indeed, the first identity is obvious because q is invariant under z 7ā z + 1.The space of (holomorphic) functions on H satisfying the transformation properties is
(1.0.4) is defined to be the space of modular forms Mk/2(4) of weight k/2 and level 4. Thetheory of modular forms will tell us that Mk/2(4) is a finite-dimensional vector space.
For example, when k = 4, M2(4) is a 2-dimensional vector space, and one can find abasis in terms of Eisenstein series. Specifically, consider the Eisenstein series
G(z) = ā 1
24+āān=1
Ļ(n)qn,
where Ļ(n) is the divisor function Ļ(n) =ā
d|n d. Then a basis of M2(4) is
f(z) = G(z)ā 2G(2z), g(z) = G(2z)ā 2G(4z).
Hence Ļ4(z) is a linear combination of f(z) and g(z). How do we determine what combina-tion? Simply compare the first two coefficients of qn in af(z) + bg(z) with Ļ4(z), and onesees that
Ļ4(z) = 8f(z) + 16g(z).
CHAPTER 1. INTRODUCTION 8
Expanding this out, one sees that
Ļ4(z) =ānā„0
r4(n)qn = 1 + 8ānā„1
Ļ(n)qn ā 32ānā„1
Ļ(n)q4n. (1.0.5)
Consequently
r4(n) =
8Ļ(n) 4 - n8Ļ(n)ā 32Ļ(n/4) 4|n.
If one wishes, one can write this as a single formula
r4(n) = 8 (2 + (ā1)n)ād|n,2-d
d. (1.0.6)
In particular, it is obvious that r4(n) > 0 for all n, in other words, we have Lagrangeāstheorem that every positive integer is a sum of four (not necessarily nonzero) integer squares.Furthermore, we have a simple formula for the number of representations of n as a sum offour squares, in terms of the divisors of n.
In this course, we will develop the theory of modular forms, and use this to derive variousformulas of the above type. Along the way, we will give some other applications of modularforms. We will also introduce the theory of L-functions, which are an important tool inthe theory of modular forms, and are fundamental in the connection with elliptic curves.Time permitting, we will introduce generalizations of modular forms, such as Siegel modularforms and automorphic forms.
As much as possible, we will try to keep the prerequisites to a minimum. Certainly,working knowledge of linear algebra is expected, as well as some familiarity with groups andrings. Familiarity with elementary number theory is helpful but not necessary (modulararithmetic will be used, as well as some standard notation from elementary number theory).We may at times discuss some aspects of basic algebraic number theory, but these discussionsshould be sufficiently limited to not greatly affect the flow of the text if ignored.
Particularly in the beginning, we will be discussing geometric ideas, as this providesmotivation for studying modular forms (why study functions on the upper-half plane Hsatisfying seeming strange transformation laws as in (1.0.4)?). Here, familiarity with suchthings as Riemann surfaces, isometry groups and universal covers will be helpful, but wewill develop the needed tools as we go. Some basic notions of point-set topology (open sets,continuity, etc.) will be assumed.
Perhaps most helpful will be a solid course on complex analysis (fractional linear trans-formations, holomorphy, meromorphy, Cauchyās integral formula, etc.) and familiarity ofFourier analysis. For those lacking (or forgetting) this analysis background, I will recall thenecessary facts as we go, but the reader should refer to texts on complex analysis or Fourieranalysis for further details and proofs.
There are a variety (but not a plethora) of exercises intertwined with the text. Most ofthem are not too difficult, and I encourage you to think about all of them, whether or notyou decide itās worth your while to work out the details. I have starred certain exerciseswhich I consider particularly important. You may find a larger selection of exercises fromthe references listed below.
CHAPTER 1. INTRODUCTION 9
There are many good references for the basic theory of modular forms. Unfortunately,none of them do exactly what we want, which is why I am writing my own notes. Alsounfortunately, the terminology and notation among the various texts is not standarized. Onthe other hand, they are better at what they do than my notes (and likely with fewer errors),so you are encouraged to refer to them throughout the course.
ā¢ [Ser73] Serre, J.P. A course in arithmetic. A classic streamlined introduction to mod-ular forms of level 1. Many of the details you need to work out for yourself.
ā¢ [Kob93] Koblitz, Neal. Introduction to elliptic curves and modular forms. A solid intro-duction to modular forms of both integral and half-integral weight (or arbitrary level),if slightly dense. The goal is to present them in connection with elliptic curves andshow how they are used in Tunnellās solution, assuming the weak BirchāSwinnerton-Dyer conjecture, of the ancient congruent number problem.
ā¢ [Kil08] Kilford, L.J.P. Modular forms: a classical and computational introduction. Anew book, and it seems like a good introduction to modular forms. Has errata online.The one thing lacking for our course is that it does not cover L-functions.
ā¢ [Zag08] Zagier, Don. Elliptic modular forms and their applications, in āThe 1-2-3 ofmodular forms.ā A beautiful overview of modular forms (primarly level 1) and theirapplications. Available online.
ā¢ [Lan95] Lang, Serge. Introduction to modular forms. Itās Serge Lang. Covers someadvanced topics.
ā¢ [Apo90] Apostol, Tom. Modular functions and Dirichlet series in number theory. Anice classical analytic approach to modular forms.
ā¢ [Mil] Milne, J.S. Modular functions and modular forms. Online course notes. Thistreatment, somewhat like [DS05] or [CSS97], has a more geometric focus (e.g., modularcurves).
ā¢ [DS05] Diamond, Fred and Shurman, Jerry. A first course in modular forms. An ex-cellent book, perhaps requiring more geometric background than others, focusing onthe connections of modular forms, elliptic curves, modular curves and Galois repre-sentations. Available online.
ā¢ [Iwa97] Iwaniec, Henryk. Topics in classical automorphic forms. An excellent andfairly elementary analytic approach using classical automorphic forms. Many interest-ing applications are presented.
ā¢ [Miy06] Miyake, Toshitsune. Modular forms. A fairly advanced presentation of thetheory of modular forms, starting with automorphic forms on adeles. Contains usefulmaterial not easily found in many texts. Available online.
ā¢ [Bum97] Bump, Daniel. Automorphic forms and representations. A thorough (at leastas much as possible in 550 pp.) text on automorphic forms and representations onGL(n). The first quarter of the book treats classical modular forms.
CHAPTER 1. INTRODUCTION 10
ā¢ [Ste07] Stein, William. Modular forms: a computational approach. Perhaps a usefulreference for those wanting to do computations with modular forms.
ā¢ [CSS97] Cornell, Silverman and Stevens (ed.s). Modular forms and Fermatās last the-orem. A nice collection of articles giving an overview of the theory of elliptic curves,modular curves, modular forms and Galois representations, and how they are used toprove Fermatās last theorem.
Chapter 2
Elliptic functions
Before we introduce modular forms, which, as explained in the introduction, are functionson the upper half-plane, or the hyperbolic plane, satisfying certain transformation laws, itmay be helpful to get a basic understanding of elliptic functions, which are functions on Csatisfying certain simpler transformation laws. That is our goal for this chapter.
Elliptic functions are a classical topic in complex analysis, and their theory can be foundin several books on the subject, such as [Ahl78], [Lan99], [FB09] (available online) or [Sta09](available online), as well as many texts on elliptic curves and modular forms (e.g., [Kob93],[Iwa97]). In fact, the complex analysis books [FB09] and [Sta09] discuss modular forms.Since elliptic functions are not a focus for this course, but rather a tool for motivation ofthe theory of modular forms, we will not strive for rigorous proofs, but merely conceptualunderstanding. Put another way, this chapter is a sort of summary of the pre-history ofmodular forms.
We will explain later how the theory of elliptic functions is essentially a āEuclideanversionā of the theory of modular forms, and in fact modular forms first arose from thestudy of elliptic functions.
2.1 Complex analysis review: Holomorphy
First let us review some basic facts from the theory of functions of one complex variable.Let C denote the complex plane. Throughout these notes, z will denote a complex
number, and unless stated otherwise, x and y will denote the real and imaginary parts,x = Re(z) and y = Im(z), of z. I.e., z = x+ iy where x, y ā R.
Let f : C ā C. As vector spaces, we clearly have C ' R2 via the linear isomorphismz 7ā (x, y), hence we may also view f as a function from R2 to R2. In fact, C and R2
are isomorphic as topological spaces, so the notion of continuity is the same whether weregard f as a function on C or R2. The conceptual difference between functions on C andR2 arises when we study the notion of differentiablilty, and the difference arises because onecan multiply complex numbers, but not (naturally) elements of R2.
Specifically, look at the condition
f ā²(z) = limhā0
f(z + h)ā f(z)
h. (2.1.1)
11
CHAPTER 2. ELLIPTIC FUNCTIONS 12
If we want to think of f as truly a function of C, this means we should be taking h ā Cin the above limit. In other words, unlike in the real case where h can only approach 0from the left or right, here h can approach 0 along any line or path to the origin in C. Inparticular, (2.1.1) should be true for h ā R and h = ik ā iR. Note, restricting to h ā Rin the limit above amounts to differentiating with respect to x, and restricting to h ā iRamounts to differentiating with respect to y.
We can write f(z) uniquely as f(z) = u(z) + iv(z) where u : Cā R and v : Cā R, i.e.,u = Re(f) and v = Im(f). We also write u(z) = u(x, y) and v(z) = v(x, y). Then condition(2.1.1) taking h ā R means
f ā²(z) = limhā0
f(x+ h+ iy)ā f(x+ iy)
h
= limhā0
u(x+ h, y) + iv(x+ h, y)ā u(x, y)ā iv(x, y)
h
= limhā0
u(x+ h, y)ā u(x, y)
h+ i lim
hā0
v(x+ h, y)ā v(x, y)
h
=āu
āx+ i
āv
āx.
Similarly, taking h = ik ā iR in (2.1.1) means
f ā²(z) = limkā0
f(x+ i(y + k))ā f(x+ iy)
ik
= limkā0
u(x, y + k) + iv(x, y + k)ā u(x, y)ā iv(x, y)
ik
= limkā0
u(x, y + k)ā u(x, y)
ik+ i lim
kā0
v(x+ k, y)ā v(x, y)
ik
=1
i
(āu
āy+ i
āv
āy
)=āv
āyā iāu
āy.
Comparing the real and imaginary parts of these 2 expressions for f ā²(z) gives the CauchyāRiemann equations
āu
āx=āv
āy,
āv
āx= āāu
āy. (2.1.2)
Definition 2.1.1. Let U ā C be an open set. We say f : U ā C is (complex) differ-entiable, or holomorphic, at z if the limit limhā0
f(z+h)āf(z)h exists (for h ā C). In this
case, the derivative f ā²(z) is defined to be the value of this limit.We say f is holomorphic on U if f is holomorphic at each z ā U . If f is holomorphic
on all of C, we say f is entire.
As we saw above, being holomorphic on an open set U means that the CauchyāRiemannequations will hold (and the partial derivatives will be continuous). (This is in fact if and onlyif.) Contrast this to differentiable functions on R2: if one knows the partial derivatives existand are continuous on an open set in R2, the function is (real) differentiable there. TheCauchyāRiemann equations give a much stronger condition for a function to be complexdifferentiable.
CHAPTER 2. ELLIPTIC FUNCTIONS 13
The biggest consequence of the CauchyāRiemann equations is that if f is holomorphic onU , so is f ā². Hence being differentiable once means being infinitely differentiable. This featuremakes complex analysis much nicer than real analysis. In particular, any differentiablefunction on U has a Taylor series expansion around any z0 ā U :
f(z) = f(z0) + f(z ā z0) +f ā²ā²(z0)
2!(z ā z0)2 +
f ā²ā²ā²(z0)
3!(z ā z0)3 + Ā· Ā· Ā· .
Recall that any power seriesā
nā„0 an(z ā z0)n has a radius of convergence R ā [0,ā]such that the series converges absolutely for |z ā z0| < R and diverges for |z ā z0| > R.
Definition 2.1.2. Let U be an open set in C and f : U ā C. We say f is analytic atz0 ā U if, for some R > 0, we can write
f(z) =ānā„0
an(z ā z0)n, |z ā z0| < R.
We say f is analytic on U if f is analytic at each z0 ā U .
Note if we can write f(z) as a power seriesā
nā„0 an(z ā z0)n about z0, and this serieshas radius of convergence R, then f is analytic on the open disc |z ā z0| < R.
One of the main theorems of complex analysis is
Theorem 2.1.3. Let U be an open set of C and f : U ā C. The following are equivalent.(i) f is holomorphic on U ;(ii) f is infinitely differentiable on U ;(iii) f if analytic on U ; and(iv) If u = Re(f) and v = Im(f), then the partial derivatives of u and v with respect to
x, y exist, are continuous, and satisfy the CauchyāRiemann equations.
From the definition, it is clear differentiation over C satisfies the usual differentiationrules of calculus (sum, product, quotient and chain rules; as well as the power rule andderivative formulas for trigonometric and exponential functions).
For n ā Z, the power functions f(z) = zn are well defined and entire for n ā„ 0, andholomorphic on the punctured plane Cā 0 for n < 0.
Exercise 2.1.4. Consider f(z) = 1z . Show f satisfies the CauchyāRiemann equations.
Deduce that it is analytic on its domain, but there is no single power series expansion validfor all z ā Cā 0.
One can define ez, sin(z) and cos(z) by the usual Maclaurin series expansions
ez =āān=0
zn
n!,
sin(z) =
āān=0
(ā1)nz2n+1
(2n+ 1)!,
CHAPTER 2. ELLIPTIC FUNCTIONS 14
and
cos(z) =āān=0
(ā1)nz2n
(2n)!.
These series have radius of convergence ā, and hence define entire functions.On the other hand, the logarithm cannot be extended to an entire function. The best
one can do is make it well defined on the complex plane minus a ray from the origin. Thisinvolves choosing a ābranch cut.ā Unless otherwise specified, we will chose our branch cutsso that the complex logarithm is holomorphic function on Cā Rā¤0
Power functions for non-integral exponents can be defined in terms of the logarithm.Specifically, one can formally define za = ea log z. This will be holomorphic when the loga-rithm is, which will typically be Cā Rā¤0 for us.
Some basic facts about analytic functions are recorded in the following.
Theorem 2.1.5. Let z0 ā C and f be defined on a neighborhood of z0. Suppose f is analyticat z0. Then f is analytic in a neighborhood U of z0. Furthermore, we have the following.
(a) If f ā² is nonzero on U , then f is conformal, i.e., f preserves angles.(b) If f ā²(z0) 6= 0, then f is locally invertible at z0, i.e., there exist a function g which is
analytic near f(z0) such that g f = id near z0.(c) Let S be a subset of U containing an accumulation point. If g : U ā C is analytic
and f(z) = g(z) for z ā S, then f(z) = g(z) for all z ā U .
The first part of the theorem follows from the fact we already remarked after Definition2.1.2. Part (a) says that analytic function preserve a lot of geometry. Even though theydo not in general preserve distance, this property of preserving angles forces certain rigidbehavior of analytic functions. For instance, conformality implies that there is no analyticfunction mapping an open disc to an open square. Another example of this rigid behaviouris Liouvilleās function, which we will state later, but geometrically says that no analyticfunction can map the complex plane to any bounded region. Part (c) also describes arigidity feature: an analytic function is determined by its values on merely a countable setof points (which contains an accumulation point).
2.2 Complex analysis review II: Zeroes and poles
The most basic analytic information about an analytic function is the location of its zeroesand poles. Letās start off discussing zeroes, although thereās not much to say at the moment.
Definition 2.2.1. Let f be analytic at z0 such that f(z0) = 0. We say f has a zero oforder m at z0 if limzāz0
f(z)(zāz0)m exists and is nonzero, or, equivalently, if the power series
expansion of f at z0 has the form
f(z) =
āān=m
an(z ā z0)n
with am 6= 0.
CHAPTER 2. ELLIPTIC FUNCTIONS 15
Note that zeroes of a nonconstant analytic function must be isolated, i.e., they form adiscrete set. To see this, suppose f is analytic on U , and let S be the set of zeroes of f . Iff is nonconstant and S has an accumulation point, then Theorem 2.1.5(c) implies f ā” 0 onU . In particular, in any bounded region, an nonzero analytic function has a finite numberof zeroes.
Definition 2.2.2. Let U be a neighborhood of z0 and f be a nonconstant analytic functionon U ā z0. We say f has a pole of order m at z0 if 1
f(z) has a zero of order m atz0, or, equivalently, limzāz0(z ā z0)mf(z) exists and is nonzero. In this case, we writef(z) = limzāz0 f(z) =ā.
If one wants to be a little more formal about assigning a value of ā to f , consider theRiemann sphere C = P1(C) = C āŖ ā. (Here the open sets of C are generated by theopen sets of C together with the balls about infinity, z : |z| > ĪµāŖā, for Īµ ā Rā„0. HenceC is topologically a sphere.) Even though there are infinitely many āreal directionsā to gooff to infinity in C (meaning picturing C as R2), we think of them all leading to the samepoint, ā, on C.
In fact, one can push this idea further. If f : C ā C, one can define a value for f(ā)and think of f : Cā C, and one can talk about analytic maps from the Riemann sphere toitself.
We need a term for analytic functions with poles (since functions are not called analytic,or holomorphic, at their poles).
Definition 2.2.3. Let U ā C be an open set and f : U ā C. Let S = fā1(ā) ā U be theset of poles of f in U . If S is discrete and f is analytic on UāS, we say f is meromorphicon U .
In particular, if f and g are holomorphic functions on U and g 6ā” 0, then f/g is meromor-phic on U . This follows from the fact that f/g is differentiable outside of the set of zeroesof S, which we remarked above is discrete. Hence all rational functions are meromorphicon C. Specifically, if f and g are nonzero polynomials, then the number of zeroes (countingmultiplicity, i.e., summing up the orders of zeroes) of f/g is deg(f) and the number of poles(again counting with multiplicity) is deg(g).
Just like a holomorphic function has a power series expansion about any point in itsdomain, a meromorphic function has a Laurent series expansion around any point in itsdomain.
Suppose f : U ā C is meromorphic, and let z0 ā C. If f(z0) 6= ā, then we just havea usual power series expansion f(z) =
āān=0 an(z ā z0)n valid near z0. Suppose instead f
has a pole of order m at z0. Then (zā z0)mf(z) is holomorphic near z0, so we have a powerseries expansion
(z ā z0)mf(z) =
āān=0
an(z ā z0)n.
This implies, in a neighborhood of z0, we have the following Laurent series expansion
f(z) =
āān=ām
an+m(z ā z0)n =
āān=ām
bn(z ā z0)n,
CHAPTER 2. ELLIPTIC FUNCTIONS 16
where we put bn = an+m for n ā„ ām. (A Laurent series is simply a series of the above form,i.e., a power series where the exponents are allowed to start at a finite negative number.)
Exercise 2.2.4. Write down a Laurent series for z+2z2(z+1)
about z0 = 0.
Proposition 2.2.5. Let z0 ā C and U be a neighborhood of C. Suppose f : Uāz0 ā C isanalytic and limzāz0 |zā z0|m|f(z)| = 0 for some m ā N. By defining f(z0) = limzāz0 f(z),the extension f : U ā C is meromorphic on U .
There are two cases in the proof. Either limzāz0 f(z) exists as a finite complex numberor not. In fact if f(z) is bounded as z ā z0, one can use Cauchyās integral formula (whichwe will recall later) to show the limit exists and the extension of f to U is analytic at z0.In this case we say f has a removable singularity.
Otherwise, g(z) = (z ā z0)mf(z) has a removable singularity, so by the above g(z) canbe extended to be analytic on U . If g ā” 0, then f ā” 0, which cannot happen since we haveassumed limzāz0 f(z) is not finite. Hence g(z) has a zero of some finite order 0 < k < m atz0. Then 1
f(z) = (zāz0)m
g(z) has a zero of order mā k at z0, i.e., f(z) has a pole of order mā kat z0, and is by definition meromorphic.
We remark that if the condition limzāz0 |zā z0|m|f(z)| = 0 does not hold for some m inthe above proposition, then f(z) has what is called an essential singularity. One exampleis e1/z at z = 0. While we do not need this, the behavior at essential singularities is soremarkable, we would be remiss not to point it out. For the next two results we assume Uis an open neighborhood of z0 and f : U ā z0 ā C is analytic.
Theorem 2.2.6. (Weierstrass) Suppose f has an essential singularity at z0. Then on anyneighborhood V ā U of z0, the values of f come arbitrarily close to any complex number,i.e., f(z) : z ā V ā z0 is dense in C.
This is a rather surprising result, though it is not too difficult to prove. However, oneof the most amazing theorems in complex analysis (or even all of mathematics) is thatsomething much stronger is true.
Theorem 2.2.7. (Big Picard). Suppose f has an essential singularity at z0. Then on anyneighborhood V ā U of z0, the values of f range over all complex number with at most onepossible exception, i.e., |Cā f(V )| = 0 or 1.
For example, since we know e1/z is never 0 for z ā C, Big Picard tells us that in everyneighborhood of 0, e1/z attains every nonzero complex value!
(Note the above theorem is called āBig Picardā because Picard has another beautiful the-orem in complex analysis named after him, now called Little Picard, which is a consequenceof Big Picard. We will recall Little Picard later.)
2.3 Periodic functions
Let Ļ ā Cā0 and Ī© a region in C, i.e., Ī© is a nonempty connected open subset of C. Weassume the map T (z) = TĻ(z) = z+Ļ is an isometry of Ī©. Since T preserves distance in C,this simply means T (Ī©) = Ī©. For instance, if Ļ = 1 (or any nonzero real number), we could
CHAPTER 2. ELLIPTIC FUNCTIONS 17
take Ī© to be a horizontal strip a < Im(z) < b for some fixed a, b. This is clearly invariantunder T .
a
b
Ī©z T (z)
(A slightly stranger looking region for Ī© could be the region between the curves y = sin(x)and y = 1 + sin(x).)
We say f : Ī© ā C has period Ļ if f(z + Ļ) = f(z) for all z ā Ī©. Note that if Ī© isnot preserved by the translation T , then there are no functions with period Ļ on Ī© becausef(z) and g(z) := f(z + Ļ) would have different domains.
Let Ī be the cyclic group of isometries of Ī© generated by T , i.e.,
Ī =. . . , Tā2, Tā1, I = T 0, T, T 2, . . .
= TkĻ : k ā Z ,
i.e., Ī is the group of all translations on Ī© by an integer multiple of Ļ.Any time we have a group of isometries acting on a metric space, we can consider the
quotient, in this case X = Ī©/Ī. What this means is the following. Define two points of Ī©to be Ī-equivalent if they differ by some element Ļ ā Ī, i.e., z ā¼Ī zā² if Ļ(z) = zā² for someĻ ā Ī. Since Ī is a group, ā¼Ī is an equivalence relation, and we can let [z] denote theĪ-equivalence class of z.
Then the quotient space X = Ī©/Ī, as a set, is defined to be the set of Ī-equivalenceclasses [z] : z ā Ī© of point of Ī©. This naturally inherits a topology from Ī©, the quotienttopology. Specifically, the open sets of X are of the form [z] : z ā U, where U is an openset of Ī©.
Graphically, in the above example where Ļ = 1 and Ī© = a < Im(z) < b, we can pictureX = Ī©/Ī as below.
a
b
X
Ļ
CHAPTER 2. ELLIPTIC FUNCTIONS 18
Here we identify the left and right borders of X. Topologically this is homeomorphic toS1 Ć (a, b). Precisely, we have a āflatā open cylinder of diameter 1 and height b ā a. Theadjective āflatā here refers to the fact that, while X is topologically a cylinder, its geometryis Euclidean, i.e., it has no curvature.
While X is technically not a subset of Ī©, let alone a specific subset of Ī© as above, it isoften convenient identify X with a subset of Ī© (at least as a setāgeometrically, we haveto identify the left and right borders of the shaded region above to get X). This is donethrough the notion of a fundamental domain.
Definition 2.3.1. Let Ī© ā C be a region and Ī a group of isometries of Ī©. We say a closedsubset F of Ī© with connected interior is a fundamental domain for Ī (or Ī©/Ī) if
(i) any z ā Ī© is Ī-equivalent to some point in F ;(ii) no two interior points of F are Ī-equivalent; and(iii) the boundary of F is a finite union of smooth curves in Ī©.
(Note: different authors impose different conditions on fundamental domains. For some,the fundamental domain would be the interior of what we called the fundamental domain.Others may have different conditions on the shape of the boundary, or require convexity.)
So, in our above example, we can take a fundamental domain F to be the shaded regionin the previous picture, including the boundary, i.e.,
F = z ā C : a < Im(z) < b, 0 ā¤ Re(z) ā¤ 1 = [0, 1]Ć (a, b).
(While this is not closed in C, it is closed in Ī©.) One could of course construct a differentfundamental domain by translating F to the left or right. A slightly different fundamentaldomain F ā² is pictured below.
a
b
F ā²
Ļ
Now we can view a function f : Ī© ā C with period Ļ as a function of X since thevalue of f(z) only depends upon the Ī-equivalence class of z. In particular, in our aboveexample, we can identify continuous functions on Ī© with period Ļ with continuous functionson X, i.e., continuous functions on the fundamental domain F = [0, 1] Ć (a, b) such thatf(iy) = f(1 + iy) for y ā (a, b).
Moving back to the case of a general region Ī© invariant under T = TĻ, there are twobasic approaches to constructing periodic functions on Ī©:
CHAPTER 2. ELLIPTIC FUNCTIONS 19
(1) Clearly the function e2Ļiz/Ļ has period Ļ. Hence we can use this to construct otherfunctions with period Ļ.
(2) We can start with a function f(z), which decreases sufficiently fast, such as f(z) = 1z2,
and average f over all elements of Ī, e.g.,
f(z) =ānāZ
f(z + nĻ).
The condition on the rate of decay of f will guarantee thatāf(z + nĻ) converges, and it
is clear that f(z + Ļ) = f(z).
For now, letās focus on (1). It turns out that we can write all (analytic) periodic functionsin terms of e2Ļiz/Ļ, but approach (2) will be useful in more complicated situations.
By making a change of variable z 7ā z/Ļ, which transforms the domain Ī© to Ļā1Ī©, wemay assume our period Ļ = 1. For simplicity, let us assume Ī© = C, so X = Ī©/Ī = C/Z.Thus, for a fundamental domain F for C/Z is F = z ā C : 0 ā¤ Re(z) ā¤ 1, and we identifythe sides Re(z) = 0 and Re(z) = 1 to get X. In other words, X is a flat cylinder of infiniteheight, and X is in bijection with the subset Fā = 0 ā¤ Re(z) < 1 of C.
Fā
10
Consider the image of Fā under the map f(z) = e2Ļiz. Write z ā Fā as z = x+ iy where0 ā¤ x < 1. Then
f(z) = e2Ļiz = eā2Ļye2Ļix = reiĪø,
where we put r = eā2Ļy and Īø = 2Ļx mod 2Ļ. Viewing f : z 7ā (r, Īø), it is clear that theimage of Fā under f is (0,ā) Ć [0, 2Ļ). Further this map is injective. Thinking back interms of f : Fā ā C, we see e2Ļiz is an analytic one-to-one map of Fā onto CĆ = Cā 0.
Suppose g : Cā C has period 1. Assuming g is continuous, we have a Fourier expansion
g(z) =
āān=āā
an(y)e2Ļinz,
where the n-th Fourier coefficient an(y) is given by
an(y) = g(n) =
ā« 1
0g(z)eā2Ļinzdx, (z = x+ iy).
A priori, an(y) is a just function of y.
CHAPTER 2. ELLIPTIC FUNCTIONS 20
However if g is meromorphic, then we have a stronger result. Put Ī¶ = f(z). Thenthere is a unique meromorphic G : CĆ ā C such that g(z) = G(Ī¶). It is natural to askwhen G extends to a meromorphic function of C. By Theorem 2.2.5, this is if and onlyif limĪ¶ā0G(Ī¶)|Ī¶|m = 0 for some m. Note that Ī¶ = e2Ļiz ā 0 for z ā Fā if and only ifIm(z)āā. Hence we may restate the above condition on G(Ī¶) being meromorphic at 0 aslimIm(z)āā g(z)|e2Ļimz| ā 0 for some m, i.e.,
for Im(z) large, there exists m such that |g(z)| < e2Ļmy. (2.3.1)
Since Ī¶ ā 0 corresponds to Im(z) ā ā, we say g is meromorphic at iā if (2.3.1) issatisfied.
Assume now that g is also meromorphic at iā. Then G(Ī¶) has a Laurent series expansionabout 0,
g(z) = G(Ī¶) =
āān=ām
cnĪ¶n =
āān=ām
cne2Ļinz,
where m is the order of the pole of G at 0. (We can take m = 0 if G does not have a poleat 0.) This must agree with the Fourier expansion above, so we have an(y) = cn for alln ā„ ām and an(y) ā” 0 for n < ām. Hence for meromorphic periodic functions, the Fouriercoefficients are constant (independent of y) and all but finitely many of the negative Fouriercoefficients vanish.
These facts about the Fourier expansion will be crucial in our study of modular forms.
2.4 Doubly periodic functions
Let Ļ1, Ļ2 be two complex periods, i.e., two nonzero complex numbers linearly independentover R. Then they generate a lattice Ī = ćĻ1, Ļ2ć = aĻ1 + bĻ2 : a, b ā Z ā C. We mightpicture Ī as āparallelogram gridā follows.
Ļ1
Ļ2
Ļ1 + Ļ2
0
While technically the lattice Ī is only the set of vertices of the above parallelogram grid,it is convenient to draw it as above.
CHAPTER 2. ELLIPTIC FUNCTIONS 21
Definition 2.4.1. A meromorphic function f : C ā C is elliptic (or doubly periodic)with respect to Ī if
f(z + Ļ) = f(z) for all z ā C, Ļ ā Ī.
The space of all such functions is denoted E(Ī).
Note f ā E(Ī) if and only if f(z + Ļ1) = f(z + Ļ2) = f(z). As with singly periodicfunctions, we can identify doubly periodic functions with functions on a quotient spaceC/Ī. Note this quotient space is a (flat) torus. We can take for a fundamental domain Fthe (closed) parallelogram whose vertices are drawn above.
Ļ1
Ļ2
Ļ1 + Ļ2
0
F
Then C/Ī is identified with the torus obtained by gluing together both opposite pairsof edges of the fundamental parallelogram F .
We remark that elliptic functions were originally studied because they contain, as aspecial case, the inverse functions of the classical elliptic integrals (integrals involving thesquare root of a cubic or quartic polynomial, which arise in the problem of finding the arclength of certain elliptical shapes, such as spirals and cycloids). This is also, as you mayguess, the root of the modern terminology.
We would like to be able to describe the space of meromorphic functions on C/Ī. Earlier,we suggested two methods for constructing periodic functions: (1) write functions in termsof a simple periodic function you know; and (2) average functions over Ī. Since there areno obvious nonconstant doubly periodic functions, here weāll start with approach (2).
In order to get something that converges, we need to average a function that decayssufficiently fast. A first idea might be to try averaging 1
z2over the lattice Ī = ć1, ić. This
would be āĻāĪ
1
(z + Ļ)2=āa,bāZ
1
(z + (a+ bi)2). (2.4.1)
If this converges, it would have a pole of order 2 at each lattice point, and nowhere else.However, it does not converge (absolutely). To see this, take z = 0, and sum up the absolutevalues of all terms excluding the 1
z2term to getā
(a,b)āZ2ā(0,0)
1
|a+ bi|2=āa,b
1
a2 + b2ā 4
ā« ā1
ā« ā1
dx dy
x2 + y2āā« 2Ļ
0
ā« ā1
r dr dĪø
r2=ā.
CHAPTER 2. ELLIPTIC FUNCTIONS 22
(Here of course the approximations are certainly good enough to formally check divergenceof the right most integral implies divergence of the sum on the left.)
Since (2.4.1) does not converge absolutely, the next simplest idea would be to consider
g(z) =āĻāĪ
1
(z + Ļ)3.
Indeed this sum converges absolutely, except of course when z ā Ī, in which case we geta pole of order 3. However, it turns out that one can modify the sum in (2.4.1) to getan elliptic function with only poles of order 2 at each lattice point. Precisely, we defineWeierstrass pe function (with respect to Ī) to be
ā(z) =1
z2+
āĻāĪā0
(1
(z + Ļ)2ā 1
Ļ2
).
This can be shown to converge and although not as obvious as (2.4.1), to be doubly periodic(see exercises below). Given this, it is clear this has a pole of order 2 at each z ā Ī and nopoles elsewhere, so ā is an elliptic function of order 2. (The order of an elliptic function isthe sum of the orders of its poles in C/Ī.)
Since ā is analytic on Cā Ī, it is differentiable, and on checks that its derivative is
āā²(z) = ā2g(z) = ā2āĻāĪ
1
(z + Ļ)3, (2.4.2)
which has order 3.
Exercise 2.4.2. Show ā(z) converges absolutely for z 6ā Ī and uniformly on compact setsin Cā Ī.
Exercise 2.4.3. Verify Equation (2.4.2).
Exercise 2.4.4. Use the previous exercise together with the fact that ā is even to show āis elliptic with respect to Ī.
Now one might ask: are there any (nonconstant) elliptic functions of order 0 or 1? Abasic result of complex analysis is the following.
Theorem 2.4.5. (Liouville) Any bounded entire function is constant.
If f ā E(Ī) has no poles, then it must be holomorphic on C, i.e., entire. Further the onlyvalues attained by f are the ones attained by restricting f to the fundamental parallelogramF , so f is also bounded. This shows there are no nonconstant elliptic functions of order 0(āā holomorphic).
Furthermore, one can show there are no elliptic functions of order 1. For those of youwho remember your complex analysis, the argument goes as follows. The double periodicitycondition, together with Cauchyās theorem, implies that the integral around any fundamentalparallelogram with no poles on the border must be 0. (We may assume by shifting ourfundamental domains that there are no poles on these parallelograms.) Hence by the residue
CHAPTER 2. ELLIPTIC FUNCTIONS 23
theorem, the sum of the residues in any parallelogram must be 0. In particular, we cannothave only one simple pole inside such a parallelogram.
The point is that ā is the simplest (nonconstant) elliptic function. Furthermore, it is nottoo difficult to show that the space of elliptic functions E(Ī) is a field, and it is generatedby ā and āā², i.e.,
E(Ī) = C(ā, āā²),
i.e., any elliptic function can be expressed as a rational function in ā and āā².
2.5 Elliptic functions to elliptic curves
The theory of elliptic functions gave rise to the modern theory of elliptic curves and tomodular forms. This is very beautiful and important mathematics, so despite the fact thatit will not be so relevant to our treatment of modular forms, I feel obliged to at leastsummarize these connections.
In this section, x and y do not denote the real and imaginary parts of a complex numberz.
Definition 2.5.1. Let F be a field. An elliptic curve over F is a nonsingular (smooth)cubic curve in F 2.
We wonāt give a precise definition of nonsingular, but by cubic curve we mean a curvedefined by a cubic polynomial (in this case, in two variables).
Theorem 2.5.2. Let F be a field of characteristic zero and E/F be an elliptic curve. Then,up to a change of variables, we may express E in Weierstrass form as
y2 = x3 + ax+ b, (a, b ā F ).
Furthermore, an equation of the above type defines an elliptic curve if and only if the dis-criminant ā = ā(E) = ā16(4a3 + 27b2) 6= 0.
The nonzero discriminant condition is what forces the equation to be nonsingular. E.g.,the equation y2 = x3 has a cusp at the origin (graph it over R).
Actually, one typically wants to consider projective elliptic curves rather than just theaffine curves. Suffice it to say that, for a curve in Weierstrass form, the projective ellipticcurve can be thought of as the affine curve given above together with a point at infinity. Itis well known that the points of an elliptic curve form an abelian group, with the point atinfinity being the identity element. However to prove it directly from the above definitionis not so easy.
Let Ī = ćĻ1, Ļ2ć be a period lattice in C, and ā be the associated Weierstrass pe function.Then for all z ā Cā Ī,
āā²(z)2 = 4ā(z)3 ā 60G4ā(z)ā 140G6,
where Gk =ā
ĻāĪā0 Ļāk. In other words, the map z 7ā (ā(z), āā²(z)) maps C onto (the
affine points) of the elliptic curve EĪ : y2 = 4x3 ā 60G4xā 140G6 defined over C, assuming
CHAPTER 2. ELLIPTIC FUNCTIONS 24
Ī is chosen so that the discriminant of EĪ» is nonzero. (Replacing y with 2y puts this inWeierstrass form, if one wishes to do that.) By the periodicity of ā and āā², this factorsthrough (C ā Ī)/Ī. We extend this map to C/Ī by sending Ī to the point at infinity onEĪ.
This map gives an analytic isomorphism
C/Ī ' EĪ
from the torus C/Ī to the elliptic curve EĪ. Furthermore, all elliptic curves over C (andconsequently all elliptic curves over any subfield of C) arise from some such lattice Ī. Con-sequently all (projective) elliptic curves over C are topologically tori. Now it is clear thatthe points on the (projective) elliptic curve EĪ naturally form an additive group becauseC/Ī is (with respect to addition on C, taken modulo Ī), and the identity of EĪ is the pointat infinity, which corresponds to the identity Ī of (C/Ī,+) by construction.
Hence elliptic functions (particularly ā and āā²) provide a link between lattices in C andcomplex elliptic curves. We will now sketch how elliptic functions and elliptic curves leadto the theory modular forms.
We say two lattices Ī and Īā² in C are equivalent if Īā² = Ī»Ī for some Ī» ā CĆ.
Exercise 2.5.3. Show that if Ī and Īā² are equivalent, the elliptic functions on Īā² are simplythe elliptic functions on Ī composed with a simple change of variables.
Furthermore, Ī and Īā² being equivalent is the same as the elliptic curves EĪ and EĪā²
being group isomorphic by an analytic map.Write Ī = ćĻ1, Ļ2ć. Clearly Ī is equivalent to the lattice ć1, Ļć where Ļ = Ļ2/Ļ1. Since
ć1, Ļć = ć1,āĻć and Ļ 6ā R, we may assume Im(Ļ) > 0, i.e., Ļ lies in the upper half-planeH = z ā C : Im(z) > 0.
For Ļ, Ļ ā² ā H, the lattices ć1, Ļć and ć1, Ļ ā²ć are equal if and only if Ļ ā² ā ć1, Ļć andĻ ā ć1, Ļ ā²ć, i.e., if and only if Ļ ā² = aĻ + b and Ļ = cĻ ā² + d for some a, b, c, d ā Z. It is easyto see this means Ļ ā² = Ļ + b for some b ā Z.
Hence we may assume Ļ ā H/Z, or more precisely, ā12 ā¤ Re(Ļ) < 1
2 . This means thatĀ±1,Ā±Ļ will be the four nonzero lattice points of ć1, Ļć closest to the origin. Hence theonly way two lattices ć1, Ļć and ć1, Ļ ā²ć can be equivalent is if Ī» Ā±1,Ā±Ļ = Ā±1,Ā±Ļ ā². It isnot hard to see this means either Ļ ā² = Ļ or Ļ ā² = ā 1
Ļ .
Exercise 2.5.4. Let Ļ, Ļ ā² ā H. Fill in the details in the above argument to show that ć1, Ļćand ć1, Ļ ā²ć are equivalent if and only if Ļ ā² = Ļ + b or Ļ ā² = ā 1
Ļ + b, for some b ā Z.
What the above shows is that equivalence classes of lattices are parametrized by the setof Ļ ā H subject to the conditions either ā1
2 ā¤ Re(Ļ) < 12 and |Ļ | > 1 or ā1
2 ā¤ Re(Ļ) ā¤ 0and |Ļ | = 1.
CHAPTER 2. ELLIPTIC FUNCTIONS 25
ā12
12
We will see in the next section that the (closure of the) above region is a fundamentaldomain for Ī\H, where Ī ' PSL(2,Z) is the group of (hyperbolic) isometries of H generatedby the transformations Ļ 7ā Ļ + 1 and Ļ 7ā ā 1
Ļ .What does all this have to do with modular forms? Well, if you are studying elliptic
curves (or even just lattices), one thing you want to look is invariants. For example, givenĻ ā H, consider the map
ā : Ļ 7ā Ī = ć1, Ļć 7ā EĪ 7ā ā(EĪ»),
sending a point in H to the discriminant of the associated elliptic curve. Equivalent (an-alytically group isomorphic) elliptic curves have essentially the same discriminant, so ā isa (holomorphic) function on H satisfying certain transformation properties under Ī. Theseproperties will make ā what is called a modular form of weight 12 and level 1.
Chapter 3
The PoincarƩ upper half-plane
There are two basic kinds of non-Euclidean geometry: spherical and hyperbolic. Sphericalgeometry is easy enough to imagine. Consider the sphere S2 =
(x, y, z) ā R3 : x2 + y2 + z2 = 1
.
Then the geodesics (paths which locally minimize distance) are simply the great circles. Inother words, for two points P and Q on S2, a shortest path in S2 between them is anarc lying on a great circle connecting P and Q (this great circle will be unique providedP 6= āQ), and the distance between P and Q in S2 is the (Euclidean) length of this arc.
Hyperbolic geometry is a bit harder to picture. Perhaps the easiest way to initiallyvisualize the hyperbolic plane is one sheet of the two-sheeted hyperboloid x2 + y2 ā z2 = 1,however this is not the easiest model to work with for many purposes. Just like with theEuclidean plane, the hyperbolic plane has translations, rotations and reflections. The twomost common models to use, both named after PoincarĆ©, are the PoincarĆ© disc model (dueto Beltrami) and the PoincarĆ© upper half-plane model (due to Riemann). The disc modelhas the advantage of making rotations easy to visualize, and the upper half-plane model hasthe advantage of making translations easy to visualize. When working with modular forms,one always uses the upper half-plane model.
3.1 The hyperbolic plane
Unless stated otherwise, in this chapter z, w ā H and x, y ā R>0.
Definition 3.1.1. The PoincarƩ upper half-plane, or hyperbolic plane, is the set
H = z ā C : Im(z) > 0
together with the metric given by the distance function
d(z, w) = coshā1
(1 +
|z ā w|2
2 Im(z) Im(w)
).
Angles in the hyperbolic plane H are just taken to be the usual Euclidean angles.We will often write
d(z, w) = coshā1
(1 +
u(z, w)
2
),
26
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 27
where
u(z, w) =|z ā w|2
Im(z) Im(w).
While modular forms will be functions on H satisfying certain transformation laws underisometry groups, we do not actually need to know too much about the geometry of H forour study of modular forms. Thus some basic facts about the geometry of H which we donot need, but may help conceptually to be aware of, I will just state without proof. Insteadone may refer to basic references on hyperbolic geometry, such as [And05] or [Kat92].
The first such fact is the following.
Lemma 3.1.2. d(z, w) is a metric on H.
To see why this is plausible, first notice the argument of coshā1 always lies in [1,ā).Recall cosh(x) = ex+eāx
2 so cosh(0) = 1 and cosh is increasing and concave up on [0,ā). Socoshā1(1) = 0 and coshā1 is increasing and concave down on [1,ā) (it grows logarithmicallyof course).
In particular, (i) d(z, w) : H Ć H ā [0,ā) with d(z, w) = 0 if and only if the argumentof coshā1 is 1, i.e., if and only if u(z, w) = 0, i.e., if and only if z = w. It is also clearthat (ii) d(z, w) = d(w, z) for z, w ā H since u(z, w) = u(w, z). Thus to show d(z, w) isa metric, it remains to show (iii) the triangle inequality, d(z, w) ā¤ d(z, v) + d(v, w), holdsfor all z, w, v ā H. This is somewhat technical (and more easily proven using a differentformulation of the hyperbolic distance), so Iāll leave it to you to either work out or look up.
Another fact one should know, though not formally used in our development of modularforms is the following.
Proposition 3.1.3. The geodesics of H are the vertical lines z ā H : Re(z) = x0 and thesemicircles z ā H : |z ā z0| = r0 in H which meet R orthogonally. In particular, given anyz, w ā H, there is a unique geodesic connecting them.
Here are some drawing of geodesics:
0
Donāt get confused by analogy with the spherical model here. While itās true that thegeodesic arc from z to w is the shortest path between them, the distance between themis not the Euclidean arc length of this arc, but rather the hyperbolic arc length given by(ds)2 = (dx)2+(dy)2
y2, where z = x + iy. In other words, the hyperbolic distance d(z, w) as
defined above does not give you the Euclidean length of geodesic arc from z to w. Forinstance, the distance from any point z ā H to the origin (i.e., limyā0+ d(iy, z)) is infinite.Similarly, the distance from any finite point z ā H to iā (i.e., limyāā d(iy, z)) is infinite.We compute the case of z = i below.
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 28
Example 3.1.4. Let y ā R>0. Then
d(i, iy) = coshā1
(1 +
(y ā 1)2
2y
)In fact, since coshā1(x) = ln(x+
āx+ 1
āxā 1), we can write
d(i, iy) = ln
(y2 + 1
2y+|y2 ā 1|
2y
)= | ln y|.
In particular, we see iy and i/y are equidistant from i, and indeed d(i, 0) := limyā0+ d(i, iy) =ā = limyāā d(i, iy) =: d(i, iā). Applying the triangle inequality shows in fact d(z, 0) =d(z, iā) =ā for any z ā H.
Exercise 3.1.5. For x, y > 0, compute the hyperbolic distance d(ix, iy).
Exercise 3.1.6. For z = eiĪø ā H, compute the hyperbolic distance d(i, z).
3.2 Fractional linear transformations
In this section, z will denote an element of H, x = Re(z) and y = Im(z) > 0.One nice thing about the upper half-plane model is that there is a nice way to de-
scribe the isometries of H in terms of fractional linear transformations (also called Mƶbiustransformations), which you may have encountered in complex analysis.
Recall for a ring R, we may define the 2Ć 2 special linear group
SL2(R) :=
(a bc d
): a, b, c, d ā R, adā bc = 1
,
as well as the corresponding projective special linear group
PSL2(R) := SL2(R)/Z,
where Z denotes the (central) subgroup of scalar matrices of SL2(R), i.e.,
Z =
(a
a
)ā SL2(R)
=
(a
a
): a ā R, a2 = 1
.
(If youāre not familiar with these, check that these are groups.) The most important casesfor us in this course are R = R and R = Z; in both of these cases Z = Ā±I.
Definition 3.2.1. A fractional linear transformation of H is a map Hā C of the form(a bc d
)z =
az + b
cz + d, z ā H
where(a bc d
)ā SL2(R).
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 29
(More generally, one can consider fractional linear transformations(a bc d
)ā GL2(C),
but we will always mean elements of SL2(R) by the term fractional linear transformation.)First note that, for c, d ā R not both 0, z ā az+b
cz+d is a rational function of C with at mostone pole (of order 1) at ād/c ā R if c 6= 0, fractional linear transformations are holomorphicmaps from H to C.
Exercise 3.2.2. Show that the image of H under a linear fractional transformation is con-tained in H.
This allows us to compose fractional linear transformations on H.
Lemma 3.2.3. For Ļ, Ļ ā SL2(R), the fractional linear transformation Ļ Ļ : Hā H givenby their composition is equal to the fractional linear transformation ĻĻ given by their matrixproduct.
The proof is just a simple computation, which we relegate to
Exercise 3.2.4. Prove Lemma 3.2.3.
Lemma 3.2.5. Let Ļ ā SL2(R). The fractional linear transformation Ļ : Hā C in fact ananalytic automorphism Ļ : H ā H, i.e., Ļ : H ā H an analytic bijection whose inverse isalso analytic. Further, Ļ acts trivially on H if and only if Ļ = Ā±I.
Proof. Write Ļ =
(a bc d
). Then
Ļ(z) =ax+ aiy + b
cx+ ciy + dĀ· cx+ dā ciycx+ dā ciy
=(ad+ bc)x+ bd+ acy2 + iy(adā bc)
(cx+ d)2 + (cy)2
Since adā bc = 1, we have(a bc d
)z =
(ad+ bc)x+ bd+ acy2 + iy
(cx+ d)2 + (cy)2=
(ad+ bc)x+ bd+ acy2 + iy
|cz + d|2, (3.2.1)
which clearly has positive imaginary part, so Ļ(H) ā H.To see this is one-to-one, suppose Ļ(z) = Ļ(w) for z, w ā H. Then cross-multiplying
denominators and expanding out
az + b
cz + d=aw + b
cw + d
givesadz + bcw = adw + bcz,
which says(adā bc)z = (adā bc)w,
i.e., z = w.Note the identity of SL2(R) acts trivially on H. By the previous lemma, this means
Ļā1 Ļ acts trivially on H. In particular, Ļ maps onto H. Hence Ļ : H ā H is an analyticautomorphism.
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 30
In order for Ļ to be the identity on H, looking at the imaginary part of Ļ(z) shows(cx + d)2 + (cy)2 = 1 for all x ā R and y > 0. This implies c = 0 and d = Ā±1. Then thereal part of Ļ(z) is simply adx+ bd. For this to always equal x, we need b = 0 and ad = 1.Hence Ļ = Ā±I are the only elements of SL2(R) which act trivially on H.
So from now on, we will view fractional linear transformations as analytic automorphismson H (i.e., holomorphic bijections from H to H whose inverses are also holomorphic).
Because composition respects group multiplication, and the only fractional linear trans-formations acting trivially are Ā±I, two elements Ļ, Ļ ā SL2(R) define the same map if andonly if Ļ = Ā±Ļ . This means we can identify the group of fractional linear transformationson H with PSL2(R). Despite the fact that elements of PSL2(R) are technically not 2 Ć 2matrices, we will still write elements of PSL2(R) as elements of SL2(R) with the conventionthat āĻ is identified with Ļ .
Now letās look at a few specific examples of fractional linear transformations. Let
Ļn =
(1 n
1
),
Ī“m2 =
(m
mā1
), and
Ī¹ =
(1
ā1
).
Then
Ļn(z) =
(1 n
1
)z = z + n
simply translates H to the right by n,
Ī“m2(z) =
(m
mā1
)z = m2z
dilates, or scales, H outward radially from the origin by m2, and
Ī¹(z) =
(1
ā1
)z = ā1
z
inverts H about the semicircle z ā H : |z| = 1. (Writing z = reiĪø may make these lattertwo transformations easier to visualize. The second then becomes obvious, and we seeĪ¹(reiĪø) = 1
rei(ĻāĪø).) Note all these transformations map geodesics to geodesics. Furthermore,
because they are holomorphic maps, they are conformal, i.e., they preserve angles.
Lemma 3.2.6. The group of fractional linear transformations PSL2(R) acts transitively onH, i.e., for any z, w ā H, there exists Ļ ā H such that Ļ(w) = z.
Proof. It suffices to show that for any z ā H, there exists Ļ ā PSL2(R) such that Ļ(i) = z.For then there also exists Ļ ā PSL2(R) such that Ļ(i) = w, so (ĻĻā1)(w) = z by theprevious lemma.
Write z = x+ iy with x, y ā R and let Ļ = ĻxĪ“y. Then Ļ(i) = Ļx(iy) = x+ iy = z.
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 31
A basic fact, though we will not need it, is the following.
Proposition 3.2.7. The group PSL2(R) of fractional linear transformations on H is pre-cisely the group of orientation-preserving isometries of H.
(To obtain the orientation-reversing isometries, one considers 2Ć 2-matrices of determi-nant ā1.)
We note that since angles in H are simply Euclidean angles, the fact that holomorphicmaps are conformal implies fractional linear transformations preserve hyperbolic angles.
While we omit the proof, we remark that it is straightforward (though somewhat tedious)to check that all fractional linear transformations are isometries, i.e., they preserve distance,i.e., d(z, w) = d(Ļ(z), Ļ(w)) for z, w ā H and Ļ ā PSL2(R). However the computations aresimpler in the following case.
Exercise 3.2.8. Let Ļ ā PSL2(R) and z ā H. Show d(z, i) = d(Ļ(z), Ļ(i)). (Note it sufficesto show u(z, i) = u(Ļ(z), Ļ(i)).)
From now on, we will often write Ļ(z) simply as Ļz for Ļ ā PSL2(R).
3.3 The modular group
In this course, we will not work with the group of all fractional linear transformationsPSL2(R), but rather certain nice discrete subgroups Ī. First we will study the most basicdiscrete subgroup, the (full) modular group, PSL2(Z).
(Some authors call SL2(Z) the full modular group because they prefer to work in termsof matrices, but we prefer to think in terms of fractional linear transformations of H, soPSL2(Z) is more natural to work with. However, there is no difference between speciallinear groups and projective linear groups in terms of fractional linear transformations, sothis should not cause any confusion.)
Proposition 3.3.1. PSL2(Z) = ćS, T ć where S =
(1
ā1
)and T =
(1 1
1
).
Note that S and T are simply the inversion Ī¹ and translation Ļ1 defined in the previoussections, however the notation of S and T is standard for these specific matrices.
Proof. Let Ī = ćS, T ć. It is clear Ī ā PSL2(Z). Note also that T ā Ī implies
N =
(1 x
1
): x ā Z
ā Ī.
Similarly
N =
(1y 1
): y ā Z
ā Ī,
since N is generated by T = Sā1TS =
(1ā1 1
)ā Ī.
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 32
Now let(a bc d
)ā PSL2(Z). We want to show
(a bc d
)ā Ī. First note that
Sā1
(a bc d
)S =
(d ācāb a
),
we may assume |a| ā¤ |d|. Next observe(a bc d
)(1 x
1
)=
(a ax+ bc cx+ d
).
Hence by multiplying(a bc d
)on the right by some element ofN , we may assume 0 ā¤ b < |a|.
Similarly, since (1y 1
)(a bc d
)=
(a b
ay + c by + d
),
we may also assume 0 ā¤ c < |a|. Hence ad and bc are integers such that ad ā bc = 1,|ad| ā„ a2 and |bc| < a2. So we must have ad = 1 and bc = 0. In particular, a = d = Ā±1,and 0 ā¤ b < |a| and 0 ā¤ c < |a| then means b = c = 0, i.e.,(
a bc d
)= Ā±
(1 0
1
)ā Ī.
Remark. One can show in fact ćS, T |S2 = (ST )3 = 1ć is a presentation for PSL2(Z).
We have already defined the notion of a fundamental domain for Ī©/Ī, where Ī© ā Cand Ī is a group of (Euclidean) isometries of Ī©. We define fundamental domains in thehyperbolic plane the same way. For formalityās sake, we write out the definition now.
Definition 3.3.2. Let F ā H be a closed set with connected interior, and Ī a subgroup ofPSL2(Z). We say F is a fundamental domain for Ī (or Ī\H) if
(i) any z ā H is Ī-equivalent to some point in F ;(ii) no two interior points of F are Ī-equivalent; and(iii) the boundary āF of F is a finite union of smooth curves in H ā© F .
To clarify for future use, by (ii) we mean that if z, zā² lie in the interior F0 of F such thatĪ³z = zā² for Ī³ ā Ī, then z = zā². We do not mean that Ī³z = zā² implies Ī³ = I. However, wewill see later (Lemma 3.5.1) that our interpretation of (ii) implies Ī³ = I.
LetF =
z ā H : |Re(z)| ā¤ 1
2, |z| ā„ 1
.
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 33
0 12ā1
21ā1
F
i
Ī¶3 Ī¶6
(Here Ī¶k = e2Ļi/k.)
Proposition 3.3.3. F is a fundamental domain for PSL2(Z).
Proof. Condition (iii) of Defintion 3.3.2 (āF is a finite union of smooth curves) is obviouslysatisfied, so we just need to show the first two conditions.
First we show (i) any point z ā H is PSL2(Z)-equivalent to some point in F , i.e., for any
z ā H, there is some Ī³ =
(a bc d
)ā PSL2(Z) such that Ī³z ā F . Observe that by (3.2.1), we
have Im(Ī³z) = Im(z)|cz+d|2 . Since c, d ā Z and they are not both 0, |cz + d|2 attains a minimum
as Ī³ ranges over PSL2(Z). Consequently, we may choose Ī³ such that Im(Ī³z) is maximal.Replacing Ī³ by TnĪ³ for some n ā Z shows that we may further assume |Re(Ī³z)| ā¤ 1
2 .It remains to show |Ī³z| ā„ 1. Suppose not. Let w = Ī³z and write w = u+ iv. Then
Sw =āu+ iv
u2 + v2.
In particular, if |w| < 1, then Im(Sw) > Im(w), i.e., Im(SĪ³z) > Im(Ī³z), contradicting themaximality of Im(Ī³z). This proves (i).
Now we need to show (ii) no two interior points of F are PSL2(Z)-equivalent. Suppose
z and w lie in the interior of F , and Ī³z = w for some Ī³ =
(a bc d
)ā PSL2(Z). We may
assume Im(w) ā„ Im(z), or else we could switch z and w and replace Ī³ by Ī³ā1. SinceIm(w) = Im(z)
|cz+d|2 , this means |cz + d| ā¤ 1. In particular |Im(cz + d)| = |c|Im(z) < 1, but
z ā F implies Im(z) >ā
32 so |c| < 2.
First suppose c = 0. Then adā bc = ad = 1. Multiplying Ī³ by Ā±1 if necessary (recall weare working in PSL2(Z), so this does not change Ī³), we may assume a = d = 1 so Ī³ = Tn
for some n ā Z. In this case it is clear we cannot have Ī³z ā F .
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 34
Now suppose c = Ā±1. Multiplying Ī³ by Ā±1 if necessary, we may assume c = 1. Note
1 > |cz + d|2 = Re(z + d)2 + Im(z)2 > Re(z + d)2 +3
4.
Hence |Re(z + d)| < 12 , which implies d = 0 since |Re(z)| < 1
2 . Then the determinantcondition implies b = ā1. So
Ī³ =
(a ā11 0
),
i.e., Ī³z = aā 1z , which cannot lie in F . This proves (ii).
Recall when we defined the notion of a fundamental domain, we said some authorsrequire fundamental domains to be convex. Note that F is convex in H (with respectto the hyperbolic metric), i.e., given any two points in F , one can connect them with aunique geodesic segment, and that segmentāeither a vertical line segment or a segment ofa semicircle centered on the real lineālies entirely in F .
Exercise 3.3.4. Show two boundary points z, zā² of F are PSL2(Z)-equivalent if and only ifIm(z) = Im(zā²) and Re(z) = āRe(zā²).
The above exercise shows the quotient space PSL2(Z)\H parametrizes equivalence classesof lattices as described Section 2.5 (cf. figure at end of Section 2.5). (Note, in contrast toChapter 2, it is standard to write the quotient by the action of PSL2(Z) on the left herebecause we think of matrices as acting on H from the left.)
Since the fundamental domain F we have given above is universally used, it is calledthe standard fundamental domain for PSL2(Z). There is a trivial way to cook upother fundamental domainsānamely Ī³F is also a fundamental domain for PSL2(Z) for anyĪ³ ā PSL2(Z). This statement is slightly generalized in the following simple exercise.
Exercise 3.3.5. Let Ī ā PSL2(R) be discrete and suppose F ā² is a fundamental domain forĪ. Suppose Ī³ ā PSL2(R) such that Ī³ā1ĪĪ³ = Ī. Show Ī³F ā² is also a fundamental domain forĪ.
*Exercise 3.3.6. (i) Show that the set of Ī³F with Ī³ ā PSL2(Z) tile H, i.e., their unioncovers H and their interiors are pairwise disjoint.
(ii) Show the picture below of the partial tiling of H given by Ī³F is correct, where welabelled the region Ī³F simply by Ī³. (Hint: begin by showing S takes the vertical line x+iR>0
to the upper semicircle passing through 0 and ā 1x .)
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 35
0 12ā1
21ā1 2ā2 1
323
ITā1 T
STā1S TS
STā1STSTS TST
In the above diagram, note STS = Tā1STā1 and TST = STā1S.
3.4 Congruence subgroups
From our point of view so far, the simplest kinds of (meromorphic) modular forms (modularfunctions of level one) will be meromorphic functions f : Hā C satisfying
f(Ī³z) = f(z) for Ī³ ā PSL2(Z). (3.4.1)
Note this is a hyperbolic analogue of the definition of elliptic functions. Specifically, fixa lattice Ī ā C and let ĪĪ be the group of isometries of C given by Ī³Ļ(z) = z + Ļ where Ļranges over Ī. Then elliptic functions w.r.t. Ī were simply defined to be the (meromorphic)functions on C satisfying f(Ī³z) = f(z) for Ī³ ā ĪĪ.
So instead of being functions on C (with Euclidean geometry) invariant under an isometrygroup ĪĪ corresponding to a lattice Ī, modular functions will be functions on H (withhyperbolic geometry) invariant under a hyperbolic isometry group Ī, with the simplest casebeing Ī = PSL2(Z).
If one wants to think of a hyperbolic analogue of the lattice Ī, there is no issue here.Note one can recover the lattice Ī ā C from the group ĪĪ simply by looking at the ĪĪ-translates of the origin in C. In this way, we see any isometry group of C isomorphic toZĆ Z gives a lattice. Similarly, for any (noncyclic) ādiscreteā isometry group Ī ā PSL2(R)of H, one can think of the corresponding āhyperbolic latticeā as simply the Ī-translates ofi. (Note discrete here precisely means the Ī-translates of i form a discrete subset of H, andone excludes the case of cyclic groups Ī as they will correspond to ā1-dimensionalā latticesin H.)
Now the modular functions satisfying (3.4.1), while certainly very interesting, are notsufficient for most number theoretic purposes. There are two ways of generalizing (3.4.1)that will include many import functions arising naturally in number theory. One way is toloosen the invariance by only requiring f(Ī³z) = Ī±(Ī³, z)f(z) for some āweightā Ī±(Ī³, z) (thenone calls the resulting functions modular forms). The second way is to not require (3.4.1)
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 36
hold for all Ī³ ā PSL2(Z), but merely all Ī³ ā Ī, where Ī is a suitable subgroup of PSL2(Z).In fact, as alluded to earlier, one could just require that Ī be a noncyclic discrete subgroupof PSL2(R). However the most interesting cases of study for number theory is when Ī is acongruence subgroup of PSL2(Z).
Definition 3.4.1. Let N ā N. The modular group of level N is
Ī0(N) =
(a bc d
)ā PSL2(Z) : c ā” 0 mod N
.
Note when N = 1, we have Ī0(1) = PSL2(Z), and we sometimes call this the fullmodular group. It is also clear that
Ī0(N) ā Ī0(M) if M |N.
One also has the more refined congruence subgroups
Ī1(N) =
(a bc d
)ā PSL2(Z) :
(a bc d
)ā”(
1 b0 1
)mod N
and the principal congruence subgroups
Ī(N) =
(a bc d
)ā PSL2(Z) :
(a bc d
)ā”(
1 00 1
)mod N
.
All of these subgroups are finite index inside PSL2(Z) and have been studied classicallyin the context of modular forms. Note that
Ī(N) ā Ī1(N) ā Ī0(N).
In general, a congruence subgroup of PSL2(Z) is a subgroup containing some Ī(N).However, the most important congruence subgroups are the modular groups Ī0(N), and wewill restrict our focus in this course to them.
Unless explicitly stated otherwise, from now on, Ī will always denote a congruence sub-group of PSL2(Z).
In order to understand the coset space PSL2(Z)/Ī0(N), it will be helpful to use theāprojective line over Z/NZ.ā For (a, c), (aā², cā²) ā Z/NZĆ Z/NZ, we write (a, c) ā¼ (aā², cā²) if(aā², cā²) = (Ī»a, Ī»c) for some Ī» ā (Z/NZ)Ć. It is clear that ā¼ is an equivalence relation. Wedenote the equivalence class of (a, c) by (a : c), and define the projective line
P1(Z/NZ) = (a : c)|a, c ā Z/NZ, gcd(a, c) = 1 .
Here by gcd(a, c) = 1, we mean that there exist a, c ā Z in the congruence classes a mod Nand c mod N such that gcd(a, c) = 1); or, alternatively, there exist r, s ā Z/NZ such thatar + cs ā” 1 mod N .
Lemma 3.4.2. The coset space PSL2(Z)/Ī0(N) is finite, in bijection with P1(Z/NZ), and
a set of coset representatives is given by matrices of the form(a bc d
), where (a, c) ā ZĆ Z
such that (i) (a : c) ranges over P1(Z/NZ) and (ii) b, d are chosen such that adā bc = 1.
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 37
Proof. First observe that(a bc d
)ā(aā² bā²
cā² dā²
)Ī0(N) āā
(dā² ābā²ācā² aā²
)(a bc d
)ā Ī0(N).
By definition, this holds if and only if the lower left hand coefficient of the product on theright, aā²cā cā²a, is divisible by N . In other words, we have(
a bc d
)ā(aā² bā²
cā² dā²
)Ī0(N) āā aā²c ā” acā² mod N.
Note that a pair of integers (a, c) can occur as a column of a matrix in PSL2(Z) if and onlyif adā bc = 1 is solvable, i.e., if and only if gcd(a, c) = 1.
Hence the space of cosets PSL2(Z)/Ī0(N) is in bijection with the set of pairs (a, c) āZ/NZĆZ/NZ such that gcd(a, c) = 1 under the equivalence that (a, c) ā” (aā², cā²) āā aā²c ā”acā² mod N . It suffices to show that aā²c ā” acā² mod N is equivalent to (aā², cā²) = (Ī»a, Ī»c) forsome Ī» ā (Z/NZĆ).
Clearly if (aā², cā²) = (Ī»a, Ī»c), then aā²c ā” acā² mod N . On the other hand, suppose(a, c), (aā², cā²) ā Z Ć Z such that gcd(a, c) = gcd(aā², cā²) = 1 and aā²c ā” acā² mod N . WriteN = pn1
1 Ā· Ā· Ā· pnkk . By the Chinese Remainder Theorem, Z/NZ ' Z/pn1
1 Z Ć Ā· Ā· Ā· Ć Z/pnkk Z.Under this isomorphism, write a = (a1, . . . , ak) where ai ā Z/pnii Z, and do similarly for c,aā², and cā².
Then aā²c ā” acā² mod N means aā²ici ā” aicā²i mod pnii for each i. By the condition gcd(a, c) =1, we know pi cannot divide both ai and ci. Assume pi - ai, and say gcd(ci, p
nii ) = peii .
Then aā²ici ā” aicā²i mod pnii means gcd(cā²i, p
nii ) = peii so pi - aā²i. Put Ī»i = aā²ia
ā1i . Then
aā²i = Ī»iai and cā²i = Ī»ici. Consequently Ī» = (Ī»1, . . . , Ī»k) ā (Z/NZ)Ć such that (aā², cā²) ā”(Ī»a, Ī»c) mod N .
Corollary 3.4.3. [PSL2(Z) : Ī0(N)] = Nāp|N
(1 + 1
p
), where p runs through the distinct
prime divisors of N .
Proof. First suppose N = pn. For any equivalence class (a : c), either a is divisible by p ornot. If not, we may rewrite this class as (Ī»a : Ī»c) = (1 : Ī»c) where Ī» ā” aā1 mod N . Thereare N choices for Ī»c mod N , and all inequivalent. If p|a then p - c, so by the same argumentwe may rewrite (a : c) = (Ī»a : 1). Since Ī»a must be a multiple of p, there are N/p choicesfor Ī»a, and they all give inequivalent classes. Thus the proposition holds when N = pn.
The proof for arbitraryN follows from the prime power case using the Chinese RemainderTheorem (see exercise below).
Corollary 3.4.4. Let p be a prime, and n ā N. Then a complete set of coset representativesof PSL2(Z)/Ī0(pn) is given by
T ipS =
(ip ā11 0
): 0 ā¤ i < pnā1
āŖST jS =
(1 0āj 1
): 0 ā¤ j < pn
.
Proof. First check T iS and ST jS are given by the expressions above. Then observe theproof of Corollary 3.4.3 actually tells us that
P1(Z/pnZ) =
(ip : 1) : 0 ā¤ i < pnā1āŖ (1 : j) : 0 ā¤ j < pn .
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 38
Hence by Lemma 3.4.2, we know S = T 0S, T pS, . . . T pnā1
S, I = ST 0S, STS, . . . , ST pnā1S
forms a set of coset representatives for PSL2(Z)/Ī0(pn).
Exercise 3.4.5. Complete the proof of Corollary 3.4.3 in the case where N is not a primepower.
Exercise 3.4.6. Find a complete set of coset representatives for PSL2(Z)/Ī0(15).
Exercise 3.4.7. Find [PSL2(Z) : Ī1(N)]. (Suggestion: determine the index inside Ī0(N).)
Exercise 3.4.8. Find [PSL2(Z) : Ī(N)]. (Suggestion: see previous suggestion.)
Lemma 3.4.9. Let F be a fundamental domain for PSL2(Z), and let Ī be a congruencesubgroup of PSL2(Z). Let Ī±i be a complete set of coset representatives for PSL2(Z)/Īsuch that F ā² =
āĪ±ā1i F has connected interior. Then F ā² is a fundamental domain for Ī.
Proof. Clearly the boundary of F ā² is a finite union of smooth curves, since the same is trueof each Ī±iF .
Let z ā H. We show z is Ī-equivalent to a point in F ā². First, there is some Ļ ā PSL2(Z)such that Ļz ā F . Write Ļ = Ī±iĪ³ for some Ī±i where Ī³ ā Ī. Then Ī±iĪ³z ā F , so Ī³z āĪ±ā1i F ā F ā².Now we want to show no two points z and zā² in the interior F ā²0 of F ā² can be Ī0(N)
equivalent. The basic idea is that the following. Suppose zā² = Ī³z for Ī³ ā Ī, and writez = Ī±ā1
i w, zā² = Ī±ā1j wā² where w,wā² ā F . Hence wā² = Ī±iĪ³Ī±
ā1j w, i.e., wā² and w are equivalent
in PSL2(Z). If w,wā² are actually interior points of F , this can only happen if wā² = w,and one can use this to conclude zā² = z. However it need not be that w,wā² are interiorpoints (e.g., examine the diagram in the example below), so instead we make the followingargument to reduce to this case.
Let Ī³ ā Ī and suppose U = F ā²0ā©Ī³F ā²0 is nonempty. For some Ī±i, Ī±iU ā©F0 is nonempty.Say w ā Ī±iU ā© F0. Then z = Ī±ā1
i w ā F ā²0 such that zā² = Ī³z ā F ā²0. There exists Ī±j suchthat wā² = Ī±jz
ā² ā F . Thus we have wā² = Ī±jĪ³Ī±ā1i w, i.e., w is PSL2(Z)-equivalent to wā². It is
a simple exercise (below) that no interior point of F is PSL2(Z)-equivalent to a boundarypoint, hence wā² ā F0.
Since no two interior points of F are PSL2(Z)-equivalent, we have wā² = w, i.e., w is afixed point of Ī±jĪ³Ī±ā1
i . However no interior points of F are fixed by a nontrivial element ofPSL2(Z) (either justify this to yourself, or see Lemma 3.5.1 below), whence Ī±jĪ³Ī±ā1
i = I.This implies Ī±jĪ = Ī±iĪ, i.e., i = j, which then implies Ī³ = I so zā² = z.
Exercise 3.4.10. Let Ī be a congruence subgroup of PSL2(Z) and F a fundamental domainfor Ī. Show that no interior point of F is Ī-equivalent to a boundary point.
Example 3.4.11. Consider Ī0(2) and let F be the standard fundamental domain for PSL2(Z).By Corollary 3.4.4, a set of representatives for PSL2(Z)/Ī0(2) is I, S, STS. Hence by theabove lemma, F āŖ SF āŖ STā1SF would be a fundamental domain for Ī0(2) if it had con-nected interior, but it does not (cf. diagram for Exercise 3.3.6). Instead, if we replace thecoset representative STS = Tā1STā1 with Tā1S, and then apply the above lemma, we see
F ā² = F āŖ SF āŖ STF ,
pictured below, is a fundamental domain for Ī0(2).
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 39
0 12ā1
2
F
SFSTF
Note that F ā² is convex in H.
*Exercise 3.4.12. Find a fundamental domain for Ī0(4) which contains the above funda-mental domain for Ī0(2).
3.5 Cusps and elliptic points
As we said in the previous section, in some sense the simplest kinds of modular forms(modular functions) will be (meromorphic) functions on the upper half-plane H which areinvariant under transformations in PSL2(Z) or some congruence subgroup Ī. This means wecan think of them as functions on the quotient space X = Ī\H, or in other words, functionson a fundamental domain F for Ī which satisfy certain conditions on the boundary. (Thespace X is simply F with certain boundary points identified.)
There are 2 special kinds of points for X. First, let us discuss elliptic points. We sayz ā F (or X) is an elliptic point if there exists Ī³ ā Ī ā I such that Ī³z = z. In otherwords, the elliptic points for F are the points fixed by some (non-identity) transformationin Ī. We have already seen these play a role in finding a fundamental domain (in the proofof Lemma 3.4.9). The Ī³ ā ĪāI which fix some point of X (or equivalently F) are calledthe elliptic elements of Ī.
For example if Ī = PSL2(Z) and F is the standard fundamental domain, then i isfixed by the transformation S. We will proceed to determine all elliptic points and ellipticelements for PSL2(Z). First we observe the following general result.
Lemma 3.5.1. Suppose F is a fundamental domain for Ī and z is an elliptic point of F .Then z lies on the boundary āF of F .
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 40
Proof. Suppose z lies in the interior F0 of F and Ī³ ā Ī such that Ī³z = z. Then there existsa neighborhood U of z in F0 such that Ī³U ā F0. Consequently, any w ā U is Ī-equivalentwith Ī³w. However, by the definition of fundamental domain, this means Ī³w = w for allw ā U , i.e., Ī³ must act trivially on U . Since two holomorphic functions agreeing on an openset agree everywhere, this gives Ī³ = I, i.e., z is not elliptic.
Lemma 3.5.2. The only elliptic points for the standard fundamental domain of PSL2(Z) arei, Ī¶3 and Ī¶6. The elliptic subgroup fixing i is I, S. The subgroup fixing Ī¶3 is
I, ST, Tā1S
.
The subgroup fixing Ī¶6 isI, STā1, TS
.
Observe by the diagram in Exercise 3.3.6, we can visually see the only possible ellipticelements of PSL2(Z) are S, ST , STā1, STS, TST , Tā1S, TS, Tā1 and T . Further T andTā1 clearly fix no points of H.
*Exercise 3.5.3. Prove the previous lemma (either using Exer 3.3.6 or not).
Lemma 3.5.4. Let Ī be a finite index subgroup of PSL2(Z) and F a fundamental domain.Then any elliptic point of F must be of the form Ī³z ā āF where Ī³ ā PSL2(Z) and z āi, Ī¶3, Ī¶6. i.e., it is a PSL2(Z)-translate of i, Ī¶3 and Ī¶6 lying on the boundary of F .
Further, such a Ī³z will be elliptic if and only if Ī³CĪ³ā1 ā Ī where C is the stabilizer ofz in PSL2(Z). In this case, Ī³CĪ³ā1 is the stabilizer subgroup of Ī³z in Ī.
In particular, there are only finitely many elliptic points and elliptic elements for Ī.
Proof. Suppose w ā āF has nontrivial stabilizer C in Ī. Let F0 be the standard fundamentaldomain for PSL2(Z). There is some Ļ ā PSL2(Z) such that z = Ļw ā F0. Consequently,ĻCĻā1 is a subgroup of PSL2(Z) stabilizing z, i.e., z is an elliptic point in PSL2(Z)
On the other hand, let z ā i, Ī¶3, Ī¶6 and C be the subgroup (of order 2 or 3) whichstabilizes z in PSL2(Z). If Ī³z ā āF for some Ī³ ā PSL2(Z), then the stabilizer of Ī³z inPSL2(Z) is Ī³CĪ³ā1. Hence Ī³z is elliptic for Ī if and only if Ī³CĪ³ā1 ā Ī.
If z is an elliptic point for Ī, we say it is elliptic of order n if the stabilizing subgroupof z in Ī has order n. From the above lemma, we see that any elliptic point must have order2 or 3 in Ī ā PSL2(Z)
Exercise 3.5.5. Determine the elliptic points for Ī0(2) and Ī0(4) (Use the fundamentaldomains from 3.4.11 and Exercise 3.4.12.
Exercise 3.5.6. Let p be a prime.(i) Show that the number of elliptic points of order 2 for Ī0(p) is
Īµ2(p) = 1 +
(ā1
p
)=
1 p = 2
2 p ā” 1 mod 4
0 p ā” 3 mod 4.
(ii) Show that the number of elliptic points of order 3 for Ī0(p) is
Īµ3(p) = 1 +
(ā3
p
)=
1 p = 3
2 p ā” 1 mod 3
0 p ā” 2 mod 3.
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 41
(iii) Write a table which determines the total number of elliptic points for Ī0(p) dependingonly on the values of p mod 12.
More generally (e.g., [DS05, Corollary 3.7.2]), the number of elliptic points of order 2(resp. 3) for Ī0(N) is
āp|N Īµ2(p) (resp.
āp|N Īµ2(p)).
We remark that elliptic points in F will be precisely the non-smooth points of theRiemann surface X = Ī\H. To get some idea of the geometry that goes on, consider thecase of F being the standard fundamental domain for PSL2(Z). Around a non-elliptic point,one needs to go around 2Ļ to make a closed loop. On the other hand, to make a closed looparound i, one only needs to go Ļ radians. Similarly, to make closed loops around Ī¶3 andĪ¶6, one needs to go around 2Ļ/3 radians. In fact, the subgroup fixing any elliptic point z iscyclic of order m, and this subgroup is generated by an elliptic element which can be viewedas a hyperbolic rotation around z by 2Ļ/m.
While elliptic points do play a role in the theory of modular forms, more important forus will be the notion of the cusps of Ī. In fact, we need to know what cusps are to even givea definition of modular forms.
Thinking of the Riemann surface, X = Ī\H goes off to infinity in certain places. For ex-ample, think of Ī = PSL2(Z) and the standard fundamental domain F . If we look at the defi-nition of distance in H, it is clear the width of the fundamental domain d
(ā1
2 + iy, 12 + iy
)ā
0 as y ā ā. (Alternatively, think about the fundamental domain SF .) Hence if wethink about some truncated fundamental domain, say F100 = z ā F : Im(z) > 100, itlooks like an half-infinite cylinder whose diameter is shrinking to an infinitesimal amountas Im(z) ā ā. This behavior at infinity is called a cusp. If we look at the fundamentaldomain for Ī0(2) in Example 3.4.11, it appears to have two cusps, one at iā and one at 0.(Remember, distance to the real line is infinite in H.)
We will not be overly concerned with the geometry of X at its cusps (geometrically, theyall look the same), but mainly in determining what they in terms of fundamental domains.
One way to think about what the cusps for X should be is to think about how tocompactify it. Let us think about the case of Ī = PSL2(Z) with standard fundamentaldomain F . By our remarks about the geometry of F with large imaginary part, it makessense to look at the one-point compactification F = F āŖ iā of F . In other words, welook at X = PSL2(Z)\H āŖ iā.
It would be nice to think of X itself as a quotient space, i.e., X = PSL2(Z)\H whereH is obtained from H by adding possible cusps. Precisely, we define the extended upperhalf-plane H = H āŖ iā āŖQ.
Now we extend the action of PSL2(Z) to H as follows:(a bc d
)iā = lim
yāā
aiy + b
ciy + d=
ac c 6= 0
iā c = 0,
and (a bc d
)p
q= lim
yā0+
a(pq + iy
)+ b
c(pq + iy
)+ d
=
ap+bqcp+dq cp+ dq 6= 0
iā cp+ dq = 0,
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 42
where pq ā Q. (Note some authors call iā by ā, though we feel the notation iā is
pictorially more suggestive. In fact, the natural compactifcation of Hāmost easily seenfrom the PoincarĆ© disc modelāis to simply take the one-point compactification of HāŖR, soeven if we think of the symbol ā meaning limxāā = +ā, we still have ā = iā.) Thatthis is indeed an action (i.e., that the action associates with the group multiplication inPSL2(Z)) follows formally from the action on H.
If one wants to specify the topology on H, we can start with defining a basis of openneighborhoods of iā in H to be Uy = iā āŖ z ā H : Im(z) > y where y > 0. Then forx ā Q, take Ī³x ā PSL2(Z) such that Ī³iā = x. A basis of open neighborhoods around x inH is then given by Ī³xUy where y > 0.
Exercise 3.5.7. Show the open neighborhood Ī³xUy is a Euclidean circle in H āŖ R which istangent to R at x.
Definition 3.5.8. Let Ī ā PSL2(Z). The cusps of Ī (or Ī\H) are the set of Ī-equivalenceclasses of iā āŖQ.
Example 3.5.9. From the definition, we see any element of Q is PSL2(Z)-equivalent to iā.Hence, PSL2(Z) has 1 cusp.
Example 3.5.10. Let(a bc d
)ā Ī0(2). Then c ā” 0 mod 2, so a ā” d ā” 1 mod 2. Hence
iā is Ī0(2) equivalent to a (nonzero) rational number ac in reduced form if and only if c is
even. Similarly, 0 is Ī0(2) equivalent to a nonzero rational number bd in reduced form if and
only if d is odd. Thus there are two cusps for Ī0(2), represented by 0 and iā. This agreeswith the picture of the fundamental domain in Example 3.4.11.
Lemma 3.5.11. For any prime p, Ī0(p) has precisely 2 cusps, represented by 0 and iā.
Proof. Simply replace 2 by p in the previous example.
Exercise 3.5.12. Draw a fundamental domain for Ī0(3) which has 0 and iā as limit points.
Lemma 3.5.13. For any Ī0(N) the number of cusps is finite.
This statement is true more generally for congruence subgroups, but we will restrict ourproof to the case of modular groups.
Proof. Consider any rational number pq in reduced form. Because we only want to show
the number of Ī0(N)-equivalence classes of iā āŖ Q is finite, we may assume pq is not
Ī0(N)-equivalent to iā.There exist cā², d ā Z relatively prime such that cā²pN + dq = gcd(pN, q), which equals
gcd(N, q) since gcd(p, q) = 1. Further cā²pN + dq = gcd(N, q) implies gcd(c, d) = 1 where
c = cā²N . Thus there exist a, b such that(a bc d
)ā Ī0(N). Hence
(a bc d
)p
q=ap+ bq
cp+ dq=
ap+ bq
gcd(N, q).
CHAPTER 3. THE POINCARĆ UPPER HALF-PLANE 43
Therefore replacing pq by something Ī0(N)-equivalent, we may assume q|N .
Further, since T ā Ī0(N), we see
T bp
q=
(1 b0 1
)p
q=p+ bq
q
is also Ī0(N)-equivalent to pq . Hence we can also assume 0 ā¤ p < q. This gives only a finite
number of possible non-equivalent pq .
One can refine the argument above to precisely count the number of cusps for Ī0(N).
Exercise 3.5.14. Show that the number of cusps for Ī0(N) is given byād|N
Ļ(gcd(d,N/d))
where Ļ is the Euler phi function.
*Exercise 3.5.15. Compute the cusps for Ī0(4).
Chapter 4
Modular Forms
4.1 Modular curves and functions
Definition 4.1.1. The modular curves of level N , Y0(N) and X0(N), are defined to be
Y0(N) = Ī0(N)\H
andX0(N) = Ī0(N)\H.
In particular, Y0(1) = PSL2(Z)\H and X0(1) = PSL2(Z)\H.Note X0(N) is the compactification of Y0(N) obtained by adding the cusps for Ī0(N),
as described in the last chapter.
Exercise 4.1.2. Show X0(N), with the quotient topology, is compact, using the topology onH described Section 3.5.
Before we go further, let us explain the terminology a bit. In Section 2.5, we saw thatY0(1) parameterized lattices in C, equivalently, isomorphism classes of (generalized) ellipticcurves. (Some lattices would give a curve with discriminant 0, and these degenerate curvesare called generalized elliptic curves). One can also define a generalized elliptic curve forthe cusp in X0(1), and thus view X0(1) as parametrizing the space of (generalized) ellipticcurves over C, up to analytic isomorphism. Similarly, X0(N) parametrizes isomorphismclasses of pairs (E,C) where E is a (generalized) elliptic curve over C and C is a cyclicsubgroup of order N .
In algebraic geometry, a moduli space is a geometric object (curve, surface, manifold,etc.) whose points parametrize a class of other geometric objects, in this case elliptic curveswith a distinguished cyclic subgroup. The Riemann surface X0(N) can also be viewed as analgebraic curve over C, hence the term modular curve. Realizing X0(N) as a moduli spaceis fundamental in the theory of elliptic curves, though we will not focus on this aspect inthis course.
For those familiar with Riemann surfaces, or complex manifolds, we can state the defi-nition of a modular function very simply.
44
CHAPTER 4. MODULAR FORMS 45
Definition 4.1.3. (Riemann surface version) A modular function of level N is a mero-morphic function f : X0(N)ā C.
For both readers familiar with Riemann surfaces and those not, let us translate this intoa statement about functions on H.
First, think about f restricted to Y0(N) = Ī0(N)\H. Being a meromorphic function onY0(N) means we can lift f to a function f : Hā C such that
(i) f(Ī³z) = f(z) for all Ī³ ā Ī0(N), and(ii) f is meromorphic on H.
Now we need to explain what it means for f to be meromorphic at the cusps of X0(N).We first consider the notion of meromorphy at the cusp iā. It may be helpful at this pointto recall the discussion at the end of Section 2.3 (though our notation here is differentāfand F instead of g and G, and q instead of Ī¶).
Since T =
(1 10 1
)ā Ī0(N), we know f(Tz) = f(z + 1) = f(z), i.e., f is periodic of
period 1. Consequently, this means f(z) has a Fourier expansion
f(z) =ānāZ
ane2Ļinz
where the n-th Fourier coefficient an is given by
an =
ā« 1
0f(z)eā2Ļinzdx, (z = x+ iy)
valid for all z ā H. (Our argument at the end of Section 2.3 will justify that an does notdepend upon y.) Put q = e2Ļiz. Thus the Fourier expansion becomes
f(z) =ānāZ
anqn. (4.1.1)
This is also called the q-expansion for f . Note that the map z = x+ iy 7ā q = eā2Ļye2Ļix isan analytic bijection from the vertical strip
z ā H : ā1
2 ā¤ Im(z) < 12
in H to the punctured
discDĆ = z ā C : 0 < |z| < 1. Therefore we may think of f(z), a priori a periodic functionon H, instead as a function F (q) on DĆ.
If z = iy, then q = eā2Ļy, so as z ā iā, q ā 0. Hence f(z) being meromorphic at thecusp z = iā precisely means that F (q) is meromorphic at q = 0. If F (q) is meromorphicat q = 0, then it has a Laurent expansion, which must equal the Fourier expansion (4.1.1).In other words F (q) is meromorphic at q = 0 if and only if the Fourier coefficients an of fare independent of y and
(iii-a) the Fourier coefficients an for all but finitely many n < 0.
By (2.3.1), this is equivalent to the condition that |f(z)| grows at most exponentially asy āā, i.e.,
(iii-aā) for y 0, there exists m such that |f(z)| < emy.
CHAPTER 4. MODULAR FORMS 46
Now that we know what it means for f(z) to be meromorphic at the cusp iā, we canuse this to say what it means to be meromorphic at any cusp. Namely consider a cusp forĪ0(N) represented by z0 ā Q, and let Ļ ā PSL2(Z) be an element which maps iā to z0.Consider the function
f |Ļ (z) = f(Ļz).
Since Ļ is an isometry of H, f |Ļ is also meromorphic on H. Thus we can view |Ļ as anoperator on meromorphic functions of H, called the slash operator .
Lemma 4.1.4. Suppose f(Ī³z) = f(z) for Ī³ ā Ī0(N). Let Ļ ā PSL2(Z). Then f |Ļ (z+N) =f |Ļ (z) for all z ā H.
Proof. First note that f |Ļ is left-invariant under Ļā1Ī0(N)Ļ : for Ī³ ā Ī0(N), we have
f |Ļ (Ļā1Ī³Ļz) = f(Ī³Ļz) = f(Ļz) = f |Ļ (z).
On the other hand, note that the principal congruence subgroup
Ī(N) = Ī³ ā PSL2(Z) : Ī³ ā” I mod N ā Ī0(N)
is a normal subgroup of PSL2(Z). In particular Ī(N) ā Ļā1Ī0(N)Ļ , so f |Ļ is invariantunder TN ā Ī(N). This proves the claim.
This lemma means that f |Ļ is also periodic with period N . (It may actually have smallerperiod if a smaller power of T lies in Ļā1Ī0(N)Ļ .) Consequently, f |Ļ has a Fourier expansionof the form
f |Ļ (z) =ānāZ
aĻ,nqnN , (4.1.2)
whereqN = e2Ļiz/N .
(If, for instance, f |Ļ actually has period 1, we can write f |Ļ (z) =ācnq
n, i.e., the aĻ,n = 0unless N |n.) What f(z) to be meromorphic at the cusp z0 means is precisely that f |Ļ ismeromorphic at iā. Now our discussion about what it meant for f(z) to be meromorphicat iā goes through for functions with period N just as well, and we see f(z) is meromorphicat the cusp z0 if and only if
(iii-b) the Fourier coefficients aĻ,n = 0 for all but finitely many n < 0;
or equivalently,
(iii-bā) for y 0, there exists m such that |f |Ļ (z)| < emy.
Observe that (4.1.2) can be used to give another series expansion for f . Namely, sincef(z) = f |Ļ (Ļā1z), we can write
f(z) =ānāZ
aĻ,nqnĻ , qĻ = e2ĻiĻā1z/N . (4.1.3)
This expansion, again valid for all z ā H, is called a Fourier (or qā) expansion at z0
(with respect to Ļ) for f(z). Consequently, we sometimes refer to the usual Fourier expansion(4.1.1) as the Fourier expansion at iā.
CHAPTER 4. MODULAR FORMS 47
A priori, the Fourier expansion (4.1.3) at z0, and consequently the conditions (iii-b) and(iii-bā), depend upon the choice of Ļ ā PSL2(Z) we used to send iā to z0. We would like tosay all of these are independent of the choice of Ļāand in fact the choice of z0 representingthe given cuspāand this is essentially what the following lemma tells us.
Lemma 4.1.5. Suppose a meromorphic f : Hā C satisfies f(Ī³z) = f(z) for all Ī³ ā Ī0(N).Let z0, z
ā²0 ā iāāŖQ represent the same cusp for Ī0(N), i.e., zā²0 = Ī³z0 for some Ī³ ā Ī0(N).
Now suppose Ļ, Ļ ā² ā PSL2(Z) such that Ļ Ā·iā = z0 and Ļ ā² Ā·iā = zā²0. Then f |Ļ (z) = f |Ļ ā²(z+j)for some j ā Z.
Consequently, the Fourier coefficients with respect to Ļ and Ļ ā² are related by
aĻ ā²,n = aĻ,ne2Ļij/N
for all n; in particular |aĻ ā²,n| = |aĻ,n| for all n.
Proof. First observe that Ī³Ļ is an element of PSL2(Z) which sends iā to zā²0. Therefore
(Ļ ā²)ā1Ī³Ļ =
(a bc d
)ā PSL2(Z)
which preserves iā. By the definition of our action of PSL2(Z) on iā āŖ Q, this meansc = 0, whence a = c = Ā±1 so (Ļ ā²)ā1Ī³Ļ = T j for some j ā Z, i.e., Ļ ā² = Ī³ĻTāj . Consequently
f |Ļ ā²(z) = f(Ļ ā²z) = f(Ī³ĻTājz) = f(ĻTājz) = f |Ļ (Tājz) = f |Ļ (z ā j).
This gives the first part of the lemma.Since we may also the last equation as f |Ļ ā² = f |ĻTāj , the Fourier expansion at zā²0 (w.r.t.
Ļ ā²) for f(Ī³z) is
f(z) = f(Ī³z) =ānāZ
aĻ ā²,ne2ĻiT jĻā1Ī³ā1Ī³z/N =
ānāZ
aĻ ā²,ne2ĻiT jĻā1z/N =
ānāZ
aĻ ā²,ne2Ļij/Ne2ĻiĻā1z/N .
Comparing this with the Fourier expansion at z0 (w.r.t. Ļ) for f(z) gives the second assertion.
In other words, f(z) may technically have more than one Fourier expansion at a givencusp, but any Fourier expansion is obtained from a given one by replacing the parameterqĻ = e2Ļiz/N with a parameter of the form qĪ³ĻTāj = e2Ļi(ĻĪ³ā1z+j) for j ā Z and Ī³ ā Ī0(N)
and multiplying all the Fourier coefficients aĻ,n by a fixed root of unity e2Ļij/N .
Now we can recast Definition 4.1.3 in the language it is typically presented in mostmodular forms texts.
Definition 4.1.6. (Upper half-plane version) We say a function f : Hā C is a modularfunction of level N if
(i) f(Ī³z) = f(z) for all Ī³ ā Ī0(N);(ii) f is meromorphic on H; and(iii) f is meromorphic at each cusp of Ī0(N); i.e., for each Ļ ā PSL2(Z), the Fourier
coefficients aĻ,n as defined in (4.1.3) satisfy aĻ,n = 0 for all but finitely many n < 0.
CHAPTER 4. MODULAR FORMS 48
In practice one only needs to check the condition on negative Fourier coefficients aĻ,n asĻ ranges over a finite set of elements of PSL2(Z) such that Ļ ā iā runs over all cusps ofĪ0(N). Further, our discussion above of course implies that (iii) is equivalent to
(iiiā) for each Ļ ā PSL2(Z), the function f |Ļ is of moderate growth in y = Im(z), i.e.,for y large, there exists m such that |f |Ļ (z)| < emy.
Exercise 4.1.7. Suppose f is a modular function of level 1 with Fourier expansion f(z) =āāām anq
n. Show that for any Ļ ā PSL2(Z), the Fourier coefficients with respect to Ļ arethe usual Fourier coefficients, i.e., aĻ,n = an for all n.
Note that our above discussion goes through if we replace Ī0(N) by an arbitrary con-gruence subgroup Ī, as Ī ā Ī(N) for some N . Thus for any congruence subgroup Ī ofPSL2(Z), we can say f : H ā C is a modular function for Ī by simply replacing Ī0(N)by Ī in the definition above.
Exercise 4.1.8. Suppose f is a modular function for a congruence subgroup Ī(N).(i) For Ļ ā PSL2(Z), show f |Ļ is a modular function for Ļā1Ī(N)Ļ .(ii) Deduce that |Ļ : f 7ā f |Ļ operates on the space of modular functions for Ī(N).(iiii) Show f |Ļ = f |Ļ ā² if Ļ ā² ā ĪĻ . Hence at most |Ī(N)\PSL2(Z)| of the operators |Ļ can
be distinct.(iv) Show (f |Ļ )|Ļ ā² = |ĻĻ ā² , hence these operators give a right action of the quotient group
Ī(N)\PSL2(Z) on the space of modular functions for Ī(N).
There are a couple obvious things we can say now about modular functions.
Example 4.1.9. Any constant function on H is a modular function for any congruencesubgroup Ī.
Example 4.1.10. Suppose Ī, Īā² are two congruence subgroups such that Īā² ā Ī. If f is amodular function for Ī, then f is a modular function for Īā². In particular, if f is a modularfunction of level N and N |M , then f is also a modular function of level M .
Now of course we would like to know if some interesting (non-constant) modular functionsexist. It turns out they are not trivial to construct. For example, let us try to naivelyconstruct a modular function for PSL2(Z), i.e., a modular function of level 1. Let g(z) = 1
zn
for n ā Z. Then we can try to average over the PSL2(Z)-translates
f(z) =ā
Ī³āPSL2(Z)
g(Ī³z) =1
(Ī³z)n.
We can see this diverges just by considering Ī³ of the form ST j . Such Ī³ contributeājāZ
1
(ST jz)n=ā 1
(S(z + j))n=ā 1
(ā1/(z + j))n=āj
(ā1)n(z + j)
to the average, but this already diverges.An alternative approach might be the following. Since PSL2(Z) is generated by S (z 7ā
ā1/z) and T (z 7ā z+ 1), we just need to find a function invariant under both of them. We
CHAPTER 4. MODULAR FORMS 49
know e2Ļiz is invariant under T , so we could average this over the group ćSć = I, S. Thiswould give us
e2Ļiz + e2ĻiSz = e2Ļiz + eā2Ļi/z.
Unfortunately, because of the second term, this is no longer invariant under T . (And if onetries to average over enough of PSL2(Z) to make the sum formally PSL2(Z)-invariant, thenit will diverge as in the first attempt.)
The easiest way to construct nontrivial modular functions will be to use modular forms.We will discuss this in the next section. First, we will present one result about nontrivialmodular functionsāthey cannot be holomorphic!
Recall the following result of complex analysis.
Theorem 4.1.11. (Open Mapping Theorem) Let U ā C be an open set, and f : U ā Cbe holomorphic and nonconstant. Then f is an open map, i.e., f maps open sets to opensets.
Corollary 4.1.12. Let X be a compact Riemann surface, and f : X ā C holomorphic.Then f is constant.
Proof. What f : X ā C holomorphic means is the following. Around any point p ā X,there is an open set U such that there is an analytic isomorphism Ī¹U ā D with the openunit disc D ā C. For each such p, the map f Ī¹ā1 : D ā C is holomorphic at p.
Consequently, the Open Mapping Theorem says if f is not constant, f(X) is an opensubset of C. However, since f is continuous and X is compact, f(X) is also compact. Butthere are no open compact subsets of C.
Since X0(N) is compact (as is Ī\H for any congruence subgroup Ī), there are no non-constant holomorphic modular functions of level N (or for any congruence subgroup Ī).In fact, since the torus C/Ī is also a compact Riemann surface for a lattice Ī, the abovecorollary also tells us there are no nonconstant holomorphic elliptic functions with respectto Ī.
4.2 Eisenstein series
Even though we couldnāt construct modular functions by an averaging process like we couldfor elliptic functions, the theory of elliptic functions does provide a subtle hint for construct-ing modular functions. Recall that for a lattice Ī ā C, the field of elliptic functions withrespect to Ī was generate by the associate ā-function and its derivative āā². We constructedāā² by an simple averaging technique, but the construction of ā was more complicated.
Consequently, one might ask if the derivatives of modular functions are simpler to con-struct than modular functions themselves. Suppose f is a modular function for Ī. Let Ī³ ā Ī
and write Ī³ =
(a bc d
). Then
d
dzf(z) =
d
dzf(Ī³z) = f ā²(Ī³z) Ā· d
dz(Ī³z) = f ā²(Ī³z) Ā· d
dz
az + b
cz + d=
1
(cz + d)2f ā²(Ī³z).
CHAPTER 4. MODULAR FORMS 50
In other words, f ā² is no longer exactly invariant by Ī, but satisfies the transformation law
f ā²(Ī³z) = (cz + d)2f ā²(z).
Similarly if g(z) = f (k)(z), we have
g(Ī³z) = (cz + d)2kg(z).
As I have suggested, it is easier to construct functions with these transformation propertiesthan those which are strictly invariant under some Ī.
Definition 4.2.1. Let f : Hā C be meromorphic and Ī a congruence subgroup of PSL2(Z).Let k ā 2N āŖ 0. If
f(Ī³z) = (cz + d)kf(z), Ī³ =
(a bc d
)ā Ī, z ā H,
then f is weakly modular (or a weak modular form) of weight k for Ī. If Ī = Ī0(N)we say f is weakly modular of weight k and level N .
Note the factor cz + d is not quite determined uniquely by a(a bc d
)ā PSL2(Z), but
only up to Ā±1. However since k is assumed to be even, this creates no ambiguity in thedefinition above.
The adjective āweakā here refers to the fact that there is no condition at the cusps, likewe had for modular functions.
(One can define weak modular forms, and modular forms, of odd weight k, just byreplacing PSL2(Z) with SL2(Z). However for the same reason, they will not exist unlessĪ ā SL2(Z) is a sufficiently small congruence subgroup such that Ī³ ā Ī =ā āĪ³ 6ā Ī, orequivalently āI 6ā Ī. This rules out the case of modular groups Ī0(N) viewed as subgroupsof SL2(Z), but not Ī1(N) or Ī(N). Since our main focus in this course is modular forms forĪ0(N), we will not have much reason to consider these forms of odd weight.)
Lemma 4.2.2. Suppose f (resp. g) is weakly modular of weight k (resp. l) for Ī.(i) If k = l and c ā C, then cf +g is weakly modular of weight k = l for Ī. Hence weakly
modular functions of weight k form a complex vector space.(ii) f Ā· g is weakly modular of weight k + l for Ī.(iii) f
g is weakly modular of weight k ā l for Ī provided g 6ā” 0.
The proof is immediate.
Consequently, if we can construct two different weak modular forms of weight k for somek, then their quotient will be a nontrivial weak modular form of weight 0, which is a modularfunction provided the condition of meromorphic at the cusps is satisfied. We will come backto the condition at the cusps later, and use this to define meromorphic and holomorphicmodular forms. For now, we will show how to construct weak modular forms of even weightk > 2.
CHAPTER 4. MODULAR FORMS 51
The idea is basically to take a weighted average of some function g(z) : H ā C overa congruence subgroup Ī. To explain this, for Ī³ ā PSL2(Z), we define the automorphyfactor j(Ī³, z) : PSL2(Z)Ć Hā C to be
j(Ī³, z) = cz + d, Ī³ =
(a bc d
). (4.2.1)
(As we remarked after Definition 4.2.1, j(Ī³, z) is only defined up to Ā±1, but as we will raisethis to an even power, this causes no ambiguity.) For k even, consider the average
f(z) =āĪ³āĪ
1
(cz + d)kg(Ī³z) =
āĪ³āĪ
j(Ī³, z)ākg(Ī³z).
Then we formally havef(Ī³0z) =
āĪ³āĪ
j(Ī³, Ī³0z)ākg(Ī³Ī³0z).
Now we will need the following property of the automorphy factor.
Lemma 4.2.3. For Ī³, Ī³ā² ā PSL2(Z), we have
j(Ī³Ī³ā², z) = Ā±j(Ī³ā², z)j(Ī³, Ī³ā²z).
Proof. Observe that (3.2.1) implies
Im(Ī³z) =Im(z)
|j(Ī³, z)|2.
HenceIm(z)
|j(Ī³Ī³ā², z)|2= Im(Ī³Ī³ā²z) =
Im(Ī³ā²z)
|j(Ī³, Ī³ā²z)|2=
Im(z)
|j(Ī³, Ī³ā²z)j(Ī³ā², z)|2.
This shows the lemma is true up to taking absolute values.Thus the lemma follows from the following claim: Suppose f and g are meromorphic
functions with |f(z)|2 = |g(z)|2. Then f(z) = Ī¶g(z) where |Ī¶| = 1. We may assume g 6ā” 0.The hypothesis just says ff = gg, i.e., f/g = g/f . Hence the image of the meromorphicfunction f/g is contained in the set of z ā C such that z = 1/z, i.e., |z|2 = 1. By the OpenMapping Theorem (restrict to an open set where f/g is holomorphic), this is impossibleunless f/g is constant. Then the absolute value condition implies f(z) = Ī¶g(z) where|Ī¶| = 1.
This meansj(Ī³Ī³ā², z) = Ī¶j(Ī³ā², z)j(Ī³, Ī³ā²z)
with |Ī¶ = 1|. Then the fact that j(Ī³, z) is of the form cz + d where c, d ā Z (so e.g.,j(Ī³, z) ā Z for z ā Z) implies that Ī¶ = Ā±1.
By this lemma, we have
f(Ī³0z) = j(Ī³0, z)kāĪ³āĪ
j(Ī³Ī³0, z)ākg(Ī³Ī³0z) = j(Ī³0, z)
kāĪ³āĪ
j(Ī³, z)ākg(Ī³z) = j(Ī³0, z)kf(z),
CHAPTER 4. MODULAR FORMS 52
where we replaced Ī³ by Ī³Ī³ā10 in the middle step. In other words, provided f(z) converges
and is meromorphic, f(z) is weakly modular of weight k.However, as alluded to in the last section, averaging over all of Ī will be too much for
f(z) to converge. We also briefly toyed with a second ideaāstart with a periodic functionand average over just what we need to yield the weak modularity condition for Ī. Letāsrestrict to the case Ī = Ī0(N) so T ā Ī. (Arbitrary Ī ā Ī(N) can be treated similarlysince TN ā Ī(N).) Let P be the subgroup of PSL2(Z) generated by T , i.e.,
P = ćT ć =
(1 n0 1
): n ā Z
ā Ī0(N).
Note for Ī³ ā P , j(Ī³, z) = 1, so any periodic function g(z) with period 1 satisfies the weakmodularity condition for T . Thus averaging over P\Ī0(N) should, at least formally, yieldsomething which is weakly modular.
First letās describe this coset space.
Lemma 4.2.4. A complete set of representatives for P\Ī0(N) is parametrized by the set(c, d) ā Z2 : c ā” 0 mod N, gcd(c, d) = 1
/Ā± 1,
where a coset representative with parameter (c, d) can be taken uniquely in the form(a bc d
)ā
Ī0(N) with 0 ā¤ b < |d|, or(a bc d
)= S if (c, d) = Ā±(1, 0) and N = 1.
Proof. Let Ī³ =
(a bc d
)ā Ī0(N), so adā bc = 1 and c ā” 0 mod N . We can replace Ī³ with
the element (1 n0 1
)(a bc d
)=
(a+ cn b+ dnc d
)which lies in the same right P -coset of Ī0(N) as Ī³. First suppose d = 0. Then ābc = 1so we may assume b = āc = 1, which can only happen if N = 1. Choosing n = āa shows
Ī³ ā P(
0 1ā1 0
).
Now suppose d 6= 0. Then we may (uniquely) choose n such that 0 ā¤ b + dn < |d|.Consequently we may assume 0 ā¤ b < |d|. On the other hand, the determinant conditionsays a = 1+bc
d ā Z, i.e., bc ā” ā1 mod d. This determines b, and consequently a, uniquely.
So one idea might be to average the function e2Ļiz over this coset space, but again thereare some issues with convergence. However, thereās a simpler periodic function we coulduseāa constant function! In the k = 0 case, this obviously diverges, but if k ā„ 4, it willconverge.
Definition 4.2.5. Let k ā„ 4 be even. The (normalized) Eisenstein series of weight kand level N is defined to be
Ek,N (z) =ā
Ī³āP\Ī0(N)
j(Ī³, z)āk =1
2
ā(c,d)āZ2
gcd(c,d)=1cā”0 mod N
1
(cz + d)k. (4.2.2)
CHAPTER 4. MODULAR FORMS 53
If N = 1 we often write Ek for Ek,N .
Note the second expression for Ek,N in the definition follows (formally) from the abovelemma, and the factor of 1
2 comes from the fact that we include both (c, d) and (āc,ād) inthe sum.
Proposition 4.2.6. The Eisenstein series Ek,N converges absolutely and uniformly on com-pact subsets of H. Therefore, Ek,N is a weak modular form of weight k and level N which isholomorphic on H.
Proof. The discussion above shows that Ek,N transforms formally under Ī0(N) as it should.Specifically, for Ī³0 ā Ī0(N), we have
Ek,N (Ī³0z) =ā
Ī³āP\Ī0(N)
j(Ī³, Ī³0z)āk = j(Ī³0, z)
kā
Ī³āP\Ī0(N)
j(Ī³Ī³0, z)āk.
Since right multiplying P\Ī0(N) by Ī³ā10 simply permutes the cosets P\Ī0(N), we may
replace Ī³ by Ī³Ī³ā10 in the sum to see
Ek,N (Ī³0z) = j(Ī³0, z)kEk,N (z). (4.2.3)
Now letās deal with convergence. Since the series Ek,N for N > 1 is a sum over a strictlysmaller set of terms than the series for Ek = Ek,1, it suffices to prove absolute convergencefor Ek(z).
First suppose z lies in the standard fundamental domain F for PSL2(Z). Then
|cz + d|2 = (cz + d)(cz + d) = c2|z|2 + 2cdRe(z) + d2.
Since z ā F , |z| ā„ 1 and Re(z) ā¤ 12 so
|cz + d|2 ā„ c2 ā |cd|+ d2 =
|cĪ¶3 + d|2 cd > 0
|cĪ¶3 ā d|2 cd < 0.
Hence for z ā F ,
|Ek(z)| ā¤āc,dā„0
gcd(c,d)=1
1
|cĪ¶3 + d|k+
ācā„0, dā¤0
gcd(c,d)=1
1
|cĪ¶3 ā d|k= 2
āc,dā„0
gcd(c,d)=1
1
|cĪ¶3 + d|k.
Note that cĪ¶3 + d : c, d ā Z form the lattice in C generated by 1 and Ī¶3. Now we wantto solve the lattice point problem of estimating how many (c, d) ā Z2 there are such thatcĪ¶3 +d lies inside the open disc Dr of radius r centered at the origin, i.e., bound the numberof (c, d) such that
|cĪ¶3 + d| < r.
Observe that if |c| ā„ 2r, then |Im(cĪ¶3 + d)| = |Im(cĪ¶3)| = |c|ā
32 > r, i.e., cĪ¶3 + d 6ā Dr.
Similarly, looking at real parts show if |c| < 2r then we must have |d| < 2r if cĪ¶3 + d ā Dr.
CHAPTER 4. MODULAR FORMS 54
I.e., the number of (c, d) such that |cĪ¶3 + d| ā Dr is at most 4r2. In particular, the numberof (c, d) such that |cĪ¶3 + d| ā Dr+1 āDr is at most 4(r + 1)2, and we can bound
|Ek(z)| ā¤ā
(c,d)āZ2
(c,d)6=(0,0)
1
|cĪ¶3 + d|kā¤
ā(c,d)āZ2
rā¤|cĪ¶3+d|<r+1
1
rkā¤āār=1
4(r + 1)2
rk,
which converges for k > 3. This establishes absolute convergence on F .Then the formal identity (4.2.3) implies we have absolute convergence at any point in H.
Namely we can write any point of H as Ī³z for some Ī³ ā PSL2(Z) and z ā F . Thus (4.2.3)says |Ek(Ī³z)| = |j(Ī³, z)kEk(z)|.
Since the bound above in F is independent of z, i.e., Ek(z) is a bounded function onF , it also implies uniform convergence of Ek(z) and Ek,N (z) on F . For Ek(z) this impliesuniform convergence on compact sets of H by the transformation property of Ek(z) sincefor a fixed Ī³, j(Ī³, z) is uniformly bounded on compact sets. Because the tail of a seriesfor Ek,N (z) can be bounded by the tail of a series for Ek(z), this also implies Ek,N (z) isuniformly bounded on compact subsets.
The theory of normal families or normal convergence in complex analysis says that ifa series of analytic functions converges absolutely and uniformly on compact subsets, thelimit function is also analytic. This finishes the proposition.
For the future, let us record one more thing we got out of the above proof.
Corollary 4.2.7. Ek,N (z) is bounded on the standard fundamental domain for PSL2(Z).In particular, Ek,N (z) is bounded as z ā iā.
Proof. The first statement comes directly from the proof. On the other hand, if Imz > 1,then T jz ā F for some j. Therefore Ek,N (z) = j(T j , z)ākEk,N (T jz) = Ek,N (T jz) is alsobounded.
This says Ek(z) (as well as Ek,N (z)) should be āholomorphic at iā.ā Once we defineholomorphy at the cusps in the next section, we will see these Eisenstein series are holomor-phic at all cusps, making them holomorphic modular forms.
One interesting and elementary aspect of Eisenstein series is their Fourier expansion.Since Ek,N (z + 1) = Ek,N (z), it has a Fourier expansion
Ek,N (z) =āān=0
anqn, q = e2Ļiz.
Here the sum starts from n = 0 since Ek,N (z) is holomorphic at iā. The idea is to use thefollowing.
Lemma 4.2.8. (Lipschitzā formula) Let k ā„ 2 and z ā H. ThenānāZ
1
(z + n)k= Ck
āān=1
nkā1qn
where
Ck =(ā2Ļi)k
(k ā 1)!.
CHAPTER 4. MODULAR FORMS 55
Proof. Recall the product formula for sine:
sinĻz = Ļz
āān=1
(1ā z2
n2
).
Taking the logarithmic derivative of this gives
Ļ cotĻz =1
z+āān=1
(1
z + n+
1
z ā n
). (4.2.4)
(Writing the cotangent formula this way as opposed toā
nāZ1
z+n gives a series whichis absolutely convergent.) On the other hand, since sin(Ļz) = Im(Ļz) = eĻizāeāĻiz
2i andcos(Ļz) = Re(Ļz) = eĻiz+eāĻiz
2 , we have
Ļ cotĻz = ĻieĻiz + eāĻiz
eĻiz ā eāĻiz= Ļi
e2Ļiz + 1
e2Ļiz ā 1= Ļi
q + 1
q ā 1= Ļiā 2Ļi
1ā q= Ļiā 2Ļi
āān=0
qn.
Taking the (kā1)-st derivative of each of these expressions for Ļ cotĻz gives the lemma.
Comparing
Ek,N (z) =1
2
ā(c,d)āZ2
gcd(c,d)=1cā”0 mod N
1
(cz + d)k
with Lipschitzā formula, it would appear easier to compute the Fourier expansion for Ek,N (z)without the condition that gcd(c, d) = 1 in the above sum.
For simplicity, letās first consider the case of full level, i.e., N = 1. Define
Gk(z) =ā
(0,0)6=(c,d)āZ2
1
(cz + d)k(4.2.5)
=āān=1
ā(c,d)āZ2
gcd(c,d)=n
1
(cz + d)k
=
āān=1
1
nk
ā(c,d)āZ2
gcd(c,d)=1
1
(cz + d)k= 2Ī¶(k)Ek(z),
where Ī¶(s) =ā 1
ns for Re(s) > 1 is the Riemann zeta function. Since Gk(z) is just a scalarmultiple of Ek(z), we also call Gk(z) an (unnormalized) Eisenstein series. (In fact, in mostclassical treatments, one defines Gk(z) before Ek(z).)
CHAPTER 4. MODULAR FORMS 56
Now applying the Lipschitz formula, we have
Gk(z) =ā
(0,0) 6=(c,d)āZ2
1
(cz + d)k
=ād 6=0
1
dk+ 2
āāc=1
ādāZ
1
(cz + d)k
= 2Ī¶(k) + 2Ck
āāc=1
āān=1
nkā1e2Ļiczn
= 2Ī¶(k) + 2Ck
āāc=1
āān=1
nkā1qcn.
Then what is the coefficient of a given qm? There is a contribution from nkā1qcn whenevercn = m, hence it is 2CkĻkā1(m) where Ļk(m) is the classical divisor function
Ļk(m) =ād|m
dk.
This establishes the following Fourier expansion.
Proposition 4.2.9. Let k ā„ 4 be even. Then we have the following Fourier expansion forthe Eisenstein series of weight k.
Gk(z) = 2Ī¶(k) + 2Ck
āān=1
Ļkā1(n)qn.
In terms of the normalized Eisenstein series, we can write this as
Ek(z) = 1ā 2k
Bk
āān=1
Ļkā1(n)qn,
where Bk is the k-th Bernoulli number.
We recall the Bernoulli numbers can be defined as the coefficients in the expansion
x
ex ā 1=āāk=0
Bkxk
k!.
Alternatively we can define them recursively by B0 = 1 and
Bk = ākā1āj=0
(kj
)Bj
k ā j + 1.
Exercise 4.2.10. Check these two definitions of Bk are equivalent.
CHAPTER 4. MODULAR FORMS 57
Proof. We already derived the expansion for Gk(z). Dividing by 2Ī¶(k) and using Eulerāsformula
Ī¶(k) = āCkBk2k
gives the expansion for Ek(z).
The fact that the Fourier coefficients of Eisenstein series are divisor functionsāobjectsof study in elementary number theoryāis our first real evidence that modular forms andfunctions are important in number theory. Later, we can use the theory of modular forms toprove results about these divisor functions, as well as relate them to problems in quadraticforms, as we discussed in the introduction.
At this point we can say why the Ek are called normalized Eisenstein seriesāthey arenormalized so that their leading Fourier coefficient is 1.1 Since z = iā corresponds to q = 0,the 0-th Fourier coefficient gives us the āvalueā of a periodic function at iā, i.e., Ek(iā) = 1.
Example 4.2.11.
E4(z) = 1 + 240
āān=1
Ļ3(n)qn.
E6(z) = 1ā 504āān=1
Ļ5(n)qn.
E8(z) = 1 + 480āān=1
Ļ7(n)qn.
Exercise 4.2.12. From the first few examples, it looks like the Fourier coefficients for Ekmight always be integers. This is not true (though they are clearly rational from Proposition4.2.9). Find the first even k such that 2k
Bk6ā Z.
To come back to the original question we asked as motivationāare derivatives of mod-ular functions easier to construct directly than modular functions themselves?āweāve onlyconstructed holomorphic things which behave like derivatives of modular functions. Sincenon-constant modular functions are non-holomorphic, their derivatives are non-holomorphicand cannot be things like these Eisenstein series. However, we can still use these Eisensteinseries (or more generally holomorphic modular forms, which we define below) to constructmodular forms by taking quotients! (Of course, all this was just motivation for the funnytransformation law in the definition of weak modular formsāweāre really interested in mod-ular forms in this course, not modular functions.) Here is an example that is important inthe theory of elliptic curves.
*Exercise 4.2.13. We define the j-invariant to be
j(z) = 1728E4(z)3
E4(z)3 ā E6(z)2.
Use the Fourier expansions for Ek to show j is a nonconstant modular function for PSL2(Z).1One can instead normalize so that a1 = 1. This will have the benefit that the n-th Fourier coefficient is
just Ļkā1(n), so the anās with n ā„ 1 are multiplicative. We will consider this other normalization in (6.0.1)and again in Chapter 8.
CHAPTER 4. MODULAR FORMS 58
Since isomorphism classes of (generalized) elliptic curves are parametrized by X0(1) =PSL2(Z)\H, this gives a (nontrivial) invariant for elliptic curves which varies smoothly inthe parameters defining the curve.
Remark. While it is obvious that Ek cannot converge if k = 0, it is natural to ask whathappens if k = 2. The sum does not converge absolutely, but if we specify the order ofsummation, we can get a convergent function.
Exercise 4.2.14. Define
E2(z) =1
2Ī¶(2)
āāc=āā
( āāā²
d=āā
1
(cz + d)2
),
where the prime on the inner sum means we omit the term d = 0 when c = 0.(a) Using Lipshitzā formula on the inner sum first, show the resulting expression for
E2(z) converges absolutely.(b) Show
E2(z) = 1ā 24āān=1
Ļ1(n)qn.
However E2(z) is not quite modular. It can be shown (e.g., [Kob93, p. 113]) that
E2(ā1/z) = z2E2(z) +12
2Ļiz.
Now let us consider the case of higher level. The naive generalization to level N wouldbe to consider
Gk,N (z) =ā
(0,0)6=(c,d)āZ2
cā”0 mod N
1
(cz + d)k=āān=1
ā(c,d)āZ2
cā”0 mod Ngcd(c,d)=n
1
(cz + d)k, (4.2.6)
but we canāt write this in terms of Ek,N (z) as simply as we did when N = 1. The reasonis that if we rewrite the inner sum on the right as gcd(c, d) = 1 and factor out a nāk, webreak the condition c ā” 0 mod N if gcd(n,N) 6= 1.
Letās see what happens when N = p is prime. Then
Gk,p(z) =ā
(0,0)6=(c,d)āZ2
cā”0 mod p
1
(cz + d)k
=ā
(c,d)āZ2
cā”0 mod pgcd(d,p)=1
1
(cz + d)k+
ā(0,0) 6=(c,d)āZ2
cā”dā”0 mod p
1
(cz + d)k
= Gāk,p(z) +1
pkGk(z),
CHAPTER 4. MODULAR FORMS 59
where
Gāk,p(z) =ā
(c,d)āZ2
cā”0 mod pgcd(d,p)=1
1
(cz + d)k
=ān>0
gcd(n,p)=1
ā(c,d)āZ2
cā”0 mod pgcd(c,d)=n
1
(cz + d)k
=ān>0
gcd(n,p)=1
1
nk
ā(c,d)āZ2
cā”0 mod pgcd(c,d)=1
1
(cz + d)k
= 2L(k, 1p)Ek,p(z)
and L(s, 1p) is the Dirichlet series associated to the trivial character mod p, i.e., for Re(s) >1,
L(s, 1p) =ān>0
gcd(n,p)=1
1
ns=
āān=1
1
nsāāān=1
1
(pn)s=
(1ā 1
ps
)Ī¶(s).
In summary, we have
Ek,p(z) =
(1ā 1
pk
)ā1 1
2Ī¶(k)Gāk,p(z)
andGāk,p(z) = Gk,p(z)ā
1
pkGk(z).
Since we already know the Fourier expansion for Gk(z), it suffices to determine the Fourierexpansion for Gk,p(z). This is similar to the expansion for Gk(z):
Gk,p(z) =ā
(0,0) 6=(c,d)āZ2
cā”0 mod p
1
(cz + d)k
=ād 6=0
1
dk+ 2
āāc=1
ādāZ
1
(cpz + d)k
= 2Ī¶(k) + 2Ck
āāc=1
āān=1
nkā1e2Ļicpzn
= 2Ī¶(k) + 2Ck
āāc=1
āān=1
nkā1qcpn
= 2Ī¶(k) + 2Ck
āān=1
Ļkā1(n)qpn = Gk(pz),
Here the last equality follows from Proposition 4.2.9 together with the observation that iff(z) has Fourier expansion
āanq
n, then f(pz) has Fourier expansion anqpn.
CHAPTER 4. MODULAR FORMS 60
Hence the above relations between Ek,p, Gāk,p, Gk,p, Gk and Ek allow us to write Ek,p interms of Ek:
Ek,p(z)
(1ā 1
pk
)ā1(Ek(pz)ā
1
pkEk(z)
). (4.2.7)
The above calculations also give the following Fourier expansions:
Proposition 4.2.15. Let N = p be prime, and k ā„ 4 even. Then
Gāk,p(z) = 2
(1ā 1
pk
)Ī¶(k) + 2Ck
āān=1
(Ļkā1(n/p)ā Ļkā1(n)
pk
)qn
and
Ek,p(z) = 1ā 2k
Bk
(1ā 1
pk
)ā1 āān=1
(Ļkā1(n/p)ā Ļkā1(n)
pk
)qn.
(Here Ļkā1(n/p) = 0 if n/p 6ā Z.)
*Exercise 4.2.16. Work out the Fourier expansion for Ek,N (z) when N = 4 and k ā„ 4even. Specifically, show
Gk,4(z) = Gāk,4(z) +1
2kGk,2(z)
whereGāk,4(z) =
ācā”0 mod 4d odd
1
(cz + d)k= 2
(1ā 1
2k
)Ī¶(k)Ek,4(z).
Deduce that
Ek,4(z) = 1ā 2k
Bk
(1ā 1
2k
)ā1 āān=1
(Ļkā1(n/4)ā 1
2kĻkā1(n/2)
)qn. (4.2.8)
Comparing Fourier expansions, conclude
Ek,4(z) = Ek,2(2z).
To treat Eisenstein series of arbitrary level N , we need to break up the inner sum inthe right of (4.2.6) into various pieces depending upon gcd(n,N). The trick is to use theMƶbius function Āµ(n), i.e., Āµ(n) = ā1 (resp. 1) if n is squarefree and has an odd (resp.even) number of prime factors, and Āµ(n) = 0 if it is not squarefree. This is an importanttool in the study of multiplicative functions2 because of Mƶbius inversion: if f and g aremultiplicative functions,
g(n) =ād|n
f(d)
impliesf(n) =
ād|n
Āµ(d)g(n/d).
2Recall f : Nā C is multiplicative if f(mn) = f(m)f(n) whenever gcd(m,n) = 1.
CHAPTER 4. MODULAR FORMS 61
(This is like an arithmetic Fourier transform.)For the sake of time and simplicity, we will not work out the Fourier expansions for
arbitrary N , but simply state them and refer to [Sch74] (see also [Boy01], [DS05], [Kob93])for details. For k ā„ 3, let
Ļk,N (n) =ād|n
Āµ(N/ gcd(d,N))
Ļ(N/ gcd(d,N))dk.
Then
Ek,N (z) = 1ā 2kĻ(N)
NkBk
āp|N
(1ā 1
pk
)ā1 āān=1
Ļkā1,N (n)qn. (4.2.9)
Exercise 4.2.17. Check (4.2.9) agrees with Proposition 4.2.15 when N is prime.
Exercise 4.2.18. Using the identity E2(ā1/z) = z2E2(z)+ 122Ļiz mentioned in Exercise 4.2.14,
show that for N > 1 prime, E2,N (z) := E2(z)āNE2(Nz) is a weak modular form of weight2 and level N . Determine the Fourier expansion of E2,N (z). (It is not necessary that N beprime here, but we will define E2,N differently when N is not prime in Chapter 8.)
We remark that for level N > 1 (or general Ī), one can construct a different Eisensteinseries for each cusp of Ī0(N) (all holomorphic if k ā„ 4). Weāll discuss this some in thenext chapter, but here is what it means in prime level. Recall from Lemma 3.5.11 thatĪ0(p) has two cusps. This means means there should be a 2-dimensional space of Eisensteinseries associated to Ī0(p). This space is generated by Ek and Ek,p (or by (4.2.7), Ek(z) andEk(pz))ācf. Example 4.3.10 below.
4.3 Modular forms
It is finally time for us to get to the full definition of modular forms. What we need to do isrefine the notion of weak modularity to include a condition of meromorphy or holomorphyat the cusps.
First letās think about the Eisenstein series Ek (k ā„ 4 even) of full level defined in theprevious section, which we said should be modular forms. In this case, the relevant group oftransformations is Ī0(1) = PSL2(Z) and there is only one cusp for X0(1). In Corollary 4.2.7,we saw that Ek is bounded as z ā iā, which means that Ek should be āholomorphic atiā.ā Or, thinking in terms of Fourier expansions, we saw Ek(z) = 1ā 2k
Bk
āān=1 Ļkā1(n)qn,
so Ek can be thought of as having the value 1 at z = iā (ā q = 0).On the other hand, any rational number also represents the cusp for X0(1). What
happens to Ek(z) as z tends to an element of Q? The weak modularity condition actuallyforces Ek(z) to be unbounded as z ā H approaches a rational number z0. To see this, say
Ī³ Ā· iā = z0 and write Ī³ =
(a bc d
). Then
limzāz0
Ek(z) = limzāiā
Ek(Ī³z) = limzāiā
(cz + d)kEk(z) = (c Ā· iā+ d)k Ā· 1 =ā,
CHAPTER 4. MODULAR FORMS 62
since one cannot have c 6= 0 if z0 ā Q. In other words, while Ek is āholomorphicā at iā, theweak modularity property (when k 6= 0) forces it to have a āpoleā (which should be of orderk) at all z0 ā Q.
While it perhaps makes sense to say that Ek(z) is meromorphic at the cusp for PSL2(Z)(it has at most a pole of finite order at each element of iā āŖ Q), it is not clear whetherwe should think of it as being holomorphic at the cusp. In fact, from what I said above,one might be inclined not to. However, Ek(z) does have a Fourier expansion with nonegative terms, and it is as close as possible to being holomorphic at the cusp given theweak modularity transformation property.
Bearing this in mind, one defines holomorphy at a cusp as follows. Similar to the |Ī³operator we defined in Section 4.1, we can deal with the automorphy factor by consideringthe weight k slash operator |Ī³,k, defined for Ī³ ā PSL2(Z) and k ā 2Zā„0, which sends anyf : Hā C to
f |Ī³,k(z) = j(Ī³, z)ākf(Ī³z) = (cz + d)ākf(Ī³z), (4.3.1)
where Ī³ =
(a bc d
). Note |Ī³,0 is precisely the operator |Ī³ from Section 4.1. Observe that
the weight k weak modular transformation property, f(Ī³z) = j(Ī³, z)f(z) = (cz + d)kf(z)for Ī³ ā Ī, is equivalent to the statement
f |Ī³,k(z) = f(z), Ī³ ā Ī. (4.3.2)
In other words, a meromorphic f : H ā C is a weak modular form of weight k if and onlyif (4.3.2) holds.
Consequently, while Ek(z) is not holomorphic at any z0 ā Q, it is true that Ek|Ī³,k(z) isholomorphic at iā for each Ī³ ā PSL2(Z). We will call this being holomorphic at the cusp.Or, more generally, we make the following definition.
Recall if f is a meromorphic, periodic function on H with a period N ā R, f has aFourier expansion f(z) =
ānāZ anq
nN where qN = e2Ļiz/N . In Section 4.1, we said such
an f is meromorphic at at iā if an = 0 for all but finitely many n < 0, or alternatively,there exists m such that |f(z)| < emy for y = Im(z) 0. Similarly, we say such an f isholomorphic at iā if an = 0 for all n < 0, or equivalently, if f(z) is bounded for y 0.
Definition 4.3.1. Let f : Hā C be a weak modular form of weight k for Ī ā PSL2(Z). Wesay f is meromorphic (resp. holomorphic) at the cusps if f |Ļ,k is meromorphic (resp.holomorphic) at iā for each Ļ ā PSL2(Z).
To see this definition makes sense, we need to know that f |Ļ,k is also periodic with realperiod.
Exercise 4.3.2. Let Ī ā Ī(N) be a congruence subgroup of PSL2(Z) and suppose f isweakly modular of weight k for Ī. Show that f |Ļ,k(z +N) = f |Ļ,k(z) for all Ļ ā PSL2(Z).
We would also like to know that to check meromorphy/holomorphy at the cusps, itsuffices to check it for a single representative of each cusp. This follows from the followinggeneralization of Lemma 4.1.5.
CHAPTER 4. MODULAR FORMS 63
For Ļ ā PSL2(Z) and f a weak modular form of weight k for Ī ā Ī(N), write the Fourierexpansion (at iā) of f |Ļ,k as
f |Ļ,k(z) =ānāZ
aĻ,nqnN , qN = e2Ļiz/N .
We call the aĻ,nās the Fourier coefficients for f with respect to Ļ . This is perhaps aslight misuse of notation, as, unlike in the case of modular functions, this does not actuallygive us a series expansion for f(z) in qĻ = e2ĻiĻā1z/N , but rather just an expansion forf |Ļ,k(z), or if one wishes, j(Ļ, z)ākf(z). Indeed, f(Ļz) will not generally have a real period,so there will not typically be a series expansion for f in terms of powers of qĻ .
Exercise 4.3.3. Suppose a meromorphic f : H ā C is weakly modular of weight k for acongruence subgroup Ī ā Ī(N). Let z0, z
ā²0 ā iā āŖ Q represent the same cusp for Ī, i.e.,
zā²0 = Ī³z0 for some Ī³ ā Ī. Now suppose Ļ, Ļ ā² ā PSL2(Z) such that Ļz0 = Ļ ā²z0 = iā.(i) Show f |Ļ,k(z) = f |Ļ ā²,k(z + j) for some j ā Z.(ii) Deduce the Fourier coefficients with respect to Ļ and Ļ ā² are related by
aĻ ā²,n = aĻ,ne2Ļij/N
for all n; in particular |aĻ ā²,n| = |aĻ,n| for all n.
Both of these exercises are really no more than verifying the proofs of Lemma 4.1.4 andLemma 4.1.5 go through in the setting k > 0.
Definition 4.3.4. Let f : Hā C be meromorphic, k even, and Ī a congruence subgroup ofPSL2(Z). We say f is a meromorphic modular form of weight k for Ī if
(i) f is weakly modular of weight k for Ī, i.e., f |Ļ,k(z) = f(z) for all Ļ ā Ī; and(ii) f is meromorphic at the cusps.
Definition 4.3.5. Let f : Hā C be meromorphic, k even, and Ī a congruence subgroup ofPSL2(Z). We say f is a (holomorphic) modular form of weight k for Ī if
(i) f is weakly modular of weight k for Ī, i.e., f |Ļ,k(z) = f(z) for all Ļ ā Ī; and(ii) f is holomorphic at the cusps.
The space of modular forms of weight k for Ī is denoted Mk(Ī).
In other words, if we just say āmodular formā we mean holomorphic modular form. Thisis (fairly) standard terminology (an older term you may sometimes see is āentire modularformā), though terminology for meromorphic modular forms varies and is less standardized.For example, they are called modular functions (even though they are not functions on Ī\H!)by [Kob93] and automorphic forms (even though this does not agree with the standard usageof the term!) by [DS05].
As we have mentioned earlier, the modular forms we will be most interested in aremodular forms for the modular groups Ī0(N). Unless otherwise specified, we will assumethe weight k ā„ 2 is even, so that nonzero modular forms exist on Ī0(N), and even non-constant ones provided N > 1 if k = 2 (see below).
Definition 4.3.6. If f is a modular form of weight k for Ī0(N), we say f is a modularform of weight k and level N . The space of such forms is denoted Mk(N).
CHAPTER 4. MODULAR FORMS 64
In other words, the notation Mk(N) just means Mk(Ī0(N)).
Example 4.3.7. For Ī a congruence subgroup, any constant function lies in M0(Ī).
Note the level of a modular form is not uniqueāthe above example says a constantfunction is level N for any N . More generally, if M |N , then Ī0(M) ā Ī0(N) so it is clearfrom the definition that Mk(M) ā Mk(N). (In fact, we will see later that there are otherembeddings besides the obvious one.) Yet more generally, if Ī ā Īā², then Mk(Ī
ā²) āMk(Ī).
Proposition 4.3.8. For k ā„ 4 even, the Eisenstein series Ek,N is a modular form of weightk and level N .
We remark this is also true for k = 2 and N > 1.
Proof. By Proposition 4.2.6, we know Ek,N is weakly modular of weight k and level N , andholomorphic on H. Hence we just need to show Ek,N is holomorphic at the cusps. (In fact,we already know from Section 4.2 that it is holomorphic at the cusp given by iā, and thatEk(iā) = Ek,p(iā) = 1.)
Let Ļ =
(r st u
)ā PSL2(Z) and consider
Ek,N |Ļ,k(z) =1
2
1
(tz + u)k
ā(c,d)āZ2
gcd(c,d)=1cā”0 mod N
1(c rz+stz+u + d
)k=
1
2
ā(c,d)āZ2
gcd(c,d)=1cā”0 mod N
1
(c(rz + s) + d(tz + u))k
=1
2
ā(c,d)āZ2
gcd(c,d)=1cā”0 mod N
1
((cr + dt)z + (cs+ du))k
=1
2
ā(c,d)āZ2
gcd(c,d)=1cā”0 mod N
1
(cā²z + dā²)k,
where in the last sum we have put cā² = cr + dt and dā² = cs + du. Note we can viewT Ļ =
(r ts u
)as an element of GL2(Z) acting on Z Ć Z, and T Ļ
(cd
)=
(cā²
dā²
). Hence the
final sum runs over a certain subset of pairs (cā², dā²) ā ZĆ Zā (0, 0). Thus
|Ek,N |Ļ,k(z)| ā¤1
2
ā(cā²,dā²)āZ2ā(0,0)
1
|cā²z + dā²|k.
However, we already showed this sum on the right is bounded in the proof of Proposition4.2.6.
CHAPTER 4. MODULAR FORMS 65
Proposition 4.3.9. For Ī a congruence subgroup and k ā„ 0 even, Mk(Ī) is a complexvector space.
Proof. We show Mk(Ī) is a subspace of the space of holomorphic functions on H.Suppose f ā Mk(Ī) and c ā C. Then it is clear that cf ā Mk(Ī). Similarly, if
f, g ā Mk(Ī), then f + g clearly satisfies the weak modularity condition (cf. Lemma 4.2.2,and f |Ļ,k, g|Ļ,k being bounded at iā for each Ļ ā PSL2(Z) implies (f + g)|Ļ,k = f |Ļ,k + g|Ļ,kis also bounded at iā, i.e., f + g is also holomorphic at the cusps.
Example 4.3.10. From Proposition 4.3.8 and the observation that Mk(1) ā Mk(p), wehave Ek(z), Ek,p(z) āMk(p). Hence by (4.2.7) we also have Ek(pz) āMk(p). The subspaceof Mk(p) generated by two of these Eisenstein series is called the Eisenstein subspace ofMk(p). We will see later that for k small the Eisenstein subspace makes up all of Mk(p) butfor k large there will be other forms in Mk(p) due to the existence of cusp forms.
There are two main reasons we work with holomorphic forms, rather than meromorphicforms. The first is that restricting to holomorphic forms, the spaces Mk(Ī) are finite-dimensional vector spaces. (This is also why one needs the condition of being holomorphicat the cusps.) One of the main applications of modular forms, say to the theory of quadraticforms as presented in the introduction, is to construct a modular form in two different waysand show that the constructions are equal. Knowing thatMk(Ī) is finite dimensional meansthat one can check two modular forms are identical by checking that a finite number of theirFourier coefficients agree. We will explore these ideas some in the rest of this chapter, andexplain this precisely in the next chapter.
One simple but important way to construct modular forms is the following.
Lemma 4.3.11. Let Ī be a congruence subgroup, and f ā Mk(Ī), g ā Ml(Ī). Thenfg āMk+l(Ī).
Proof. We already observed that f + g is weakly modular of weight k + l in Lemma 4.2.2.Then, as in the previous proof, if f |Ļ, k and g|Ļ,l are bounded at iā, we see (fg)|Ļ,k+l =(f |Ļ,k) (g|Ļ,l) is also bounded at iā, i.e., fg is also holomorphic at the cusps.
The second reason to restrict our study to holomorphic modular forms is that meromor-phic modular forms can be simply constructed as quotients of holomorphic modular forms(cf. exercise below), and, at least on the full modular group (cf. [FB09, Exercise VI.3.5]), allmeromorphic modular forms arise this way. (If you find a reference which discusses this forother congruence subgroups, or has the complete proof for PSL2(Z), please let me know.)If one restricts to meromorphic modular forms of weight 0 for Ī, i.e., modular functions onĪ\H, then basic function theory for compact Riemann surfaces implies any modular functionis a quotient of two elements of Mk(Ī) for some k. We will prove something more precise inthe case of Ī = PSL2(Z) later.
Exercise 4.3.12. Let Ī be a congruence subgroup, f āMk(Ī) and g āMl(Ī). Show fg is a
meromorphic modular form of weight k ā l provided g 6ā” 0.
Now letās look at an example of the above lemma.
CHAPTER 4. MODULAR FORMS 66
Example 4.3.13. Consider the Eisenstein series
E4(z) = 1 + 240āān=1
Ļ3(n)qn āM4(1).
Then
E24(z) = 1 + 480
āām=1
Ļ3(m)qm + (240)2āā
m,n=1
Ļ3(m)Ļ3(n)qm+n
= 1 + 480
āān=1
(Ļ3(n) + 120
nā1ām=1
Ļ3(m)Ļ3(nām)
)qn āM8(1).
On the other hand,
E8(z) = 1 + 480
āān=1
Ļ7(n)qn āM8(1).
Computing the first few Fourier coefficients of E24 ,
E4(z)2 = 1 + 480(q + 129q2 + 2188q3 + Ā· Ā· Ā·
),
we see the first several Fourier coefficients of E24 match with those of E8. This suggests
E24 = E8,
which implies the sum-of-divisors identity
Ļ7(n) = Ļ3(n) + 120
nā1ām=1
Ļ3(m)Ļ3(nām).
In particular, this would mean
Ļ7(n) ā” Ļ3(n) mod 120.
In the next chapter, we will see thatM8(1) is 1 dimensional, so just knowing the 0-th Fouriercoefficients (the āconstant termsā) match is enough to deduce E2
4 = E8. While one can provethis and similar sum-of-divisors identities with just elementary number theory, it is not atall simple.
4.4 Theta series
The motivation we presented in the introduction for studying modular forms is the applica-tion to the theory of quadratic forms, though we have also seen that they arise in connectionwith the function theory of the modular curves X0(N). (The reader may want to reread theintroduction at this point.) The passage from quadratic forms to modular forms comes viatheta series.
Jacobi seems to have begun the study theta functions in connection with elliptic func-tions. Specifically, in 1829, he obtained an expression for the Weierstrass ā-functions in
CHAPTER 4. MODULAR FORMS 67
terms of theta functions. This has become important in mathematical physics. However,our interest in theta functions will be in relation to modular and quadratic forms.
Recall from the introduction, Jacobiās theta function
Ļ(z) =āā
n=āāqn
2, (4.4.1)
where, as usual, q = e2Ļiz. Thinking of this as a power series in the parameter q, it isclear this series converges absolutely and uniformly on compact sets in H, hence it is ameromorphic function on H with period 1 (w.r.t. z). Then
Ļk(z) =kāi=1
( āāni=āā
qn2i
)=
ān1,...,nkāZ
qn21+n2
2+Ā·Ā·Ā·+n2k =
ānā„0
rk(n)qn, (4.4.2)
where rk(n) is the number of ways to write n as a sum of k squares, i.e.,
rk(n) = #
(x1, . . . , xk) ā Zk : x21 + x2
2 + Ā· Ā· Ā·+ x2k = n
.
In other words, the Fourier coefficients of Ļk tell us the number of ways n can be written asa sum of k squares. We would like to say this is a modular form.
Proposition 4.4.1. Jacobiās theta function satisfies the transformation laws
Ļ(z + 1) = Ļ(z), Ļ
(ā1
4z
)=
ā2z
iĻ(z). (4.4.3)
Proof. The first equation is obvious from the definition as q is invariant under z 7ā z + 1.The idea to prove the second equation is to use Poisson summation, which says for any
Schwartz function (i.e., rapidly decreasing function) f : Rā R, we haveānāZ
f(n) =ānāZ
f(n),
wheref(t) =
ā« āāā
e2Ļistf(s)ds
is the Fourier transform of f . (See any introduction to Fourier analysis for a proof of Poissonsummation, which is a simple consequence of Fourier inversion.)
Note that we can write (for fixed z ā H)
Ļ(z) =ānāZ
f(n)
wheref(t) = e2Ļizt2 .
Now we will assume z = iy with y > 0, so that f(t) = eā2Ļyt2 is Schwartz.
CHAPTER 4. MODULAR FORMS 68
We compute
f(t) =
ā« āāā
e2Ļi(st+s2z)ds.
Completing the square, we see
s2z + st = z
(s+
t
2z
)2
ā t2
4z
Let u = (s+ t/2z). Then
f(t) = eāĻit2/2z
ā« āāā
e2Ļizu2du = eĻt2/2y
ā« āāā
eā2Ļyu2du =
ā1
2yeĻt
2/2y.
Here the last equality follows from the substitution v =ā
2yu, which reduces the problemto the well known Gaussian integral ā« ā
āāeāĻv
2dv = 1.
Now applying Poisson summation proves Ļ(ā1
4z
)=ā
2zi Ļ(z) when z = iy. However,
since both sides are holomorphic functions, knowing they agree on a set with an accumulationpoint implies they agree everywhere.
Lemma 4.4.2. Ī0(4) is generated by T and T =
(1 04 1
).
Proof. Suppose Ī³ =
(a bc d
)ā Ī0(4). Note
Ī³T k =
(a b+ kac d+ kc
).
Since gcd(a, c) = 1 but c ā” 0 mod 4, we know a is odd. In particular a 6= 0, so by replacingĪ³ by some Ī³T k, we may assume |b| ā¤ |a|2 . Further, a odd means |b| 6= |a|
2 , i.e., |b| < |a|2 .
On the other hand,
Ī³T k =
(a+ 4kb bc+ 4kd d
).
If |b| 6= 0, note |a + 4kb| < 2|b| for some k ā Z, so we upon replacing Ī³ by Ī³T k, we mayassume |a| < 2b.
Continuing in this manner of replacing Ī³ by elements of the form Ī³T k and Ī³T k alter-nately, we eventually reduce to the situation b = 0 by Fermat descent. (Observe in theabove replacements, when we reduce |b| we do not change a, and when we reduce |a| we donot change b, so this process terminates in a finite number of steps.)
But if b = 0, by the determinant condition and the definition of Ī0(4), we know that (upto Ā±1) Ī³ must be of the form (
1 04k 1
)= T k.
CHAPTER 4. MODULAR FORMS 69
The following exercise is a simple consequence of Lemma 4.2.3.
Exercise 4.4.3. Suppose Ī = ćĪ³1, . . . , Ī³rć and let f : Hā C be meromorphic. Show that if
f(Ī³iz) = j(Ī³i, z)kf(z)
for i = 1, . . . , r, then f is weakly modular of weight k for Ī.
Proposition 4.4.4. Let k ā N such that k ā” 0 mod 4. Then Ļk āMk/2(4).
Proof. We know Ļk is holomorphic on H because Ļ is. By the above lemma and exercise, toshow Ļk is weakly modular of weight k
2 for Ī0(4), it suffices to show
Ļk(z + 1) = Ļk(Tz) = j(T, z)k/2Ļk(z) = Ļk(z)
andĻk(
z
4z + 1
)= Ļk(T z) = j(T , z)k/2Ļk(z) = (4z + 1)k/2Ļk(z).
The former is clear, since Ļ(z + 1) = Ļ(z). To see the second, put w = z4z+1 and observe
the second part of (4.4.3) implies
Ļk(
z
4z + 1
)= Ļk(w) =
(i
2w
)k/2Ļk(ā1
4w
)=
(i
2w
)k/2Ļk(ā1ā 1
4z
)=
(i
2w
)k/2Ļk(ā1
4z
)=
(i
2w
)k/2(2z
i
)k/2Ļk(z)
= (4z + 1)k/2Ļk(z).
Thus it remains to show Ļk is holomorphic at the cusps. By Corollary 3.4.4, we can takefor a set of representatives of PSL2(Z)/Ī0(4) the elements
I, S, STS =
(1 0ā1 1
), T 2S =
(2 ā11 0
).
We know Ļ is holomorphic at iā by its Fourier expansion, so we need to show f |Ī³,k/2 isholomorphic at iā for Ī³ = S, STS and TS. Note by (4.4.3) we have
Ļk|S,k/2(z) =
(1
z
)k/2Ļk(ā1
z
)=
(1
z
)k/2 ( z2i
)k/2Ļk(z
4
)=
(1
2i
)k/2Ļk(z
4
),
which is holomorphic at iā since Ļk(z) is. We leave the cases of Ī³ = STS and Ī³ = TS asan exercise.
CHAPTER 4. MODULAR FORMS 70
Exercise 4.4.5. Complete the proof of the above proposition by showing that Ļk|Ī³,k/2 isholomorphic at iā for Ī³ = STS and Ī³ = T 2S. (Here k ā N with k ā” 0 mod 4.)
One might like to say that Ļk is a modular form of odd weight when k ā” 2 mod 4, and amodular form of half-integral weight when k is odd. These notions can be made precise, butthe transformation group can no longer be Ī0(4) (see parenthetical remarks after Definition4.2.1). We will not treat the theory of modular forms of odd or half-integral weights in thiscourse (see, e.g., [Kob93]), but leave the case of Ļk when k ā” 2 mod 4 as an exercise.
Exercise 4.4.6. Let Ī be a finite index subgroup of SL2(Z). Then for Ī³ ā Ī, j(Ī³, z) is welldefined (not just defined up to Ā±1).
Let k ā Z. We say a holomorphic function f : H ā C is a modular form of weight kfor Ī, if f |Ī³,k(z) = f(z) for all Ī³ ā Ī and z ā H, and f |Ļ,k(z) is holomorphic at iā for allĻ ā SL2(Z).
(a) Suppose āI ā Ī. Show there are no nonzero modular forms of odd weight for Ī.(b) Consider
Ī =
(a bc d
)ā”(
1 b0 1
)mod 4
ā SL2(Z),
i.e., Ī = Ī1(4) viewed as a subgroup of SL2(Z) (not PSL2(Z)). For any k ā„ 2 even, showĻk is a modular form of weight k/2 for Ī.
Example 4.4.7. From the above proposition, we know
Ļ8(z) = 1 + 16q + 112q2 + 448q3 + Ā· Ā· Ā· āM4(Ī0(4)).
(There is 1 way to write 0 as a sum of 4 squares, 16 ways to write 1 = (Ā±1)2+02+02+Ā· Ā· Ā·+02
as a sum of 8 squares counting symmetries, and so on.)From Exercise 4.2.16, we know
E4,4(z) = 1 + 16
āān=1
(16Ļ3(n/4)ā Ļ3(n/2)) qn = 1ā 16q2 + 112q4 ā Ā· Ā· Ā· āM4(4).
We also have
E4,2(z) = 1+16āān=1
(16Ļ3(n/2)ā Ļ3(n)) qn = 1ā16q+112q2ā448q3 + Ā· Ā· Ā· āM4(2) āM4(4)
and
E4(z) = 1 + 240
āān=1
Ļ3(n)qn = 1 + 240(q + 9q2 + 28q3 + Ā· Ā· Ā·
)āM4(1) āM4(4).
Assuming these three Eisenstein series give a basis for M4(4) (they do, as we will see in thenext chapter), so a little linear algebra on the first three coefficients of the Fourier expansionsshows
Ļ8(z) =1
16E4(z)ā 1
16E4,2(z) + E4,4(z).
Comparing Fourier expansions shows
r8(n) = 16 (Ļ3(n)ā 2Ļ3(n/2) + 16Ļ3(n/4)) .
CHAPTER 4. MODULAR FORMS 71
Besides completing a proof of the above result in the next chapter, we will study r2k(n)for other values of k later in the course.
Another classical example of a theta series comes from asking about representing numbersas sums of triangular numbers. Recall the n-th triangular number is n(n+1)
2 , which is thenumber of entries in the first n rows of Pascalās triangle. Let Ī“k(n) be the number of waysn is a sum of k triangular numbers, i.e.,
Ī“k(n) = #
(x1, Ā· Ā· Ā· , xk) ā Zkā„0 :
x1(x1 + 1)
2+x2(x2 + 1)
2+ Ā· Ā· Ā·+ xk(xk + 1)
2= n.
Let
Ļ(z) = q1/8āān=0
qn(n+1)
2 .
Then
Ļk(z) = qk/8āān=0
Ī“k(n)qn.
Proposition 4.4.8. Suppose k ā” 0 mod 8. Then Ļk(z) āMk/2(4).
In the interest of time, we will not prove this now, but may give a proof at a later time.However, the point for now is it allows one to do the following exercise.
*Exercise 4.4.9. (i) Compute Ī“8(n) for n = 0, 1, 2, 3.(ii) Assuming E4, E4,2 and E4,4 is a basis for M4(4), determine a, b, c ā Q such that
Ļ8(z) = aE4 + bE4,2 + cE4,4.
(iii) Using (ii), show Ī“8(n) = Ļ]3(n+ 1) where
Ļ]m(n) =ād|n
n/d odd
dm.
We remark that for a general positive definite quadratic form Q in r variables over Z,one can associate the theta series
ĪQ(z) =āān=0
rQ(n)qn,
where rQ(n) denotes the number of solutions to Q(x1, . . . , xr) = n in Zr. One can write
Q(x1, . . . , xr) =1
2
(x1 x2 Ā· Ā· Ā· xr
)A
x1
x2...xr
,
where A ā MrĆr(Z) is symmetric with detA > 0, and the diagonal entries of A are even.Let N ā N be minimal such that NAā1 ā MrĆr(Z) with even diagonal entries. Then onecan show ĪQ(z) āMk/2(Ī1(N)).
CHAPTER 4. MODULAR FORMS 72
4.5 Ī· and ā
About half a century after Jacobiās began the theory of theta functions, Dedekind introduceda similar kind of function, which has played an important role in the theory of modular forms.
Definition 4.5.1. The Dedekind eta function Ī· : Hā C is given by
Ī·(z) = q1/24āān=1
(1ā qn) .
The factor q1/24 may seem curious at first, as with the factor q1/8 in the definition of Ļin the previous section, but it is needed to get the desired transformation laws. Namely, wehave
Proposition 4.5.2. The function Ī· is a holomorphic function on H satisfying the transfor-mation properties
Ī·(z + 1) = Ī·(z), Ī·
(ā1
z
)=
āz
iĪ·(z).
Proof. To see that it is holomorphic, it suffices to show the product converges absolutelyand uniformly on compacts sets in H, which is equivalent to doing the same for
log Ī·(z) = log q1/24 +
āān=1
log(1ā qn) =Ļiz
12+
āān=1
log(1ā qn).
Recall z ā H corresponds to q = e2Ļixeā2Ļy in the region 0 < |q| < 1. Using the Taylorexpansion for log(1ā x) = ā
āāj=1
xj
j , we see if x = |q| thenā£ā£ā£ā£ā£āān=1
log(1ā qn)
ā£ā£ā£ā£ā£ ā¤āān=1
āāj=1
xnjj =āān=1
Ļā1(n)xn
where Ļā1(n) is the sum of the reciprocals of divisors of n. In particular, this is less thanāān=1 nx
n, which we know converges (say by the ratio test) absolutely with a uniform boundfor x lying in a compact subset of (0, 1). This establishes holomorphy.
The transformation Ī·(z + 1) = Ī·(z) is evident since q is invariant under z 7ā z + 1.There are many ways to obtain the second identity, and we follow [Els06], which gives a
proof suggested by Petersson. Since both sides are analytic, it suffices to prove logarithmicderivative of both sides of the desired identity are equal, i.e.,
1
z2
Ī·ā²(ā1/z)
Ī·(ā1/z)=
d
dzlog Ī·
(ā1
z
)=
d
dzlog
āz
iĪ·(z) =
āzi Ī·ā²(z)ā i
2
āizĪ·(z)ā
zi Ī·(z)
=Ī·ā²(z)
Ī·(z)+
1
2z.
Let Ļ ā H and consider the lattice Ī = ć1, Ļć. One defines the associated Weierstrasszeta function
Ī¶(s; Ī) =1
s+
ā(m,n)āZ2
(m,n)6=(0,0)
(1
s+mĻ + nā 1
mĻ + n+
s
(mĻ + n)2
)
CHAPTER 4. MODULAR FORMS 73
Using Eulerās expression (4.2.4) for cotangent, and a similar one for csc2, we have
Ī¶(s; Ī) =Ļ2
3s+ Ļ cotĻs+
ām 6=0
(Ļ cotĻ(mĻ + s)ā Ļ cotĻmĻ + Ļ2s csc2 ĻmĻ
).
Consequently, we have
Ļ1(Ļ) := Ī¶
(1
2; Ī
)=
(Ļ2
6+ Ļ2
āām=1
csc2 ĻmĻ
)
andĻ2(Ļ) := Ī¶
(Ļ2
; Ī)
= ĻĻ1(Ļ) +1
2.
On the other hand, the definition of Ī¶(z; Ī) implies
Ļ2(Ļ) =1
ĻĻ1
(ā1
Ļ
).
Comparing the previous two equations yields
1
ĻĻ1
(ā1
Ļ
)= ĻĻ1(Ļ) +
1
2.
Note our expression for Ļ1 combined with the Fourier expansion csc2 Ļz = ā4āā
n=1 nqn
gives
Ļ1(z) = ā2ĻiĪ·ā²(z)
Ī·(z)
(cf. Exercise below). But now the previous transformation property for Ļ1 gives us preciselythe desired identity for Ī·ā²(z)
Ī·(z) .
Exercise 4.5.3. (a) Determine the q-expansion for Ī·ā²(z)Ī·(z) .
(b) Check Ļ1(z) = ā2ĻiĪ·ā²(z)Ī·(z) as asserted in the proof above.
We remark that had we proved the functional equation for the (pseudo-)Eisenstein se-ries E2 stated in Exercise 4.2.14, one can derive the transformation law for Ī· in a morestraightforward manner from that. See, e.g., [Kob93] or [DS05].
Just like Ļ and Ļ, one can relate Ī· to quadratic forms. Specifically, one can ask aboutrepresenting an integer n as a sum of k pentagonal numbers. Here the n-th pentagonalnumber is 3n2ān
2 for n ā„ 0, and this has a geometric interpretation just like square andtriangular numbers do (draw a few). Then Eulerās pentagonal number theorem (Euler hadpreviously considered the formal product
āān=1(1ā xn), not actually Ī· as a function of H)
states
Ī·(z) = q1/24āā
n=āā(ā1)nq
3n2ān2 .
Note this is not exactly the analogue of Ļ for pentagonal numbers, due to both the factorof (ā1)n and the appearance of the ānegative pentagonal numbersā 3n2ān
2 for n < 0.
CHAPTER 4. MODULAR FORMS 74
However, what is more interesting, and Eulerās motivation for the pentagonal numbertheorem, is that
1
Ī·(z)= qā1/24
āān=0
p(n)qn, (4.5.1)
where p(n) denotes the partition function. Recall a partition of n is a way of writing n asa sum of positive integers (order does not matter), e.g., the partitions of 3 are 3 = 2 + 1 =1 + 1 + 1. Then p(n) is the number of partitions of n, so, e.g., p(3) = 3 (we count 3 itself asthe trivial partition of 3). Here is a table of some partition numbers
n 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Ā· Ā· Ā· 100p(n) 1 2 3 5 7 11 15 22 30 42 56 77 101 135 176 231 Ā· Ā· Ā· 190569292
The relation (4.5.1) follows immediately from the product expansion (our definition) forĪ·, and the following exercise.
Exercise 4.5.4. Show the formal identityāān=1
1
1ā xn=āān=0
p(n)xn,
where p(0) = 1.
A classical problem in number theory and combinatorics was to find a simple expressionfor p(n). (Partition numbers come up often in combinatorial problems.) There is no (known)simple (finite rational) closed form expression for p(n). As you can see from the table, thenumbers p(n) start off growing quite slowly, but then appear to have exponential growth (anasymptotic was given by Hardy and Ramanujan). However, by Eulerās relation (4.5.1), onecan use Ī· and the theory of modular forms to get many results about the partition function.In fact, just in January 2011, Brunier and Ono gave a finite algebraic expression for p(n) interms of Ī·, E2 and certain binary quadratic forms of discriminant 1ā 24n.
So what precisely is the relation with modular forms? Just like with Ļ and Ļ, you mightexpect certain powers of Ī· to be modular forms. From the q1/24 factor appearing, it seemsclear that such an exponent would need to be a multiple of 24. (This is the same reason weneeded to consider Ļk where k ā” 0 mod 8.)
Proposition 4.5.5. Let k ā” 0 mod 24. Then Ī·k āMk/2(1).
In fact, since these forms are of level 1, the proof is easier than that for Ļ.
*Exercise 4.5.6. Prove Proposition 4.5.5.
There are a couple of things to remark here on apparent differences with powers of Ļand Ļ.
First, we are getting modular forms on the full modular group, as opposed to Ī0(4) inthe case of powers of Ļ and Ļ. What this means is that Ī· has slightly more refined symmetrythan Ļ and Ļ. In fact one can write Ļ and Ļ in terms of Ī·:
Ļ(z) =Ī·(2z)5
Ī·(z)2Ī·(4z)2, Ļ(z) =
Ī·2(2z)
Ī·(z)
CHAPTER 4. MODULAR FORMS 75
One reason why Ī· is quite special, is that all modular forms of āsmall levelā can be generatedby āeta quotients.ā This is useful in giving product expansions for modular forms.
Second, one can try to write Ī·24 a linear combination of Eisenstein series on M12(1).However there is only one Eisenstein series here, namely E12 and just looking at the constantterm (i.e., the value when z = iā or q = 0) shows Ī·24 is not a constant multiple of E12.(When we worked on M4(4) we could use Eisenstein series from lower levels in addition toE4,4, but there is no lower level to work with now.) In other words, the Fourier coefficientsfor Ī·24 do not satisfy such a simple expression in terms of divisor functions as those for Ļ8
or Ļ8. However, this issue not so much from any inherent difference between Ī· versus Ļ andĻ, but rather from the difference in the exponents of the functions, or put another way, theweights of the modular forms.
Except in the case of low weights k, the space Mk(Ī) will not be generated (as a vectorspace) by only Eisenstein series, but also require cusp forms. The Fourier coefficients of thesecusp forms, are on one hand more mysterious than the Fourier coefficients of Eisenstein series(not easily expressible in terms of elementary functions like Ļk(n)), but on the other hand aremuch better behaved (asymptotically they satisfy better bounds and are in a sense āevenlydistributedā.)
Definition 4.5.7. Let f ā Mk(Ī). We say f is a cusp form (of weight k for Ī) if fvanishes at the cusps, i.e., if, for each Ļ ā PSL2(Z), f |Ļ,k(z)ā 0 as Im(z)āā. The spaceof cusp forms of weight k for Ī is denoted by Sk(Ī), or simply Sk(N) if Ī = Ī0(N).
Note the S in Sk(Ī) stands for Spitzenform, which is German for cusp form. The Frenchdo not like this notation. (Cusp form is forme parabolique in French, but no one uses Pk(Ī).)
Lemma 4.5.8. Let f ā Mk(Ī). Then f vanishes at the cusps if and only if f(z) ā 0 asz ā z0 for any z0 ā Q āŖ iā.
Contrast this with the case of Eisenstein series, where holomorphic at the cusps does notmean that Ek,N (z) is bounded as z tends to a rational number.
Proof. First suppose f vanishes at the cusps. Let z0 ā QāŖiā, and Ļ =
(a bc d
)ā PSL2(Z)
such that Ļiā = z0. Since f |Ļ,k(z+N) = f |Ļ,k(z) for some N , we have a Fourier expansion
f |Ļ,k(z) =ānā„0
aĻ,nqnN , qN = e2Ļiz/N .
The fact that this tends to 0 as z ā iā implies aĻ,0 = 0. Consequently f |Ļ,k(z) = qNh(z)where h(z) =
ānā„0 aĻ,n+1q
nN is a holomorphic function. On the other hand
f(Ļz) = (cz + d)kf |Ļ,k(z) = (cz + d)kqNh(z)ā 0
as z ā iā because qN = e2Ļix/Neā2Ļy/N ā 0 faster than (cz + d)k ā ā. This impliesf(z)ā 0 as z ā Ļiā = z0.
Conversely, suppose f(z)ā 0 as z ā z0 for all z0 ā QāŖiā. Take Ļ ā PSL2(Z). Then
f |Ļ,k(z) = (cz + d)ākf(Ļz)ā 0 as z ā iā
since both (cz + d)āk ā 0 and f(Ļz)ā 0 as z ā iā.
CHAPTER 4. MODULAR FORMS 76
As with holomorphy at the cusps, to check the condition of vanishing at the cusps givenabove, it suffices to check the vanishing condition at each cusp, rather than each element ofQ āŖ iā.
Lemma 4.5.9. Let f ā Mk(Ī). Let z1, . . . zr ā Q āŖ iā be a set of representatives forthe cusps for Ī. If f(z)ā 0 as z ā zi for 1 ā¤ i ā¤ r, then f vanishes at the cusps.
Proof. Let z0 ā Q āŖ iā and write zi = Ī³z0 for some 1 ā¤ i ā¤ r and Ī³ =
(a bc d
)ā Ī.
Since f(z)ā 0 as z ā zi, we have f(Ī³z)ā 0 as z ā z0. Therefore, we also have
f(z) = f |Ī³,k(z) = (cz + d)ākf(Ī³z)ā 0
as z ā z0 since here (cz + d)āk ā 0.
Definition 4.5.10. We define the discriminant modular form ā := Ī·24 āM12(1). TheRamanujan Ļ-function defined by the Fourier expansion of ā so that
ā(z) =āān=1
Ļ(n)qn.
It is clear from the definition of Ī· that Ī·24 has no constant (n = 0) term in the Fourierexpansion, so ā is holomorphic at iā, which is the only cusp for PSL2(Z). Thus ā is acusp form, i.e., ā ā S12(1).
The reason for the terminology of ā is that is gives the discriminant for elliptic curves.Namely if E is the elliptic curve corresponding to the lattice ć1, zć, and ā(E) is the appro-priately normalized discriminant, then ā(E) = ā(z).
In 1916, Ramanujan computed the first 30 values of Ļ(n) and noticed various arithmeticalcuriosities. Here are the first several values
n 1 2 3 4 5 6 7 8 9 10 11 12Ļ(n) 1 -24 252 -1472 4830 -6048 -16744 84480 -113643 -115920 534612 -370944
The most important property Ramaujan numerically observed is that Ļ is multiplicative,i.e., Ļ(mn) = Ļ(m)Ļ(n) whenever gcd(m,n) = 1. This alone means that Ļ must havesome interesting arithmetical content. He also observed (again numerically) that Ļ(p2) =Ļ(p)2āp11. Both these statements were proved by Mordell in 1917, and this was generalizedby Hecke. We will prove these results later using the theory of Hecke operators.
Ramanujan also conjectured that |Ļ(p)| ā¤ 2p5āp. Compare this with the p-th Fouriercoefficient for E12 which is, up to a constant, Ļ11(p) = 1 + p11. In other words, the Fouriercoefficients for the cusp form ā of weight 12 should grow at most like the square root ofthe growth of the Eisenstein series of weight 12. This was proved by Deligne in 1974 as aconsequence of his Fields-medalāwinning proof of the Weil conjectures. While we obviouslycanāt prove this in our course, there is a weaker bound due to Hecke which is easy to provethat we will treat later. Heckeās bound states |Ļ(p)| = O(p6), i.e., |Ļ(p)| ā¤ Cp6 for someconstant C. In fact, the proof so simple this bound is often called the trivial bound, andany improvement in the exponent was considered substantial progress.
Ramanujan also noticed some remarkable congruences, such as the following.
CHAPTER 4. MODULAR FORMS 77
Example 4.5.11. We will see in the next chapter that dimM12(1) = 2. Observe that
E12(z) = 1 +65520
691
āān=1
Ļ11(n)qn = 1 +65520
691q + Ā· Ā· Ā·
and
E4(z)3 =
(1 + 240
āān=1
Ļ3(n)qn
)3
= 1 + 720q + Ā· Ā· Ā· .
The fact that dimM12(1) = 2 implies any cusp form (any form with zero constant term)must be a multiple of ā, so comparing coefficients of q, we see
691E12(z)ā 691E4(z)3 = ā432000q + Ā· Ā· Ā· = ā432000ā.
Since the coefficients of E4(z)3 are integers, taking this mod 691 gives
566āān=1
Ļ11(n)qn ā” 691E12(z) ā” 566ā ā”āān=1
Ļ(n)qn mod 691,
i.e.,Ļ(n) ā” Ļ11(n) mod 691.
Note, while we have expressed ā as an algebraic (polynomial) combination of Eisensteinseries, our comments before the definition of cusp form were stating that not all modularforms will be linear combinations of Eisenstein series. Roughly, it is the cusp forms whichwill not be. In fact, once we have the notion of an inner product on Mk(Ī), we will see thatthe space of cusp forms is orthogonal to the space generated (linearly) by Eisenstein series.
While we could, from the previous example, write Ļ(n) as a polynomial expression inĻ3 and Ļ11 akin to Example 4.3.13 (in fact, we will see later that one can write Ļ(n)āandmore generally the Fourier coefficients of any modular form of full levelāas a polynomialexpression in Ļ3 and Ļ5), this is qualitatively different than being able to write the r8(n) andĪ“8(n) as linear expressions in Ļ3 as in Example 4.4.7 and Exercise 4.4.9. This is reflected inthe very different behavior of Fourier coefficients for forms which are linear combinations ofEisenstein series and cusp forms.
Chapter 5
Dimensions of spaces of modularforms
As indicated in the introduction and the last chapter, we would like to know that a givenspace of modular forms Mk(Ī) is finite dimensional. Then, for example, we may be able tofind a basis for the space in terms of Eisenstein series, and use this to study questions aboutquadratic forms, as Example 4.4.7 and Exercise 4.4.9. In order to do this, we will need toknow the dimension of the relevant space Mk(Ī).
In this chapter, we will first handle the case of full level, proving finite dimensionalityand giving a simple formula for the dimension of Mk(1). Then, as in [Kil08], we will proveSturmās bound, which will allow us to deduce finite dimensionality for Mk(Ī). One canprove dimension formulas for Mk(N) (and more generally Mk(Ī)), but in the interest oftime we will not prove them in general, but merely state them for reference. However, it willbe a consequence of Sturmās bound that one can, for instance, rigorously verify the resultsof Example 4.4.7 and Exercise 4.4.9 without actually checking the dimension of M4(4).
5.1 Dimensions for full level
To study the dimension of Mk(1), we will need the residue theorem from complex analysis,which we now recall. If f is meromorphic with a pole at s, then the residue of f at s isResz=sf(z) = aā1 where f(z) =
āan(z ā s)n, i.e., the residue is the coefficient of the 1
zāsterm in the Laurent expansion of f at s.
Theorem 5.1.1. (Cauchyās Residue Theorem) Let U be an open set in C whose bound-ary is a simple closed curve C. Let f : U ā C be meromorphic and let sj denote the setof singularities of f . If f extends to a holomorphic function at each z ā C, thenā«
Cf(z)dz = 2Ļi
āj
Resz=sjf(z)
By convention, the integral around a closed curve will be in the counterclockwise direc-tion. The condition that f must be holomorphic on C essentially means that there shouldbe no poles along our path of integration.
78
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 79
Example 5.1.2. Suppose f is holomorphic on a region U . In this case there are no residuesbecause there are no poles, so
ā«C f(z)dz = 0 for any simple closed curve C ā U .
Example 5.1.3. Consider f(z) = 1zk, and let C be any simple closed curve containing the
origin. Here there is just one pole, at z = 0, and Resz=0f(z) = 1 if k = 1 and 0 otherwise.Thus ā«
C
1
zkdz =
2Ļi k = 1
0 else.
Meromorphic functions are controlled by their zeroes and poles. For a meromorphicfunction f on U , we define for any point p ā U , the order of f of f at p to be vp(f) = mwhere the Laurent (or Taylor) expansion of f at p is f(z) =
āān=m an(zā p)n with am 6= 0,
i.e., the order tells you where the Laurent/Taylor series expansion at p starts. In otherwords, if vp(f) = m > 0 means f has zero of order m at p, vp(f) = 0 means f(p) 6= 0, andvp(f) = m < 0 means f has a pole of order ām at p.
Aside: in algebraic geometry, one associates to f a divisor div(f) defined to be a formalsum
div(f) =āpāU
vp(f)p.
This tells you where the zeroes and the poles are and what their order is, and if f is a rationalfunction, then this sum is finite. For example, if we consider the meromorphic function
f(z) = 12086(z ā 1)3(z ā 4)
z2(z + 2i)
on C = C āŖ ā, then
div(f) = 3 Ā· 1 + 1 Ā· 4ā 2 Ā· 0ā 1 Ā· (ā2i)ā 1 Ā· ā.
In other words f has zeroes of order 3 and 1 at z = 1 and z = 4, poles of order 2, 1 and 1at z = 0, 2i and ā. I.e., v1(f) = 3, v4(f) = 1, v0(f) = ā2, vā2i(f) = ā1 and vā(f) = ā1(and vp(f) = 0 for any other p ā C. Note replacing f with a nonzero constant multiple doesnot change the divisor, and it is not hard to see that div(f) determines f up to a constant.
Observe that for any rational function f ,āpāC
vp(f) = 0. (5.1.1)
(If f = qr where q and r are polynomials, div(f) = div(q) ā div(r). Now the number of
zeroes of q is simply the degree, and q has no poles in C, but a pole of order deg(q) atā. Therefore the sum
āvp(q) of coefficients of div(q) is 0, and the same is true for r, and
therefore f .) Our proof of dimension formulas for Mk(1) will rely on an analogue of thisformula for modular forms.
We remark that (5.1.1) is analogous to the formulaāv
|x|v = 1
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 80
from algebraic number theory, where x ā Q and v runs over the places of Q. (And bothof these formulas are trivial consequences of the definition of the order of a function at apoint and the p-adic valuation of a rational number. The analogy between algebraic numbertheory and algebraic geometry runs much deeper than this of course.)
Theorem 5.1.4. Let f be a nonzero meromorphic modular form of weight k for PSL2(Z).For p ā H, let Cp be the elliptic subgroup of PSL2(Z) stabilizing p (cf. Section 3.5), and let
Fā =
z ā H : ā1
2ā¤ Re(z) <
1
2, |z| ā„ 1, and |z| > 1 if Re(z) ā„ 0
ā Ī¶3 ,
so Fā āŖ i, Ī¶3 is in bijection with PSL2(Z)\H. ThenāpāPSL2(Z)\H
1
|Cp|vp(f) = viā(f) +
1
2vi(f) +
1
3vĪ¶3(f) +
āpāFā
vp(f) =k
12. (5.1.2)
Fā
iĪ¶3
ā12
12
We wrote the expressionā 1|Cp|vp(f) on the left of (5.1.2) both to give a more uniform
presentation of the sum and make more clear the analogy with (5.1.1). However it is clearthis expression equals viā(f) + 1
2vi(f) + 13vĪ¶3(f) +
āpāFā vp(f) from Section 3.5, and this
latter expression is what we will use in the proof.
Proof. Note that we can relate the orders of zeroes and poles of f to residues of the loga-rithmic derivative f ā²
f . Namely if f(z) =ā
n=m an(z ā p)n with am 6= 0, then
f ā²(z)
f(z)=mam(z ā p)mā1 + (z ā p)m ((m+ 1)am + Ā· Ā· Ā·)
am(z ā p)m (1 + Ā· Ā· Ā·)=
m
z ā p+ c0 + z(c1 + c2z
2 + Ā· Ā· Ā· ),
soResz=p
f ā²
f= vp(f).
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 81
Consequently, if f is holomorphic on U and C ā U is a simple closed curve which misses allzeroes and poles of f (i.e., misses all poles of f
ā²
f ), then the residue theorem saysā«C
f ā²
fdz = 2Ļi
āvp(f), (5.1.3)
where the sum is over all points in the region enclosed by C.First we assume that f has no zeroes or poles on the boundary āFā of Fā. Note
āFā = āF where F is the standard fundamental domain for PSL2(Z)); here, we mean theboundary inside H, so a zero or pole at iā is not ruled out. In this case we will considerour simple closed curve C to be of the following type.
The curve C is mostly easily described in the diagram below, but here is how I woulddescribe it in words also. The curve C, oriented counterclockwise, will begin at some pointa high up on the left boundary Re(z) = ā1
2 of Fā, travel down the vertical line Re(z) = ā12 ,
stopping at a point b just short of Ī¶(3), then arcing inside Fā to travel along the bottomboundary |z| = 1, making a small arc inside Fā to avoid i, then continuing along thebottom boundary, again making a small arc to avoid Ī¶6, then climbing up the right boundaryRe(z) = 1
2 to a point e such that Im(e) = Im(a), then traveling along the horizontal lineIm(z) = Im(a) to end back at a.
Fā
ā12
12
a
b
bā²dā²
dc cā²
e
C
Note that this curve C is made in such a way that no zeroes of poles of f inside F āi, Ī¶3, Ī¶6 lie outside of C. This is possible since there are only finitely many zeroes and polesinside F . To see this, recall the open balls Br(iā) = iā āŖ z ā H : Im(z) > r form a
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 82
basis of neighborhoods of iā. So f being meromorphic at iā means we can choose a ballBr(iā) such that f has no zeroes or poles in Br(iā) except possibly at iā. Consequentlyall zeroes and poles of f inside F must lie inside a truncated fundamental domain Fr =z ā F : Im(z) ā¤ r. But the number of zeroes and poles inside Fr must be finite becauseFr is compact.
We also assume C is symmetric under reflection about iR+.By (5.1.3), we know ā«
C
f ā²
fdz = 2Ļi
āpāFā
vp(f), (5.1.4)
On the other hand, we can compute the integral along C piecewise. Since f(z+1) = f(z)we have ā« b
a
f ā²
fdz +
ā« e
dā²
f ā²
fdz = 0.
Thus ā«C
f ā²
fdz =
ā« bā²
b
f ā²
fdz +
ā« c
bā²
f ā²
fdz +
ā« cā²
c
f ā²
fdz +
ā« d
cā²
f ā²
fdz +
ā« dā²
d
f ā²
fdz +
ā« a
e
f ā²
fdz.
First note the change of variables z 7ā q transforms the integral from e to a to a circlearound q = 0 with negative orientation. Then Cauchyās Residue Theorem givesā« a
e
f ā²
fdz = ā2Ļiviā(f).
Next, if we integrate f ā²
f around the circle Ī±r of radius r containing the arc from b to bā², weget 2ĻivĪ¶3(f). (By making the arc of small enough radius, we may assume f has no zerosor poles inside this circle except possibly at Ī¶3.) If we take the radius r going to 0, its angletends to Ļ
3 , so you might guess ā« bā²
b
f ā²
fdz ā āĻi
3vĪ¶3(f).
This is true, and it follows from applying the following lemma to g(z) = f ā²(zāĪ¶3)f(zāĪ¶3) , which has
at most a simple pole at 0 whose residue is vĪ¶3(f).
Lemma 5.1.5. Suppose g is meromorphic with at most a simple pole at the origin. LetSr be the circle of radius r around the origin, 0 ā¤ Īø1 ā¤ Īø2 ā¤ 2Ļ, and consider the pointsar = reiĪø1 and br = reiĪø2 on Sr. Then
limrā0
ā« br
ar
g(z)dz = (Īø2 ā Īø1)iResz=0 g(z),
where the integral is taken along the arc in Sr from ar to br. In particular, in the limit asr ā 0, the integral only depends upon the length of the arc.
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 83
Proof. First write g(z) = aā1zā1+h(z) where h(z) is holomorphic at 0 and aā1 = Resz=0 g(z).
It is clear that limrā0
ā« brarh(z)dz = 0, so it suffices to consider
limrā0
ā« br
ar
aā1
zdz.
Write z = reiĪø so dz = ireiĪødĪø. Then
limrā0
ā« br
ar
aā1
zdz = lim
rā0
ā« Īø2
Īø1
aā1
reiĪøireiĪødĪø = (Īø2 ā Īø1)iaā1.
Similarly, we have ā« cā²
c
f ā²
fdz ā āĻivi(f)
and ā« dā²
d
f ā²
fdz ā āĻi
3vĪ¶6(f) = āĻi
3vĪ¶3(f).
Putting all this together gives
2Ļi
viā(f) +1
2vi(f) +
1
3vĪ¶3(f) +
āpāFā
vp(f)
= limCāāF
ā« c
bā²
f ā²
fdz +
ā« d
cā²
f ā²
fdz.
By our symmetry assumption on C, S =
(0 1ā1 0
)takes the arc from bā² to c to the arc from
d to cā² (reverses orientation). Recall Sz = ā1z , so differentiating f(Sz) = zkf(z) gives
f ā²(Sz)d
dzSz = f ā²(Sz)
d
dz
ā1
z=
1
z2f ā²(Sz) = kzkā1f(z) + zkf ā²(z).
Therefore,f ā²(Sz)
f(Sz)=kzk+1f(z) + zk+2f ā²(z)
zkf(z)= kz + z2 f
ā²(z)
f(z).
so if z = Sw, then dz = 1w2dw andā« c
bā²
f ā²(z)
f(z)dz =
ā« cā²
d
f ā²(Sw)
f(Sw)
dw
w2=
ā« cā²
d
(k
w+f ā²(w)
f(w)
)dw
Since the path from d to cā² is taken along |w| = 1,ā« c
bā²
f ā²
fdz +
ā« d
cā²
f ā²
fdz = k
ā« cā²
d
dw
wā k
ā« Ļ/2
Ļ/3dĪø = k
Ļi
6,
as C ā āF .This proves the theorem with our assumptions about the zeroes and poles along the
boundary of āF . Now suppose there are m points p1, . . . , pm, along āF āi, Ī¶3, Ī¶6 at which
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 84
f is zero or infinite. (There are at most finitely many by meromorphy.) Then we can use theabove argument, by suitably modifying the type of our curve C so as to just avoid p1, . . . , pm.Note that if pi is such a point along the left boundary Rez = ā1
2 (resp. bottom left boundaryRez < 0, |z| = 1), then Tpi (resp. Spi) is also such a point along the right (resp. bottomright) boundary. For example if there are only two such points, say p1 on the left boundary,and p2 = Tp1 along the right boundary, we can take our curve of the following type.
Fā
ā12
12
a
b
bā²dā²
dc cā²
e
p1 p2
C
Here our previous argument goes through unchanged, since the integrals from a to b anddā² to e still cancel each other out. This obviously works for any (finite) number of zeroesand poles along the side boundary. It is also easy to see one can treat zeroes and poles onthe lower boundary similarly (cf. exercise below).
Exercise 5.1.6. With notation as in the previous theorem, suppose f has a zero or poleat some point p1 lying on the lower left boundary
z ā H : |z| = 1,ā1
2 < Re(z) < 0of āF .
Assume f has no zeroes or poles on āF āi, Ī¶3, Ī¶6 except at p1 and p2 = Sp1. Check (5.1.2)still holds.
Now we can use this to start computing dimensions of spaces of modular forms.
Example 5.1.7. While we have been assuming k ā„ 0 even up to now, one can considerthe same definitions for k < 0 even, and (5.1.2) still holds. If k < 0, then (5.1.2) saysf must have a pole. Consequently, there are no (nonzero) holomorphic modular forms ofnegative weight, justifying our āomissionā of negative weights. (Meromorphic modular forms
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 85
of negative weights do of course existājust take 1f for any nonzero holomorphic modular
form f of positive weight.)
Proposition 5.1.8. We have
Mk(1) =
C k = 0
0 k = 2
CEk k = 4, 6, 8, 10, 14.
This will complete the proof of the relation between Ļ7 and Ļ3 in 4.3.13.
Proof. We already know M0(1) = C from Corollary 4.1.12.Next note any nonzero term on the left of (5.1.2) is at least 1
3 , so (5.1.2) has no solutionsof k = 2.
If k = 4, there is precisely one solution to (5.1.2), namely vĪ¶3(f) = 1 and all othervp(f) = 0. Similarly, one easily sees (5.1.2) has only one solution for k = 6, 8, 10 or 14.However, we already know Ek is one nonzero modular form in Mk(1). Consequently iff ā Mk(1) for k = 4, 6, 8, 10 or 14, then (5.1.2) having only one solution means the zeroesof f must agree with the zeroes of Ek. Consequently, f/Ek is a meromorphic modular formof weight 0 with no zeroes or poles, i.e., a constant since M0(1) = C.
Note if k = 12, then there are infinitely many solutions to (5.1.2), and we need to useanother argument to determine M12(1) (and similarly for Mk(1) for k > 14). We alreadyknow two forms in this space, the Eisenstein series E12, and the cusp form ā.
Lemma 5.1.9. Let k ā„ 4 even. Then Mk(1) = Sk(1)ā CEk.
Proof. Let f āMk(1). We know Ek āMk(1) is 1 at iā. Thus g(z) = f(z)ā f(iā)Ek(z) āMk(1) and vanishes at iā, i.e., g ā Sk(1).
In other words, every modular form in Mk(1) is the sum of a cusp form and a multipleof the Eisenstein series Ek. In particular, to determine dimMk(1) it suffices to determinedimSk(1) = dimMk(1) ā 1. Note that by the previous proposition, there no nonzero cuspforms of level 1 for weights 0 through 10 or 14, so ā is the first (smallest weight) instanceof a cusp form.
Proposition 5.1.10. The space S12(1) = Cā, i.e., M12(1) = Cāā CE12.
This will complete the proof that Ļ(n) ā” Ļ11(n) mod 691 in Example 4.5.11.
Proof. Note that with k = 12, (5.1.2) only has one solution with viā(f) > 0, namelyviā(f) = 1 and vp(f) = 0 for any p 6= iā. This applies to any nonzero f ā S12(1), sincesuch an f must vanish at iā. I.e., any nonzero f ā S12(1) has a simple (order 1) zero atiā and no zeroes in H. In particular this is true for ā. Then, as above, for any f ā S12(1),we have f/ā āM0(1), i.e., f is a scalar multiple of ā.
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 86
For k = 16, 18, 20, 22, 26, again one can observe there is only one solution to (5.1.2) withviā(f) > 0 to see dimSk(1) = 1 and dimMk(1) = 2. However, we would prefer to be ableto explicitly construct the cusp forms.
How can we construct cusp forms? One way would be to construct a form in Mk(1)which isnāt a multiple of Ek and subtract off an appropriate multiple of Ek, as in the proofof Lemma 5.1.9. More suitable for our present purposes is just to multiply a cusp form(namely ā) with another form.
Exercise 5.1.11. For any congruence subgroup Ī, let f ā Mk(Ī) and g ā S`(Ī). Thenfg ā Sk+`(Ī).
Now we show that any cusp form can be constructed as a multiple of ā and a modularform of smaller weight.
Lemma 5.1.12. Let k ā„ 16. Then Sk(1) = ā Ā·Mkā12(1). In particular, dimMk(1) =1 + dimMkā12(1).
Proof. Let f ā Sk(1). Then viā(f) ā„ 1. On the other hand, ā has a simple zero at iā andis nonzero on H, therefore f/ā is a holomorphic modular form of weight k ā 12.
We can put all this information together in the following theorem.
Theorem 5.1.13 (Dimension formula in level 1). Let k ā„ 0 even. Then
dimMk(1) =
b k12c+ 1 k 6ā” 2 mod 12
b k12c k ā” 2 mod 12.
By Lemma 5.1.9, this also tells us the dimensions of Sk(1). Note the reason for thedifference in the case k ā” 2 mod 12 is because E2 6āM2(1).
In fact, we know more than the dimensions. What we did above, allows us to a basis forany Mk in terms of Eisenstein series and ā.
Example 5.1.14. Consider M40(1). We can write
S40(1) = ā Ā·M28(1) = ā Ā· (CE28 ā S28(1)) = ā Ā· (CE28 āā (CE16 āāE4)) .
Consequently, a basis for M40(1) isE40,āE28,ā
2E16,ā3E4
.
Exercise 5.1.15. Write a basis for M36(1) in terms of Eisenstein series and ā.
Since we can write any modular form of level 1 in terms of Eisenstein series and ā, wecan write any modular form of level 1 simply in terms of Eisenstein series by the relation(now justified)
ā =691
432000
(E3
4 ā E12
)that we obtained in Example 4.5.11. In fact we can write ā in terms of strictly lower weightEisenstein series, which is in many places how ā is defined.
Exercise 5.1.16. Show that
ā =E3
4 ā E26
1728.
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 87
Note the above relation shows the j-invariant defined in Exercise 4.2.13 is simply
j(z) =E4(z)3
ā(z).
In other words, the j-invariant of an elliptic curve (which is actually an invariant of analyticisomorphism classes) is closely related to the discriminant ā of the elliptic curve (which isnot an invariant for isomorphism classes).
In fact all modular forms for PSL2(Z) can be written in terms of E4 and E6.
Proposition 5.1.17. Let f āMk(1). Then f is polynomial in E4 and E6.
Proof. Since the above procedure shows any f is a polynomial in ā =E3
4āE26
1728 andE4, E6, . . . , Ek,it suffices to show each Ej is a polynomial in E4 and E6. We use induction. Suppose it istrue for Ej with j ā¤ k ā 2, and consider Ek.
We may assume k ā„ 8 even. Then there exist r, s ā Zā„0 such that k = 4r + 6s. HenceE = Er4E
s6 ā Mk(1). Since neither E4 nor E6 vanish at infinity, E this is not a cusp form.
Further, because Mk(1) = CEk ā Sk(1), there exists g ā Sk(1) such that E ā g = cEk forsome c 6= 0. (Note the Fourier expansions of E ā g and Ek both start with 1, so in factc = 1). On the other hand, since g = āgā² for some gā² ā Mkā12(1), by induction g is apolynomial in E4 and E6. Therefore Ek also is.
Exercise 5.1.18. (i) Write E10 and E14 as polynomials in E4 and E6.(ii) Using a relation between E14, E10 and E4, show
Ļ13(n) ā” 11Ļ9(n)ā 10Ļ3(n) mod 2640.
5.2 Finite dimensionality for congruence subgroups
As you can see from the proof of Theorem 5.1.13, trying to carry out the same approachfor an arbitrary congruence subgroup does not seem so appealing. Of course, for a givencongruence subgroup, one can fix a fundamental domain and mimic the proof of Theorem5.1.4 and use this to compute dimension formulas.
However, there is an general approach using some Riemann surface theory and algebraicgeometry, namely the RiemannāRoch theorem. We will not cover this here (see, e.g., [DS05]),but simply remark that one can prove an analogue of Theorem 5.1.4, which states if f is ameromorphic modular form of weight k on Ī, thenā
pāĪ\H
1
|Cp|vp(f) =
k
2
(1
2Īµ2 +
2
3Īµ3 + Īµā + 2g ā 2
), (5.2.1)
where Cp is the stabilizer of p in Ī, Īµ2 (resp. Īµ3) is the number of elliptic points of order 2for Ī, Īµā is the number of cusps for Ī, and g is the genus of Ī\H. Note for Ī = PSL2(Z),one has Īµ2 = 1, Īµ3 = 1, Īµā = 1 and g = 0 so this agrees with Theorem 5.1.4.
Consequently, one can show if k ā„ 0 even, then
dimMk(Ī) =
(k ā 1)(g ā 1) + bk4cĪµ2 + bk3cĪµ3 + k
2 Īµā k ā„ 2
1 k = 0,(5.2.2)
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 88
and
dimSk(Ī) =
(k ā 1)(g ā 1) + bk4cĪµ2 + bk3cĪµ3 +
(k2 ā 1
)Īµā k ā„ 4
g k = 2
0 k = 0.
(5.2.3)
For Ī = Ī0(N), explicit formulas for g, Īµ2, Īµ3 and Īµā in terms of N are given in [DS05](cf. Exercise 3.5.6 and Exercise 3.5.14), making the formulas for dimMk(N) and dimSk(N)quite explicit. For instance, we remark
dimS2(p) =1
12(p+ 1)ā 1
4
(1 +
(ā1
p
))ā 1
3
(1 +
(ā3
p
))(5.2.4)
and
dimS2(p2) =
112(p+ 1)(pā 6) + 1ā 1
4
(1 +
(ā1p
))ā 1
3
(1 +
(ā3p
))p ā„ 5
0 p = 2, 3 (or 5).(5.2.5)
We tabulate some explicit dimensions at the end of this chapter.
Instead, we will take a simpler, and in some sense more practical, approach (as in [Kil08])using Sturmās bound, which will allow us to conclude finite dimensionality with a bound fordimensions, which in some cases will be sharp.
Theorem 5.2.1. (Sturmās bound) Let Ī be a congruence subgroup and f ā Mk(Ī). LetĻ1, . . . , Ļt be the cusps of Ī. If ā
Ļj
vĻj (f) >k[PSL2(Z) : Ī]
12,
then f = 0.
Proof. First suppose Ī = PSL2(Z) and f ā Mk(1) with r := viā(f) > k12 . Since ā has
only a simple zero at iā and is nonzero on H, fār is a holomorphic modular form of weight
k ā 12r < 0. Since there are no nonzero holomorphic modular forms of negative weight(Example 5.1.7), this means f = 0.
Now suppose Ī 6= PSL2(Z) and put M = [PSL2(Z) : Ī] and f ā Mk(Ī) with viā(f) >kM12 . We will use f to construct a modular form for PSL2(Z).
Let Ļi1ā¤iā¤M be a set of coset representatives for Ī\PSL2(Z) with Ļ1 ā Ī. Recall
f |Ļ,k(z) = j(Ļ, z)ākf(Ļz) = (cz + d)ākf(Ļz),
where Ļ =
(a bc d
), so f |Ļ1,k = f . Let
F (z) =
Māi=1
f |Ļi,k(z) = f(z)
Māi=2
f |Ļi,k(z).
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 89
Notef |Ī³Ļi,k = (f |Ī³,k) |Ļi,k = f |Ļi,k
for any Ī³ ā Ī, i.e., f |Ļi,k does not depend upon the choice of coset representative.Consider any Ļ ā PSL2(Z). Note Ļ ā²i := ĻĻi is just another set of coset representatives
for Ī\PSL2(Z), hence
F |Ļ,kM =
(Māi=1
f |Ļi,k
)|Ļ,k =
Māi=1
(f |Ļi,k) |Ļ,k =Māi=1
f |ĻĻi,k = F,
i.e., F ā MkM (1). Since viā(F ) =āM
i=1 vĻā1i Ā·iā
(f) > kM12 , by the PSL2(Z) case we know
F = 0, which implies f = 0.
Remark 5.2.2. In the course of the proof we have shown how to construct a modular formof full level at the expense of increasing the weight. The same argument evidently shows thefollowing.
Let Īā² ā Ī ā PSL2(Z) be congruence subgroups, f ā Mk(Īā²) and M = [Ī : Īā²]. Let
Ļi1ā¤iā¤M be a set of coset representatives for Īā²\Īā². Then
F (z) =Māi=1
f |Ļi,k āMkM (Ī).
What Sturmās bound really means is the following.For a congruence subgroup Ī, we put q = e2Ļiz as usual if T ā Ī. If not, then TN ā Ī
for some N , and we set q = e2Ļiz/N for the minimal such N ā N. Then f ā Mk(Ī) hasperiod N so we can write the Fourier expansion as f(z) =
āanq
n.
Corollary 5.2.3. Let f(z) =āanq
n, g(z) =ābnq
n ā Mk(Ī). If an = bn for n ā¤k[PSL2(Z):Ī]
12 , then f = g.
Proof. Apply Sturmās bound to f ā g.
Corollary 5.2.4. For a congruence subgroup Ī,
dimMk(Ī) ā¤ k
12[PSL2(Z) : Ī] + 1.
In particular,
dimMk(N) ā¤ k
12Nāp|N
(1 +
1
p
)+ 1.
Proof. Sturmās bound says the linear map given by
f(z) =ānā„0
anqn 7ā (a0, a1, . . . , ar)
where r = k12 [PSL2(Z) : Ī] is an injective map from Mk(Ī) to Cr+1.
The second statement follows from Corollary 3.4.3.
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 90
We remark that the bound for dimMk(N) is asymptotically of the right order (for fixedk and N large)āthe āmain termā in the general dimension formulas for both dimMk(N)
and dimSk(N) is kā112 N
āp|N
(1 + 1
p
).
Example 5.2.5. Let k = 0, then we see for any Ī, dimM0(Ī) ā¤ 1. On the other hand anyconstant function lies in M0(Ī), so M0(Ī) = C. This provides another proof that there areno nonconstant holomorphic modular functions (cf. Corollary 4.1.12).
Example 5.2.6. Let N = 4. Then we see
dimMk(4) ā¤ k
124 Ā· 3
2+ 1 =
k
2+ 1.
In particular
dimM4(4) ā¤ 4
2+ 1 = 3.
On the other hand, we have constructed 3 linearly independent elements of M4(4) (none ofwhich are cusp forms), namely E4,4, E4,2 and E4 (cf. Example 4.4.7). Hence
dimM4(4) = 3.
This justifies the formulas for the number of representations of n as a sum of 8 squares(resp. triangular numbers) in Example 4.4.7 (resp. Exercise 4.4.9).
Exercise 5.2.7. Show dimMk(Ī0(2)) = 2 for k = 4, 6.
Observe that even in small weights, dimMk(N) > 1 for higher levels, in contrast to thelevel 1 case, where the dimension did not jump to 2 until the occurrence of the cusp form āin weight 12. The reason for this difference is that there is an Eisenstein series Ek,d āMk(N)for each d|N , and these are all different. In general Ī will have multiple cusps, and one canconstruct an āEisenstein series for each cuspā (which turn out to be the same as the Ek,d ford|N when Ī = Ī0(N) and N is a product of distinct primes or N = 4).
While we havenāt shown this yet, E4 ā E4,4 (or anything in M4(4)) is not a cusp form,even though it vanishes at iā. We point this out to emphasize that in higher level, onereally needs to check a form vanishes at all cusps in order to verify it is a cusp form.
However, one sometimes has 1-dimensional spaces in higher level (besides in the trivialweight k = 0).
Exercise 5.2.8. (a) Consider the Eisenstein series E2,2(z) = E2(z)ā2E2(2z) from Exercise4.2.18. Show E2,2(z) āM2(2) and E2,2(2z) āM2(4).
(b) Deduce dimM2(2) = 1 and dimM2(4) = 2.
(c) Show r4(n) =
8Ļ1(n) n odd24Ļ1(n0) n = 2rn0, n0 odd.
One can show the first cusp form for Ī0(4) occurs at weight 6.
Exercise 5.2.9. (a) Show FĪ·(z) = Ī·12(2z) ā S6(4).(b) Using Sturmās bound, show dimS6(4) = 1.(c) For k ā„ 6 even, show Sk(4) = FĪ· Ā·Mkā6(4).(d) Show for k ā„ 0,
dimMk(4) =k
2+ 1.
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 91
Example 5.2.10. Let us now consider the number r12(n) of representations of n as a sumof 12 squares. By Proposition 4.4.4, we know
Ļ12(z) =
āān=0
r12(n)qn = 1 + 2 Ā· 12q + 4 Ā· 66q2 + 8 Ā· 220q3 + Ā· Ā· Ā· āM6(4).
From the previous exercise, we know M6(4) = ćE6, E6,2, E6,4, FĪ·ć. Note
E6(z) = 1ā 504āān=1
Ļ5(n)qn = 1ā 504(q + 33q2 + 244q3 + Ā· Ā· Ā·
)
E6,2(z) = 1ā 504
31
āān=1
(32Ļ5(n/2)ā Ļ5(n)) qn = 1 +504
31
(q + q2 + 244q3 + Ā· Ā· Ā·
)E6,4(z) = E6,2(2z) = 1ā 504
31
āān=1
(32Ļ5(n/4)ā Ļ5(n/2)) qn = 1 +504
31q2 + Ā· Ā· Ā·
FĪ·(z) = Ī·12(2z) = q
āān=1
(1ā q2n)12 =ān>0
anqn = q ā 12q3 + Ā· Ā· Ā· .
One can verify that
Ļ12 = ā 5
336E6 +
31
1008E6,2 +
62
63E6,4 + 16FĪ·.
Consequently, one gets a formula for r12(n) in terms of Ļ5 and the Fourier coefficients anof the cusp form FĪ·:
r12(n) = 8Ļ5(n)ā 512Ļ5(n/4) + 16an.
Because of the appearance of cusp forms of level 4 starting at weight 6, for 2k ā„ 12 withk even, one no longer gets as elementary an expression for r2k(n) as the Fourier coefficientsof cusp forms are in some sense more mysterious. (While weāve avoided modular forms ofodd weight, there is also a cusp form of weight 5 for Ī1(4), so the formula for r10(n) alsoinvolves a cusp form.) However, as mentioned before, Heckeās bound will show the Fouriercoefficients for cusp forms grow at a much slower rate, so one is at least able to get a niceasymptotic for r2k(n) just in terms of the Fourier coefficients of Eisenstein series. We willsay a little more about this once we get to Heckeās bound.
We remark that in specific cases, it is still possible to get elementary formulas for r2k(n)with more work. For instance, Glaisher in fact derived an explicit elementary formula forr12(n) as
r12(n) = (ā1)nā18ād|n
(ā1)d+n/dd5 + 4ā
N(Ī±)=n
Ī±4,
where Ī± runs over all Hurwitz integers with norm n.
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 92
Appendix: Dimension Tables
Explicit dimensions for small k,N are given in Table 5.1 and Table 5.2. Note that, for a fixedN and k ā„ 4, dimMk(N)ādimSk(N) is constant. One may see this from comparing (5.2.2)and (5.2.3), and one sees dimMk(N) ā dimSk(N) = Īµā for k ā„ 4. This corresponds tothe fact that the various cusps give rise to linearly independent Eisenstein series on Mk(N),which are all holomorphic if k ā„ 4, and these Eisenstein series together with the cusp formslinearly generate Mk(N).
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 93
Table 5.1: Dimensions of spaces of modular forms for 2 ā¤ k ā¤ 8
N M2(N) S2(N) M4(N) S4(N) M6(N) S6(N) M8(N) S8(N)
1 0 0 1 0 1 0 1 02 1 0 2 0 2 0 3 13 1 0 2 0 3 1 3 14 2 0 3 0 4 1 5 25 1 0 3 1 3 1 5 36 3 0 5 1 7 3 9 57 1 0 3 1 5 3 5 38 3 0 5 1 7 3 9 59 3 0 5 1 7 3 9 510 3 0 7 3 9 5 13 911 2 1 4 2 6 4 8 612 5 0 9 3 13 7 17 1113 1 0 5 3 7 5 9 714 4 1 8 4 12 8 16 1215 4 1 8 4 12 8 16 1216 5 0 9 3 13 7 17 1117 2 1 6 4 8 6 12 1018 7 0 13 5 19 11 25 1719 2 1 6 4 10 8 12 1020 6 1 12 6 18 12 24 1821 4 1 10 6 16 12 20 1622 5 2 11 7 17 13 23 1923 3 2 7 5 11 9 15 1324 8 1 16 8 24 16 32 2425 5 0 11 5 15 9 21 1526 5 2 13 9 19 15 27 2327 6 1 12 6 18 12 24 1828 7 2 15 9 23 17 31 2529 3 2 9 7 13 11 19 1730 10 3 22 14 34 26 46 3831 3 2 9 7 15 13 19 1732 8 1 16 8 24 16 32 2433 6 3 14 10 22 18 30 2634 6 3 16 12 24 20 34 3035 6 3 14 10 22 18 30 2636 12 1 24 12 36 24 48 3637 3 2 11 9 17 15 23 2138 7 4 17 13 27 23 37 3339 6 3 16 12 26 22 34 3040 10 3 22 14 34 26 46 38
CHAPTER 5. DIMENSIONS OF SPACES OF MODULAR FORMS 94
Table 5.2: Dimensions of spaces of modular forms for 10 ā¤ k ā¤ 16
N M10(N) S10(N) M12(N) S12(N) M14(N) S14(N) M16(N) S16(N)
1 1 0 2 1 1 0 2 12 3 1 4 2 4 2 5 33 4 2 5 3 5 3 6 44 6 3 7 4 8 5 9 65 5 3 7 5 7 5 9 76 11 7 13 9 15 11 17 137 7 5 9 7 9 7 11 98 11 7 13 9 15 11 17 139 11 7 13 9 15 11 17 1310 15 11 19 15 21 17 25 2111 10 8 12 10 14 12 16 1412 21 15 25 19 29 23 33 2713 11 9 15 13 15 13 19 1714 20 16 24 20 28 24 32 2815 20 16 24 20 28 24 32 2816 21 15 25 19 29 23 33 2717 14 12 18 16 20 18 24 2218 31 23 37 29 43 35 49 4119 16 14 20 18 22 20 26 2420 30 24 36 30 42 36 48 4221 26 22 32 28 36 32 42 3822 29 25 35 31 41 37 47 4323 19 17 23 21 27 25 31 2924 40 32 48 40 56 48 64 5625 25 19 31 25 35 29 41 3526 33 29 41 37 47 43 55 5127 30 24 36 30 42 36 48 4228 39 33 47 41 55 49 63 5729 23 21 29 27 33 31 39 3730 58 50 70 62 82 74 94 8631 25 23 31 29 35 33 41 3932 40 32 48 40 56 48 64 5633 38 34 46 42 54 50 62 5834 42 38 52 48 60 56 70 6635 38 34 46 42 54 50 62 5836 60 48 72 60 84 72 96 8437 29 27 37 35 41 39 49 4738 47 43 57 53 67 63 77 7339 44 40 54 50 62 58 72 6840 58 50 70 62 82 74 94 86
Chapter 6
Hecke operators
Recall the conjecture of Ramanujan that Ļ(mn) = Ļ(m)Ļ(n) form,n relatively prime. Whilethis specific result was shown by Mordell, it was Hecke who developed a general theory todetermine which modular forms have multiplicative Fourier coefficients (i.e., amn = amanwhenever gcd(m,n) = 1ānote this imposes no condition on a0). Note that if the Fouriercoefficients an of some modular form are multiplicative, then they are determined by a0 andthe prime power Fourier coefficients apr .
The following elementary exercise shows the Eisenstein series series Ek essentially havemultiplicative Fourier coefficients.
Exercise 6.0.1. If gcd(m,n) = 1, show Ļk(m)Ļk(n) = Ļk(mn).
Namely, if we write
Ek(z) = 1ā 2k
Bk
ānā„1
Ļkā1(n)qn =ā
anqn,
the above exercise shows
aman =
(2k
Bk
)2
Ļkā1(m)Ļkā1(n) =
(2k
Bk
)2
Ļkā1(mn) = ā 2k
Bkamn = a1amn.
Or, if we simply consider the renormalized Eisenstein series,
Eāk(z) = āBk2k
(z)Ek(z) = āBk2k
+ānā„1
Ļkā1(n)qn, (6.0.1)
we see Eāk has multiplicative Fourier coefficients. Now we can say that by āessentially mul-tiplicativeā Fourier coefficients, we mean some nonzero multiple of our modular form hasmultiplicative Fourier coefficients.
Heckeās idea for determining if a modular form has (essentially) multiplicative Fouriercoefficients, very roughly, is the following. Consider an operator
Up
(āanq
n)
=ā
apnqn. (6.0.2)
95
CHAPTER 6. HECKE OPERATORS 96
If this operator preserves S12(1), then
Upā = Ī»pā
for some Ī»p ā C, since S12(1) = ćāć. On the other hand,
Upā = Up(q ā 24q2 + 252q3 ā Ā· Ā· Ā· ) = Ļ(p)q ā 24Ļ(p)q2 + Ā· Ā· Ā·
so Ī»p = Ļ(p) and therefore we would have Ļ(pn) = Ļ(p)Ļ(n) for all n.However, Ļ is not totally multiplicative, i.e., Ļ(p2) 6= Ļ(p)2, so Up does not preserve
ā. Instead, since Ramanujan predicted Ļ(p2) = Ļ(p)2 ā p11, one would expect Ļ(pn) =Ļ(p)Ļ(n) ā p11Ļ(n/p) for p|n. Thus one can guess the correct operator (for weight 12)should be
Tp
(āanq
n)
=āp-n
apnqn +
āp|n
(apn + p11an/p
)qn. (6.0.3)
Then one just needs to show Tp preserves S12(1).
6.1 Hecke operators for Ī0(N)
While we could define the Hecke operators directly by their action on Fourier expansionsalong the lines of (6.0.3), it will be helpful (and more motivated) to think of them as actingon lattices.
First letās consider the case of PSL2(Z). Recall PSL2(Z)\H parameterizes the space oflattices up to homothety (equivalence by CĆ). Hence a weak modular form of weight 0 issimply a meromorphic function on equivalence classes of lattices.
What about weight k? Given f ā Mk(1), consider the lattice Ī = ć1, Ļć with Ļ ā H.Put F (Ī) = f(Ļ). Since ćĻ1, Ļ2ć = ćĻā²1, Ļā²2ć if and only if(
a bc d
)(Ļ1
Ļ2
)=
(Ļā²1Ļā²2
)one easily sees that ć1, Ļć = ć1, Ļ ā²ć (with Ļ ā² ā H also) if and only if Ļ ā² = Ļ + d for somed ā Z. Thus f having period 1 implies F (Ī) is well-defined on lattices of the form ć1, Ļć.
Now consider an arbitrary lattice Ī = ćĻ1, Ļ2ć. We can write Ī = Ļ1ć1, Ļ2/Ļ1ć = Ī»ć1, Ļćwhere Ī» = Ļ1 and Ļ = Ļ2/Ļ1. Thus, to see how to extend F to a function on all lattices itsuffices to determine how F should behave under multiplying a lattice Ī = ć1, Ļć by a scalarĪ». Itās reasonable to ask that a scalar Ī» should just transform F by some factor, i.e.,
F (Ī»Ī) = c(Ī»)F (Ī).
The only condition we have is that this should be compatible with our definition of F (ć1, Ļć) =f(Ļ). In other words, if Ī»ć1, Ļ ā²ć = ć1, Ļć, we need to ensure F (Ī»ć1, Ļ ā²ć) = F (ć1, Ļć).
This boils down to the case where Ī» = Ļ and Ļ ā² = ā 1Ļ . Here we require
c(Ļ)f
(ā1
Ļ
)= F (Ļć1, 1
Ļć) = F (ć1, Ļć) = f(Ļ).
CHAPTER 6. HECKE OPERATORS 97
But now the modularity of f implies c(Ļ) = 1Ļk. In other words, the weight k modular form
f can be viewed as a homogeneous function F of degree āk on the space of lattices of C,i.e., a function F such that
F (Ī»Ī) =1
Ī»kF (Ī)
for Ī» ā CĆ. Call the space of such F by L(k).Conversely, if F is a function of lattices such that F (Ī»Ī) = Ī»ākF (Ī) with k ā„ 0 even,
then one can check f(z) = F (ć1, zć) āMk(1). Hence there is a bijection betweenMk(1) andL(k).
Then for PSL2(Z) we can define the n-th Hecke operator in terms of F ā L(k):
Tn(F (Ī)) = F
āĪā²nāĪ
Īā²
,
where the sum is over all Īā² ā Ī of index n. In other words Tn averages F over all sublatticesof index n. It is clear that TnF is still a function of lattices, and it is homogeneous of degreeāk. Clearly T1F = F .
Thus the correspondence between modular forms f ā Mk(1) and F ā L(k) induces anaction of the Hecke operators
Tn : Mk(1)āMk(1).
Letās see how Tn translates to an operator on Mk(1).First observe that the sublattices Īā² of Ī = ć1, Ļć of index n are precisely Īā² = ć1, Ļ ā²ć,
where (Ļ ā²
1
)=
(a bc d
)(Ļ1
)for
(a bc d
)āMn(Z)
whereMn(Z) denotes the 2Ć 2 integer matrices of determinant n. For such a
Īā² = ćaĻ + b, cĻ + dć = (cĻ + d)ćaĻ + b
cĻ + d, 1ć,
we have
F (Īā²) = F
((cĻ + d)ćaĻ + b
cĻ + d, 1ć)
= (cĻ + d)ākf
(aĻ + b
cĻ + d
)= f |Āµ,k(Ļ)
where we extend the slash operator
f |Āµ,k(Ļ) = (cĻ + d)ākf
(aĻ + b
cĻ + d
)for Āµ =
(a bc d
)āMn(Z).
(Note in general for Āµ =
(a bc d
)ā GL+
2 (Q) (the superscript + means positive determinant)
one typically defines the slash operator with a factor of det(Āµ)k/2, so our notation departsfrom the standard here, but seems the most reasonable for our present purpose.)
CHAPTER 6. HECKE OPERATORS 98
Given Āµ1, Āµ2 ā Mn(Z), one can check Āµ1ć1, Ļć = Āµ2ć1, Ļć if and only if Āµ2 = Ī³Āµ1 forsome Ī³ ā SL2(Z). Hence on the level of modular forms, we have
Tnf =ā
ĀµāSL2(Z)\Mn(Z)
f |Āµ,k.
For our arithmetic purposes, it is better to introduce a normalization factor of n1āk in theHecke operator, which we will do below. The above discussion was just motivation for howto define Hecke operators for PSL2(Z).
While we will not go through the details (cf. [Kob93]), a similar argument can be madefor the modular groups Ī0(N) (as well as for Ī1(N) and Ī(N)). The idea is that Ī0(N)\Hparameterizes pairs (Ī, C) where Ī is a lattice in C and C is a cyclic subgroup of Ī of orderN . Then one can identify modular forms f āMk(N) with homogeneous functions of degreeāk on pairs (Ī, C) and define Hecke operators Tn similarly, though some care must be takenwhen gcd(n,N) > 1.
Let
Mn,N (Z) =
(a bc d
)āMn(Z)|c ā” 0 mod N
/ Ā±I .
Definition 6.1.1. Suppose gcd(n,N) = 1. We define the n-th Hecke operator Tn onMk(N)by
Tnf = nkā1ā
ĀµāĪ0(N)\Mn,N (Z)
f |Āµ,k.
We will see the normalization factor nkā1 will make the action on Fourier coefficientsnice.
Observe that for f āMk(N), Āµ āMn,N (Z) and Ī³ ā Ī0(N) we have
f |Ī³Āµ,k = (f |Ī³,k)|Āµ,k = f |Āµ,k,
so the above sum over coset representatives is well defined. It is also easy to see that Tn islinear. Now we show it actually is an operator on Mk(N), as well as Sk(N).
Unless otherwise specified, we assume gcd(n,N) = 1 in what follows.
Proposition 6.1.2. We have Tn : Mk(N)āMk(N) and Tn : Sk(N)ā Sk(N).
Proof. Let Ī³ ā Ī0(N). Then
(Tnf)|Ī³,k = nkā1
āĀµāĪ0(N)\Mn,N (Z)
f |Āµ,k
|Ī³,k = nkā1ā
ĀµāĪ0(N)\Mn,N (Z)
f |ĀµĪ³,k
= nkā1ā
ĀµāĪ0(N)\Mn,N (Z)
f |Āµ,k = Tnf
since right multiplication by Ī³ ā Ī0(N) preservesMn,N (Z), and therefore simply permutesthe cosets Ī0(N)\Mn,N (Z).
Note that each f |Āµ,k is holomorphic on H, therefore Tnf is. To see Tnf is holomor-phic at each cusp, let Ļ ā PSL2(Z) and consider f |Āµ,k|Ļ,k = f |ĀµĻ,k = f |Āµā²,k where Āµā² =
CHAPTER 6. HECKE OPERATORS 99
ĀµĻ =
(a bc d
)ā Mn(Z) (but not necessarily Mn,N (Z)). As Im(z) ā ā, f |Āµā²,k(z) ā
limIm(z)āā1
(cz)kf(ac
)which has a finite limit because f is holomorphic at the cusp a
c . Thuseach f |Āµā² is holomorphic at iā, and Tnf is holomorphic at the cusps.
The same argument we used for holomorphy at the cusps shows that, if f vanishes atthe cusps, so does Tnf .
Lemma 6.1.3. A set of coset representatives for Ī0(N)\Mn,N (Z) is(a b0 d
): a, d > 0, ad = n, 0 ā¤ b < d
.
Proof. Take(a bcN d
)ā Mn,N (Z) representing some coset in Ī0(N)\Mn,N (Z). Since
gcd(n,N) = 1, we know gcd(a, cN) = 1. Then there exist x, y ā Z such that(x ycN āa
)ā
Ī0(N). Now observe(x ycN āa
)(a bcN d
)=
(ax+ cyN bx+ dy
0 bcN ā ad
),
which means we may assume c = 0 for our coset representative.
Further, given(a b0 d
)āMn,N (Z), we may replace it by the coset representative
(1 x0 1
)(a b0 d
)=
(a b+ dx0 d
)where we choose x so that 0 ā¤ b + dx < d, i.e., we may just assume 0 ā¤ b < d. Sincen = ad > 0, a and d have the same sign, we may multiply by āI if necessary to assumea, d > 0. This shows any coset has a representative of the desired form.
Now consider(a b0 d
)and
(aā² bā²
0 dā²
)such that ad = aā²dā² = n, 0 ā¤ b < d and 0 ā¤ bā² < dā².
Assume they represent the same coset, i.e.,(aā² bā²
0 dā²
)=
(r stN u
)(a b0 d
)=
(ra rb+ sdtNa tNb+ ud
)
for some(r stN u
)ā Ī0(N). First we see this means t = 0, which means r = u = 1 (up to
Ā±1). This means aā² = a dā² = d, and bā² = b + sd. However 0 ā¤ bā² < dā² = d implies we need
s = 1, i.e.,(r stN u
)= I, and therefore any two distinct matrices of the prescribed form
represent distinct cosets.
Theorem 6.1.4. Let f(z) =āanq
n āMk(N) and suppose gcd(m,N) = 1. Then
(Tmf)(z) =ā
bnqn
CHAPTER 6. HECKE OPERATORS 100
wherebn =
ād| gcd(m,n)
dkā1amn/d2 .
In particular, if m = p is prime, then
bn =
apn p - napn + pkā1an/p p|n
so(Tpf)(z) =
ān6ā”0 mod p
apnqn +
ānā”0 mod p
(apn + pkā1an/p
)qn.
Proof. By the previous lemma, we have
(Tmf)(z) = mkā1ā
f |a b0 d
,kwhere the sum runs over a, b, d ā Zā„0 such that ad = m and 0 ā¤ b < d. Note
f |a b0 d
,k(z) =1
dkf
(az + b
d
)=
1
dk
āān=0
ane2Ļia
dzne2Ļi b
dn =
1
dk
āān=0
Ī¶bnd anqadn.
For fixed a, d we will be considering the sum
ā0ā¤b<d
1
dkf
(az + b
d
)=
1
dk
āān=0
ā0ā¤b<d
Ī¶bnd
anqadn.
Note that the inner sumā
0ā¤b<d Ī¶bnd will just be a sum over all d-th roots of unity, and
therefore vanish, unless d|n, in which case it is just d. Thus
ā0ā¤b<d
1
dkf
(az + b
d
)=
1
dkā1
āān=0
adnqan.
Hence
(Tmf)(z) = mkā1āad=m
ā0ā¤b<d
1
dkf
(az + b
d
)= mkā1
āad=m
(1
dkā1
āān=0
adnqan
)
=āān=0
āa|m
(akā1amn/aq
an)
=āān=0
ād|m
dkā1amn/d2qn.
Corollary 6.1.5. The Ramanujan tau function satisfies Ļ(mn) = Ļ(m)Ļ(n) whenevergcd(m,n) = 1 and Ļ(pr) = Ļ(p)Ļ(prā1)ā p11Ļ(prā2).
CHAPTER 6. HECKE OPERATORS 101
Proof. Consider
(Tpā)(z) =āp-n
Ļ(pn)qn +āp|n
(Ļ(pn) + p11Ļ(n/p)
)qn.
On the other hand, Tp acts on S12(1) = Cā so
(Tpā)(z) = Ī»ā(z)
for some Ī» ā CĆ. The 1st Fourier coefficients of ā and Tpā are just 1 and Ļ(p), so Ī» = Ļ(p).Comparing the n-th Fourier coefficients, we see
Ļ(pn) = Ī»Ļ(n) = Ļ(p)Ļ(n) p - nĻ(pn) + p11Ļ(n/p) = Ī»Ļ(n) = Ļ(p)Ļ(n) p|n.
The former equation proves the multiplicativity when m = p and the latter equation provesthe recursion relation for Ļ(pr).
To obtain the general multiplicativity law Ļ(mn) = Ļ(m,n) for gcd(m,n) = 1, we simplyuse the same argument as above with Tm instead of Tp.
The above argument applies in a more general setting.
Exercise 6.1.6. Suppose f(z) =āanq
n ā Sk(N) (resp. Mk(N)) and dimSk(N) = 1 (resp.dimMk(N) = 1). Show
(i) If gcd(m,N) = gcd(n,N) = gcd(m,n) = 1, then a1amn = aman. In particular,if f(z) 6= 0 we must have a1 6= 0, and if we normalize f such that a1 = 1, the Fouriercoefficients are multiplicative.
(ii) If a1 = 1 and p - N , then apn = apapnā1 ā pkā1apnā2 for n ā„ 2.
What this means is we can use Hecke operators to compute values of Fourier coefficientsfrom just knowing what the Tpās are.
Exercise 6.1.7. Using the fact that dimS16(1) = 1, use the previous exercise to help com-pute the first 10 Fourier coefficients of E4ā.
Now you might ask, for what modular forms f are the Fourier coefficients multiplicativein the sense a1amn = aman for gcd(m,n) = 1? By Exercise 6.0.1, we know this is true forthe Eisenstein series Ek, and now we have seen it is true for ā, and by Exercise 6.1.6, anyEkā where k = 4, 6, 8, 10, 14.
In general, if you have two power series (or Fourier expansions) f and g with multiplica-tive coefficients, the formal product fg will not have multiplicative Fourier coefficients, sothere is no reason to expect all modular formsāor even products of modular forms withmultiplicative Fourier coefficientsāto have multiplicative coefficients. For instance, startingin weight 24 for the full modular group, we have a space of cusp forms of dimension greaterthan 1, namely S24(1) = ćā2, E12āć. Since
ā(z) = q ā 24q2 + 252q3 ā 1472q4 + 4830q4 ā 6048q6 + Ā· Ā· Ā·
CHAPTER 6. HECKE OPERATORS 102
we computeā2(z) = q2 ā 48q3 + 1080q4 ā 2944q5 + 143820q6 + Ā· Ā· Ā· .
Right away, we see two things: 1) the Fourier coefficients of q2 and q3 do not multiply togive the Fourier coefficient of q6, and 2) a1 = 0 so the multiplicative property a1an = amanfor gcd(m,n) = 1 would mean an = 0 whenever n is divisible by two different primes, whichis obviously not the case. (In fact something stronger is true, see Corollary 6.1.11 below.)
One can also check that
E12 = 1 +65520
691
(q + 2049q2 + 177148q3 + 4196353q4 + 48828126q5 + Ā· Ā· Ā·
)so
E12ā = q +65520
691
(2039
2730q2 +
527191
260q3 +
525013709
4095q4 + Ā· Ā· Ā· ā 666536766
65q6 + Ā· Ā· Ā·
)While the first Fourier coefficient is 1 here, again one sees the products of the q2 and q3
coefficients does not give the q6 coefficient (itās not even the right sign!). So one cannothope that the Fourier coefficients of Ekā are multiplicative in general.
You might now ask, are there any elements of S24(1) with multiplicative Fourier coeffi-cients or does this property of having multiplicative Fourier coefficients only occur in smallweight? We will see there always are cusp forms with multiplicative Fourier coefficients.First, letās make an observation in the weight 12 case: while 691
65520E12 and ā have multi-plicative Fourier coefficients, one canāt expect this for any modular form in M12(1). Namelyif f and g have multiplicative Fourier coefficients, f + g will generally not. So while mostforms in M12(1) do not have multiplicative Fourier coefficients, M12(1) is generated by twoāniceā forms, 691
65520E12 and ā, which do.Using the theory of Hecke operators, we will show that there exists such a āniceā basis
for Mk(N) and Sk(N). The basic idea is to show that the Hecke operators Tm and Tncommute for gcd(m,n) = 1 (and m,n prime to N). Then one defines an inner productćĀ·, Ā·ć, called the Petersson inner product, on Mk(N) and shows each Tn is Hermitian withrespect to ćĀ·, Ā·ć, i.e., ćTnf, gć = ćf, Tngć. Then a well known theorem in linear algebra saysthat a commuting family of operators which are Hermitian with respect to ćĀ·, Ā·ć can besimultaneously diagonalized by some orthonormal basis with respect f1, . . . , fr to ćĀ·, Ā·ć Inother words, each fi is an eigenform for Tn, i.e., Tnfi = Ī»fi for some Ī» ā C.
Definition 6.1.8. Let f ā Mk(N) be nonzero. We say f is a (Hecke) eigenform if,for each n relatively prime to N , there exists a (Hecke) eigenvalue Ī»n ā C such thatTnf = Ī»nf .
We say f is a normalized eigenform if the first nonzero Fourier coefficient is 1.
The arguments from Corollary 6.1.5 and Exercise 6.1.6 apply to show any Hecke eigen-form has multiplicative Fourier coefficients. Precisely, work out
*Exercise 6.1.9. Let f =āanq
n ā Mk(N) be a Hecke eigenform. Suppose gcd(m,n) =gcd(m,N) = gcd(n,N) = 1. Then a1amn = aman.
CHAPTER 6. HECKE OPERATORS 103
In other words, the basis f1, . . . , fr which diagonalizes the Tnās asserted above is a basisof Hecke eigenforms, and therefore a basis of Mk(N) with multiplicative Fourier coefficientsin the sense of the previous exercise.
We remark that there is a technicality here we have ignored, namely that the Peterssoninner product ćf, gć is only well defined when at least f or g is a cusp form. So the aboveargument will only technically show that Sk(N) has a basis of eigenforms but, at least whenN = 1, we will see how this implies one can extend the basis of eigenforms of Sk(N) to abasis of eigenforms for Mk(N).
We remark that for a eigencusp form (a Hecke eigenform which is a cusp form) the firstnonzero Fourier coefficient should be a1 in order for the Hecke operators to not force everyFourier coefficient to be zero.
Lemma 6.1.10. Let f =āanq
n āMk(1) be a Hecke eigenform and k > 0. Then a1 6= 0.
Proof. Write Tmf = Ī»mf =ābnq
n. Then b1 = am. Consequently
am = Ī»ma1.
In other words, if a1 = 0, then am = 0 for all n ā„ 0. Consequently f(z) = a0 āM0(1).
Corollary 6.1.11. Let f āMk(1). Then ā2f is not a Hecke eigenform.
In particular, this shows ā2 is not an eigenform.
Now letās get to the first step in showing the existence of Hecke eigenforms, which isshowing the Hecke operators are commutative.
Lemma 6.1.12. Suppose gcd(m,N) = gcd(n,N) = 1 and gcd(m,n) = 1. Then the Heckeoperators on Mk(N) satisfy TmTn = Tmn.
Proof. Take f(z) =āarq
r ā Mk(N). Write Tnf(z) =ābrq
r and Tm(Tnf)(z) =ācrq
r.Then
br =ā
d| gcd(n,r)
dkā1anr/d2
socr =
āe| gcd(m,r)
ekā1bmr/e2 =ā
e| gcd(m,r)
ekā1ā
d| gcd(n,mr/e2)
dkā1amnr/d2e2 .
Since gcd(m,n) = 1, d runs over the divisors of gcd(n, r/e), and therefore dā² = de runs overthe divisors of mn and
cr =ā
dā²| gcd(mn,r)
(dā²)kā1amnr/(dā²)2
so TmTn = Tmn.
Lemma 6.1.13. Suppose p - N . The Hecke operators on Mk(N) satisfy
TprTps =ā
d| gcd(pr,ps)
dkā1Tpr+s/d2 . (6.1.1)
CHAPTER 6. HECKE OPERATORS 104
Proof. The proof here is modeled on the one in [Apo90, Theorem 6.13], but there appear tome to be errors in the r = 1 case of loc. cit. I believe I have corrected them.
We may as well assume r ā¤ s.First letās show the r = 1 case, which just says
TpTps = Tps+1 + pkā1Tpsā1 .
For f āMk(n),
Tpsf(z) = ps(kā1)ā
0ā¤iā¤s
ā0ā¤b<pi
f |psāi b0 pi
,k= ps(kā1)
ā0ā¤iā¤s
pāikā
0ā¤b<pif
(psāiz + b
pi
). (6.1.2)
In particular
Tpg(z) = p(kā1)g(pz) + pā1ā
0ā¤bā²<pg
(z + bā²
p
)so
TpTpsf(z) = p(s+1)(kā1)ā
0ā¤iā¤spāik
ā0ā¤b<pi
f
(ps+1āiz + b
pi
)(6.1.3)
+ pā1ps(kā1)ā
0ā¤bā²<p
ā0ā¤iā¤s
pāikā
0ā¤b<pif
(psāi(z + bā²) + pb
pi+1
). (6.1.4)
Note the i = s term from (6.1.4) is
pā1āsā
0ā¤bā²<p
ā0ā¤b<ps
f
(z + bā² + pb
ps+1
)= pā1ās
ā0ā¤b<ps+1
f
(z + b
ps+1
).
Thus adding the i = s term from (6.1.4) to (6.1.3) gives (6.1.2) with s replaced by s + 1,i.e., they sum to Tps+1f(z).
Now the remaining terms, i.e., the i < s terms from (6.1.4), sum to
pā1ps(kā1)ā
0ā¤bā²<p
ā0ā¤iā¤sā1
pāikā
0ā¤b<pif
(psā1āiz + b+ psā1āibā²
pi
)
If i ā¤ sā12 , then psā1āibā²
piā Z, so by periodicity of f there is no dependence on bā² and the
contribution for such a fixed i is
ps(kā1)pāikā
0ā¤b<pif
(psā1āiz + b
pi
). (6.1.5)
In fact, for any i, b + psā1āibā² mod pi runs over the set of residue classes mod pi exactly ptimes, so the contribution for any i is given again by (6.1.5). This proves the r = 1 case.
CHAPTER 6. HECKE OPERATORS 105
Now we suppose (6.1.1) is true up to some fixed r. By the r = 1 case we have
Tpr+1Tps = (TpTpr)Tps ā pkā1Tprā1Tps .
By the inductive assumption that (6.1.1) is true for r, we also have
Tp (TprTps) =ā
0ā¤iā¤rpi(kā1)TpTpr+sā2i .
Comparing these, and making another use of the r = 1 case, gives
Tpr+1Tps =ā
0ā¤iā¤rpi(kā1)TpTpr+sā2i ā pkā1Tprā1Tps
=ā
0ā¤iā¤r
(pi(kā1)Tpr+s+1ā2i + p(i+1)(kā1)Tpr+sā1ā2i
)ā pkā1Tprā1Tps .
Expanding out the last term using the r ā 1 case of (6.1.1) gives
Tpr+1Tps =ā
0ā¤iā¤r+1
pi(kā1)Tpr+s+1ā2i .
The previous two lemmas combine to give the following.
Corollary 6.1.14. The Hecke operators Tngcd(n,N)=1 on Mk(N) commute.
The former lemma also shows knowing what the prime power Hecke operators do tellsyou what all Hecke operators do, and the latter lemma shows that the prime power Heckeoperators are determined by the prime Hecke operators. In particular, we see
Corollary 6.1.15. If f ā Mk(N) is an eigenform for each Tp with p - N prime, i.e.,Tpf = Ī»pf for some Ī»p ā C, then f is a Hecke eigenform.
Proof. The r = 1 case of (6.1.1) says, for j = sā 1 ā„ 2,
Tpj = TpTpjā1 ā pkā1Tpjā2 .
If Tpif = Ī»pif for some Ī»pi whenever i < j, then we see
Tpjf = Tp(Ī»pjā1f)ā pkā1Ī»pjā2f = Ī»pjf
whereĪ»pj =
(Ī»pĪ»pjā1 ā pkā1Ī»pjā2
). (6.1.6)
Hence by induction, we see if Tpf = Ī»pf then Tpjf = Ī»pjf for some Ī»pjāprecisely, with Ī»jpsatisfying the recursion in (6.1.6).
Now let n be relatively prime to N , and write n = pe11 . . . perr . By Lemma 6.1.12, we have
Tnf =
(rāi=1
Tpeii
)f =
rāi=1
Ī»peiif.
CHAPTER 6. HECKE OPERATORS 106
6.2 Petersson inner product
We know thatMk(Ī) is a finite dimensional complex vector space, and therefore can be madeinto a Hilbert space, i.e., an inner product space. The standard way to make a functionspace into a Hilbert space is with the L2 inner product. Namely, if f, g ā L2(X) for somespace X, then the inner product is given by
ćf, gć =
ā«Xf(x)g(x) dx.
While modular forms are not L2 on H, they are essentially L2 on relevant Riemannsurface X = Ī\H. We say essentially here, because they are of course not actually functionson Ī\H due to the automorphy transformation factor except in the uninteresting (constant)case of k = 0.
Letās see what happens when we naively try to make f ā Mk(Ī) into a function onX = Ī\H. For simplicity, consider Ī = PSL2(Z) = ćS, T ć. We already have f(Tz) =f(z + 1) = f(z) so f descends to a function on the infinite cylinder ćT ć\H. We would liketo modify f to a function F = Ff which still satisfies F (z + 1) = F (z) but also satisfiesF (Sz) = F (ā1/z) = F (z).
Since the invariance under T is a transformation rule in x = Re(z) which we donāt wantto mess up, we might just try to impose invariance under S by modifying y = Im(z). Sincef(S Ā· iy) = f(ā1/iy) = f(i/y) = (iy)kf(iy), it makes sense to consider the function (whichwill be called a Maass form)
F (z) := yk/2f(z) = Im(z)k/2f(z). (6.2.1)
Then we still have F (z + 1) = F (z) for all z and now
F (ā1/iy) = F (i/y) = yāk/2f(i/y) = Ā±ikyk/2f(iy) = (ā1)k/2F (iy),
so at least F (ā1/iy) = F (iy) when k ā” 0 mod 4. (One could let F (z) = (iy)k/2f(z) sothat we actually have F (ā1/iy) = F (iy), but one typically considers F as defined in (6.2.1),since the sign (ā1)k/2 will not matter in the end.) It doesnāt quite satisfy F (ā1/z) = F (z)for all z but it will be close, and we can say how close.
In general, for any f ā Mk(Ī), we can associate to f the Maass form F defined by(6.2.1). Noting that
|j(Ī³, z)|2 =Im(z)
Im(Ī³z), Ī³ ā SL2(R) (6.2.2)
(cf. (3.2.1)), we see
F (Ī³z) = Im(Ī³z)k/2f(Ī³z) = Im(Ī³z)k/2j(Ī³, z)kf(z) = j0(Ī³, z)kIm(z)k/2f(z) = j0(Ī³, z)kF (z)
for Ī³ ā Ī, where
j0(Ī³, z) =j(Ī³, z)
|j(Ī³, z)|.
CHAPTER 6. HECKE OPERATORS 107
In other words, the Maass form F (not holomorphic) is not quite invariant under Ī, but it isup to a factor j0(Ī³, z) of absolute value 1. This is good enough for our purposes.1 Namely, ifalso g āMk(Ī) and G(z) = Im(z)k/2g(z) is the associated Maass form, it means the product
F (Ī³z)G(Ī³z) = j0(Ī³, z)kj0(Ī³, z)kF (z)G(z) = F (z)G(z)
is invariant under Ī.Consequently, we can define the inner product
ćf, gć =
ā«Ī\H
F (z)G(z) dĻ,
where dĻ is an area measure on the Riemann surface X = Ī\H. Now in practice, you wantto express this as an integral on a fundamental domain for Ī inside H, so we need to knowwhat the hyperbolic measure should be.
Lemma 6.2.1. The measure dx dyy2
= dz dzIm(z)2
is an invariant measure on the hyperbolic planeH, i.e., for Ī³ ā PSL2(R) we haveā«
Hf(Ī³z)
dx dy
y2=
ā«Hf(z)
dx dy
y2,
for any integrable (w.r.t. to this measure) function f on H.
Proof. Write Ī³ =
(a bc d
), and let
w = Ī³z =(ad+ bc)x+ bd+ acy2 + iy
c2x2 + c2y2 + d2= u+ iv.
Notedw
dz=
1
|cz + d|2=āu
āx+ i
āv
āx=āv
āyā iāu
āy.
Then the Jacobian determiantā£ā£ā£ā£āxāu āyāu
āxāv
āyāv
ā£ā£ā£ā£ =
ā£ā£ā£ā£ā£āuāx āuāy
āvāx
āvāy
ā£ā£ā£ā£ā£ā1
=
ā£ā£ā£ā£|cz + d|ā2 00 |cz + d|ā2
ā£ā£ā£ā£ā1
= |j(Ī³, z)|4.
Since |j(Ī³, z)|2 = Im(z)Im(Ī³z) = y
v , we haveā«Hf(Ī³z)
dx dy
y2=
ā«Hf(w)|j(Ī³, z)|4 du dv
y2=
ā«Hf(w)
du dv
v2.
1We remark that one can make f invariant under Ī at the expense of working on a larger space thanH. Namely, one has the surjective map PSL2(R) ā H given by Ī³ 7ā Ī³ Ā· i. Since SO(2) stabilizes i, onecan view PSL2(R)/SO(2) ' H. Thus one can lift f to a function on PSL2(R) by f(Ī³) = f(Ī³ Ā· i) and definethe automorphic form Ļf (Ī³) = j(Ī³, i)k/2f(Ī³). This makes Ļf invariant under Ī so it is a function onĪ\PSL2(R). Hence, viewed as functions on PSL2(R), the passage from modular forms to automorphic formstrades right SO(2) invariance for left Ī invariance.
CHAPTER 6. HECKE OPERATORS 108
Definition 6.2.2. Let f, g āMk(Ī). The Petersson inner product of f with g is definedto be
ćf, gć :=1
[PSL2(Z) : Ī]
ā«Ī\H
ykf(z)g(z)dx dy
y2(6.2.3)
whenever the integral converges.
The above discussion shows that the integrand is invariant under Ī, so the definingintegral makes sense, provided it converges. Consequently, we can also write this integral as
ćf, gć :=1
[PSL2(Z) : Ī]
ā«Fykf(z)g(z)
dx dy
y2(6.2.4)
where F is any fundamental domain for Ī.The normalization factor [PSL2(Z) : Ī]ā1 roughly cancels out the volume of Ī\H, defined
to bevol(Ī\H) =
ā«Ī\H
dx dy
y2.
Lemma 6.2.3. Let Ī be a congruence subgroup in PSL2(Z). Then vol(Ī\H) is finite.
Proof. Let F ā² be a fundamental domain for Ī. By Lemma 3.4.9,
F ā² =[PSL2(Z):Ī]ā
i=1
Ī±iF ,
where F is the standard fundamental domain for PSL2(Z) and Ī±i ā PSL2(Z). Since thearea measure is invariant under the action of PSL2(R),
vol(F ā²) =ā
vol(Ī±iF) = [PSL2(Z) : Ī]vol(F),
so it suffices to show F has finite volume. Note
vol(F) =
ā« 1/2
ā1/2
ā« āā
1āx2
dx dy
y2ā¤ā« 1/2
ā1/2
ā« ā1/2
dx dy
y2= 2
ā« 1/2
ā1/2dx = 2.
The proof is of course valid not just for congruence subgroups, but any finite indexsubgroup of PSL2(Z), and consequently any subgroup of PSL2(R) commensurable withPSL2(Z).
Exercise 6.2.4. Compute vol(X0(1)) = vol(PSL2(Z)\H).
Note the above lemma implies the Petersson inner product converges for (the admittedlynot very interesting case of) f, g ā M0(Ī) = C. However for general weight k we do notalways have convergence. For instance, in the standard fundamental domain of PSL2(Z),the yk can grow too fast for the Petersson inner product to converge unless f(z)g(z)ā 0 asy ā ā. Similarly, if we tend to a cusp in Q, f(z)g(z) may grow too fast (cf. Section 4.3)for the inner product to converge. This suggests that when k > 0 we need f or g to be acusp form for ćf, gć to converge.
CHAPTER 6. HECKE OPERATORS 109
Proposition 6.2.5. Let f, g ā Mk(Ī). The Petersson inner product ćf, gć converges ifeither f ā Sk(Ī) or g ā Sk(Ī).
Proof. We assume f ā Sk(Ī). The case g ā Sk(Ī) is similar.Let F be a fundamental domain for Ī\H and F be its closure in H. Write F = KāŖ
āUi,
where this is a finite union of subsets with K compact in H and each Ui ā H containingexactly one cusp, say zi. Since ykā2f(z)g(z) is bounded on K it suffices to show this isbounded on each Ui.
As in the proof of Lemma 4.5.8, we know f(z)ā 0 exponentially fast as z ā zi, whereasykā2g(z) has at most a finite order pole at zi. Thus ykā2f(z)g(z) is bounded on each Ui,and because vol(F) = vol(F) <ā, the Petersson inner product converges.
What this means is that the Petersson inner product defines an inner product (it is clearlysesquilinear, i.e., linear in the first vector and anti, or conjugate, linear in the second) onthe space of cusp forms Sk(N). This inner product almost extends to an inner product onMk(N), but ćf, gć need not converge when both f and g are not cusp forms.
While we have failed to make Mk(N) a Hilbert space, we have at least made Sk(N) one.Most importantly, the inner product behaves symmetrically with respect to the action ofthe Hecke operators.
Lemma 6.2.6. There exists a complete set of representatives Āµi for Ī0(N)\Mn,N (Z) suchthat
nĀµā1
i
is also a complete set of representatives for Ī0(N)\Mn,N (Z).
Proof. First we note that both Ī0(N)\Mn,N (Z) andMn,N (Z)/Ī0(N) (which equals Ļ1(n) =ād|n d by Lemma 6.1.3). This follows because the map Āµ 7ā nĀµā1, which mapsMn,N (Z) to
itself, induces a bijection of the right cosets Ī0(N)\Mn,N (Z) with the left cosetsMn,N (Z)/Ī0(N).Then we can write
Mn,N (Z) =
rāi=1
Ī0(N)Ī±i =
rāi=1
Ī²iĪ0(N),
for some collections, Ī±i and Ī²j inMn,N (Z). It is easy to see there must be a permutationĻ : 1, . . . , r ā 1, . . . , r such that Ī0(N)Ī±i ā© Ī²Ļ(i)Ī0(N) 6= ā for all i (otherwise there isa proper subset of indices i such that Ī0(N)Ī±i =Mn,N (Z)).
Take Āµi ā Ī0(N)Ī±i ā©Ī²Ļ(i)Ī0(N) for i = 1, . . . , r. Then Āµi forms a complete set of rep-resentatives both for Ī0(N)\Mn,N (Z) and forMn,N (Z)/Ī0(N). The latter fact composedwith the map Āµ 7ā nĀµā1 shows that
nĀµā1
i
is also a complete set of representatives for
Ī0(N)\Mn,N (Z).
Note: even for n = p, the representatives given in Lemma 6.1.3, do not satisfy theconditions given in the above lemma. At least some places in the literature are sloppyabout this, and as a consequence have a gap or error in the proof of the following theorembelow (this includes an earlier version of these notes, as I originally copied this mistake fromelsewhere, and I thank Roberto Miatello for pointing this out to meāthe approach of usingthe above lemma can be found in [Miy06]).
CHAPTER 6. HECKE OPERATORS 110
Theorem 6.2.7. The Hecke operators Tn on Sk(N) are hermitian with respect to thePetersson inner product, i.e.,
ćTnf, gć = ćf, Tngć
for f, g ā Sk(N).
Proof. Fix a fundamental domain F for Ī0(N). For simplicity of notation, we extend ćf, gćby (6.2.4) to any functions on H for which the integral converges.
First we claim that for any Ļ =
(a bc d
)ā PSL2(R), we have
ćf |Ļ,k , gć = det(Ļ)ākćf, g|Ļā1,kć,
where we extend |Ļ,k to Ļ ā GL2(R)+ = Ī± ā GL2(R) : det(Ī±) > 0 by (4.3.1). (The con-dition of positive determinant ensures Ļ maps H to H. We remark that if we had followedstandard notation and included a factor of det(Ļ)k/2 in our definition of the slash operator,the determinant factor would not appear in the above identity.)
This follows since
ykf |Ļ,k(z) g(z) =Im(z)k
(cz + d)kf(Ļz)g(z) = det(Ļ)ākIm(w)k(cz + d)
kf(w)g(Ļā1w).
where we put w = Ļz and used the identity
|j(Ļ, z)|2 = det(Ļ)Im(z)
Im(Ļz), Ļ ā GL2(R)+,
which is a simple generalization of (6.2.2). Then observing (cz+d)k = j(Ļ, z)k = j(Ļā1, w)āk
shows this equalsdet(Ļ)ākIm(w)kf(w)g|Ļā1,k(w),
which implies our claim.Recall
Tnf = nkā1ā
ĀµāĪ0(N)\Mn,N (Z)
f |Āµ,k = nkā1rāi=1
f |Āµi,k,
where Āµ1, . . . , Āµr is a set of representatives for Ī0(N)\Mn,N (Z), which we take to be as inLemma 6.2.6. Now
ćTnf, gć = nkā1āćf |Āµi,k, gć = nā1
āćf, g|Āµā1
i ,kć
Since g|nĻ,k = nākg|Ļ,k, this implies
ćTnf, gć = nkā1āćf, g|nĀµā1
i ,kć = ćf, Tngć,
by our choice of Āµi.
Corollary 6.2.8. The space Sk(N) has a basis consisting of Hecke eigenforms.
CHAPTER 6. HECKE OPERATORS 111
Proof. Since Tn is a family of commuting operators which are hermitian with respect toćĀ·, Ā·ć, a well known theorem in linear algebra tells us that an orthonormal basis with respectto ćĀ·, Ā·ć is a basis of eigenvectors for all Tn.
By Exercise 6.1.9, we know the normalized (a1 = 1) Hecke eigenforms in Sk(N) haveFourier coefficients an which are multiplicative for n relatively prime to the level N . Hencethe cusp forms are generated by forms whose Fourier coefficients are some kind of arithmeticsequences.
There are some more questions we can ask at this point to push our theory further.
ā¢ Does the whole space of modular forms Mk(N) actually have a basis of Hecke eigen-forms?
ā¢ Can we define the Hecke operators Tn on Mk(N) when gcd(n,N) > 1?
ā¢ Can we actually find a basis for Sk(N) (or Mk(N)) whose Fourier coefficients an aremultiplicative for all n?
Here we will just briefly discuss these questions.First, observe that Theorem 6.2.7 is actually valid for f, g ā Mk(N) when ćTnf, gć
converges. One can use this to deduce that Mk(N) does have a basis of eigenforms, and infact we already know it at least one case.
Example 6.2.9. The space Mk(1) has a basis of Hecke eigenforms. To see this, recallMk(1) = CEk ā Sk(1). By Exercise 6.0.1, we know Ek is an eigenform, hence the basis ofeigenforms for Sk(1) can be extended to a basis of eigenforms for Mk(1).
In fact, the Eisenstein series (for Mk(N), or more generally for Mk(Ī)) are orthogonalto the space of cusp forms in the sense that ćE, fć = 0 whenever E is an Eisenstein seriesand f is a cusp form. Often one uses this relation to define Eisenstein series: the space ofEisenstein series for Mk(Ī) will be the elements E ā Mk(Ī) such that ćE, fć = 0 for allf ā Sk(Ī).
Next, one can define Hecke operators Tn on Mk(N) when gcd(n,N) > 1, however adefinition in terms ofMn,N (Z) is somewhat trickier as the analogue Lemma 6.1.3 is not asnice. For simplicity, letās just talk about Tp where p|N , since all the Tnās can be constructedout of just the Tpās. One can define
Tpf =ā
0ā¤j<pf |1 j
0 p
,k,
and if f(z) =āanq
n, one obtains
Tpf(z) =ā
apnqn.
The definition of Hecke operators can be made more uniform (including for arbitrary con-gruence subgroups Ī) by working in terms of double cosets (see, e.g., [DS05] or [Kob93]).
While the Hecke operators Tp for p|N can be defined without much trouble, one does notin general have a basis of eigenforms for Sk(N) for all Tn. Roughly the issue is the following.
CHAPTER 6. HECKE OPERATORS 112
Say N = pq with p, q distinct primes and f ā Sk(q) is a Hecke eigenform. In particular, f isan eigenform for Tp (on Sk(q)) Then automatically f ā Sk(N). However Tp on Sk(N) actsdifferently, so f is no longer an eigenform for Tp.
More generally, if d|N and f(z) ā Sk(N/d), then f(z), f(dz) ā Sk(N) and we thesubspace generated by all such elements the space Sold
k (N) of oldforms. Define the spaceof newforms Snew
k (N) to be its orthogonal complement in Sk(N) (w.r.t. the Petersson innerproduct). Then it is true that Snew
k (N) has a basis of eigenforms for all Tp, as opposedto just for p - N . However, it can happen that oldforms are also eigenform for all Tp,or that oldforms have multiplicative Fourier coefficients but are not eigenforms for all Tp.(Conversely, if f is an eigenform for all Tp, the Fourier coefficients may not be multiplicative,but they essentially areāthey are if a1 = 1.)
The notion of newforms versus oldforms leads to another interesting question:
ā¢ Given some form f ā Sk(N), how can we determine if it comes from a form of smallerlevel or if it is a newform?
We will study all of these issues in Chapter 8.Finally we remark that the theory of Hecke operators for an arbitrary congruence sub-
group Ī is similar. See [Ste07] for how to computationally find a basis of Hecke eigenformsfor a given space Mk(Ī).
Chapter 7
L-functions
One of the main themes in modern number theory, is to associate to various objects (num-ber fields, Dirichlet characters, Galois representations, elliptic curves, modular forms) ananalytic gadget called an L-function. The idea comes from the theory of the Riemann zetafunction, so before we introduce L-functions for modular forms, we recall some basic factsabout the zeta function and Dirichlet L-functions.
NOTE: This chapter is readable, but not as complete as I would like. Specifically, I didnāthave enough time in my mind to say why you should care about L-functions of modularforms, e.g., the relation with elliptic curves and Fermatās last theorem. I hope to somedayflesh out this chapter more.
7.1 Degree 1 L-functions
Letās begin by reviewing the zeta function. Then we will review some facts about DirichletL-functions. Both of these are considered ādegree 1ā L-functions, which will be explainedlater.
7.1.1 The Riemann zeta function
First recall the Riemann zeta function is defined by
Ī¶(s) =ā 1
ns(7.1.1)
for s ā C with Re(s) > 1. Then Ī¶(s) can be (uniquely) meromorphically continued to afunction of the whole complex plane with only a simple pole at s = 1. Euler (who consideredĪ¶(s) for s real) observed
Ī¶(s) =ā 1
ns=āp
1
1ā pās. (7.1.2)
Since each factor on the right converges at s = 1, the fact that Ī¶(s) has a pole at s = 1implies there are infinitely many prime numbers. This is the first hint that the analyticbehaviour of Ī¶(s) can encodes deep arithmetic information.
113
CHAPTER 7. L-FUNCTIONS 114
One important feature of Ī¶(s) is its functional equation relating the values at s to thevalues at 1ā s. Namely, recall the gamma function is defined by
Ī(s) =
ā« ā0
tsā1eāsdt (7.1.3)
for Re(s) > 0. Then Ī(s) has meromorphic continuation to C with simple poles at s =0,ā1,ā2, . . .. Further Ī satisfies the functional equation
Ī(s+ 1) = sĪ(s)
and one easily computes from the definition that Ī(1) = 1. Hence, by induction, thefunctional equation implies Ī(n+ 1) = n! for n ā N.
Then the functional equation for Ī¶(s) can be written
Ī¶(s) = 2sĻsā1 sin(Ļs
2
)Ī(1ā s)Ī¶(1ā s).
One can rewrite this functional equation more simply in terms of the completed zetafunction
Z(s) =s(sā 1)
2Ļs/2Ī(s
2
)Ī¶(s)
asZ(s) = Z(1ā s).
Either way, the functional equation means one can obtain the values for Ī¶(s) from the valuesof Ī¶(1 ā s). In particular, if Re(s) < 0, then Re(1 ā s) > 1 so one can also use (7.1.1) toevaluate Ī¶(s) on the left half-plane Re(s) < 0. The in-between region, 0 ā¤ Re(s) ā¤ 1 iscalled the critical strip and line of symmetry Re(s) = 1
2 is called the critical line.As is now famous, the zeros of the Riemann zeta function are intimately connected with
prime numbers. Specifically, letĻ(x) =
āpkā¤x
log p
be the Chebyshev function, which counts the number of prime powers up to x. Then onecan show the astounding explicit formula
Ļ(x) = xāāĻ
xĻ
Ļā log 2Ļ, (7.1.4)
where Ļ runs over the zeroes of Ī¶(s) (including the ātrivial zeroesā to the left of the thecritical strip). Consequently, if we knew where all the zeroes of Ī¶(s) lay, we would knowexactly where all the prime powers are. Regardless, one can already use (7.1.4) to prove theprime number theorem
Ļ(x) := # p : p ā¤ x ā¼ Ļ(x)
log xā¼ x
log x.
The very deep Riemann hypothesis asserts that all nontrivial zeroes (all zeroes inside thecritical strip) actually lie on the critical line, and this would give an optimal bound on theerror term in the prime number theorem.
CHAPTER 7. L-FUNCTIONS 115
In another direction, special values of Ī¶(s) tell us interesting arithmetic information. Forinstance, Euler showed that for n ā N,
Ī¶(2n) = (ā1)n+1B2n Ā· (2Ļ)2n
2(2n)!. (7.1.5)
One can interpret 1/Ī¶(2n) as the probability that 2n integers chosen at random are relativelyprime (this is a simple consequence of (7.1.2)). Based on this expression, one might ask,when n is odd, if Ī¶(n) = cĻn for some c ā Q? While this is not known (even for Ī¶(3)), it isexpected the answer is no.
One heuristic reason why values at the even and odd integers should be different comesby looking the functional equation. Namely, the expression (7.1.5) simplifies to
Ī¶(1ā 2n) = āB2n
2n.
Thus it appears the arithmetic of the special values of Ī¶(2n) is made more clear by lookingat Ī¶(1 ā 2n). On the other hand by the function equation and that Ī(s) has poles atnegative integers, in order for Ī¶(s) to be holomorphic we need Ī¶(ā2n) = 0. (These are thetrivial zeroesānote the functional equation does not produce zeroes at negative odd integersbecause then the vanishing of sin
(Ļs2
)cancels the pole from Ī(1 ā s).) Since Ī¶(2n + 1)
corresponds to Ī¶(ā2n) = 0, there is no indication that Ī¶(s) at positive odd integers shouldfollow the same sort of arithmetic behaviour as at positive even integers. (In light of theprobabilistic interpretation of 1/Ī¶(n), this is perhaps analogous to the difference betweenrk(n), i.e., Ļk(z), for k odd and even.)
7.1.2 Dirichlet L-functions
Definition 7.1.1. We say Ļ : Zā C is a Dirichlet character mod N if(i) Ļ(n) = Ļ(n+N) for all n ā Z;(ii) Ļ(n) = 0 āā gcd(n,N) > 1;(iii) Ļ(1) = 1; and(iv) Ļ(mn) = Ļ(m)Ļ(n) for all m,n ā Z.
The standard way to construct a Dirichlet character is to take a character Ļ of (Z/NZ)Ć,extend it to Z/NZ by setting it to 0 on all the noninvertible residue classes, then lifting itto a function of Z. It is easy to see this satisfies the above definition, and conversely anyDirichlet character arises in such a way.
Then one defines the Dirichlet L-function for a Dirichlet character Ļ to be
L(s, Ļ) =āān=1
Ļ(n)
ns, (7.1.6)
for Re(s) > 1. (Condition (i) in the definition implies Ļ is bounded, so the series convergesabsolutely for Re(s) > 1.) Since Ļ is multiplicative, again one an Euler product expansion
L(s, Ļ) =āān=1
Ļ(n)
ns=āp
1
1ā Ļ(p)pās. (7.1.7)
CHAPTER 7. L-FUNCTIONS 116
As with Ī¶(s), one can show L(s, Ļ) has meromorphic continuation to C, but this time it willin fact be entire whenver Ļ is a nontrivial character (i.e., does not come from the trivialcharacter of (Z/NZ)Ć).
Because the Dirichlet characters mod N allow one to distinguish among the differentresidue classes mod N , these Dirichlet L-functions provide a way to study primes lying inarithmetic progressions. Specifically, Dirichlet showed that L(1, Ļ) 6= 0 for each nontrivialĻ, and used this to prove that for any b relatively prime to N , there are infinitely manyprimes of the form aN + b, a, b ā N.
Again in analogy with the Riemann zeta function, Dirichlet L-functions have functionalequations. If Ļ is a āprimitiveā Dirichlet character mod N , the completed Dirichlet L-function is
Ī(s, Ļ) =
(N
Ļ
) s+Īµ2
Ī
(s+ Īµ
2
)L(s, Ļ) (7.1.8)
where Īµ ā 0, 1 is the order of Ļ(ā1), i.e., Īµ = 0 if Ļ(ā1) = 1 and Īµ = 1 if Ļ(ā1) = ā1.Then the functional equation reads
Ī(s, Ļ) = (āi)ĪµāN
(nān=1
Ļ(n)e2Ļin/N
)Ī(1ā s, Ļ). (7.1.9)
Note here the functional equation does not relate L(s, Ļ) with L(1 ā s, Ļ) in general, butrather with L(1 ā s, Ļ), where Ļ is the complex conjugate of Ļ (also a Dirichlet charactermod N). However if Ļ is real valued, which is equivalent to Ļ2 is trivial, then Ļ = Ļ andthis functional equation relates L(s, Ļ) with L(1ā s, Ļ).
In any case, one can still recover the values of L(s, Ļ) for Re(s) < 0 using the functionequation (7.1.9) together with the series for L(1ās, Ļ). The generalized Riemann hypothesis(GRH) asserts that any zeroes of L(s, Ļ) inside the critical strip 0 ā¤ Re(s) ā¤ 1 in fact lie onthe critical line Re(s) = 1
2 . As one might expect from analogy with the Riemann hypothesis,GRH would tell us, up to a very precise error, how many primes ā¤ x lie in a given arithmeticprogression. (An asymptotic is already known by the Chebotarev density theorem.) What isperhaps more surprising, is that GRH also implies the weak Goldbach conjecture that everyodd number > 7 is a sum of 3 odd primes.1
Again, just like with Ī¶(s), there are formulas for special values of L(s, Ļ). Let d besquarefree, and consider the quadratic extension K = Q(
ād) of Q of discriminant ā (here
ā = 4d if d ā” 2, 3 mod 4 and ā = d if d ā” 1 mod 4). and the Dirichlet character mod āgiven by
Ļā(n) =
(ā
n
),
where(
ān
)is the Kronecker symbol. In particular, if p is an odd prime not dividing ā,
Ļā(p) = 1 if ā is a square mod p and Ļā(p) = ā1 otherwise.If d < 0, i.e., K is imaginary quadratic, the Dirichlet class number formula says
L(1, Ļā) =2ĻhK
wā|ā|
,
1Since writing these notes, the weak Goldbach conjecture was proved by my buddy Harald Helfgott!Thanks, Harald!
CHAPTER 7. L-FUNCTIONS 117
where hK is the class number of K and w is the number of roots of unity in K, i.e., w = 6if d = ā3, w = 4 if d = ā1 and w = 2 otherwise. If d > 0, i.e., K is a real quadratic field,then Dirichletās class number formula says
L(1, Ļā) =2 log Ī·hKā
ā,
where Ī· > 1 is the fundamental unit in the ring of integers of K. Since the class numberis a fundamental, yet mysterious, invariant of a number field, we see that special values (inthis case at s = 1) of L-functions encode interesting arithmetic information.
7.2 The philosophy of L-functions
In general, what is an L-function? There is no widely accepted answer yet as to whatprecisely constitutes an L-function, or what objects give rise to L-functions. However basedon examples, there are certain properties L-functions should satisfy, similar to Ī¶(s) and theDirichlet L-functions.
Suppose one has an object X described by some data a1, a2, a3, . . ., one can study thesequence (an) forming the (formal) L-series (or Dirichlet series) for X
L(s,X) =āān=1
anns.
Exercise 7.2.1. Suppose |an| = O(nm) for some m, i.e. there exists a constant C such that|an| ā¤ Cnm for all n. Show L(s,X) converges absolutely for Re(s) > 1 +m.
No matter what the original object X was, the L-series only depends on the sequence(an), so for the present purposes we could just assume the object X is the sequence (an),and so we will sometimes write X = (an) below.
An L-series is just a series of the formā an
ns , but just requiring convergence on a righthalf plane does not give all the properties one would like to have an L-function. Based onthe theory of Dirichlet L-functions and Ī¶(s), in order for L(s,X) to be called an L-function,one might also ask that
ā¢ L(s,X) has meromorphic continuation to C;
ā¢ L(s,X) has a functional equation L(s,X) = Ļ(s,X)L(k ā s, X) for some āsimpleāfunction Ļ(s,X), some k ā R and some related object X;
ā¢ L(s,X) should have an Euler product L(s,X) =āp Lp(s,X), for some āsimpleā local
factors Lp(s,X).2
Example 7.2.2. Let X = K be a number field, i.e., a finite extension of Q. Let OK be itsring of integers, and an be the number of integral ideals of norm n. In particular, if K = Q,then an = 1 for all n so L(s,Q) = Ī¶(s). More generally, L(s,K) = Ī¶K(s) is the Dedekindzeta function for K. The Dedekind zeta function also satisfies all properties listed above.
2One may not be able to make L(s,X) have an Euler product for all X in the space of interestāe.g.,modular formsābut only certain āniceā Xāe.g., Hecke eigenforms.
CHAPTER 7. L-FUNCTIONS 118
Example 7.2.3. Let Ļ be a Dirichlet character and an = Ļ(n). Then L(s, Ļ) is the DirichletL-function we defined earlier.
Letās elaborate on the Euler product condition a little more. How did we get the Eulerproduct for Dirichlet L-functions? It came from the fact that Ļ(n) is multiplicativeāin facttotally multiplicative. Recall a sequence (an) is multiplicative (resp. totally multiplica-tive) if amn = aman for gcd(m,n) = 1 (resp. for all m,n).
Exercise 7.2.4. Let X = (an) be a multiplicative sequence such that |an| = O(nm), anddefine the local L-factor
Lp(s,X) = 1 +apps
+ap2
p2s+ap3
p3s+ Ā· Ā· Ā·
Show Lp(s,X) converges absolutely for Re(s) > 1 +m, and that on this half-plane, we havethe product formula
L(s,X) =āān=1
anns
=āp
Lp(s,X).
A priori, there is no reason for Lp(s,X) to have a simple expression. In the case ofDirichlet L-functions, the local L-factor Lp(s, Ļ) = 1
1āĻ(p)pās simply because Ļ(pj) = Ļ(p)j .More generally, we need, for each p, a relation among the apj ās in order for Lp(s,X) tosimplify. Suppose, for a fixed p, the terms apj satisfy a degree d linear recurrence relation:
apj+d = c0apj + c1apj+1 + Ā· Ā· Ā·+ cdā1apj+dā1 .
Then one can show the local L-factor is of the form
Lp(s,X) =1
F (pās)
where F is a polynomial (depending on c0, . . . , cdā1, which depend upon p) of degree ā¤ d.For instance, for a Dirichlet character Ļ and a prime p, then the polynomial F (t) = 1āĻ(p)t.For Lp(s,X) of this form, call the degree of F the degree of Lp(s,X).
If X = f is a Hecke eigenform and the anās are its Fourier coefficients, then the theory ofHecke operators tells us the Fourier coefficients apj satisfy a degree 2 linear recursion, so thelocal L-factors should be reciprocals of (typically) quadratic polynomials in pās. (For a finitenumber ābadā primes, the L-factor may be trivial or (a reciprocal of a) linear (polynomial).)
Typically, the ādegreeā of almost all (all but finitely many) local L-factors, Lp(s,X),will be the same, and we will call this the number the degree of the L-function L(s,X).This explains why the Riemann zeta function and Dirichlet L-functions are called degree 1L-functions, and the L-functions of modular forms are called degree 2 L-functions.
The properties above are some properties we want L-functions to satisfy, but are probablynot strong enough to say any function with these properties should be the L-function of someobject. In the next section, we will describe some conditions that are sufficient to ensure anL-series is the L-function of a modular form.
Just as with Dirichlet L-functions, when one can construct an L-function L(s,X) forsome object X, one should expect the following.
CHAPTER 7. L-FUNCTIONS 119
ā¢ The analytic propertiesānamely the location of the zeroes and polesāof L(s,X)should reveal deep information about X.
ā¢ For certain special points s = s0, there should be a meaningful formula for specialvalue L(s0, X).
7.3 L-functions for modular forms
As was suggested in the previous section, if f(z) =āā
n=0 anqn is a modular form, we will
define its L-series by L(s, f) =āā
n=1anns . Obviously replacing f with a multiple cf replaces
L(s, f) with cL(s, f), so one often assumes f is normalized so its first nonzero Fouriercoefficient is 1.
For convergence of L(s, f), we need the following simple estimate on Fourier coefficientsof cusp forms, due to Hardy and Hecke.
Proposition 7.3.1. (The Hecke, or trivial, bound) Let f(z) =āanq
n ā Sk(N). Then,|an| = O(nk/2), i.e., for some constant C, we have
|an| ā¤ Cnk/2. (7.3.1)
Proof. From our discussion at the beginning of Section 6.2, we know f(z)f(z)yk = |f(z)yk/2|2is Ī0(N) invariant. If F is a fundamental domain for Ī0(N), f being a cusp form meansf(z) ā 0 as z ā F tends to a boundary point in Q āŖ iā. Consequently, as in the proofof Proposition 6.2.5, we see |f(z)yk/2| is bounded on F , and by invariance, bounded on H.Say |f(z)yk/2| ā¤ C1 for all z ā H.
Then for fixed y > 0,
aneā2Ļny =
ā£ā£ā£ā£ā« 1
0f(x+ iy)eā2Ļinzeā2Ļnydx
ā£ā£ā£ā£ =
ā£ā£ā£ā£ā« 1
0f(x+ iy)eā2Ļinxdx
ā£ā£ā£ā£ ā¤ C1yāk/2.
Setting y = 1n gives the desired bound.
The Ramanujan conjecture (proved by Deligne) is that one actually has |an| ā¤ Cn(kā1)/2,for some C (depending on f but not n). This is very deep and it is the best general boundpossible. (In analytic number theory, knowing the exponents in such bounds is crucialfor applications, and even a small improvement in the exponent is often considered majorprogress.) We note there is also a more explicit version of Deligneās bound which says|ap| ā¤ 2p(kā1)/2 and therefore |an| ā¤ d(n)n(kā1)/2, where d(n) is the number of divisors ofn, for suitable cusp forms f (namely f should be a normalized eigennewform, as defined inthe next chapterāfor now just think an eigenform with a1 = 1, which is all that is actuallymeant in the case of full level N = 1.).
Compare these bounds with those for Eisenstein series, which are simple to obtain. Weexplained this earlier, but it may be good to go over it on your own now.
Exercise 7.3.2. Write Ek(z) =āanq
n. Show |an| = O(nkā1) and that this exponent issharp, i.e., |an| 6= O(nkā1āĪµ) for any Īµ > 0.
CHAPTER 7. L-FUNCTIONS 120
Example 7.3.3. Recall from Example 5.2.10, we have
r12(n) = 8Ļ5(n)ā 512Ļ5(n/4) + 16an,
where the anās are the Fourier coefficients of the cusp form FĪ·. Hence Heckeās bound tellsus
r12(n) = 8Ļ5(n)ā 512Ļ5(n/4) +O(n3),
or more simply for n 6ā” 0 mod 4,
r12(n) = 8Ļ5(n) +O(n3).
As in the above exercise, we see that Ļ5(n) roughly grows at the rate of n5.The Ramanujan conjecture says in fact the bound on the error is O(n5/2).
Definition 7.3.4. Let f(z) =āanq
n ā Mk(N). The (Hecke) L-function associated tof is given by
L(s, f) =
āān=1
anns,
where this sum converges. The completed L-function associated to f is given by
Ī(s, f) =
(āN
2Ļ
)sĪ(s)L(s, f).
By Exercises 7.2.1 and 7.3.2, we see if f = Ek ā Mk(1) then L(s, f) converges forRe(s) > k. In fact this is true for any f āMk(N). If f ā Sk(N), then by Heckeās bound wesee L(s, f) converges for Re(s) > 1 +k/2. In fact, for cusp forms the Ramanujan conjecturetells us L(s, f) converges in the slightly larger right half-plane Re(s) > 1 + (k ā 1)/2.
We remark that unless f is an eigenform, L(s, f) will not have an Euler product in gen-eral, but we will still call these L-functions as they at least have meromorphic continuationand functional equation.
Lemma 7.3.5. Let f āMk(N) and consider the Fricke involution
f(z) :=1
Nk/2zkf
(ā1
Nz
).
Then f āMk(N). Moreover, if f ā Sk(N), then so is f .
Proof. We can write
f(z) =(detĻ)k/2
j(Ļ, z)kf(Ļz) = (detĻ)k/2f |Ļ(z),
where
Ļ =
(0 ā1N 0
).
CHAPTER 7. L-FUNCTIONS 121
One sees
Ļ
(a bc d
)Ļā1 =
(d āc/NābN a
)so Ļ normalizes Ī0(N). Then if Ī³ ā Ī0(N), we compute
f |Ī³ = (detĻ)k/2f |ĻĪ³ = (detĻ)k/2f |Ī³ā²Ļ = (detĻ)k/2f |Ļ = f ,
where Ī³ā² = ĻĪ³Ļā1 ā Ī0(N). Hence f satisfies the correct modular transformation law.Holomorphy and holomorphy at the cusps follow automatically. Similarly if f is zero at
the cusps it is easy to see f is also.
Note if N = 1, then f = f .
Theorem 7.3.6. Let f(z) =āanq
n ā Mk(N). The completed L-function Ī(s, f) can becontinued to a meromorphic function in s satisfying
Ī(s, f) = (ā1)k/2Ī(k ā s, f),
with at most simple poles at s = 0 and s = k. Furthermore, if f ā Sk(N), then Ī(s, f) isentire and bounded in vertical strips, i.e., Ī(s, f) is bounded when Re(s) is.
Proof. (Sketch) The key to the proof comes from using an integral representation for Ī(s, f).Namely, we look at the Mellin transform of f for Re(s) > k,ā« ā
0f(iy)ysā1dy =
ā« ā0
āaneā2Ļnyysā1dy =
Ī(s)
(2Ļ)sL(s, f) = Nās/2Ī(s, f).
Write f(z) =ābnq
n. Then one can show
Ī(s, f) +a0
s+ (ā1)k/2
b0k ā s
=
ā« ā1
f(iy/āN)āa0)ysā1dy+
ā« ā1
(f(iy/āN)ā b0)ykāsā1dy.
Since f(iy/āN)āa0 and (f(iy/
āN)āb0 are of rapid decay when y āā, the integrals on
the right converge absolutely, analytically continue to entire functions of s, and are boundedon vertical strips. A similar expression for Ī(s, f) gives the functional equation.
Theorem 7.3.7. Suppose f(z) =āanq
n ā Sk(1) be a Hecke eigenform. Then
L(s, f) =āp
1
1ā aāsp + pkā1pā2s.
Theorem 7.3.8. (Heckeās converse theorem) Let (an) be a sequence such that |an| =O(nm) for some m, and k ā 2N. Suppose
Ī(s) =Ī(s)
(2Ļ)s
āān=1
anns
has meromorphic continuation to C,
Ī(s) = (ā1)k/2Ī(k ā s)
CHAPTER 7. L-FUNCTIONS 122
andĪ(s) +
a0
s+ (ā1)k/2
a0
k ā sis entire and bounded in vertical strips. Then
f(z) =āān=0
anqn āMk(1).
This provides an analytic way to show something is a modular form! Roughly it says,given a sequence (an) of polynomial growth, the Dirichlet series L(s) =
ā anns is the L-
function of a modular form L(s) = L(s, f) if it satisfies certain nice properties (stated abovein terms of Ī(s)). In other words, if the Dirichlet series is nice, then the sequence (an) isactually the Fourier coefficients for a modular form!
There is also a converse theorem for Mk(N), due to Weil, but it requires more than justthe niceness of a single function L(s). It roughly says, given a sequence (an) of polynomialgrowth, the Dirichlet series L(s) =
ā anns is the L-function of a modular form in Mk(N) if
the set of twisted L-seriesLĻ(s) =
ā anĻ(n)
ns
satisfy certain nice properties for all Hecke characters Ļ. (Hecke characters are a general-ization of Dirichlet chacters.)
To put this in context, we will summarize the primary methods of constructing modularforms:
ā¢ Use an averaging procedure (e.g., as with our Eisenstein series and Hecke operators).
ā¢ Construct a suitable Ī·-quotient.
ā¢ Use theta series.
ā¢ Use the converse theorem.
ā¢ Use the trace formula (which we will not describe).
Both the method of the converse theorem and the trace formula are not truly construc-tive, but they provide a way of showing something you suspect should be a modular formactually is a modular form.
The simplest instance of this is the following. Since the L-function L(s, f) of a modular(eigen)form is a degree 2 Euler product, one might ask if you can construct some such f byasking that L(s, f) is a product of 2 known degree 1 L-functions.
Example 7.3.9. PutL(s) = Ī¶(s)Ī¶(sā k + 1)
for k ā„ 4 even. One can see by Heckeās converse theorem, that L(s) is the L-function of amodular form of weight k. In fact
L(s) = L(s, Ek).
CHAPTER 7. L-FUNCTIONS 123
More generally, if Ļ1 and Ļ2 are primitive Dirichlet characters mod N1 and N2, then
L(s, Ļ1)L(sā k + 1, Ļ2) = L(s, f)
for somef āMk(Ī1(N1N2)).
If Ļ1(ā1) = Ļ2(ā1), then in fact f āMk(N1N2) = Mk(Ī0(N1N2)). Explicitly,
f =ānā„1
anqn, an =
āad=n
Ļ1(a)Ļ2(d)dkā1.
From the Euler product for L(s, f), one can deduce that f is a Hecke eigenform. However,this construction will not yield a cusp form (even though a0 = 0, f does not vanish at allcusps). In fact, one can construct all Eisenstein series on Ī1(N) in this way by includingdegenerate cases (when Ļ1 and Ļ2 are trivial, so non-primitive if N1, N2 > 1, then f = Ek).Thus the space of Eisenstein series, and therefore Mk(Ī1(N)), and by restriction Mk(N),also has a basis consisting of eigenforms.
125
This part is still in progress, and currently only has a mostly finished chapter on new-forms, and the beginning of a chapter on Hilbert modular forms, and is also likely to containerrors. After finishing the chapter on Hilbert modular forms, I will probably work on achapter on half-integral weight forms.
In any event, as remarked in the preface, it is not intended to contain complete proofs.
Chapter 8
Newforms and oldforms
Recall the main points of Chapter 6 were: (i)Mk(N) has a basis of eigenforms for the Heckeoperators Tp, p - N (or equivalently, Tn, gcd(n,N) = 1); and (ii) if f is an eigenform, thenits Fourier coefficients an are multiplicative for n relatively prime to N .
In this chapter we will try to resolve some of the issues brought up at the end of Chapter 6,principally:
ā¢ Can we find modular forms in Mk(N) whose Fourier coefficients are multiplicative forall n?
ā¢ To what extent do such forms make up the space Mk(N)?
ā¢ Given a modular form f āMk(N), how can we tell if it comes from a form of smallerlevel?
One application of the first question is that if f ā Mk(N) has multiplicative Fouriercoefficients, then the L-series L(s, f) will have an Euler product.
To put the third question in context, recall that Mk(d) ā Mk(N) if d|N , i.e., anymodular form of level N is also of level Nm for any m ā N. Thus given a modular form f ,itās natural to ask what its true level? (This is often called the exact level of f .) Forms thatcan be constructed (as linear combinations) of forms of smaller level are called oldforms,and the forms orthogonal to all of these are called newforms. Then the third question canbe essentially be restated as: how can we distinguish newforms from oldforms?1
Because of the connection of multiplicativity of Fourier coefficients and being eigenformsof Hecke operators, it makes sense to try to answer at least the first two questions byextending our theory of Hecke operators Tn to all n. It will turn out that the space ofnewforms is spanned by forms which are simultaneous eigenforms for all Tn. This is not truefor the oldforms, though some oldforms may still be eigenforms for all of the Tn. However,following Atkin and Lehner [AL70], we can pick out the new eigenforms by introducing someadditional operators Wp (for p|N) which also commute with all of the Tnās (and therefore
1More precisely, these questions are equivalent when restricted to eigenforms, but not quite equivalentin general. For instance, say f has exact level 3 and g has exact level 5, so f + g has level 15. It is not anewform because it is just a linear combination of forms of smaller level, though f + g itself does not comefrom level 3 or level 5.
126
CHAPTER 8. NEWFORMS AND OLDFORMS 127
the space spanned by simultaneous eigenforms for all of these operators will be precisely thespace of newforms).
Another important application of these AtkināLehner operators is that they will allowus to distinguish eigenforms. Itās possible for two eigenforms (which are not scalar multiplesof each other) to have the same Hecke eigenvalues for all Tn. Often when proving theoremsabout modular forms, you want to use Hecke operators to isolate individual eigenforms. Thisis possible sometimes, e.g., for S2(p) with p prime (all forms will be newforms in this case),but in general one has to use something like these AtkināLehner operators in conjunctionwith the Hecke operators.
Our first goal will be to define Hecke operators Tn on Mk(N) for all n. As we remarkedin Chapter 6, our earlier approach of using lattices to define Hecke operators leads to compli-cations when gcd(n,N) > 1, so we begin by explaining the double coset approach to Heckeoperators.
8.1 Hecke operators via double cosets
Going back to our definition of Eisenstein series, we saw one of the basic ideas for makingmodular forms on a congruence subgroup Ī is to take the weighted average of a functionover Ī. For instance, (4.2.2) told us that
Ek,N (z) =ā
Ī³āP\Ī0(N)
j(Ī³, z)āk =ā
Ī³āP\Ī0(N)
1|Ī³,k(z),
where P = ćT ć is the unipotent subgroup of PSL2(Z) and 1 denotes the constant function.Now letās suppose we have two congruence subgroups Ī1,Ī2, and f ā Mk(Ī1). If we
want to make a modular form for Ī2 out of f , we can try to do it by averaging, i.e., a sumof the form
āf |Ī³,k(z). Of course if we let Ī³ run over all of Ī2, this will not convergeāfor
instance f is already invariant by |Ī³,k for Ī³ ā Ī1 ā© Ī2, which is infinite. This is why wealready had to mod out by P in the definition of Ek,N . In the present case, it makes senseto sum over Ī³ ā (Ī1 ā© Ī2)\Ī2. Alternatively, we can sum over Ī³ ā Ī1\Ī1Ī2. This latterform generalizes to the double coset operators.
For Ī± ā GL2(Q)+, we define the associated weight k double coset operator
[Ī1Ī±Ī2]kf(z) =ā
Ī³āĪ1\Ī1Ī±Ī2
det Ī³kā1f |Ī³,k(z)
Lemma 8.1.1. For f āMk(Ī1), [Ī1Ī±Ī2]kf(z) āMk(Ī2).
Proof. See [DS05, Sec 5.1].
Depending on your terminology, this may or may not technically be an operator whenĪ1 6= Ī2 (itās not a map from a space to itself), but our interest is in the case whereĪ1 = Ī2 = Ī0(N).
For any prime p, define the p-th Hecke operator on Mk(N) to be
Tpf = [Ī0(N)
(1
p
)Ī0(N)]k.
CHAPTER 8. NEWFORMS AND OLDFORMS 128
When we compute our Hecke operators below, we will see that this agrees with the previousdefinition of Tp when p - N .
One can similarly define more general Hecke operators Tn in terms of double cosets,however the Tpās are the crucial part, and for simplicity, we will avoid this and just definethe Tnās using the Tpās and the Hecke operator relations we already found in Chapter 6.2
We inductively define
Tpr+1 =
TpTpr ā pkā1Tprā1 p - NTpTpr p|N,
and then extend by multiplicativity: set
Tmn = TmTn
when gcd(m,n) = 1. The only part thatās unmotivated is the case of Tpr+1 when p|N (sowe simply have Tpr = (Tp)
r for p|N). Basically the difference arises from the difference inthe double coset decompositions:
Lemma 8.1.2. For p - N ,
Ī0(N)
(1
p
)Ī0(N) =
pā1āb=0
Ī0(N)
(1 b0 p
)āĪ0(N)
(p 00 1
).
For p|N ,
Ī0(N)
(1
p
)Ī0(N) =
pā1āb=0
Ī0(N)
(1 b0 p
).
Proof. It is simple to derive this from Lemma 6.1.3. We remark [DS05, Sec 3.8] also provesan analogue for Ī1(N) in (3.16).
We have the following generalization of Theorem 6.1.4.
Theorem 8.1.3. Let f(z) =āanq
n āMk(N). If p - N , then
(Tpf)(z) =ā
n6ā”0 mod p
apnqn +
ānā”0 mod p
(apn + pkā1an/p
)qn.
If p|N , then
(Tpf)(z) =āān=0
apnqn.
Proof. The proof is just a calculation similar to the proof of Theorem 6.1.4 using the doublecoset decomposition in the last lemma. See also [DS05, Prop 5.2.2(a)].
2For instance when p|N , Tpr is given by the double coset operator for Ī0(N)
(1
pr
)Ī0(N). See, e.g.,
[KL06, Sec 3.6] for the general definition of Tn in terms of double cosets.
CHAPTER 8. NEWFORMS AND OLDFORMS 129
In fact, one can generalize the operation of Tp when p|N to get the Up transformation(not assuming p|N)
Up : Mk(N)āMk(pN)
(Upf)(z) =
āān=0
apnqn.
(You may recall Up from (6.0.2).) Namely, if f ā Mk(N), then f ā Mk(pN) by inclusionand Upf = Tpf (as an element of Mk(pN)). The fact that Tp operates on Mk(pN) provesthe image of Up lands in Mk(pN). We remark that some authors use Up to denote the Tpoperators in the case p|N , and reserve the notation Tp for the case p - N . (E.g., Atkin andLehner [AL70], who also use p only for primes prime to N , and q for primes dividing N , soyou may hear people refer to Tpās and Uqās.)
Corollary 8.1.4. If f ā Mk(N) is an eigenform for every Tp, then f is an eigenform forevery Tn.
Proof. For p|N , since Tpr = (Tp)r it is evident that if if f is an eigenform for Tp it is also an
f is an eigenform for Tpr . Now apply multiplicativity of the Tnās and Corollary 6.1.15.
As for terminology, for f āMk(N), if we just say f is an eigenform, then to be consistentwith previous terminology this means f is an eigenform for all Tp with p - N , i.e., for all Tnwith gcd(n,N) = 1. We will sometimes say f is a complete eigenform to mean f is aneigenform for all Tp, i.e., for all Tn. (This terminology is probably not standard.)
What it arithmetically means to be an eigenform for Tp with p|N (or Up) is easier todescribe than for Tp with p - N .
Corollary 8.1.5. Suppose f(z) =āanq
n āMk(N) and p|N .(i) Suppose f is an eigenform for Tp with eigenvalue Ī»p. Then apr = Ī»rpa1. If a1 6= 0,
then Ī»p =apa1. If a0 6= 0, then Ī»p = 1.
(ii) Suppose the anās are multiplicative and f 6= 0 so a1 = 1. If a0 6= 0, then f is aneigenform for Tp if and only if apr = 1 for all r ā„ 0. If a0 = 0, then f is an eigenform forTp if and only if apr = (ap)
r for all r ā„ 0.
Proof. The theorem says that f is an eigenform for Tp if and only if there exists Ī» such thatapn = Ī»an for all n.
(i) The first two assertions follow from apr = Ī»paprā1 for r ā„ 1. The latter from the factthat apĀ·0 = a0 = Ī»a0.
(ii) If f is an eigenform, the conditions on apr follow from (i). Conversely, suppose apr =(ap)
r. Then for n = prm with p - m, we have apn = apr+1m = apr+1am = apapram = apan,i.e., f is an eigenform for Tp with eigenvalue ap.
Compare this with Exercise 6.1.6 and the following exercise.
Exercise 8.1.6. Suppose f(z) =āanq
n āMk(N) and p - N .(i) Show that if a0 6= 0 and Tpf = Ī»pf , then Ī»p = 1 + pk.(ii) If the anās are multiplicative, determine a necessary and sufficient condition on the
apr ās for f to be an eigenform of Tp.
CHAPTER 8. NEWFORMS AND OLDFORMS 130
Recall that if an is a multiplicative sequence, it is determined by just the apr ās. If theFourier coefficients an of f ā Mk(N) are multiplicative, being an eigenform for any Tp isequivalent to the apr ās satisfying a certain recurrence relation, which depends on whetherp|N and on k for p - N , and therefore that the anās are determined by just the apās.
The following result tells us conversely that being an eigenform for all Tnās (or equiva-lently by the first corollary, for all Tpās) implies the anās are multiplicative.
Proposition 8.1.7. Let f(z) =āanq
n āMk(N) be an eigenform for all Tn with eigenvalueĪ»n. Then an = Ī»na1. Hence, if f is a normalized (so a1 = 1) eigenform for all Tn, then,the anās are multiplicative.
This is why we defined the Hecke operators so that Tpr = (Tp)r for p|N . From the above
calculation of Tp, this is the right definition to make this proposition true.
Exercise 8.1.8. Prove the above proposition.
The following exercise explains the relation between āraising the levelā of f at p and theHecke operator Tp.
Exercise 8.1.9. Suppose f(z) =āanq
n āMk(N). The form g(z) = f(pz) lies in Mk(pN)and satisfies Tpg = f . Moreover, if f is an eigenform for Tn with p - n, then so is g (onMk(pN)) with the same eigenvalue.
In particular, if p|N , then the map f(z) 7ā f(pz) gives a modular form g(z) =ābnq
n =āanq
pn obtained by āspreading outā the Fourier coefficients from N āŖ 0 to pN āŖ 0 andraising the level by p. The Tp operator reverses this process. Since Tp takes the linearlyindependent forms f and g to the 1-dimensional space generated by f (for f 6= 0), we seethat the kernel of Tp : Mk(pN) ā Mk(pN) is nontrivial, i.e., Tp is not an invertible lineartransformation on Mk(pN). Furthermore, if f is a complete eigenform on Mk(N), p|N andTpf = Ī»pf , then f and f ā Ī»pg are complete eigenforms on Mk(N) where f ā Ī»pg will haveTp-eigenvalue 0.
8.2 Hecke operators on Eisenstein series
Here we investigate some details of Hecke operators on Eisenstein series. This will providesome concrete examples as well as motivation for the theory of newforms.
First we give an example where Fourier coefficients are multiplicative, but the form maynot be an eigenform for all Tp.
Example 8.2.1. For k ā„ 4 be even, recall the renormalized Eisenstein series Eāk(z) =āanq
n āMk(1) from (6.0.1). It has multiplicative Fourier coefficients an = Ļkā1(n) for n ā„1, and is an eigenform of all Tp on Mk(1). However, we can also regard Eāk ā Mk(N). OnMk(N), Eāk is not an eigenform for Tp with p|N as ap = Ļkā1(p) 6= 1 (Corollary 8.1.5(ii)).
Now letās see what happens for our Hecke operators for Eisenstein series with level. Forsimplicity, we will focus on the case k = 2.
CHAPTER 8. NEWFORMS AND OLDFORMS 131
For N = p prime, we can consider
E2,p(z) = E2(z)ā pE2(pz) (8.2.1)
as in Exercise 4.2.18. This is holomorphic and E2,p ā M2(p). From Exercise 4.2.18, youshould have gotten the Fourier expansion
E2,p(z) = 1ā pā 24
āān=1
Ļ1,p(n)qn (8.2.2)
where Ļ1,p(n) = Ļ1(n)ā pĻ1(n/p) and we interpret Ļ1(n/p) to be 0 if p - n.
Exercise 8.2.2. Check that Ļ1,p(pr) = 1, Ļ1,p(pn) = Ļ1,p(n) if p - n and Ļ1,p is multiplica-
tive. Conclude Ļ1,p(pn) = Ļ1,p(n) for any n.
This exercise tells us that, on M2(p), we have
TpE2,p(z) = 1ā pā 24āān=1
Ļ1,p(pn)qn = E2,p(z)
and for ` 6= p prime
T`E2,p(z) = (1 + `)(1ā p)ā 24
āān=1
(Ļ1,p(`n) + `Ļ1,p(n/`)) qn
= (1 + `)E2,p(z).
For the last equality, you can write n = `rm with ` - m and use the simple calculationĻ1,p(`
r+1) + `Ļ1,p(`rā1) = (1 + `)Ļ1,p(`
r). Thus for all primes `, T`E2,p = Ļ1,p(`)E2,p, andthen by the above proposition, for all n,
TnE2,p = Ļ1,p(n)E2,p.
Hence E2,p is an eigenform for all Hecke operators on M2(p).If we renormalize our Eisenstein series as
Eā2,p =ā1
24E2,p(z) =
pā 1
24+āān=1
Ļ1,p(n)qn,
then we see Eā2,p has multiplicative Fourier coefficients.
Example 8.2.3. Recall from Exercise 5.2.8 that M2(4) is 2-dimensional and generated bythe Eisenstein series
f(z) = E2,2(z) = 1 + 24āān=1
Ļ1,2(n)qn = 1 + 24(q + q2 + 4q3 + q4 + 6q5 + Ā· Ā· Ā·
)and
g(z) = E2,2(2z) = 1 + 24āān=1
Ļ1,2(n)q2n = 1 + 24(q2 + q4 + 4q6 + q8 + 6q10 + Ā· Ā· Ā·
).
CHAPTER 8. NEWFORMS AND OLDFORMS 132
Since f is a complete eigenform onM2(2), Exercise 8.1.9 tells us that f and g are eigenformson M2(4), but g is not a complete eigenform. (Moreover, since f and g have the sameeigenvalues for Tn with 2 - n, any form in M2(4) is a (not necessarily complete) eigenform.)Moreover T2g = f so T2(f ā g) = T2f ā f = f ā f = 0. Indeed, the Fourier expansion off ā g,
(f ā g)(z) = 24(q + 4q3 + 6q5 + 8q7 + Ā· Ā· Ā·
)contains no even powers of q. (We remark that even though the constant term of f ā g is 0,it is not a cusp form because it does not vanish at all other cusps.)
Hence f and f ā g give a basis of M2(4) of complete eigenforms. Moreover, we cannormalize them to see 1
24f and 124(f ā g) yield a basis of M2(4) consisting of forms with
multiplicative Fourier coefficients.
To get a weight 2 Eisenstein series of a general level N > 1, you might consider E2(z)āNE2(Nz). This is indeed in M2(N) and one can work out the Fourier expansion. However,it is not as nice as E2,p because it is not an eigenform for all Tpās (though it will be forp - N), as the following exercise shows.
Exercise 8.2.4. Suppose N = p1p2, where p1 and p2 are two not necessarily distinct primes,and let E(z) = E2(z) ā NE2(Nz) =
āanq
n. Compute ap1, ap2 and ap1p2. Show E is notan eigenform for Tp1 or Tp2 but is an eigenform for Tp with p - N .
Let us first define a weight 2 Eisenstein series for squarefree N = p1 Ā· Ā· Ā· pr. Set
Eā2,N (z) =(ā1)r+1
24
ād|N
Āµ(N
d)dE2(dz),
where Āµ is the Mƶbius function defined in Section 4.2 and r is the number of distinct primedivisors of N . One can check Eā2,N (z) ā M0(N). Observe that if N = p is prime, we haveEā2,N (z) = ā 1
24(āE2(z) + pE2(pz)) = Eā2,p, which coincides with our previous definition.To compute the Fourier coefficients, it will be convenient to use some facts about Dirichlet
convolution. Given two functions f, g : Nā C, we define their Dirichlet convolution (orDirichlet product) by f ā g(n) =
ād|n f(n)g(nd ).
Exercise 8.2.5. (i) Show that if f and g are multiplicative, then f ā g is multiplicative.(ii) Show that id ā Āµ = Ļ, where id : N ā N is the identity map and Ļ is the Euler phi
function.
We wonāt need this, but in case youāre interested, you can check that Dirichlet convolutionmakes the space of functions f : Nā C into a commutative ring.
Lemma 8.2.6. For N > 1 squarefree, we have the Fourier expansion
Eā2,N (z) = (ā1)r+1Ļ(N)
24+
āān=1
Ļ1,N (n)qn,
where Ļ1,N is the multiplicative function given by Ļ1,N (n) =ā
d| gcd(n,N) Āµ(d)dĻ1(nd ).
CHAPTER 8. NEWFORMS AND OLDFORMS 133
Proof. The constant term of Eā2,N is (ā1)r
24 (id ā Āµ)(N) = (ā1)r
24 Ļ(N). For n ā„ 1, the qn
coefficient of Eā2,N is
(ā1)rād|N
Āµ(N
d)dĻ1(
n
d) = (ā1)r
ād| gcd(n,N)
Āµ(N
d)dĻ1(
n
d)
= (ā1)rĀµ(N)ā
d| gcd(n,N)
Āµ(d)dĻ1(n
d)
=ā
d| gcd(n,N)
Āµ(d)dĻ1(n
d) = Ļ1,N (n).
It remains to show Ļ1,N (n) is multiplicative. Write n = nNm where gcd(nN , N) =gcd(n,N) and gcd(m,N) = 1. Since Ļ1 is multiplicative, we have Ļ1,N (n) = Ļ1(m)Ļ1,N (nN ).It remains to show Ļ1,N is multiplicative for powers of primes dividing N . Note
Ļ1,N (nN ) =ād|nN
Āµ(d)dĻ1(n
d) = ((Āµ Ā· id) ā Ļ1)(nN )
since Āµ is 0 on any pe with e > 1. Since Āµ Ā· id and Ļ1 are multiplicative, so is their Dirichletconvolution by the exercise above, and we are done.
Since Ļ1,N behaves like Ļ1 for primes not dividing N , this means Eā2,N is an eigenfunctionof each Tp with p - N on M2(N). (Alternatively, one can write E2,N as in terms of E2,pāsfor p|N and apply Exercise 8.1.9.) And, for p|N , we see
Ļ1,N (pe) =ād|pe
Āµ(d)dĻ1(pe
d) = Ļ1(pe)ā pĻ1(peā1) = 1.
From the previous section, this means Eā2,N is a complete eigenfunction on M2(N).What about when N > 1 is not squarefee? We can inductively define Eā2,N for cube-free
N > 1 as follows. If p|N but p2 - N , set
Eā2,pN (z) = Eā2,N (z)ā Eā2,N (pz)
Then by Exercise 8.1.9, we inductively see that Eā2,pN is an eigenfunction for all Tn withp - n and
TpEā2,pN (z) = TpE
ā2,N (z)ā TpEā2,N (pz) = Eā2,N (z)ā Eā2,N (z) = 0.
Thus we have shown
Proposition 8.2.7. For any cube-free N > 1, Eā2,N is a eigenform on M2(N), and acomplete eigenform. Moreover, if N = p2M is cube-free, then Eā2,pM and Eā2,N are bothcomplete eigenforms on M2(N), with Eā2,pM having Tp-eigenvalue 1 and Eā2,N having Tp-eigenvalue 0. These complete eigenforms are normalized so they have multiplicative Fouriercoefficients.
CHAPTER 8. NEWFORMS AND OLDFORMS 134
The issue with p3|N is as follows. For simplicity, assume N = p3. Then we have
TpEā2,p(z) = Eā2,p(z)
TpEā2,p(pz) = Eā2,p(z)
TpEā2,p(p
2z) = Eā2,p(pz).
Thus, taking these forms as a basis of this 3-dimensional subspace of M2(p3), Tp acts as thematrix
Tp =
1 1 00 0 10 0 0
.
This matrix has eigenvalue 1 with multiplicity 1 and eigenvalue 0 with multiplicity 2. Butthe eigenspace corresponding to eigenvalue 0 is 1-dimensional. Hence there is no basisof eigenforms for Tp for this 3-dimensional subspace of M2(p3). This by itself does notautomatically imply that M2(p3) has no basis of complete eigenforms for general p, but weat least see it in the following example.
Example 8.2.8. The space M2(8) is 3-dimensional, generated by Eā2,2(z), Eā2,2(2z) andEā2,2(4z). By the above argument, it does not have a basis of complete eigenforms.
One can similarly work out the action of the Hecke operators on Eisenstein series Ek,Nfrom Section 4.2 of higher weight. See, e.g., [DS05, Prop 5.2.3], which includes Eisensteinseries coming from other cusps and gives criteria for these Eisenstein series to be completeeigenforms.
8.3 AtkināLehner operators
Let p|N , and say pr is the highest power of p dividingN . The p-thAtkināLehner operatorWp on Mk(N) is given by
Wpf = pkr/2f |apr bcN dpr
,
where a, b, c, d ā Z and the above matrix has determinant adp2r ā bcN = pr.
Exercise 8.3.1. Show Wp is independent of the choice of a, b, c, d. (This is [AL70, Lem10].)
Denote by WN the Fricke involution
WNf = f = Nk/2f | 0 ā1N 0
introduced in Lemma 7.3.5. Note if N = p, then WN = Wp.
Theorem 8.3.2. The operators WN and Wp on Mk(N) for p|N commute with Tm forgcd(m,N) = 1.
Proof. For Wp, see [AL70, Lem 11].
CHAPTER 8. NEWFORMS AND OLDFORMS 135
8.4 New and old forms
The problem with finding a basis of Mk(N) which are eigenforms for all Tn is that someforms really come from smaller level, and the Tnās depend upon not just n but N .
To be precise, if d|N , then we get forms of level N from forms of level d just by inclusionMk(d) ā Mk(N), simply because Ī0(d) ā Ī0(N). Let f(z) =
āanq
n be a form in Mk(N)which is also in Mk(d), and suppose p is a prime dividing N but not d. To distinguish theHecke action in level d and level N , write T dp (resp. TNp ) for the p-the Hecke operator onMk(d) (resp. Mk(N)). Then, by Theorem 8.1.3, we have
(T dp f)(z)ā (TNp f)(z) = pkā1ā
nā”0 mod p
an/pqn = pkā1
āān=0
anqpn = pkā1f(pz).
Supposing f is a normalized eigenform on Mk(d), we see
(Upf)(z) = (TNp f)(z) = (T dp f)(z)ā pkā1f(pz) = apf(z)ā pkā1f(pz).
(Note the similarity to (8.2.1).) Hence f is also an eigenform for TNp only if there exists Ī»such that f(pz) = Ī»f(z) for all z, i.e., only if f is constant. Therefore, when we raise thelevel at p (via simple inclusion Mk(d) āMk(N)), eigenforms for Tp do not generally remaineigenforms at p.
So, if we want to try to find an basis of eigenforms of level N for all Tn, it looks likewe should try to get rid of all forms that come from smaller level.3 Now you might ask ifthere are any other ways to construct modular forms of level N from those of smaller levelsbesides simple inclusion. Weāve already seen one related to the Up transformations. Notethat with d,N , and p as above, we have that for f āMk(d),
(Upf)(z) = (TNp f)(z) = (T dp f)(z)ā pkā1f(pz).
Since T dp f āMk(d) āMk(N), we see Upf = TNp f āMk(N) implies the function z 7ā f(pz)also lies in Mk(N).
We can generalize this construction to replace p by an arbitrary integer:
Proposition 8.4.1. Suppose f(z) =āanq
n āMk(N). Then
g(z) = f(mz) =ā
anqmn āMk(mN).
Further, if f ā Sk(N), show g ā Sk(mN).
Exercise 8.4.2. Prove the above proposition.
The next exercise tells us more about the eigenspaces of Up = TNp in our earlier setup.
3We havenāt shown this is strictly necessary, except in M2(8) with Eisenstein series earlier. In factExercise 8.4.3 and Remark 8.4.4, together with the AtkināLehner decomposition below, indicate that forf ā Sk(d) a eigenform, f not being an eigenform for TNp in Sk(N) should not be an honest obstruction tofinding a basis of complete eigenforms for Sk(N) at least when p2 - N .
CHAPTER 8. NEWFORMS AND OLDFORMS 136
Exercise 8.4.3. Let p, d|N with p - d. Suppose f(z) =āanq
n is a normalized eigenform,let g(z) = f(pz), and Vf be the 2-dimensional subspace of Mk(N) generated by f =
āanq
n
and g.(i) Show TNp g = f .(ii) Prove that Vf has a basis of eigenforms for TNp if and only if |ap| 6= 2
āpkā1.
In this exercise you should observe that Up restricted to the image of Mk(d) in Mk(N)under inclusion is simply the inverse of this inclusion map.
Remark 8.4.4. You may recall we earlier mentioned Deligneās bound for cusp forms whichsays |ap| ā¤ 2
āpkā1 in the above setting. The SatoāTate conjecture (now a theorem) says
that the apās are, in a suitable sense, uniformly distributed in this range (for p fixed and fvarying). Thus we very rarely expect equality, and in fact it is conjectured that Deligneāsinequality is really a strict inequality. This means that, in our earlier discussion, while aneigenform f ā Sk(d) is no longer an eigenform for Tp in Sk(pd), it can conjecturally beexpressed as a linear combination of two eigenforms for Tp in Sk(pd).
Now we come to the definition of newforms and oldforms.
Definition 8.4.5. We define the space of oldforms of Sk(N) by
Soldk (N) = Span f(mz) : f ā Sk(d), dm|N, d < N .
Definition 8.4.6. We define the space of newforms of Sk(N) to be the orthogonal com-plement of Sold
k (N) in Sk(N), i.e.,
Sk(N) = Soldk (N)ā Snew
k (N).
Let us say a newform f ā Sk(N)new is an eigennewform4 if it is an eigenform of allthe Tpās (and thus all the Tnās), of all Wpās for p|N , and of WN .
Theorem 8.4.7 (AtkināLehner). Let f(z) =āanq
n ā Snewk (N) be an eigennewform. Then
1. the eigenvalues Ī»pās and Ī»N for f under the action of the Wpās (p|N) and WN are Ā±1;
2.āp|N Ī»p = Ī»N ;
3. if p|N but p2 - N , then ap = āĪ»ppk2ā1;
4. if p2|N , then ap = 0.
Theorem 8.4.8 (AtkināLehner). The space Snewk (N) has a basis consisting of eigennew-
forms.4In the literature, the term newform usually means a normalized eigennewform, so in this terminology
the set of newforms are a basis for Snewk (N) rather than the whole space. For instance, in [AL70], the term
newform means what we call eigennewform. You should probably stick to standard terminology and just pitymy deep-seated need to call all the elements of Sk(N)new newforms coming from psychological inadequaciesformed in childhood. I blame a clown poster in my bedroom.
CHAPTER 8. NEWFORMS AND OLDFORMS 137
Both of these theorems are contained in [AL70, Thm 3] (together with the observationthat
āp|N Wp = WN for 2 ).
We remark the following consequence of the above theorems (along with multiplicativityof Fourier coefficients of eigennewforms), which gives an (only if) test for whether f lies inthe space newforms of non-squarefree level, i.e., an easy way to detect oldforms in certaincases.
Corollary 8.4.9. Let f =āanq
n ā Sk(N) and suppose p2|N for some prime p. Thenf ā Snew
k (N) only if an = 0 for all n ā” 0 mod p.
Note there is no analogous test when N is squarefree. For instance, if k = 2 and p|Nbut p2 - N , then there are typically eigennewforms f, g ā Snew
2 (N) where ap(f) = +1 andap(g) = ā1, so we can make the p-th Fourier coefficient of some linear combination of fand g arbitrary. Of course if one restricts to complete normalized eigenforms, we can useTheorem 8.4.7 directly.
Atkin and Lehner also describe the full space of cusp forms in terms of newforms ofsmaller levels. For d|N and m|Nd , we have a map Ī¹m : Sk(d) ā Sk(N) via f(z) 7ā f(mz).When m = p, this is just the map given by the Up operator.
Proposition 8.4.10. We have
Sk(N) =ād|N
ām|N
d
Ī¹m(Snewk (d)). (8.4.1)
Example 8.4.11. For prime and prime squared levels, the above decomposition explicitlyreads
Sk(p) = Sk(1)ā Ī¹p(Sk(1))ā Snewk (p)
and
Sk(p2) = Sk(1)ā Ī¹p(Sk(1))ā Ī¹p2(Sk(1))ā Snew
k (p)ā Ī¹p(Snewk (p))ā Snew
k (p2).
Note that in the case of Sk(p), this means the old space can be decomposed as a directsum Sold
k (p) =āVf , where Vf = Cf + CĪ¹pf and f runs over a basis of eigennewforms for
Sk(1). Then by Exercise 8.4.3 and Remark 8.4.4, this means the full cuspidal space Sk(p)conjecturally has a basis of complete eigenforms.
Each Snewk (d) has a basis consisting of eigennewforms (of level d), and their Fourier
coefficients are multiplicative for all n, so this gives a basis of Sk(N) consisting of formswhich have this multiplicativity property. Note there is a difference between the Fouriercoefficients of f being multiplicative for all n and being an eigenform for all Tn.
We can use the dimension formulas given in Section 5.2 to compute dimSk(N)new. Thegeneral case is treated in [Mar05]. We just illustrate a couple of special cases:
Corollary 8.4.12. For k ā„ 2 even, we have
dimSnewk (p) = dimSk(p)ā 2 dimSk(1)
anddimSnew
k (p2) = dimSk(p2)ā 2 dimSk(p) + dimSk(1).
CHAPTER 8. NEWFORMS AND OLDFORMS 138
In particular, for k = 2, we have
dimSnew2 (p) = dimS2(p) =
p+ 1
12ā 1
4
(1 +
(ā1
p
))ā 1
3
(1 +
(ā3
p
)),
and
dimSnew2 (p2) =
112(p+ 1)(pā 8) + 1 + 1
4
(1 +
(ā1p
))+ 1
3
(1 +
(ā3p
))p ā„ 7,
0 p ā¤ 5.
Proof. The first two dimension formulas follow immediately from the explicit decompositionsin the previous example. When k = 2, note that S2(1) = 0, so Snew
2 (p) = S2(p), and thatdimension formula was given in (5.2.4). The explicit k = 2 for level p2 then follows from(5.2.5).
We remark that while we canāt find a basis of Sk(N) consisting of complete eigenformsin general, Atkin and Lehner showed one can find a basis of eigenforms which are alsoeigenfunctions of the Wqās.
Proposition 8.4.13 ([AL70], Lem 27). There exists a basis of Sk(N) consisting of functionswhich are eigenforms for all Tp with p - N and Wq with q|N .
Note that the decomposition (8.4.1) also induces a canonical basis of eigenforms of Sk(N).Namely for each d|N , one takes the set of normalized eigennewforms f of Sk(d). Then theunion of the sets
f(mz) : m|Nd
for all such d, f gives a uniquely defined basis for Sk(N).
However, the oldforms in this basis are not eigenfunctions of the Wqās. See [AL70, Sec 5] fora precise description of a basis as in Proposition 8.4.13 in terms of eigennewforms of Sk(d),d|N , including a description of the eigenvalues of Wq on the oldforms.
Now we say describe a refinement of the decomposition in (8.4.1), related to such bases.For d|N and f an eigennewform of Sk(d), we consider the subspace of Sk(N) generated byf =
āanq
n:ćfćNd :=
ām|N
d
Ī¹m(Cf) =ām|N
d
Cf(mz).
Exercise 8.1.9 tells us that for p - m, f(mz) is an eigenform for Tp with eigenvalue ap, i.e., Ī¹mpreserves Hecke eigenvalues prime to m. Consequently Tp acts on ćfćNd by scaling by ap forp - N , and thus Tn acts on this space by an when gcd(n,N) = 1. Thus the decomposition(8.4.1) can be refined to
Sk(N) =ād|N
āf
ćfćNd , (8.4.2)
where f runs over eigennewforms of Sk(d). Moreover this latter decomposition decom-position is a decomposition into the collection of common eigenspaces for the Tnās withgcd(n,N) = 1:
Theorem 8.4.14 (AtkināLehner). Suppose g ā Sk(N) is an eigenfunction of all Tpās forp - N . Then there exists a eigennewform f ā Sk(d) for some d|N such that g ā ćfćNd .Furthermore, if g 6= 0, there is a unique such f .
CHAPTER 8. NEWFORMS AND OLDFORMS 139
In fact, since Sk(N) is finite dimensional, there is some bound B (depending on k,N)such that the theorem is true if we only require g to be an eigenfunction of all Tpās withp - N and p ā¤ B. Namely, we can take B as in Sturmās bound (Theorem 5.2.1).
This first part of this theorem is [AL70, Thm 4], and the proof comes essentially byusing the decomposition (8.4.2) (which then implies the uniqueness of f) and comparingL-functions of different eigennewforms.
We remark this theorem is also a special case of a theorem known as strong multi-plicity one for automorphic representations. The representation theoretic connection isthat each eigennewform f determines a unique cuspidal automorphic representation (thesubject of another course) Ļf , whose representation space restricted to Sk(N) is preciselyćfćNd . Which is to say that these spaces ćfćNd are quite natural to look at from the pointof view of representation theory, and the Strong Multiplicity One Theorem says that onecan distinguish different cuspidal automorphic representations Ļf , Ļf ā² (here f ā² is some othereigennewform of possibly a different level, and even possibly a different weight) by lookingat sufficiently many Hecke eigenvalues prime to the levels of f and f ā².
What this theorem means in practice is that eigennewforms are determined by a finitecollection of Hecke eigenvalues (once we specify the weight and bound the level, though infact strong multiplicity one says it suffices to merely bound the weight and level), so formany purposes to understand Sk(N) it suffices to determine (finite) systems of Hecke eigen-values. Computing systems of Hecke eigenvalues Ī»nās for gcd(n,N) = 1 is then equivalentto obtaining the decomposition (8.4.2). There are known ways to compute the actions ofHecke operators Tn on Sk(N) without knowing in advance the spaceMk(N) (say as given byFourier expansions), e.g., using modular symbols or Brandt matrices on quaternion algebras.5
These provide effective ways to compute Sk(N).We illustrate this by means of an example.
Example 8.4.15. We can compute in Sage some matrices for Hecke operators on S2(55):
T2 =
0 2 0 ā1 11 0 ā2 3 ā10 0 ā2 3 ā30 1 ā2 3 ā30 0 0 0 ā2
, T3 =
0 0 2 ā2 00 ā2 2 ā2 01 ā2 5 ā4 ā20 ā2 4 ā4 ā10 0 0 0 ā1
,
and
T7 =
ā1 1 0 ā1 1
0 ā2 0 0 0ā1 ā1 ā2 1 ā1ā1 ā1 0 ā1 ā1
0 0 0 0 ā2
5Actually Brandt matrices compute Hecke operators on a slightly smaller space, which can be both
computationally and theoretically advantageous. E.g., when k ā„ 4 and N = p is prime, the Brandt matricesnaturally give the Hecke operators on Snew
k (p).
CHAPTER 8. NEWFORMS AND OLDFORMS 140
Then T2, T3 and T7 have as bases of eigenvectors
v1 =
10ā1ā10
, v2 =
1
2 +ā
23
3 +ā
20
, v3 =
1
2āā
23
3āā
20
, v4 =
2ā2011
, v5 =
00111
.
For both T2 and T3, v1, v2 and v3 generate distinct eigenspaces, but v4 and v5 span a 2-dimensional eigenspace. In this case, T7 only has 2 eigenspaces. Specifically, with respect tothis basis of eigenvectors, these Hecke matrices become
T ā²2 =
1 0 0 0 0
0 1 +ā
2 0 0 0
0 0 1āā
2 0 00 0 0 ā2 00 0 0 0 ā2
, T ā²3 =
0 0 0 0 0
0 ā2ā
2 0 0 0
0 0 2ā
2 0 00 0 0 ā1 00 0 0 0 ā1
,
and
T ā²7 =
0 0 0 0 00 ā2 0 0 00 0 ā2 0 00 0 0 ā2 00 0 0 0 ā2
,
From looking at the eigenvalues for T2 and T3, we know the first three eigenvectors mustcorrespond to newforms (because Hecke eigenvalues oldforms will appear multiple times),and we might suspect that the 2-dimensional space ćv4, v5ć corresponds to a 2-dimensionaloldspace ćfć55
p where p = 5 or p = 11.Indeed, S2(1) = S2(5) = 0 but dimS2(11) = 1. So the decomposition theorem says
S2(55) = Snew2 (55)ā ćf4ć55
11,
with the latter space being 2-dimensional, and
f4(z) = q ā 2q2 ā q3 + 2q4 + q5 + 2q6 ā 2q7 ā 2q9 + Ā· Ā· Ā·
(Here we know the Fourier coefficients a2, a3 and a7 by the above calculations. One canseparately check T5 on S2(11), and get the other listed coefficients by multiplicativity andrecurrence relations.) The above calculations, together with the calculation of T5, tells us
Snew2 (55) = Cf1 ā Cf2 ā Cf3,
where
f1(z) = q + q2 ā q4 + q5 ā 3q8 ā 3q9 + Ā· Ā· Ā·f2(z) = q + (1 +
ā2)q2 ā 2
ā2q3(1ā 2
ā2)q5 ā q5 ā (4 + 2
ā2)q6 ā 2q7 + Ā· Ā· Ā·
f3(z) = q + (1āā
2)q2 + 2ā
2q3(1 + 2ā
2)q5 ā q5 ā (4ā 2ā
2)q6 ā 2q7 + Ā· Ā· Ā·
(Here for i = 1, 2, 3, fi is the eigennewform with Fourier coeffcient ap equal to the eigenvaluefor vi under the action of Tp.)
CHAPTER 8. NEWFORMS AND OLDFORMS 141
Let us end with some remarks on the original question about whetherMk(N) has a basisconsisting of forms whose Fourier coefficients are multiplicative for all n (i.e., amn = amanwhenever gcd(m,n) = 1).
By (8.4.2), for the space of cusp forms Sk(N), this boils down to determining whether,for every newform f ā Snew
k (d), the subspace Vf,m := ćĪ¹m(f) : m|Nd ć has a basis consistingof forms with multiplicative coefficients. As in Example 8.4.11, we expect this to be true forSk(p) because conjecturally there is a basis of complete eigenforms.
On the other hand, the AtkināLehner decomposition easily gives a positive answer to amore generalized notion of multiplicative coefficients:
Proposition 8.4.16. The space Sk(N) has a basis of forms f(z) =āanq
n whose Fouriercoefficients are essentially multiplicative in the following sense: for each such f , there existsm ā N (in fact m|N) such that an = 0 if m - n and amnnā² = amnamnā² for all n, nā² ā N suchthat gcd(n, nā²) = 1.
Note that the essential multiplicativity condition is simply multiplicativity when m = 1.
Proof. Note Ī¹m(f) satisfies this property for a newform f ā Sk(d), and apply the decompo-sition (8.4.2).
If one wants to extend this to Mk(N), one needs a newform theory for Eisenstein series.This was not done by Atkin and Lehner, but worked out in the thesis of Weisinger (Harvard,1977). We wonāt explain this, but just mention it is possible. The obstruction to definingnewforms in Mk(N) in the same way as for Sk(N) is that the Petersson inner product is notdefined on all of Mk(N).
Chapter 9
Hilbert modular forms
The modular forms we have been studying up until now are often called elliptic modularforms, due to the connection with classical elliptic functions (and elliptic curves). In thischapter, we give a brief exposition of the theory of Hilbert modular forms, which is one kindof generalization of our usual elliptic modular forms. We will assume some familiarity withbasic algebraic number theory in this chapter.
We motivated the theory of modular forms in the introduction via quadratic formsand theta series. Namely, the number of ways rk(n) is a sum of k-squares is just the n-th coefficient in the q-expansion of Ļk(z). Observing that Ļ satisfies some transformationproperties, we consider the space of functions with similar properties, and are led to the thedefinition of a modular form of weight k/2.
One can similarly motivate the theory of Hilbert modular forms by considering quadraticforms over more general number fields. For simplicity, suppose F = Q(
ād) is with d > 0
the discriminant and having class number one. This has ring of integers OF = Z[ād] if
d ā” 0 mod 4 and OF = Z[1+ād
2 ] if d ā” 1 mod 4. Here the right analogue of Jacobiās thetafunction from (4.4.1) is
ĻOF (z1, z2) =ā
a+bādāOF
e2Ļi(z1(a+bād)2+z2(aāb
ād)2) =
āĪ±āOF
qĪ±2
1 qĪ±2
2 ,
where qi = e2Ļizi . The reason one considers a theta function of 2 variables is to account forthe two embeddings of F into R. Note this contains the more naive generalizationā
Ī±āOF
qĪ±2
= ĻOF (z, 0)
of Jacobiās theta function, where, as usual q = e2Ļiz. However, one gets a nicer theory byaccounting for the different embedding of F into R (just as one does in algebraic numbertheory). Using ĻOF , one can study the number of representations of n in OF as a sum of ksquares. Again, one can proceed from the transformation properties of ĻOF to define Hilbertmodular forms over F . Notice that our āFourier expansionā is also more complicated in thiscaseāit runs not over Z but over OF .
While quadratic forms provided our main motivating problem to study modular forms,we built up the definition more geometrically, by considering the surfaces Y0(N) = Ī0(N)\H
142
CHAPTER 9. HILBERT MODULAR FORMS 143
(or rather their compactifications X0(N)) and studying functions on them. Recall thatwhile there are no nonconstant holomorphic functions on X0(N), the derivatives of modu-lar functions (weak modular forms) have different transformation properties and for mosteven weights (just exclude k = 2 when N = 1), there are non-constant holomorphic forms.Consequently one can construct modular functions by constructing holomorphic modularforms and taking appropriate products and ratios to get something of weight 0 (a mod-ular function). This gives a construction of the famous j-invariant modular function inExercise 4.2.13, which is important in the theory of elliptic curves.1
In our real quadratic case F = Q(ād), the analogue will be to look at functions on a
quotient Ī\(HĆ H), where Ī is a group of isometries of HĆ H. Note that since PSL2(R) isthe isometry group of H, PSL2(R)ĆPSL2(R) acts by isometries on HĆH. The embeddingĪ± 7ā (Ī±, Ī±) of F into R Ć R that one usually considers in algebraic number theory inducesan embedding of PSL2(F ) into PSL2(R)ĆPSL2(R). The analogue of looking at congruencesubgroups of PSL2(Z) in the case of modular forms is to look at congruence subgroups
Ī ā PSL2(OF ) ā PSL2(R)Ć PSL2(R).
In fact one often replaces PSL2(OF ) by slightly more general arithmetic groups.Either of these approaches lead one to define Hilbert modular forms over a totally real
number field F (i.e., all embeddings of F into C lie in R, so Q( 3ā
2) is real, but not totallyreal).
Hilbertās motivation for studying modular forms over number fields was as follows. TheKroneckerāWeber theorem says that any abelian extension of Q can be generated by rootsof unity, i.e., special values of the function f(x) = eix. The theory of complex multiplicationdoes something analogous for imaginary quadratic extensions K/Q, realizing KroneckerāsJugendtraum (ādream of youthā). Specifically, one can generate any abelian extension of Kusing special values of the j-invariant special values of the Weierstrass ā function. Hilbertāstwelfth problem asks more generally for a determination of the the abelian extensions ofa general number field K. Consequently, one wants analogues of special functions likeexponential functions, the modular function j(Ļ) and the ā function. Hilbertās idea was toconsider modular forms over number fields to find analogues of these special functions.
Some references are [Fre90] for full level, [Bru08] or [vdG88] for F real quadratic, or[Gar90] or [Shi78] in full generality.
9.1 Basic definitions and results
Let F be a totally real number field of degree r, i.e., [F : Q] = r and there are r embeddings,Ī¹1, . . . , Ī¹r, of F into R. Denote the ring of integers of F by OF . We consider all of theseembeddings at once via
Ī¹ : F ā Rr, Ī¹(x) = (Ī¹1(x), . . . , Ī¹r(x)).
1In retrospect, I suppose this was a bit incoherentāour motivation for modular forms was to studyquadratic forms, and we use modular functions as motivation for the definition, but then we essentially dropthe problem of constructing modular functions once we have the definition of modular forms apart from thatone exercise, and just study modular forms with applications to quadratic forms. If I ever add a chapter onelliptic curves, I will hopefully say more about the j-invariant so that there is some payoff of spending allthat time on modular functions.
CHAPTER 9. HILBERT MODULAR FORMS 144
The advantage of doing this is that then Ī¹(OF ) is a lattice (discrete Z-module) in Rr, which isanalogous to Z being a discrete subgroup of R. (Note if we just consider a single embedding,e.g., Z[
ā5] ā R, we do not get a discrete subset of R when r > 1.)
Elliptic modular forms arose from looking at the modular curves Ī\H, where Ī is afinite-index subgroup of PSL2(Z), which acts discretely on H (i.e., the orbits under thisaction are discrete subsets of H). The analogue should be to look at finite-index subgroupsof PSL2(OF ). But since PSL2(OF ) ā PSL2(R) does not act discretely on H. Just as OF canbe viewed as a lattice in Rr, we can view PSL2(OF ) acting discretely on the r-fold productof upper-half planes Hr.
For g = (g1, . . . , gr) ā PSL2(R)r, we let g act on Hr via
g Ā· z = (g1z1, . . . , grzr), z = (z1, . . . , zr).
Then we extend
Ī¹ : GL2(F )ā GL2(R)r, Ī¹(Ī³) = (Ī¹1(Ī³), . . . , Ī¹r(Ī³)),
whereĪ¹i(Ī³) : GL2(F )ā GL2(R) is just given by coordinate-wise application of Ī¹i. Restrictingto SL2(F ) and quotienting out by Ā±I gives gives an embedding Ī¹ : PSL2(F )ā PSL2(R)r. So
we let PSL2(F ) act on Hr via composition with Ī¹. Notationally, if Ī³ =
(a bc d
)ā PSL2(F )
and z = (z1, . . . , zr), we will denote this action by(a bc d
)z =
az + b
cz + d:=
(Ī¹1(a)z1 + Ī¹1(b)
Ī¹1(c)z1 + Ī¹1(d), . . . ,
Ī¹r(a)z1 + Ī¹r(b)
Ī¹r(c)z1 + Ī¹r(d)
).
Proposition 9.1.1. PSL2(OF ) acts discretely on Hr.
We call PSL2(OF ) the (full) Hilbert modular group. The analogue of the congruencesubgroups Ī0(N) inside PSL2(Z) are
Ī0(N) =
(a bc d
)ā SL2(OF ) : c ā N
/ Ā±I ,
where N is a nonzero integral ideal of OF . We call these congruence subgroups ofPSL2(OF ). (As when r = 1, there are more congruence subgroups than just the Ī0(N)ās,but these are all we will consirer.) Note when F = Q, Ī0(NZ) = Ī0(N) for N ā N. Ingeneral, one can check that Ī0(N) has finite index in PSL2(OF ).
As in the case of r = 1, one adjoins cusps to Hr. Namely, we define the extended r-foldproduct of upper half-planes Hr to be the union of Hr together with the boundary pointsĪ¹(F ) ā Rr and the point at infinity iā = (iā, . . . , iā). Then Hr ā H āŖ R āŖ iār. Theelements of Hr ā Hr are the cusps of Hr. There is a natural way to put a topology on Hr
as well as extend the action of PSL2(F ) to Hr. Thus for a subgroup Ī of PSL2(OF ), we cantalk about the cusps of the quotient Ī\Hr, which are the Ī-orbits of the cusps of Hr.
Unlike the case of F = Q, there may be many cusps for PSL2(OF )\Hr, but one can showthere are finitely many:
Theorem 9.1.2. The number of cusps of PSL2(OF )\Hr is the class number hF of F .
CHAPTER 9. HILBERT MODULAR FORMS 145
For c, d ā F , k ā Zr and z = (z1, . . . , zr) ā Hr, we denote by
(cz + d)k = (Ī¹1(c)z1 + Ī¹1(d))k1 Ā· Ā· Ā· (Ī¹r(c)z1 + Ī¹r(d))kr .
Definition 9.1.3. Let k = (k1, . . . , kr) ā Zr and Ī be a finite index subgroup of PSL2(OF ).A (holomorphic) Hilbert modular form of weight k on Ī\Hr is holomorphic* functionf : Hr ā C satisfying
f(Ī³z) = (cz + d)kf(z), Ī³ =
(a bc d
)ā Ī0(N). (9.1.1)
If also f vanishes at the cusps, we say f is a cusp form. Denote the space of weight kHilbert modular forms and cusp forms on Ī\Hr respectively by Mk(Ī) and Sk(Ī).
If Ī = Ī0(N), then we write Mk(N) = Mk(Ī) and Sk(N) = Sk(Ī). A form in Mk(N)is said to have level N.
For F = Q, we have Mk(NZ) = Mk(N) and Sk(NZ) = Sk(N). When F 6= Q, by aholomorphic function of Hr we mean a function f : Hr ā C which is holomorphic in each ofthe r complex variables z1, . . . , zr. It should also extend to be āholomorphic at the cusps,āwhich is a notion one can define precisely, and once one defines it one can show itās notactually needed! (That is, itās not needed for the definition of holomorphic Hilbert modularforms, but it is used in the theory.)
Theorem 9.1.4. (Koecherās principle) Suppose [F : Q] > 1. If f : Hr ā C is holomor-phic and satisfies (9.1.1), then f is holomorphic at the cusps.
The proof of Koecherās principle uses Fourier expansions, which we describe next (thatis, we describe Fourier expansions, but not the proof of Koecherās principle).
The inverse different of F is the fractional ideal
Oā„F =x ā F : trF/Q(xOF ) ā OF
.
Here the trace from F to Q is defined as the sum of the conjugates, i.e., trF/Q(x) =āĪ¹i(x).
So if we define tr : Cr ā C via tr(z1, . . . , zr) =āzi, then tr(Ī¹(x)) = trF/Q(x), which we
also denote simply as tr(x).
Proposition 9.1.5. Given f āMk(N), we have the Fourier expansion (at ā)
f(z) =āĪ¾āOā„F
cĪ¾e2Ļitr(Ī¾z) =
āĪ¾āOā„F
cĪ¾e2Ļi(Ī¹1(Ī¾)z1+Ā·Ā·Ā·+Ī¹r(Ī¾)zr),
for some (uniquely determined) collection of Fourier coefficients cĪ¾ ā C.
As when F = Q, there are also Fourier expansions around other cusps.To specify f , we sometimes denote the Fourier coefficient cĪ¾ by cĪ¾(f). The idea of the
proof is to use that fact that f is invariant under action by(
1 x0 1
)for all x ā OF .
We call x ā F totally positive if Ī¹i(x) > 0 for all x. Koecherās principle in fact tells us:
CHAPTER 9. HILBERT MODULAR FORMS 146
Theorem 9.1.6. (Koecherās principle, 2nd version) For f āMk(N), cĪ¾(f) = 0 unlessĪ¾ = 0 or Ī¾ is totally positive.
As in the case of F = Q, a necessary condition for f to be a cusp form is c0(f) = 0.
Corollary 9.1.7. Let f āMk(N), where k = (k1, . . . , kr). If some ki 6= kj, then f ā Sk(N).
We call the weight of the form k = (k, . . . , k) parallel weight k. Thus the corollarytells us that we can only have non-cuspidal holomorphic Hilbert modular forms (e.g., theEisenstein series discussed below) in parallel weights. Another nice thing about parallelweights is restriction to the diagonal yields elliptic modular forms:
Exercise 9.1.8. Let f ā Mk(NOF ) where N ā N and k = (k, . . . , k). Show g : H ā Cgiven by g(z) = f(z, . . . , z) is an elliptic modular form in Mk(N).
Theorem 9.1.9. dimMk(N) <ā.
Bibliography
[Ahl78] Lars V. Ahlfors, Complex analysis, third ed., McGraw-Hill Book Co., New York,1978, An introduction to the theory of analytic functions of one complex variable,International Series in Pure and Applied Mathematics. MR 510197 (80c:30001)
[AL70] A. O. L. Atkin and J. Lehner, Hecke operators on Ī0(m), Math. Ann. 185 (1970),134ā160. MR 0268123 (42 #3022)
[And05] James W. Anderson, Hyperbolic geometry, second ed., Springer UndergraduateMathematics Series, Springer-Verlag London Ltd., London, 2005. MR 2161463(2006b:51001)
[Apo90] Tom M. Apostol, Modular functions and Dirichlet series in number theory, seconded., Graduate Texts in Mathematics, vol. 41, Springer-Verlag, New York, 1990.MR 1027834 (90j:11001)
[Boy01] Matthew Boylan, Swinnerton-Dyer type congruences for certain Eisenstein series,q-series with applications to combinatorics, number theory, and physics (Urbana,IL, 2000), Contemp. Math., vol. 291, Amer. Math. Soc., Providence, RI, 2001,pp. 93ā108. MR 1874523 (2002k:11063)
[Bru08] Jan Hendrik Bruinier, Hilbert modular forms and their applications, The 1-2-3 ofmodular forms, Universitext, Springer, Berlin, 2008, pp. 105ā179. MR 2447162
[Bum97] Daniel Bump, Automorphic forms and representations, Cambridge Studies in Ad-vanced Mathematics, vol. 55, Cambridge University Press, Cambridge, 1997. MR1431508 (97k:11080)
[CSS97] Gary Cornell, Joseph H. Silverman, and Glenn Stevens (eds.), Modular formsand Fermatās last theorem, Springer-Verlag, New York, 1997, Papers from theInstructional Conference on Number Theory and Arithmetic Geometry held atBoston University, Boston, MA, August 9ā18, 1995. MR 1638473 (99k:11004)
[DS05] Fred Diamond and Jerry Shurman, A first course in modular forms, GraduateTexts in Mathematics, vol. 228, Springer-Verlag, New York, 2005. MR 2112196(2006f:11045)
[Els06] JĆ¼rgen Elstrodt, A very simple proof of the eta transformation formula,Manuscripta Math. 121 (2006), no. 4, 457ā459. MR 2283473 (2007g:11051)
147
BIBLIOGRAPHY 148
[FB09] Eberhard Freitag and Rolf Busam, Complex analysis, second ed., Universitext,Springer-Verlag, Berlin, 2009. MR 2513384
[Fre90] Eberhard Freitag, Hilbert modular forms, Springer-Verlag, Berlin, 1990. MR1050763
[Gar90] Paul B. Garrett, Holomorphic Hilbert modular forms, The Wadsworth &Brooks/Cole Mathematics Series, Wadsworth & Brooks/Cole Advanced Books &Software, Pacific Grove, CA, 1990. MR 1008244
[Iwa97] Henryk Iwaniec, Topics in classical automorphic forms, Graduate Studies in Math-ematics, vol. 17, American Mathematical Society, Providence, RI, 1997. MR1474964 (98e:11051)
[Kat92] Svetlana Katok, Fuchsian groups, Chicago Lectures in Mathematics, University ofChicago Press, Chicago, IL, 1992. MR 1177168 (93d:20088)
[Kil08] L. J. P. Kilford, Modular forms, Imperial College Press, London, 2008, A classicaland computational introduction. MR 2441106 (2009m:11001)
[KL06] Andrew Knightly and Charles Li, Traces of Hecke operators, Mathematical Surveysand Monographs, vol. 133, American Mathematical Society, Providence, RI, 2006.MR 2273356
[Kob93] Neal Koblitz, Introduction to elliptic curves and modular forms, second ed., Grad-uate Texts in Mathematics, vol. 97, Springer-Verlag, New York, 1993. MR 1216136(94a:11078)
[Lan95] Serge Lang, Introduction to modular forms, Grundlehren der Mathematischen Wis-senschaften [Fundamental Principles of Mathematical Sciences], vol. 222, Springer-Verlag, Berlin, 1995, With appendixes by D. Zagier and Walter Feit, Correctedreprint of the 1976 original. MR 1363488 (96g:11037)
[Lan99] , Complex analysis, fourth ed., Graduate Texts in Mathematics, vol. 103,Springer-Verlag, New York, 1999. MR 1659317 (99i:30001)
[Mar05] Greg Martin, Dimensions of the spaces of cusp forms and newforms on Ī0(N) andĪ1(N), J. Number Theory 112 (2005), no. 2, 298ā331. MR 2141534
[Mil] J.S. Milne, Modular functions and modular forms, Online course notes.http://www.jmilne.org/math/CourseNotes/mf.html.
[Miy06] Toshitsune Miyake, Modular forms, english ed., Springer Monographs in Mathe-matics, Springer-Verlag, Berlin, 2006, Translated from the 1976 Japanese originalby Yoshitaka Maeda. MR 2194815 (2006g:11084)
[Sch74] Bruno Schoeneberg, Elliptic modular functions: an introduction, Springer-Verlag,New York, 1974, Translated from the German by J. R. Smart and E. A. Schwandt,Die Grundlehren der mathematischen Wissenschaften, Band 203. MR 0412107 (54#236)
BIBLIOGRAPHY 149
[Ser73] J.-P. Serre, A course in arithmetic, Springer-Verlag, New York, 1973, Translatedfrom the French, Graduate Texts in Mathematics, No. 7. MR 0344216 (49 #8956)
[Shi78] Goro Shimura, The special values of the zeta functions associated with Hilbertmodular forms, Duke Math. J. 45 (1978), no. 3, 637ā679. MR 507462
[Sta09] John Stalker, Complex analysis, Modern BirkhƤuser Classics, BirkhƤuser BostonInc., Boston, MA, 2009, Fundamentals of the classical theory of functions, Reprintof the 1998 edition. MR 2547082 (2010i:30001)
[Ste07] William Stein, Modular forms, a computational approach, Graduate Studies inMathematics, vol. 79, American Mathematical Society, Providence, RI, 2007, Withan appendix by Paul E. Gunnells. MR 2289048 (2008d:11037)
[vdG88] Gerard van der Geer, Hilbert modular surfaces, Ergebnisse der Mathematik undihrer Grenzgebiete (3) [Results in Mathematics and Related Areas (3)], vol. 16,Springer-Verlag, Berlin, 1988. MR 930101
[Zag08] Don Zagier, Elliptic modular forms and their applications, The 1-2-3 of modularforms, Universitext, Springer, Berlin, 2008, pp. 1ā103. MR 2409678 (2010b:11047)
Index
Bk, 56E(Ī), 21Ek, 53Ek,N , 53Gk, 55Gk,N , 58L(s, Ļ), 115L(s, f), 120Mk(N), 145Mk(N), 63Mk(Ī), 63S, 31Sk(N), 145Sk(N), 75Sk(Ī), 75T , 31Tn, 98, 128Up, 129Wp, 134Y0(N), 44Z(s), 114ćf, gć, 108ā(z), 76Ī(N), 36Ī0(N), 36Ī1(N), 36H, 24, 26Ī(s, f), 120PSL2(R), 28SL2(R), 28H, 41f , 120Ī“k(n), 71Ī·(z), 72C, 15P1(Z/NZ), 36Ļk(m), 56
Ļ(n), 76Ļ(z), 67ā, 22Ī¶(s), 113cĪ¾(f), 145j-invariant, 57j(Ī³, z), 51p(n), 74q-expansion, 45, 46qN , 46qĻ , 46vp(f), 79
analytic, 13AtkināLehner operator, 134automorphy factor, 51
Bernoulli number, 56
Cauchyās residue theorem, 78CauchyāRiemann equations, 12completed L-function, 120completed zeta function, 114congruence subgroup, 36, 144critical line, 114critical strip, 114cusp, 144cusp form, 75cusps, 41
Dedekind eta function, 72Dedekind zeta function, 117degree, 118dimension formula, 86Dirichlet L-function, 115Dirichlet character, 115Dirichlet convolution, 132discriminant, 23
150
INDEX 151
discriminant modular form, 76
eigenform, 102, 129eigennewform, 136Eisenstein series, 52elliptic curve, 23elliptic element, 39elliptic function, 21elliptic modular form, 142elliptic point, 39entire, 12equivalent (lattices), 24Euler product, 115explicit formula, 114extended upper half-plane, 41
Fourier coefficient, 19, 145Fourier coefficients, 63Fourier expansion, 46, 145fractional linear transformation, 28Fricke involution, 120, 134functional equation, 114fundamental domain, 18, 32
gamma function, 114
Hecke L-function, 120Hecke converse theorem, 121Hecke operator, 98, 127Hilbert cusp form, 145Hilbert modular form, 145Hilbert modular group, 144holomorphic, 12holomorphic at iā, 62holomorphic at cusps, 62hyperbolic plane, 26
inverse different, 145
Jacobi theta function, 67
Koecherās principle, 145, 146
Laurent series, 15level, 50, 63, 145Lipschitzā formula, 54local L-factor, 118
Mƶbius inversion, 60Maass form, 106Mellin transform, 121meromorphic, 15meromorphic at ā, 20meromorphic at cusps, 62meromorphic modular form, 63moderate growth, 48modular curve, 44modular form, 63modular function, 45, 47modular group, 31, 36multiplicative, 118
newform, 136
oldform, 136open mapping theorem, 49order, 79order (of elliptic point), 40order (of pole), 15order (of zero), 14
parallel weight, 146pentagonal number, 73Petersson inner product, 108principal congruence subgroup, 36projective line, 36projective special linear group, 28
Ramanujan tau function, 76residue, 78Riemann sphere, 15Riemann zeta function, 113
slash operator, 46, 62, 97special linear group, 28standard fundamental domain, 34Sturmās bound, 88
totally multiplicative, 118totally positive, 145trace, 145triangular number, 71
upper half-plane, 24, 26
weak modular form, 50