]]>

Looking around, it seems this is kind of a “folklore proof,” in that I haven’t found a fully rigorous version in any book, but everybody knows about the idea. I liked the proof a lot, so here I’ll present a short account of it.

Implicitly, this idea takes the nice correspondence of nonsingular curves with Riemann surfaces as the origin of genus of curves. Riemann surfaces, once you forget the complex structure, just become smooth orientable real surfaces, and that’s pretty much the appeal – genus for smooth orientable surfaces is very geometric (“number of handles”), and probably the first place people see “genus,” so it’s an intuitive way to correspond it to the algebraic case. Of course, in modern treatments, the first principles of genus (arithmetic and geometric) are usually taken to be cohomological, but who cares.

So let’s say we have a projective plane curve of degree – hence, defined by a homogenous polynomial of degree . The degenerate case is just a union of lines. Generically, each pair of lines intersects at a point; taking the analytification correspondences, this is the wedge sum of spheres.

The idea is that by shifting the coefficients slightly to make the curve nonsingular, the global topology stays the same besides at the singular wedge points, which open up into holes between the spheres. Then, if we take one sphere as a “central sphere,” it’s clear that each pair of the remaining spheres creates a handle, so for the Riemann surface, we get , as desired. (Actually, this first stage can be replaced by ANY exhibition of a curve with genus , so there are many ways to proceed, but this one is most in line with the spirit of the proof.)

This is the first step which is usually skipped: proving that this is indeed what happens to the topology upon perturbing the coefficients to get a nonsingular curve. To rigorize this argument, we use the Milnor fibration.

Let our curve be ; initially take it to be hyperplanes intersecting generically, with the consequent polynomial being a product of linear factors. What’s more, we can take all intersections to be away from, say , so we affinize in and to , defining an affine slice of the curve where all the singularities happen. Fix one singularity; WLOG it is at the origin . If we take a ball of sufficiently small radius about the origin of , then for every sufficiently small disc of radius about the origin of the complex plane, the map is a locally trivial fibration.

I won’t go into the general theory of Milnor fibers here; consult chapter 6 of Wall’s book for a treatment. It is clear in this case that the Milnor fiber is a cyllinder (without the ends); see lemma 6.3.3. To visualize why this is true, just picture what is happening locally as the hyperbola .

But the Milnor fiber above is precisely the curve defined by in the small ball . This same logic applies to every singularity, so if we make a sufficiently small perturbation to the constant term, all the singularities indeed get resolved into connecting tubes. For sufficiently small , we have to check that the global geometry away from the singularities does not change.

Notice on the full projective curve, what we have done is replace with . In particular, there can clearly be no new singularities on the hyperplane , so for the global geometry, we can continue restricting ourselves to the affine case. In this case, the Milnor map at infinity, the same as above where we take to be an arbitrary **large** ball, is also a locally trivial fibration for a certain class of polynomials, known as tame polynomials; see Nemethi and Zaharia. This class includes all polynomials with isolated critical points, including our . Hence *globally* the analytic surfaces corresponding to , parametrized by , are all topologically the same for in some punctured disc in the complex plane. In particular, no new singularities can appear, and the global geometry away from the singularities remains the same. Thus we get spheres pairwise joined by little tubes, and consequently the desired genus of

Now that we have our “base case,” we work inside the parameter space: the homogenous polynomials of degree in the coordinate ring , which we will denote , given the obvious topology as a -vector space – finite dimensional, too, since there are finitely many monomials of fixed degree. We have the subspace of polynomials yielding nonsingular curves .

First, we will show that genus is locally constant on this subspace. This is the other rigorizing section typically skipped. Let be the monomials of degree , so that they are a basis for . Then take the projective variety defined by the homogenous (in ) coordinate ring – here we are affine in , projective in . There is an obvious rational map (affine space over ), whose fiber over a point is precisely the degree curve with those coefficients on the monomials. Every curve is equivalent to some curve in this family because the coefficient of can always be made nonzero via a projective transformation.

Notice that it’s clear that the only singular points of are singular points on the individual curves. Further, it’s not hard to see that the image of the singular curves on affine space is Zariski closed, since it ultimately consists of polynomial conditions on the coefficients, by the theory of resultants. Hence if we take the analytification of the morphism , yielding , by local compactness we can find an (analytic/Euclidean) compact neighborhood of a point on whose fiber is a nonsingular curve. The preimage of this is compact, as a closed subset of , since every Proj construction yields a compact manifold and closed subsets of compact spaces are compact.

Then restricted to this preimage is a surjective map of a smooth manifold onto its image, which is clearly a submersion by checking algebraic tangent spaces. It is also a (analytic) proper map by construction. Hence by We will use Ehresmann’s theorem, it’s a locally trivial fibration, so every fiber is the same topologically, hence has the same genus. By identifying with -scaling orbits in , we have our desired result.

Finally, it’s a snap to show that is connected. Indeed, inside the vector space , it’s the complement of . A polynomial defines a singular curve if and only if there is a common zero of with its partial derivatives in , , and simultaneously. Since the partial derivative is just a linear transformation of the coefficients (hence of the vector space), by the theory of resultants, this corresponds to a codimension surface in considered as an affine space in the coefficients.

Thus, is indeed connected as the complement of a codimension , so genus is constant. The result follows.

One reason I like this proof a lot is that it uses proto-versions of two big ideas in algebraic geometry. First, family arguments, where we turn an easy case into a hard one while keeping something invariant, hinting towards cycle classes, flat families, and deformation theory. Indeed, the second step is trivial with the theory of flat families, just a specialization of the fact that the Hilbert polynomial is constant. Second, working with a geometric space which parametrizes the objects we’re talking about (a proto-moduli space).

]]>

When we have a quotient of a finite-dimensional vector space by a vector subspace, instead of a looking at an element in the quotient space, one way is to consider the orthogonal complement of the vector subspace provided we have an inner product in the ambient space. So every element in the quotient space is represented by a unique element in the orthogonal complement of the subspace instead of an equivalence class in the original ambient space. This is the same as using the element with minimum length in its equivalence class to represent the equivalence class.

In our case, the vector space of all -closed smooth -valued -forms on , in general, is an infinite-dimensional vector space. Suppose we give a Hermitian metric and also give a Hermitian metric along its fibers. So there is a natural inner product on the vector space over all -closed smooth -valued -forms on . If we use this method of orthogonal complement, or equivalently, the method of finding an element with minimum length in its equivalence class, we run into the trouble of closedness of the subspace, or equivalently, the convergence of a sequence of elements in an equivalence class minimizing the length function. We know that the vector space of all -closed smooth -valued -forms on is not complete in the natural inner product. In order to talk about closedness of subspaces or the convergence of a minimizing sequence of elements, we should first consider the vector space of all -closed smooth -valued -forms on to a Hilbert space. So we should consider the set of all -closed -valued -forms on .

- The first question that arises in considering the set of all -closed -valued -forms on is the definition of the operator on -valued -forms on .
- The second question is the closedness of the image of after it is defined.

Let denote the set of all -valued -forms on , and let be the set of all -valued -forms on . The operator from to is defined on the dense subset of . In order to make the image of closed, we should define it on as large a set as possible. So we consider the closure of the graph of in the product space of and . Given an element of and an element of , the pair is in the graph if and only if there exists a sequence converging in the norm of to so that converges in the norm of to . We have to check that this closure is the graph of a map. The trouble is that for the closure of the graph, we may have two distinct elements of given as the image of the same element of . By taking the difference of these two distinct elements of , we can assume that there exists a sequence converging in the norm of to 0 so that converges in the norm of to some nonzero element of . Take an arbitrary element of . We have the process of integration by parts. So we get

where means the inner product in and is a first-order linear partial differential operator. Letting , we conclude that

for any , contradicting that is nonzero. Hence, we know that the closure of the graph of is also a graph. This argument works in general for differential operators because we have integration by parts.

There is another way to extend the definition of from . For any , the expression makes sense when differentiation is done in the sense of distributions. So in general, would be a current. If this current can be represented by an element of , then we say that belongs to the domain of and define as . The earlier extension done by using the closure of the graph of is known as the strong extension. At first, it seems that the weak extension may have a bigger domain than the strong extension. It turns out that the two extensions are the same. This is known as the *Friedrichs Extension Lemma *(K.O. Friedrichs, “The identity of weak and strong extensions of differential operators”. *Trans. Amer. Math. Soc.* 55 (1944), 132-151). First, one observes that by using a partition of unity, we can assume that all forms involved are supported in a single coordinate chart. Then in that coordinate chart, one uses smoothing by convolution with a cutoff function. Suppose

is a first-order differential operator with smooth coefficients. Let be a nonnegative function supported on the unit open ball of . Let

Suppose is an function. Then

in norm as . We assume that is taken in the sense of distributions in . Then

in norm as . It suffices to show that

in norm as . This is clearly true when belongs to the dense subset of smooth functions. So it suffices to show that

is bounded in norm when belongs to a set bounded in norm. The zero-order part of clearly has bounded contribution. So we can assume, without loss of generality, that . Then

Clearly, the second term on the right-hand side is bounded. So we can drop it. We have

In the last integral,

because . We have

Hence,

and the norm of

is bounded by

which is bounded if the norm of is bounded.

We want to show that the image of

is closed, and the kernel of

quotiented by the image of

is isomorphic to the set of all -closed smooth -valued -forms on . First, we handle the question of the closedness of the image of

This means that if we have so that converges to some in , then we want to show that for some . Let . If we are able to solve the equation so that or a subsequence of it converges to some in , then we are done.

We are going to consider the operator

First, we would like to discuss the motivation for this operator . We look at the equation , where is in and . We would like to solve it so that has a good bound when is bounded. The equation is equivalent to for all in , i.e. . Now, is the unknown. To know is the same as knowing the linear functional . Now, we know this linear functional is , and is known. The set of all is a subset of . If we can extend this known linear functional from the set to a bounded linear functional on all of , then we know whose norm is bounded by that of the bounded linear functional. Such an extension is possible if we have an estimate

We write so that and . Since , we have . From , we have and . Hence,

is equivalent to

We can assume for that inequality that . By the Schwarz Inequality, it suffices to have an estimate

Since to have the inequality that

This inequality can be rewritten as

This shows that the operator is closely related to the solution . If we do have the inequality

then our preceding discussion shows that the equation admits a solution with

We do not expect in general to have the inequality

because this inequality would imply that vanishes. However, we have a related inequality that can serve our purpose of proving the closedness of the image of . This related inequality is *Gårding’s Inequality*. It is says that

where the norm means a norm that is equivalent to the sum of the -norm of and the norm of the first derivative of .

Before we prove Gårding’s Inequality, let us look at its consequences. Rewrite Gårding’s Inequality in the form

We know that the operator admits an inverse whose norm is , because the operator is nonnegative. Replacing by in Gårding’s Inequality, we get

Since

it follows that

This means that the map from to is continuous, where is the Hilbert space of all -valued -forms whose first derivatives are also . Since by the *Rellich-Kondrachov Theorem* the inclusion map

is a compact operator, it follows that the map from to is a compact operator. A compact operator means that it maps a bounded set of the domain space to a relatively compact subset of the target space. We will prove the Rellich-Kondrachov Theorem later. By the spectral theorem for self-adjoint compact operators, we know that the eigenvalues of the operator from to are in , the only possible limit point of is 0, the eigenspace for each is finite-dimensional, and the eigenfunctions of span . The eigenvalues are limited to , because is positive and its norm is . An equation

is equivalent to

So we conclude that the eigenvalues of are in , the eigenspace for each is finite-dimensional, and the eigenfunction of span . If 0 is an eigenvalue, we call it . By allowing some positive eigenvalues () to be counted more than once, we can assume that the eigenspace for each positive eigenvalue is only -dimensional, and the totality of the eigenfunction for () is an orthonormal basis of the orthonal complement of in . We know that is finite-dimensional. On , we can define the inverse of by

We extend to all of by defining to be zero on . We call *Green’s operator*. The reason for the name is that is the inverse of the Laplace operator . Let be the orthogonal projection from onto . Then we have the identity

Now, we are ready to prove the closedness of . We go back to our earlier notation of in so that . Suppose with . We have

By applying to both sides, we get

Taking the inner product with , we obtain

Hence,

This shows that though in general we can not solve the equation because may not vanish, we can still always solve the equation

From

and Gårding’s Inequality, we conclude that

because

We go back to our earlier notation of in so that . Since , we have

An element of belongs to if and only if

because

This implies every element of is perpendicular to both and . Since

belongs to , it follows from that . Hence, . Since is bounded independent of , by the Rellich-Kondrachov Theorem, we can select a subsequence of converging to some in . Then , because

Hence, the image of

Now, we want to show that the cohomology group can be calculated by using -valued forms instead of -valued forms. For this we need a regularity result for the equation in . More precisely, we want to show that if the norm , which is equivalent to the sum of all the norms of derivatives of of order is finite, then is finite. Note that we assume already that the norms of both and are finite. First, we observe that if and and for some , then . The case is given right away by Gårding’s Inequality. For the general case, we take any vector field and apply the argument to and use induction on . Now, we come back to the equation in . Again, we look at first the case . By Gårding’s Inequality,

Likewise,

Hence, by the above observation, is finite. For the case of a general , again we take any vector field and apply the argument to and use induction on . In these arguments, we have been quite sloppy above justifying such things as integration by parts even though our forms are not smooth. The rigorous way to justify such arguments is to smooth out the forms first *in the graph norm* of the first-order operators by using the result of Friedrichs and get estimates on the approximating smooth forms using constants dependent only on our original nonsmooth form and then take limits at the end. A form is in if and only if it can be approximated by smooth forms in the norm of .

Now that we have the regularity result for , we consider the map from the space

to the space

We want to show that is an isomorphism. First, we show that it is surjective. Take with . We have

As we argued before, when we apply to both sides and take the inner product with , we conclude that

From and the regularity of , we know that is smooth. Since differs from the -closed smooth form by the -exact form , we conclude that is surjective. Now, we look at the injectivity of . Suppose and for some . Take the decomposition

It is clear that the three spaces , , are mutually orthogonal, because

From

we conclude that

So

Since is smooth, by the regularity of and

we know that is smooth and is smooth. Since is the of the smooth form , we conclude that is injective. From the above argument, we also see that both spaces

and

are isomorphic to . An element of is called a *harmonic form*. So we have proved that the cohomology group is isomorphic to the space of all harmonic -valued -forms. Since is finite-dimensional from the spectral theorem, we know that is always finite-dimensional for a compact manifold .

]]>

It turns out both Dirichlet’s unit theorem and the finiteness of class number, when looked at the right way, have all their “deepness” hidden in a notion of compactness in a certain lattice, and this can be made explicit using the adeles (roughly, a way to glue together all the places of a global field). This is the beginning of many different long, strange stories, none of which I really understand yet; hopefully, this will be the first of many posts detailing my exploration.

This post already assumes a basic familiarity with -adics, places, and valuation theory. A secondary aim of this post is to serve as an introduction to adeles and the adelic perspective. It is somewhat atypical, in that it will not prove or even discuss many of the elementary properties of adeles other guides will, but I think that this introduction via application and intuition is in many ways preferable. Optimally, it would be read in conjunction with a more conventional introduction to adelic number theory, which will march through all the basic properties I gloss over.

**I. Finiteness of class number and Dirichlet’s unit theorem, revisited**

Let’s reiterate the proofs of finiteness of class number and Dirichlet’s unit theorem quickly, phrasing them in a manner so as to emphasize the role of compactness. We’ll work over the (hopefully) familiar case of a number field , though will keep an eye towards the end goal of generalizing to any global field.

**Aside:** a *global field*, if you’re unfamiliar with the term, is either a number field, i.e. a finite extension of , or a function field of an algebraic curve over a finite field, i.e. finite extension of . This might seem pretty arbitrary, but there’s a reason for it beyond the (in)famous finite function field/number field analogy: the completions of such fields at their places give rise to all the local fields, which are precisely the locally compact topological fields. Working backwards, we can use this to give an abstract characterization of global fields in terms of the product formula for valuations. Indeed, I think this is probably the strongest justification for the number field/finite function field analogy in the first place.

**Theorem.** Let be a number field, with ring of numbers , and let be the associated fractional ideal group. Let the class group be defined as ; i.e. fractional ideals modulo principal ones. Then is finite.

**Aside:** this is also a familiar construction in algebraic geometry: the *divisor group* of formal linear combinations of height-1 prime ideals (so, all of them, in a Dedekind domain, but in general, codimension 1 subvarieties, geometrically speaking) by the subgroup of * principal divisors*, formal linear combinations arising from an element of (*principal divisors*), where we include in the linear combination each prime appearing in the factorization of the ideal generated by the element, multiplied by its multiplicity in the factorization. It is easy to see how this matches up with the more number theoretic definition. For “nice” schemes, including the ones we’re working with, the class group is actually naturally isomorphic to , the Picard group of “line bundles” (locally free rank one modules) over the structure sheaf of with tensor products, so the two are sometimes used interchangeably. In the global field context, “line bundles” are associated with fractional ideals, linear equivalence by multiplication by a principal ideal, and all ideals are flat modules, so tensor products are just ideal multiplication, so this correspondence is obvious.

**Proof.** There are two cleanly separated steps: to show that every ideal class contains an integral ideal with uniformly bounded norm, and to show that there are finitely many integral ideals with bounded norm. The latter is obvious for number fields, since a prime ideal lying over has norm divisible by .

The former is (generally) proven using Minkowski’s theorem, applied to the lattice created by the embedding into Minkowski space (considered as -algebras).

**Aside:** You will also see Minkowski space described more explicitly in a non-canonical way involving choosing one of each pair of complex embeddings, which is workable but less neat – the volume calculations of the fundamental parallelpiped in terms of the Lebesgue measure on the resulting space introduces an awkward power of two, which of course cancels out in the end, but is still bothersome. A canonical alternate description from Wikipedia is the pithy “fixed subspace of the embedding into under complex conjugation” (where is the degree of the extension), which is pretty misleading, since it means complex conjugation acting simultaneously on the coordinates themselves, and to permute the coordinates by acting on the associated embeddings. It is an easy exercise to see how this identifies with the tensor definition, and that the natural volume forms on the two agree.

Then it is straightforward to see that the square root of the determinant is the volume of the fundamental parallelpiped (considering as the lattice), and the lattice determined by any given ideal can be calculated in terms of its index (ideal norm). The standard way to finish from here is to take the lattice associated to the inverse of an arbitrary fractional ideal, then to use a suitably scaled convex body to find a lattice point corresponding to an element of that multiplies the ideal to become integral, yet has sufficiently small norm to uniformly bound the norm resulting representative of the ideal class. The technical details are boring and meaningless.

So what’s actually happening with this Minkowski space business? The perspective we need to take is that each coordinate corresponds to a distinct real/complex *place* of – that is, an equivalence class (equivalence under induced topology) of absolute values on it. In this case, we are only considering the *archimedean* (non--adic, with the Archimedean property) places, so by Ostrowski’s theorem there is a unique one on , corresponding to the usual absolute value. By extension, each real embedding and each Galois orbit of complex embeddings corresponds to a unique absolute value and hence place on general . Then looking at Minkowski space as a product of s and s, in general, the space should be a product of completions of with respect to its various places, with canonically embedded inside.

We will need to extend our theory of “norm” here, but the already-present absolute values are suggestive enough of the general idea that we can await for this to arise naturally in the adelic perspective later. More pressing is the need to connect this to the class group. Heuristically, the idea that we can find ideal class representatives with bounded norm will correspond to compactness of the class group, and finiteness of integral ideals with bounded norm will correspond to discreteness; the proper way to think of finiteness of , then, is as a general topological consequence of compactness and discreteness.

To precisely give the correct link between as a topological group and Minkowski space, we require adelic language and concepts, but we can give a preview here. Elements of the fractional ideal group, since we can write them as unique products of prime ideals, can be identified with a kind of “-adic lattice” of their valuations at every prime (or, their nonarchimedean places). (Equivalently, the fractional ideal group is the free abelian group generated by the primes – again, this is basically the idea of associated divisors from algebraic geometry.) So this group is already discrete! (Discrete under what topology? Well, I did say this would be an imprecise preview.) And one can clearly see the link between the discreteness of the prime lattice, and the argument we saw above for finiteness of ideals with bounded norm.

To get the class group, we just have to quotient out by the sublattice of principal ideals, induced by the embedding of . If we can prove the result is compact, we’re done – so we have isolated the compactness argument.

To do this, we find a continuous surjection from a compact space onto it: we use the subset of our lattice which corresponds to ideals with bounded norm! The role of the Minkowski’s theorem argument in our original proof is to demonstrate we can find this subset; this will be replaced by a similar adelic result later. The oddly circumspect route (finding a lattice point in Minkowski space corresponding to inverse ideal, then multiplying and seeing that the dependence on ideal norms cancels) of the classical proof will be streamlined in the adelic result, which considers the archimedean topology of Minkowski space and the non-archimedean properties of the ideals simultaneously, as we will see.

Adopting a slightly more general perspective on the ring of integers and the class group will help throw this into better relief, by way of explicitly involving the non-archimedean (-adic) places of . Let be a finite set of places of , containing all the archimedean places. Then analogously to , we define to be the ring of integers of localized at all the places of , so that the primes corresponding to non-archimedean become units. ( exactly the set of archimdean places corresponds to the usual .) We can analogously define the class group or . It is not hard to demonstrate the following exact sequence, reproduced from Neukirch’s wonderful book:

(Note: the middle term has a more natural expression adelically, which we’ll see later. Each summand is also isomorphic to , by the associated valuation; here’s our “-adic lattice”!) This sequence by itself shows that is finite given the result for the usual case. But our compactness approach generalizes easily to prove the result for all simultaneously. This is really just two slightly different algebraic manifestations of the same point of view, because can be identified with a quotient of the “-adic lattice” we talked about earlier, eliminating the dimensions corresponding to the places of . Hence in fact there is a surjection from the normal class group into the -class group, as we see in the above exact sequence, and we are given another hint of what its kernel, the middle term in the exact sequence, is adelically.

Notice this makes sense, since the -ring of integers has fewer ideals, since we turned some elements into units.

**Aside:** You may be wondering, as I did: is there a way for this to make sense if does not contain all the archimedean places? No, obviously not, stupid.

Our treatment of Dirichlet’s unit theorem will be brief by contrast, since the compactness and adelic viewpoint there are practically begging to be discovered.

**Theorem.** Given a number field with real embeddings and pairs of complex embeddings, is a finitely generated abelian group; in particular, it is the product of a finite torsion group (the roots of unity) with a free abelian group of rank .

We again have the classical embedding of and hence of into Minkowski space; take log absolute values of each coordinate (the kernel is clearly the roots of unity, which we can prove are finite in number with a separate argument). By considering the norm condition on a unit, the image lies on a codimension hyperplane. If we can prove that the fundamental domain associated to the lattice (the cokernel of the map) is compact, we have that it is a full lattice on this dim- hyperplane, so we’re done.

The usual way to prove this using Minkowski’s theorem, again. Every algebraic number theory text I’ve seen does this in a somewhat different (though, modulo convolution, basically equivalent) way, but many of them obscure what’s actually going on behind seemingly arbitrary ideas and strange constructions – except for Neukirch, another reason it’s the only ANT book worth its salt. The idea which Neukirch lays bare is that we directly show that the fundamental domain is compact by showing it lies in the image of a compact set, the same trick as our earlier proof.

To see how this works, let’s go back and look at our embedding more closely. What we did was form the chain , where the second arrow is taking logarithms of absolute values (and also where the roots of unity disappear into the kernel). Inside we have our norm-one hyperplane , whose preimage in is a codimension-one hypersurface , where the unit group is injectively embedded – indeed, we could have done the entire thing (besides throwing out roots of unity) sans log-linearizing by thinking of the unit group as a “lattice” (discrete abelian subgroup) on the abelian variety . So we construct our preimage as the compact “fundamental domain” of (modulo the unit “lattice”) inside Minkowski space, as this is the more natural setting to work in.

**Aside:** this is the reason why most books’ treatment of this material is very bad: the log-linearization is nice because it handles the roots of unity, and we get the lattice and hyperplanes, etc. to work with so we don’t need a theory of multiplicative abelian varieties and the “lattice” on it. So books just work everything out inside , often using superfluous linear algebraic ideas. But Minkowski space is the more natural setting to make the geometric/number-theoretic connections apparent.

The idea is that we simply take a “box” in where the absolute value of each coordinate is bounded by a certain number, such that the product of the bounds is large enough so we can apply Minkowski’s theorem. Then we take the union of translates of the box over a collection of algebraic integers which can correspond to every possible norm less than that product. (Again, we see the finiteness of integral ideals of a given norm in action.) The intersection of these boxes with our surface gives us our bounded covering of the fundamental domain, after a little technical legwork, and we are done.

To generalize to -units , the idea is very easy: just add on the part of the “-adic lattice” corresponding to the non-archimedean places of onto Minkowski space, and repeat an almost identical argument. Indeed, this is basically captured by the exact sequence from earlier: there is an injection , whose image is the kernel of . This latter map has a finite kernel since the class group is finite, so the kernel only affects the torsion part of . Hence the rank of the free part is just (since and account for the archimedean valuations, and the primes account for the non-archimedean).

In summary, we saw that finiteness of class number and Dirichlet’s unit theorem have proofs which can be seen in a very similar way, as the compactness of a quotient of the “-adic lattice” and the compactness of a quotient of a hypersurface in Minkowski space, respectively. Look back at our exact sequence and otice that extending these results to -rings entails “subtracting dimensions” from the former and “adding dimensions” to the latter corresponding to the non-archimedean places of , a clear complementary relationship. In the adelic formulation, which glues together the archimedean places of Minkowski space with the non-archimedean -lattice, then, we should expect these to correspond to the two ends of an exact sequence in which everything is compact, which is indeed precisely what will happen.

**II. Adeles**

Let be a global field. The **adele ring** is simply what we get when we take a product of all the completions of with respect to its various places – almost. Actually, we have to take a restricted product in the following sense: let be the completion of with respect to place/valuation ; then the restricted product is the set of all elements of the full product for which all but finitely many of the coordinates are in that coordinate’s respective discrete valuation ring (with nonnegative valuation; so e.g. the -adic integers for the place of ). So it’s somewhere in between a direct product and a direct sum.

Note this restriction only makes sense for non-archimedean places, because archimedean places don’t give rise to discrete valuations rings. But this is fine, because there are always only finitely many archimedean places: this is well-known for number fields, and for function fields of algebraic curves over finite fields, there are no archimedean places, and indeed the concept is not even really applicable.

**Aside:** Here’s a somewhat involved explanation of what’s going on with these various valuations; this is very skippable. The fact that there are no archimedean places of function fields over a finite field is an immediate consequence of the general version of Ostrowski’s theorem, which says that all such places are inherited from an embedding into , i.e. there are none when the base field has nonzero characteristic. Indeed, in general, when we have a function field over , we enforce valuations of the transcendental extension to be trivial on the latter (though this is only an extra restriction in the zero-characteristic case). What happens is that we have the typical places corresponding to the prime ideals of (generated by irreducible polynomials), and then a number of infinite places.

The easiest way to see the infinite places is when our function field is precisely . Then there is just one infinite place, which is given by the total degree (numerator minus denominator) of a rational function. This might seem out of place and similar to the exceptional archimedean places of a number field, but in fact it is no different in character from the other places! To see this, consider how the map permutes the places – one can consider the infinite place generated by the “prime” ! This is legitimate because taking amounts to just choosing a different valuation ring inside , which was an arbitrary choice to begin with. Heuristically, in the algebraically closed case, can swap the infinite place with any one of the finite places (though this doesn’t actually work in practice unless we pass to completions); the infinite place is hence associated to any of the “primes” , since they are the same asymptotically. In the case, we can actually take as the uniformizing parameter of the associated DVR.

To put this precisely, when is algebraically closed, the projective linear group acts freely and transitively on the places of . In fact, we can see this very geometrically when : the places of trivial on can be identified with plus a point at infinity – nothing more than the projective line ! This can be extended to (projective) complex algebraic curves to give an abstract nonsingular model of all such curves in terms of their discrete places, the work of Andre Weil in the pre-EGA era. We can push this to higher-dimension varieties with function fields of higher transcendence dimension, but the theory becomes significantly less nice.

It is harder to get a geometric intuition, but this applies just as well to positive characteristic. If we take , the result still applies and we obtain projective curves over . The places over a finite field, like the places over e.g. , have a more complex structure since there are places of different “degrees,” corresponding to irreducible nonlinear polynomials. Then the resulting “geometry” has not just the base projective curve, but various higher-degree points which correspond to orbits under the absolute Galois group. This is again a spectral space; in fact still the projective closure of of the function field. Notice in analogue with the classical theory of algebraic curves that this interpretation in terms of projective closure gives a geometric reason why we need to take $S$ to be nonempty even in the function field case: if $S=\emptyset$, then we just get the full divisor class group of a projective curve over a field, which is $\mathbb{Z}$ by the degree homomorphism, so our result about the finiteness of class number doesn’t actually hold.

The asymmetry of archimedean places in the two types of global fields makes the arithmetic case more difficult to work with (among other things). Intuitively, the archimedean-ness of exceptional places of means unlike the function field case, in which we can take all the non-archimedean places in the nice form of a projective closure, it isn’t easy to make a complete scheme. This precludes us from using certain tools which exist in the highly-structured, close-to-analytic, projective context. (This is a common theme in the global field analogy; e.g. there is no known arithmetic derivative with nice properties either, which is a major roadblock to e.g. the abc conjecture. I wonder if there’s actually a way to connect these ideas of lack of analyticity.) This line of thinking is the beginning of Arakelov theory.

This allows us actually to give a tensor-product definition of adeles, as , where is the discrete valuation ring associated to the completion at , and the product is split up into the archimedean and non-archimedean places. This isn’t particularly important; I just thought I’d include it to show the usefulness of tensor products and demonstrate that we can give a definition without the somewhat arbitrary-seeming restricted product.

The adeles have a natural topology as a profinite group, since before tensoring with , the non-archimedean portion of the product is naturally the profinite completion of . Then we take the product topology with the usual topology on the archimedean completions (if applicable); this gives the **integral adeles**. In the full adele ring, after tensoring with , we consider the integral adeles to be an open subring. Note this is much finer than the subspace topology of the product topology on the unrestricted product.

Another way to look at this for the categorically is to see as glued together from products where a fixed finite subset of the terms are allowed to be the whole field, which is a useful perspective because it allows us to reintroduce the concept of – finite sets of places containing the archimedean ones. Indeed, we can set the **-adeles** , with the product topology. Then there are obvious inclusion morphisms from -adeles to -adeles, where , so we get a directed system, and can set as the colimit .

**Aside:** If you’re unfamiliar with the colimit (a more general categorical term for what is referred to as a direct or inductive limit in narrower settings), just think of it as “gluing together”. All the objects in the directed system (the partial -adeles) are like pieces of the whole adele ring, and the further down the chain you go (the bigger is) the closer you are to the full ring, since we want *any* finite collection of places to be the full field, not just fixed . There is a dual categorical concept called the limit (or the projective limit, or the inverse limit) which “glues together” in a slightly different way, by a kind of approximation. The best references in this case are actually Wikipedia: 1, 2. An important element (or, really, the defining element) of these two constructions is their universal property: for an inverse limit, every object with maps into the objects of the directed system (which commute with the maps between them) sees all those maps factor through the inverse limit, and for a direct limit, every object with maps from the inverse system sees all those maps factor through the direct limit.

If you’re familiar with the colimit, you might be wondering about the strange juxtaposition between finding the adele ring to be the colimit of the -adeles, while simultaneously having the “main chunk” of it be a profinite group (and hence a limit). There’s not really a reason here; the two limit/colimit constructions are basically unrelated. In some sense, the -adele direct limit is a pretty trivial gluing which just propagates the -coordinates to whatever finite coordinates we want, while the profinite completion of structure actually constructs the heart and soul of the adeles by bringing out the non-archimedean places.

However, there is a great example of limit-colimit adjunction related to the adeles, in the identification of with the solenoid on one hand and with the character group of on the other. See Keith Conrad’s notes for an exposition of the latter identification.

In this way, the adeles become a locally compact topological group; this is not difficult to prove with a general topology argument. Multiplication is also continuous, so they are in fact a topological ring. Notice we have a natural embedding given by taken the image in each completion, which sits inside the restricted product roughly because there are only finitely many places in the denominator of an element of .

The following result is fundamental. Not strictly necessary for our purposes, so it will only be a sketch, but it’s a good illustration of the kinds of arguments the topology of the adeles give us:

**Theorem.** The image of is discrete and cocompact in . (Cocompact: the quotient is compact.)

**Proof.** For the discreteness, it suffices to find an open neighborhood isolating , since we can translate it using the group operation. This is very simple; bound the norm of each coordinate by ; strictly in the -norm case. Check that this is open, and that this isolates zero (break it into the number field and function field case).

For cocompactness, we need to find a compact set which maps surjectively to the quotient in ; that is, a compact set with a representative of each equivalence class. (How familiar!) We can take actually the same open set as above but with the -norm restricted to instead of , non-strictly, since then we have a product of compact balls coordinate-wise which is hence compact by Tychonoff’s theorem. This amounts to saying that there is an element of which matches all the valuations fairly well, which is a consequence of the well-known weak approximation theorem for Dedekind domains. See here for an exposition of this theorem which mixes a sorta-pre-adelic viewpoint with a classical perspective.

There are many other properties of adeles (self-duality with character group, realization as a solenoid, strong approximation, etc.) which are important in the theory, but we will not cover them here. Consult Pete Clark’s notes for a sample.

Now we pass to the idele group. Element-wise, it’s just the unit group , but the topology is *not* the subspace topology, since, again, inversion is not continuous. The standard way to remedy this is give it the finer induced topology as a subspace of , identified with ordered pairs with .

This seems rather arbitrary, but actually it’s just a case of poor exposition – this is actually the natural topology of the ideles when identified with .

embeds into in a very understandable way.

**Theorem. (Product formula)** Let the adelic (or idelic) norm on be given as , and let be the subgroup of given by elements of norm . Then the natural embedding of into (coordinatewise, since there are of course natural embeddings ) lands in .

(Note that this makes sense for global fields, since by construction there are only countably many places, and eventually all the norms are at most .)

(Note two: the absolute value on -adic completions is familiar for number fields, being on , and being the normalized version of this (extension of norm with “denominators cleared”) in general. For function fields, it’s similarly defined as some constant , but the constant is uniform across all primes, and basically arbitrary. It is usually taken as the size of the finite field we are working over – I’m not sure if there’s any time this choice is actually significant.)

**Proof.** This is of course just a fancy way of saying that (notice this is actually always essentially a finite product) for all , where the product is over all places. A moment’s thought on the case of will give an idea of “why” this is true – the -adic valuations will switch the powers of between the numerator and the denominator, so together they will cancel out with the infinite place.

There are two common ways to prove this: the elementary method is to use the theory of norms of field extensions to reduce the problem to simply the cases of and , which are then obvious by explicit reasoning as above. There is subtlety involved in the calulations; recall the note above on the correct normalizations of the absolute values involved.

The other method is using something called the Haar measure, which you may have encountered before if you have done any study of locally compact topological groups. It is simply a measure (in the sense of measure theory) adapted to groups in the most natural way. This is the last piece of set-up we need.

**Definition.** A **Haar measure** on a topological group is a measure on the Borel algebra of its open subsets which is invariant under translation, i.e. for all open . (Notice : we’re dealing with an abelian group under addition, which makes our life a bit easier, since right vs. left translates aren’t a thing.)

The key thing that one needs to know about Haar measure is Haar’s theorem, which states that for locally compact , there is a unique (up to multiplication) nontrivial Haar measure satisfying some nice properties – inner and outer regularity and finiteness on compact subsets. The only one really relevant to us is the last one. There are more thorough treatments of the (in my opinion, tedious) theory behind Haar measure readily available; I like this more algebraic treatment by Terry Tao more than some of the others, but you can look around if you’re interested in technical tedium.

The point is, there is a Haar measure with these nice properties on , and it is an extremely important part of adelic theory. We’re not going to really do too much with it, but the ability to integrate over the adeles allows one to do quite a bit of impressive (cf. “beautiful” – Jack) analysis in the unified adelic context, which is the the basis of Tate’s famous thesis.

Denote by the Haar measure on , scaled (since uniqueness is only up to a constant) so that the induced measure on the compact quotient (“fundamental domain”) is , so that the measure “agrees with the counting measure on ” in a nice way. (Think in analogy with a measure on with the typical lattice points with fundamental domain the unit square, if it helps.)

The following lemma is the last piece of “niceness” we need to know about adelic constructs to proceed to the main proof:

**Lemma.** For any , .

**Proof.** This almost follows from the analogous statement, of scaling by , on the natural Haar measure (think -adic metric topology in the non-archimedean case, think Euclidean topology in the archimedean case) of each factor, which is easy to prove directly – you should work this out yourself if this doesn’t seem natural to you. This is, intuitively, breaking the problem into local cases at each place. (“Local” in the same sense as “local field”!) If were a true product space so that the Haar measure could be taken as the product of the individual Haar measures, we would be done – however, recall it’s actually a restricted product, so there is subtlety involved.

Instead of naively taking the product of the measures associated to our familiar -adic and Euclidean topologies all at once, we have to build it up through finite approximations, by using our construction from earlier, each of which is a genuine product space of locally compact topological groups, where the factors simply are simply given the restricted Haar measure, where they have volume . Hence we have natural Haar measures on each of them which are genuine product measures. Then it is a theorem (which is not actually difficult to prove, given uniqueness of Haar measure!) that the pullback of the measure by the inclusion restricts it to . More conceptually, this is saying is that we can construct the Haar measure through the direct limit, because the category of measured locally compact topological spaces has all limits.

In particular, for any particular , simply consider some “big enough” so that all the places where has nonzero valuation are included – possible by definition of the adeles. In particular, we can choose so that . Then the scaling effect of in is clearly (because ), and since and is an open subgroup of the whole adele ring, this coincides with the general scaling effect.

The product formula is immediately a trivial consequence of this lemma. (This is the other, more conceptual way to prove it.)

But more importantly, with the topological and metric machinery all set up, we’re ready for the main course.

**IV. Compactness, and two finiteness theorems**

The ultimate result will be this:

**Theorem.** The following statements are equivalent, for any nonempty finite set of places containing all the archimedean ones:

1) is compact.

2) is finite and the rank of is .

This begs the question: what exactly is the topology we’re considering ? Is it inherited from the ideles or the adeles? It turns out that the subspace topologies inherited from the ideles and the adeles coincide, and this is important. But more on this later.

Let’s work backwards and put the finiteness of class number and Dirichlet’s unit theorem into adelic terms.

Let be a nonempty finite set of places, containing all the archimedean ones. (By automorphism, we can actually assume that it contains the infinite place of a function field.) To get a handle on , we identify the mythical “-adic lattice” we discussed so much earlier: it is nothing more than ! Indeed, we need a group in which each element is identified with a certain valuation on every place not in . In , the places in are indeed quotiented out, since the -adeles have in the product for . On the other hand, the places not in only have a factor in the -adeles, which becomes just the -units upon passing to the unit group. Hence the quotient on those coordinates can be identified simply by giving the valuation of the element. Finally, finiteness is assured by the definition of the adeles.

But then immediately, from our earlier discussion, we obtain the isomorphism . (Exercise: put our informal discussion rigorously by giving the explicit isomorphism.) We thus need this group to be compact and discrete.

It will help to write the group a different way. It is in fact isomorphic to ! (Here, we work *a priori* with the inherited topology from the ideles, since we don’t know that the induced topologies are the same yet.) Indeed, we need only prove that modulo , we can always find a idelic-norm- representative. This is obvious for number fields, since we can alter the archimedean places however we want to achieve overall norm . It is only slightly less obvious for function fields. Since the absolute values in the function field case are all just exponentiated valuations with the same base, all we really need is to have the valuations sum to zero, which is of course possible: since is nonempty, alter any given representative of a class to have whatever valuation you want at a place of by multiplying by an adele in with the appropriate valuation there, and units everywhere else.

**Aside:** A not entirely obvious point is that we can find a uniformizing parameter at each non-archimedean place: that is, given non-archimedean , some with . This is of course necessary to get arbitrary-valuation elements in . Think about the case of a number field for simplicity: if our place corresponds to a non-principal prime like , it’s not trivial that there is a uniformizing parameter.

To see that this is indeed the case, think about it this way: for a non-archimedean place corresponding to a prime , we have a ring of integers with respect to in , . It’s not hard to prove that this will be a Dedekind domain for any global field. Then to get a uniformizing parameter, it suffices to obtain a uniformizing parameter for the localization , since the field of fractions of the completion of this ring (under the adic topology) is . Hence we simply need to prove that this is a DVR, or equivalently a PID since it’s a local ring. But the fact that localizations of Dedekind domains are DVRs is a standard algebraic fact, e.g. see here, so we’re done.

An intuitive way to think of this is as follows: in a Dedekind domain, we have unique ideal factorization. If we have an element so that the exponent of in is , once we pass to the localization, will be a uniformizing parameter! So now it suddenly seems very plausible that we can find such a , since we just need to make principal by combining it with other primes. But given finiteness of class number of global fields, every prime is torsion with respect to becoming principal, so we can just find a bunch of other ideals not divisible by but in its ideal class, which when multiplied together, give our desired principal ideal and uniformizing parameter.

Obviously this is not a way we should actually prove it, because it is circular logic (in the context of this blog post), and more generally, uses a more powerful result to prove a less powerful one. (A specific case of a less powerful one, no less – notice this proof doesn’t work for general Dedekind domains, which can have infinite class group!) But hopefully it is a useful way to think about it.

**Aside double combo:** The group is, importantly, not necessarily compact. It is known as the idele class group (notice the analogy with the ideal class group, which is of course a quotient group), and is very important in global class field theory. Indeed, the (arguably) fundamental theorem of the theory is that finite-index closed/open (equivalent, since the topology is profinite) subgroups of the idele class group are in bijection with finite abelian extensions of (inside some separable closure of the field). The quotients by these subgroups are the Galois group of the corresponding extension. In fact, the -class groups provide examples of these; each of them gives an extension of ! What a world.

**Aside triple combo:** A little more about arithmetic surfaces: taking the number field-function field dictionary even further, the fact that we have the isomorphism correspond to the fact that *for global fields, the connected component of the identity in the Picard group is the entire group*. This connected component is generally denoted for a scheme . This connected component on traditional algebraic curves is also known as the Jacobian , which classifies degree line bundles. So what we are really seeing is that ideal classes are isomorphism classes of line bundles, and that all of them are of degree . This is, heuristically, the product formula for global fields.

Disclaimer: terminology is used loosely in this aside, and I’m not sure if everything said here is *technically* true. But they are all things which *should* be true.

So now compactness is easy to deal with: is obviously a quotient group of . So we immediately have half of one way of our implication: the latter’s compactness implies the the finiteness of class number.

One last thing on class number: can we relate this to our ad hoc Minkowski argument for number fields? Sort of; the surjection from a bounded subset of the Minkowski lattice is a bit like our isomorphism with the adelic-norm- version. Of course, the latter doesn’t actually bound the norms of the archimedean places, since the non-archimedean ones still can have arbitrarily small contribution, which is why there’s still work to do.

Before tackling Dirichlet’s unit theorem, we are finally forced to prove the technical lemma regarding ‘s inherited topology.

**Theorem.** inherits the same topology as a closed subgroup (resp. subspace) of the ideles and the adeles.

**Proof.** First we check that is a closed subgroup of the adeles – the topology of the ideles is strictly finer, so this will suffice for both closedness statements. Let . Take a finite set of places so . In particular, contains all the places of where has norm greater than in that place.

If the adelic norm of is less than , then take open balls in every place where with radius as close to the respective norm of in that place as possible. By definition of adelic norm, if we take larger and larger finite subsets of the remainder of the places where has norm less than , . The product of all these open balls with the unit balls in the remaining places (just all the rings of integers so it’s a genuine open subset of the adeles) is by construction an open neighborhood of disjoint from the unit ideles.

If the adelic norm of is greater than , it’s a little trickier. We use the convergence of the product to see that we can take a sufficiently large (with, minimally, the same stipulations as earlier) so that for any outside of this set, the exponent of the valuation (that is, the such that ) is greater than : this is clear because of the increasing exponents of number fields, and the fact that function fields only actually have finitely many places to begin with. We can similarly make large enough so that (defined in the obvious way) is in . Once again by taking sufficiently small neighborhoods in the places of of and allowing elsewhere, a simple computation shows that if any place in this neighborhood outside of has norm , the whole idele has norm less than , and otherwise of course the idele has norm greater than . So we again have an open bounding away from the unit ideles. Hence between our two cases, we have the desired closedness.

(There’s some philosophizing to be done about the dichotomy of the approaches in this last paragraph between the function field/number field case once again, but I’ll leave you to think about it, since I don’t have any particularly fresh points to make.)

Since we know that the idelic subspace topology on is *a priori* at least as fine as the adelic subspace topology, it remains to show. So let be an open subset of the unit ideles in the idelic topology; we need to show that it contains an open subset in the adelic topology. We can assume WLOG that contains by multiplicative translation (which is continuous even in the adelic topology), and by contraction that is a product of neighborhoods for for some finite set of places , with in every other coordinate.

is an open set in the adelic topology. If we intersect with , we don’t necessarily get something in since the coordinates outside of might not be units. But we can ensure that they are if we make the -coordinates have sufficiently small norm, so that the non--coordinates must have exactly norm (rather than just , as is *a priori* the case) to ensure it lies in the unit adeles. But this can be done by just shrinking the s suitably. The details of the computation are left for if you’re into that kind of thing.

Now Dirichlet’s unit theorem falls out easily. In place of the log embedding of just multiplicative Minkowski space, we have an “-log embedding” in general: , given by taking . We can then take the composite , where the first map is the obvious map in to the -places.

Obviously, the image of the first arrow lies in , which maps to the hyperplane with coordinates summing to zero. Then as before, we need to make a few claims to show this does what it’s supposed to. We only sketch the proofs, since they’re all straightforward:

1) The first arrow is a discrete embedding, and the composite image is discrete as well.

That it is an embedding is clear enough, since it’s an embedding in each coordinate. Hence it suffices to establish discreteness. By the previous theorem, it suffices to prove that the “additive version” is discrete (check out the subspace topologies yourself). In other words, we need a finite neighborhood of which contains finitely many elements of . It is easy to explicitly construct such a neighborhood, but also note that it follows from discreteness of in by passing to open subrings and then dropping the other factors. (Actually we also easily find that the analogous statement about cocompactness of in is true as well.)

To prove the composite image is discrete, we note that the map is proper: take a basic compact set; the product of bounded closed intervals bounded each above and below. The result is simply an annulus in (a compact closed ball minus an open ball), so the result follows. Then the image of a discrete group under a proper map of locally compact topological groups is discrete (easy result; prove this! it’s not true for general spaces), so we’re done.

2) The kernel is finite and torsion.

We saw in the number field case that it was precisely the torsion elements; the roots of unity in , which are finite in number. The kernel for the function field is the (multiplicative group of) functions in (which, as you should’ve intuited, are functions generically regular, with their only possible poles at points corresponding to places of ) which are also regular and nonzero on the places of . But then these functions can have no poles anywhere, hence no zeros by the product formula, and hence are nothing more than constants in the coefficient field . So again: finite, and torsion.

**Aside:** It feels a little weird to call the nonzero coefficients of the function field case “roots of unity,” since it’s obvious *a priori* that all nonzero coefficient field constants are roots of . As it turns out, you should get over that weirdness, because this link between the units of the coefficient field and the roots of unity turns up in plenty of places. The number of these roots of unity, with degree relatively prime to the characteristic associated to the place, in an archimedean completion is always one greater than the residue field (think about what this means for, e.g., a -adic extension, vs. the function fields – it’s almost vacuous in the latter case).

In class field theory, cyclotomic extensions and extensions of the coefficient field (for finite function fields) both give abelian extensions for their respective global fields. The celebrated Kronecker-Weber theorem tells us that *every* abelian extension is contained in a cyclotomic field, and Hilbert’s twelfth problem asks us to generalize this to arbitrary number field; essentially, we need to find nice descriptions of the maximal abelian extension . Besides special cases where nice abelian varieties (like the unit circle!) show up in the form of complex multiplication (read Kevin’s ongoing posts), the problem remains open. (Actually, there’s a nice adelic proof of Kronecker-Weber, using the idele class group machinery mentioned earlier. This even gives a description of in the general case, though the abstraction of the description is not particularly useful for computation.) But for function fields, the correct formulation has been found in the Kronecker-Weber-Hayes theorem: it’s not an exact analogue; the maximal extension of the “roots of unity” coefficient field is not the whole story, but it’s an important piece.

Finally, Pete Clark claims that counterexamples to one of Malle’s asymptotics conjectures in the number field/function field correspond to these extensions arising from “roots of unity,” but I can’t really find anything on the function field case.

3) The quotient by the composite is compact.

This is really the crux of it, since with this, we obtain that the lattice is full rank in the hyperplane, so we finally get that the image of the unit group, hence the unit group, has rank . Denote by the hyperplane in . Then it’s not hard to see we can pass to the restricted quotient map .

Dirichlet’s unit theorem is now that the target is compact. We want to prove that this is equivalent to the source being compact, to get closer to our desired equivalence into adelic language. It’s not hard to check that , like the map it’s descended from, is proper, so one direction is quick. In the other, the image of the target is compact by general topology, hence a closed subgroup in the Hausdorff target. It is a classic fact (and a good exercise) that if we have an exact sequence of locally compact topological groups, compactness is “additive in exact sequences”: what this means in this case is that is compact if and only if and the quotient

is compact. Now we really just need to get our hands dirty and show that contains linearly independent vectors. This is a bit of straightforward greasework, reminiscent of the classical proof, which I will leave to you.

Hence this last point (3) is equivalent to the statement “ is compact.” We take it just one more step and note there is an obvious quotient map with kernel the product of all those other unit groups not in , which is compact by Tychonoff’s. So we are reduced from Dirichlet’s unit theorem to compactness of .

Now, finally:

**Proof of equivalence.**

**V. Adelic Minkowski**

Let’s take a big-picture view of what we’ve accomplished.

The class group is seen to be precisely a quotient of the (discrete) -adic lattice, which has a natural adelic description as . We are able to find norm- representatives of every equivalence class, so the quotiented -adic lattice is also . Hence finiteness of class number is equivalent to compactness of this.

Geometrically, it is obvious that the unit group (once you remove torsion) embeds nicely as a lattice in a hyperplane of dimension , so the Dirichlet unit theorem is equivalent to the quotient of the hyperplane by the lattice – which can be pulled back to the quotient of the abelian variety . (Indeed, as in the classical case, everything can be done here, on the quotient of the “hypersurface,” with a little more care and theory, but the embedding provides a nice way to get rid of the torsion and put it in a linear algebraic context.) This is the “fundamental domain” of the adelic-norm-1 hypersurface under the lattice given by the unit group. Tacking on the (compact) product of the unit balls of all the places not in allows us to replace this with the more adelic (a “fattened fundamental domain”), and so we are reduced from Dirichlet to the compactness of this last group.

We are then able to glue these two groups together (in the sense of short exact sequence, -style gluing) so that their individual compactness was equivalent to the compactness of their gluing. On a technical level, it’s clear that we have the short exact sequence where the glued object is . But intuitively, why do the “fattened fundamental domain” and the “quotiented -adic lattice” fit together nicely like this? Well, the latter’s lattice structure is precisely on the places not in ; one can imagine it as a finite torsion grid-like structure along those coordinates. The former has the “fattening” – more or less extraneous product of unit balls – in the non- coordinates, to fill in the gaps of the grid, and then a simple fundamental domain along the dimensions/coordinates corresponding to the -places. It also as hypersurface of sorts in the full idelic space, quotiented by the lattice . The first arrow in the exact sequence is then just the immersion of a fundamental polygon which is extant along some coordinates (the places of ) but not along others (the complement).

At long last, it’s time to actually prove that any of these things are actually compact. And naturally, we will need a Minkowski lemma of sorts.

**Theorem. (Adelic Minkowski)** For a fixed element , consider the set of adeles with at every place . Then there exists a constant depending only on the global field so that for any with , contains a nonzero element.

**Proof.** This is basically an exercise in manipulating Haar measure, but the idea of the proof is very geometric (as one would expect, since it’s basically the same as the proof of classical Minkowski). It’s not essential to understand the technical details, but the main thread should be easy to follow.

Let be the nicely scaled Haar measure we had from earlier. Let be the product of the closed unit discs in the non-archimedean places, with the unit disc of radius (diameter ) in the archimedean places: the key is that any two elements in can’t differ by more than in valuation at any place. is compact and contains an open neighborhood, so is finite and nonzero.

Take , and with . It is fairly clear then that to prove our desired statement, it suffices to prove that contains two points with the same equivalence class modulo .

Indeed, we have that (by the lemma about scaling effect of multiplying by constants on the Haar measure from way back), and if we take the projection map , we can proceed with a simple volume argument:

where we implicitly exchange a sum with the integral, and abuse notation to let denote both the Haar measure on both the adeles and the induced one on the fundamental domain. Hence if all the are , ; contradiction.

Take a moment to note how the above proof is pretty much exactly like the proof of classical Minkowski. Really, all we’re doing is changing the lattice, if we just broaden our understanding of “lattice” to “discrete cocompact subgroup of a locally compact topological abelian group with a nice norm associated to its Haar measure”. Indeed, it is possible to generalize the statement to this context, which then includes all versions of Minkowski.

Let’s wrap things up.

**Theorem.** For a global field , is compact.

Proof. Recall that the unit ideles inherit the topology from the adeles. So we just need a compact set of which projects surjectively onto . But just choose to be the set of adeles bounded at each place by the norm at that place of , where as in the adelic Minkowski lemma. Then for any unit idele , there is some nonzero so that by the lemma, so projects onto .

Dirichlet’s unit theorem and the finiteness of class number are immediate consequences.

Let’s conclude with a sort of bird’s-eye view of what this all means. It will be helpful to have read the asides to understand this.

So hopefully we have answered Sameer’s question adequately: the importance of Minkowski’s theorem and lattice geometry is not an accident; a general notion of “lattice” is extremely important because this idea manifests itself in algebraic/arithmetic groups suited to analyzing global fields, like the adeles/ideles. The classical “geometry of numbers” and Minkowski’s theorem arise because of the special phenomenon of archimedean places of number fields, whose corresponding completions give us copies of and . When analyzing , which is the ring of integers corresponding to just the archimedean places, Minkowski space then naturally becomes an important object of study. and have special structure that can be exploited to obtain results in this manner.

On the other hand, the presence of these places is a difficulty in some ways, since unlike the function field case, these infinite places can’t be thought of “projective completion” of the spectrum of a number field, and behave very differently, which is why a lot of results are much easier for function fields than number fields. (I mean, the Riemann hypothesis for one.) The main working approach to this is the same attitude which inspired the geometry of numbers: to work in the archimedean places, take them as they are, and use their familiar analytic structure in conjunction with the nicer non-archimedean places to make progress. This is the foundation of Arakelov theory, a modern approach to number theory. We can define Arakelov divisors, which are like elements of our “-adic lattice” with archimedean real/complex components tacked on, and even the Arakelov class group, a kind of “fattened class group,” which very much resembles in spirit our “fattened fundamental domain” from the adelic discussion of Dirichlet’s unit theorem. (Remember “obviously not, stupid?” Sorry, that was a lie; that’s precisely what this is.) We can obtain analogues of a lot of algebraic geometry this way, including Riemann-Roch, sheaves, and intersection theory.

Relatedly, but with almost the opposite attitude there is a view that there should be a **field with one element**, a mythical object over which would be a curve, whose projective completion would somehow bypass, or perhaps reveal hidden depths of, the archimedean obstacle. The field with one element is a fascinating aspect of mathematical folklore which has many, many more interesting hypothetical connections and properties than the ones arising here; see the Wikipedia page and MathOverflow for more. It should be noted that quite a few pretenders exist, but none are fully satisfying.

Some of the terminology used here was nonstandard; to further get into adelic applications in class field theory and Arakelov theory, some of this should be clarified. An “-class group” is more typically described as a **ray class group** – though the latter concept is slightly more general. A set of places is a kind of **modulus**, which figures in Artin reciprocity, a fundamental result of global class field theory. Moduli can count places with multiplicity, however.

Huge credit to Brian Conrad’s notes, from which the bulk of this material was taken. Indeed, this post is basically a more fleshed out version of those notes, recast as a general philosophical overview/introduction to adeles.

]]>

**Definition.** A *vector bundle* over a topological space is a (continuous) map satisfying the following properties:

(1) For any , the pre-image is homeomorphic to . (The pre-image of a point is called a *fiber*)

(2) (*Local Triviality*) For any , there is an open neighborhood around that point such that is homeomorphic to

is called the base space and is called the total space.

Examples of vector bundles are fairly easy to come by. We have, for example, the tangent bundle of a manifold, which can easily shown to satisfy the two conditions above. We will re-visit tangent bundles several times in future blog posts. Another important example is the Möbius bundle over . Intuitively, this assigns a line to each point on the unit circle in such a way that it “twists” once before it comes around the circle (geometrically, this would look like a Möbius strip, hence the name). This example is important because it is one of the easiest example of a non-trivial line bundle (that is, it doesn’t look globally like ).

A mild technical note: in general, we will be assuming that our base spaces are compact and Hausdorff, though a weaker assumption (such as paracompactness) is generally sufficient to get all of the properties that we want.

* * *

While there is a lot more to be said about the theory of vector bundles (which will be the case in future blog posts), instead I will focus on two related notions. The first is the theory of fibrations. This generalizes the idea of a vector bundle in a way that yields useful results in homotopy theory.

**Definition. **A *fiber bundle* is defined in the same way as a vector bundle, but instead of requiring that the fibers be homeomorphic to , we allow them to be any topological space . (Of course this also has to satisfy the local triviality condition)

While fiber bundles are useful, we want a slightly more specific homotopy theoretic property:

**Definition. **A map is said to satisfy the *homotopy lifting property* if, for any space , a homotopy lifts (not necessarily uniquely) to a homotopy satisfying . Such a map is called a *fibration*.

A reader familiar with some of the basic ideas of homotopy theory might recognize something familiar in these definitions: the idea of a covering space satisfies these properties exactly. An -sheeted covering space if precisely a -fibration. The most important property of fibrations (from the viewpoint of homotopy theory) is the following:

**Claim.** Given a fibration , there is a long exact sequence of homotopy groups:

The first two homomorphisms are the obvious ones induced by the fibration maps. The third one can be obtained through some diagram chasing (in a way analogous to the Snake Lemma).

**Example.** The *Hopf fibration* is historically an important example of a fibration. We can think of as . In an analogous way we can define as equivalence classes of pairs of complex numbers with if and only if . We can think of this equivalence relation as sending a pair of complex numbers to their quotient with an additional point at infinity – the one point compactification of . This, then, naturally induces a map sending a pair of complex numbers to their equivalence class. The pre-image of a given point is the set of complex numbers with norm 1, which is precisely . This is the Hopf fibration, and it leads to the following result on homotopy groups:

**Claim.** For , we have . In particular, .

**Proof.** We can directly apply our long exact sequence, along with the fact that al higher homotopy groups of are trivial, to get the following short exact sequence:

Because this is exact, the groups must be isomorphic. Using the fact that for all yields the desired result.

This is just one example of the usefulness of fibrations. (Ironically, from the perspective of homotopy theory, vector bundles are one of the least interesting example of fibrations because the homotopy groups of are not of much interest.)

* * *

The final area I would like to discuss is the theory of classifying spaces. First, we will explore a bit more about the theory of vector bundles:

**Definition.** Define the *infinite Grassmanian* to be the set of all -dimensional subspaces of . We can also define it as a limit of with the weak limit topology. We will denote as . There is an analogous construction in the complex case, which we will denote .

For the sake of simplicity, we will be working with complex vector bundles. The reason that this case is simpler is because every manifold has a unique complex orientation, so we don’t need to worry about issues of orientability.

**Theorem.** There is a bijection between complex line bundles and maps .

The reasoning behind this statement should be apparent: we are assigning to each point in a point in , which is precisely an -dimensional vector space. We would like to generalize this construction to fiber bundles where the fiber is any topological group. (This is a generalization because we can think of a vector bundle as having fibers in the infinite unitary group ). Unfortunately, solving this problem for any topological group is extremely difficult. However, if we assume that we are working with a discrete topological group, then there is a solution. As it turns out, is just an example of what is known as a *classifying space*:

**Definition. **Let be a discrete (topological) group. Then define a space called the *classifying space* of to be a topological space such that and all higher homotopy groups are trivial.

Of course, the task of constructing such a topological space is non-trivial. One solution is to resort to the Eilenberg-Maclane space . this certainly satisfies the definition. However, an observant algebraic topologist will notice that this definition only defines a classifying space up to homotopy equivalence, so there are many different models of classifying spaces. Thus, we will give another construction that is in some ways a “better” model – this will have the property . As the functor doesn’t preserve products, this new model can be seen as “better”.

**Definition.** Given a small category , define the *nerve* of the category to be a simplicial set based on morphisms in the category. We will give the construction here:

Our 0-simplices will simply be the objects in the category. Our 1-simplices will be morphisms between the objects. 2-simplices will be the diagrams with edges given by two composable morphisms along with their composition (that is, given two composable morphisms , we take the commutative triangle with edges ). We can continue this construction to get a simplicial set, which we will call the nerve of the category. We still need to specify the face and degeneracy maps in order to have the entire structure of a simplicial set. Define the face map by taking the simplex to the simplex by composing the morphisms into one morphism. Define the degeneracy map by inserting an identity morphism at the object .

We will use this construction to very easily construct a classifying space. Given a discrete group , we can think of it as a single-element groupoid (that is, a category with one object in which all of the morphisms are isomorphisms). Then we take the nerve of this category, and finally take the geometric realization of the nerve. This is a topological space, which will be precisely . Note that the nerve functor is right adjoint, so it commutes with limits, including products. Additionally, it is a well-known fact that geometric realization commutes with products, even though it is a left adjoint functor (see, for example, this paper for a motivation of why this is true).

That the result of this construction is a classifying space is not hard to see. The fundamental groups of a CW complex (and, thus, a simplicial complex) is obtained from the 2-skeleton by taking formal products of the 1-cells and using the 2-cells to apply relations. In this case, the 1-cells are precisely the elements of the group and the 2-cells will tell us that , so this will give us the correct fundamental group. Verifying that the higher homotopy groups are trivial is much more difficult and beyond the scope of this post.

**Example.** Consider the group with two elements. Then, in the nerve of the category, there will be exactly one non-degenerate simplex in each dimension. This will give rise to a simplicial complex with one simplex in each dimension. This is precisely analogous to infinite real projective space, which can be realized as a CW complex with one cell in each dimension. By making the proper identifications we can see that this the same space. Because our construction of classifying spaces commutes with products, we have that is the classifying space for any product of these spaces as well.

]]>

First, a classic example which demonstrates how geometric vector bundles (equivalently locally free sheaves) on a scheme do not quite correspond to topological vector bundles. It is well-known that the only two topological real line bundles over are the trivial bundle and the Möbius bundle with one twist. Any other bundle, with a number of other twists, is homeomorphic to one of these two, since we can cancel out pairs of twists – even isotopically, if we embed in five(?) dimensions.

But if we take the scheme (as e.g. , which has underlying space homeomorphic to , it is clear we get many line bundles: specifically, the twisting sheaves for each . (Here we implicitly use the equivalence of categories between locally free sheaves and schemes over which are geometric vector bundles.) What gives?

It’s not that there’s something fundamentally geometrically different about the scheme structure: a geometric vector bundle over really does have underlying topological space a line bundle over the same space; that’s clear from typical definitions as given in, e.g. Hartshorne chapter 2. What’s different is the morphisms in the category of topological line bundles versus the category of geometric line bundles over a scheme. The latter morphisms are, of course, simply morphisms of schemes over , and so have to be given locally by regular functions. But the twist-canceling homeomorphisms can’t be given this way (intuitively, think about having to twist the lines out to infinity all the way around; these involve poles, algebraically), so they are not isomorphisms in the category of geometric vector bundles.

Intuitively, if we “allowed topological morphisms,” then the even would all be trivial whereas the odd would all be the Möbius bundle. This also explains the odd behavior of global sections (i.e. that all the nonnegative have global sections while all the negative ones don’t): the negative evens do have nonvanishing globals; it’s just that, again, they can’t be given by functions which formally satisfy the local regularity conditions of the twisting sheaves. On the other hand, the odd positive ones don’t really have nonvanishing global sections, since there are points at which the odd-degree polynomials corresponding to global sections of that twisting sheaf vanish.

**Exercise.** Someone should rigorize this line of thinking to create a gorgeous proof that every odd-degree polynomial has a real zero.

—

Second, a while back, I was trying to gain intuition for the Picard group by reading through different perspectives on the invertibility of line bundles. There’s of course the inverse transition maps, tensor-hom adjunction, and all that jazz, but I also stumbled on an amusing one: consider the classifying space for line bundles, the infinite Grassmannian . An isomorphism class of line bundles on a space is equivalently a homotopy class of maps into the classifying space, so we have reduced our problem to the much simpler and more intuitive one of putting a group structure on in the homotopy category!

A little Schubert calculus gives us the following: first, the cohomology ring is concentrated in even dimension, so it is a genuine commutative ring. Second, since the product is given by the cup product, which corresponds to intersection product, the closed points of correspond to maximal ideals generated by Schubert cycles associated to 0-dimensional subspaces, points. In fact, it turns out that is the formal affine line, , whose canonical additive structure on its closed points is induced by tensor product of the line bundles associated to the points of the Grassmannian (as a moduli space) associated to the closed points.

This structure can then be pulled back to , using the techniques in this paper. Theorem 1.1 implies that the well-known induced map in cohomology is injective. We can then compose it with the isomorphism . As a hom into an abelian scheme, this last object naturally inherits a group structure. This can then be pulled back to by naturality, giving us our group structure on isomorphism classes of line bundles.

The best part is that this doesn’t even work, since isn’t actually a strictly commutative ring if it doesn’t have vanishing odd cohomology, so it doesn’t have a spectrum. To really complete this elegant line of reasoning, I think it is necessary to develop the methods of noncommutative geometry. Alain Connes still dreams at night.

]]>

In Fulton’s

Intersection Theory, he develops the notation for the degree homomorphism from to , and I was wondering if there was a reason for the notation. Is this in any sense a kind of integration?

This resulted in a quite interesting discussion over Facebook chat (that perhaps lasted longer than it needed to), which I had screencapped and posted as an answer on math.stackexchange.

The folks at math.stackexchange were less than amused by this answer (which is reasonable on their part, given the rules of the site), as seen by their discussion here on meta.math.stackexchange. Not surprisingly, the answer was deleted.

]]>

**Oh, and Anja Schulz gets a shout out as well!**

* * *

When we discuss the construction of a meromorphic function on a Riemann surface with a given principal part at a given point , we have to find holomorphic functions on such that

on . One can formulate this in a general setting by introducing the concept of sheaf cohomology. Suppose is a topological space and ia sheaf of abelian groups over . Let

be a covering of by open subsets. For every nonnegative integer , we define as follows. An element of is

where is a continuous section of over and is skew-symmetric in . We call the group of alternating -cochains for the covering with coefficients in the sheaf . An element of is an alternating -cochain for the covering with coefficients in the sheaf or simply a -cochain when there is no confusion. For notational convenience, we define to be the zero group when .

We define a map

as follows. The image of

under is

where

on and means that the index is omitted. We call the map

the coboundary map. Denote by the kernel of

and denote by the image of

The group (respectively ) is respectively called the group of alternating -cocycles (respectively -coboundaries) for the covering with coefficients in the sheaf .

The composite of

is zero, because if the image of the element

of under

is

then

Hence, is contained in . Denote by the quotient . The group is called the cohomology group of dimension for the covering with coefficients in the sheaf . It is clear from the definition that is simply the set of all global continuous sections of the sheaf over and is usually denoted also by .

In our construction of a meromorphic function on a Riemann surface with a given principal part at a given point , the collection is a -cocycle with coefficients in the sheaf of germs of holomorphic functions on , and the existence of is equivalent to being a -cboundary. In this formulation, the obstruction to the solution of the problem is the cohomology group of dimension . When we try to piece together local meromorphic functions to form a global meromorphic function, we want to get rid of the discrepancies of the local meromorphic functions, and it would serve the same purpose if we can get rid of the discrepancies by going to a refinement of the covering. The problem is solved if the cohomology group of dimension vanishes when one goes to a refinement of the covering. This suggests that one should take all coverings of the space and consider the direct limit of the cohomology groups for all the coverings.

In the general case when we have a refinement of the covering , we have

We define as the direct limit of as runs through the directed set of all open coverings of . The group is called the cohomology group of dimension of with coefficients in the sheaf . We denote respectively by , , and the direct limits of , , and . Then we have an induced coboundary map

whose kernel is and whose image is . Moreover, is the quotient of by .

Among all cohomology groups of positive dimension, the most important cohomology group is the one of dimension . Its vanishing enables one to piece together local continuous sections of a sheaf to form a global continuous section. Why are the cohomology groups of higher dimensions introduced? They are introduced the help us compute the cohomology group of dimension , because when we have a short exact sequence of three sheaves, we have a long exact sequece of cohomology groups. We are going to discuss this long exact sequence of cohomology groups.

Suppose

is an exact sequence of sheaves and sheaf-homomorphisms. For every , we have the short exact sequence

This is clear except the surjectivity of . Suppose we have an element

of . We have to show that its restriction to some refinement of is the image of some element . First, let us make a trivial observation. Suppose we have a continuous section of over an open subset of . We give a metric . For every point in , there exists a maximum positive number

such that the ball of radius centered at is contained in , and for every , the restriction of to is the image of some continuous section over . The function is clearly a lower semi-continuous function of so that if

then

for all in some open neighborhood of . We can assume, without loss of generality, that the covering is locally finite. We choose relatively compact in so that

still covers . For every point in , we let be the minimum of for all containing . Clearly, every admits an open neighborhood on which the function has a positive lower bound. Now, for every point in , let be an open metric ball centered at of radius contained in for some

so that the function

on . Let

Then is a refinement of . Suppose have a common point . We can choose a number such that

for . Let be the metric ball centered at whose radius is . Then contains for , and is contained in . Moreover, be the restriction of to . We skew-symmetrize with respect ot . Then the restriction of to is the image of the element of .

From the commutative diagram with exact rows

we get a long exact sequence

The only map in the long exact sequence that needs some explanation is the so-called connecting homomorphism . It is defined as follows. Take an element of . We can find an element of whose image under is . Let be the image of under the coboundary map . From the above commutative diagram, it follows that is the image of some element under of the element of defined by . The exactness of the sequence is a consequence of straightforward diagram chasing.

]]>

Then

We have

Now, is perpendicular to because

The length of is simply the square of the length of , because and have the same length. The square of the length of equals . Hence, the holomorphic sectional curvature in the direction of is

Since in the case of Kähler manifold the curvature of the complex metric connection of the tangent bundle agrees with the Riemannian curvature, we have

Thus the holomorphic sectional curvature of a complex submanifold is more than the corresponding holomorphic sectional curvature of the ambient Kähler manifold. Note that this statement is not true for Riemannian sectional curvatures and Riemannian manifolds, because the Riemannian sectional curvature of the unit sphere in the real Euclidean space is clearly greater than the corresponding Riemannian sectional curvature of the Euclidean space.

The decrease in holomorphic sectional curvature for complex submanifolds holds also for a more general kind of curvature, because it comes from the inequality involving . So we want to see what curvature corresponds to. Let

We consider

where for the last equality the first Bianchi identity is used. One can easily check that a -bilinear form on satisfies

if and only if

when expressed in terms of complex basis of . Hence,

Hence,

We call the *holomorphic bisectional curvature* in the direction of and (or in the drection of and ). After suitable normalization, it is equal to the sum of two Riemannian sectional curvatures, one for the plane spanned by and and the other for the plane spanned by and . This is the reason for the name holomorphic bisectional curvature. The holomorphic bisectional curvature of a complex submanifold is no more than the corresponding holomorphic bisectional curvature of the ambient Kähler manifold.

]]>

We can choose a local unitary basis of such that belongs to . Let be the connection of induced by the complex metric connection of . In other words,

for . Another way of describing the connection is that the -covariant derivative of a local section of of is obtained by taking its -covariant derivative and then projecting onto by the orthogonal projection. This connection agrees with the complex metric connecion of with respect to the metric induced from that of . The reason is as follows. Firstly, it is easy to see that the -covariant derivative of a local holomorphic section of along any direction is zero from the above description of the -covaraint differentiation. Secondly, if is a section of above a local curve of with zero -covaraint derivative along the curve, then its -covariant derivative is a linear combination of and must be perpendicular to . As a consequence,

must vanish along the curve, and the length of is constant. Let be the orthogonal complement of in . We give the complex structure of the quotient bundle . The difference of the -covariant derivative of and the -covariant derivative of of is a 1-form with values in . This -valued -form is called the *second fundamental form* of in , and we denote it by . Let

be the representation of in terms of some local *holomoprhic* atlas . Then

and

Hence,

is a -valued -form. Thus, the second fundamental form must be a -valued -form, and we can write

for a section of .

Let us now consider the case of the quotient bundle. Take a local holomorphic basis of . We use the same notation to denote local sections of orthogonal to . The holomorphicity of basis means that for some sections of , the sections are holomorphic sections of . We take also a local holomorphic basis of . We have

When we write

in terms of the local holomorphic basis of , we see that

Hence, is an -valued -form. Since from the definition of one has

it follows that is a -valued -form. So we can define a connection on by

We claim that this connection is the complex metric connection of . It is a complex connection, because we have observed earlier that

and the -form has values in . It is also a metric connection, because

We write

for sections of . The operator is a -valued -form given by

We call the *second fundamental form* of the quotient bundle of .

Another more invariant way of representing the second fundamental form of the quotient bundle of is the following. We have

The local basis of the orthogonal complement of in simply describes the monomorphism from to which lifts to the orthogonal complement of in . let us call this monomorphism . Then is simply . The entity is *a priori* only a -valued -form which is exact. Our previous discussion shows that it is actually a -valued -form. However, as a -valued -form, in general it is only closed and is not exact. Suppose we have another Hermitian metric for . Then we would have a different monomorphism from to and a different second fundamental form . The difference of and is a homomorphism from to , because and are different liftings of to . So the difference of the two second fundamental forms of equals , which is a exact -valued -form.

We want to relate to the second fundamental form of the subbundle of . For sections and of and , respectively, we have

Hence, is simply the negative of the adjoint of with respect to the Hermitian metrics of and . So

We now compute the curvatures of , , and . We choose a local orthonormal frame so that belongs to . Write

Since the connection is compatible with the metric by differentiating

we conclude that

The second fundamental form of is given by

for . From

and

we have

Thus,

(no summation over ) for any vector of type , and equality holds if and only if

In invariant formulation, we have

for any in and any vector of of type .

Let us now look at the case of the quotient bundle. We can identify the quotient bundle as the subbundle of which is the orthogonal complement of in . We needed the complex structure of only to define the connection of as a complex metric connection. Once we get the connection of , we can ignore the complex structure of . The calculation of the curvature tensor depends only on the connection. So when it comes to comparing the curvature tensors, the computation of the quotient bundle case is the same as the subbundle case. There is however one difference. The second fundamental form of the subbundle is an endomoprhism valued -form. So when it comes to evaluating the exterior product of the second fundamental form and its complex conjugate transpose at for some -vector , there is a sign difference between the quotient bundle case and the subbundle case. So we have

for any in the orthogonal complement of in and for any vector of of type , where is the image of in .

]]>