Rational approximations to Platonic solids

It is straightforward to position a cube such that its vertices are all integer coordinates. For instance, we could choose its eight vertices to be (±1, ±1, ±1). Similarly, the vertices of a regular octahedron can be positioned at cyclic permutations of (0, 0, ±1). The tetrahedron, being a subset of the cube, can also be realised exactly with integer coordinates.

What about the dodecahedron and icosahedron? If we could position all the vertices at integer coordinates, then all distances must be square-roots of integers (by Pythagoras). However, since φ² is irrational, that precludes the possibility of two lengths in the ratio 1 : φ (necessary for regular pentagons to exist). Hence, it is impossible to give all vertices of a regular dodecahedron rational coordinates.

This suggests the problem of finding best rational approximations to the dodecahedron and icosahedron. For the dodecahedron, James Buddenhagen gave a set of coordinates for a pyritohedron which differs from regularity by a microscopic amount. Relaxing the condition that the faces must be exactly planar, a nice sequence of good approximations to the regular dodecahedron is obtained by taking all even permutations and sign changes of (x, z, 0) and (y, y, y), where x < y < z are three consecutive Fibonacci numbers.

pyritohedron

With the icosahedron, the faces are triangular so there is no need to worry about planarity. In this case, we only need to take two consecutive Fibonacci numbers; the icosahedron is the convex hull of all even permutations of (±x, ±y, 0). I conjecture that this gives the best rational approximations to the regular icosahedron, where ‘best’ is defined as ‘least variance (normalised by dividing by the squared mean) of edge length’. It is optimal amongst all icosahedra with pyritohedral symmetry, at least, since the best rational approximations to φ are ratios between consecutive Fibonacci numbers.

Etymology of ‘pyritohedron’

Pyritohedra are so named because they occur naturally in crystals of iron pyrite (a.k.a. fools’ gold). I happen to have a couple of iron pyrite crystals I photographed years ago:

VLUU L200  / Samsung L200

The larger sample clearly features partial examples of what appear to be regular dodecahedra, but are actually irregular as a consequence of the crystallographic restriction theorem. The smaller sample appears to feature truncated octahedra instead, and the crystallographic restriction theorem does not preclude their uniformity. Indeed, as an easy exercise for the reader, find integer coordinates for the 24 vertices of a truncated octahedron such that all edge lengths are equal. John Baez remarked that in addition to dodecahedra with pyritohedral symmetry, their geometric duals (irregular icosahedra) also occur in crystals of iron pyrite.

Now for a historical digression: It has been suggested that the Pythagoreans (who definitely knew of the dodecahedron, or ‘sphere of twelve pentagons’) may have discovered the regular dodecahedron by idealising the pyritohedra found in nature. On the other hand, when Thaetetus exhaustively proved that there are only five Platonic solids, he used purely theoretical methods similar to the idea of a Schläfli symbol. Tim Hutton independently repeated the same lines of reasoning, finding the planar and hyperbolic tilings in the process.

There is a subtle aspect to the crystallographic restriction theorem: it only applies to true crystals with translational symmetry. In the latter half of the twentieth century, quasicrystals were discovered; these are three-dimensional generalisations of Penrose tilings, usually found in metal alloys. One particular quasicrystal has perfect icosahedral symmetry. If we lived in four-dimensional space, we could also get quasicrystals with H4 symmetry (the largest polychoral point group), based on a tessellation by Viet Elser and Neil Sloane.

Posted in Uncategorized | 5 Comments

Desert Island Theorems

Suppose you were cast away on a desert island. You’re only allowed to take a maximum of eight known theorems with you, along with rudimentary results such as ZFC axioms together with ‘boring stuff’ such as mathematical induction over the naturals, commutativity of real multiplication and the least upper-bound property of \mathbb{R}. Everything else you have to prove yourself.

So, which eight theorems would you take with you on a desert island? It would be a waste of time to take the Baire category theorem, for instance; despite being extremely useful, it’s pretty trivial to prove. The same applies to the Dirichlet pigeonhole principle. On the other end of the spectrum, whilst Fermat’s Last Theorem is very difficult to prove, the end result is not nearly as useful as the machinery developed by Andrew Wiles in the course of solving this problem.

At the end of the post, there’s a ‘comments’ section for you to mention which theorems you would choose if marooned on a desert island. Here are my choices:

8. Density Hales-Jewett theorem

The ordinary Hales-Jewett theorem is quite straightforward to prove, using plenty of induction and the Dirichlet pigeonhole principle. On the other hand, the density analogue (c.f. Szemeredi’s theorem) is much deeper, requiring the entire Polymath project to establish a purely combinatorial proof (the original proof involved ergodic theory).

I’m not sure whether I’ve ever required the full force of Density Hales-Jewett, but Szemeredi’s theorem and ordinary Hales-Jewett have proved invaluable to me.

7. The abc theorem

Disclaimer: I don’t think that a second person is capable of understanding Mochizuki theory*, so this might still be classed as a conjecture until someone can peer-review the paper.

* Formerly known as inter-universal Teichmüller theory, although Doron Zeilberger makes a good argument that it should be renamed.

The abc conjecture is concerned with triples of coprime naturals, a < b < c, such that a + b = c. It states that the radical d (larger squarefree divisor) of abc cannot be much smaller than c. Specifically, for any ε > 0, we can only find finitely many examples where d < c^{1 - \epsilon}. This asserts that Beal’s conjecture holds in all but a finite number of cases, as does Fermat’s last theorem.

6. Max-flow min-cut theorem

I would have ranked this higher, except for the fact that it has a short elementary proof. The statement of the theorem is about networks, which are directed (multi-)graphs where each edge has a maximum capacity. A flow in a network is an assignment of nonnegative real values to each of the edges, such that they do not exceed the capacity, and that Kirchoff’s current law is obeyed at all vertices (with the exception of the source and sink vertices). The statement of the theorem is that the maximum flow attainable is equal to the minimum capacity of a cut (partitioning of the graph into two sets of vertices, so as to separate the source from the sink).

It can be used to prove Menger’s theorem in graph theory, Hall’s marriage theorem, Dilworth’s theorem on partially-ordered sets, and the Erdős-Szekeres theorem (although this also follows from Dilworth’s easier twin, Mirsky’s theorem).

5. Borsuk-Ulam theorem

This is informally stated as ‘two antipodal points on the Earth’s surface have the same temperature and pressure’. More generally, if we have a continuous function f from the n-sphere to \mathbb{R}^n, then we can find antipodal points x and y such that f(x) = f(y).

A corollary is the Brouwer fixed-point theorem, and all that that implies.

4. Gödel’s completeness theorem

This is one of the most beautiful and powerful theorems in mathematical logic. If a statement in first-order logic is consistent (i.e. we cannot prove a contradiction), then this asserts that there exists a finite or countable model. Consequently, we establish logical compactness: a set of first-order sentences has a model if and only if every finite subset has a model.

With the compactness theorem, the problem of computing the chromatic number of the plane is reduced to the more tractable problem of determining the maximum chromatic number of any finite unit-distance graph.

3. Every set can be well-ordered

This is equivalent to the axiom of choice, but much friendlier than the boring statement of AC involving a choice function. The main advantage is that it endows one with the power of transfinite induction over arbitrary sets, which is one of my favourite tools for proving theorems:

  • Constructing an infinite game where neither player has a winning strategy;
  • 2-colouring the plane such that there is no continuous monochromatic path;
  • Proving that the direct product of two infinite sets has cardinality equal to the larger of the two sets (by Cantor normal form);
  • Establishing Zorn’s lemma to show that any vector space has a basis…

2. Classification of finite simple groups

This is an impossible endeavour for a single individual to attempt, with the current proof being a 5000-page behemoth comprising many different papers. The Classification obviously reduces deep results such as the Feit-Thompson theorem (that there are no finite simple groups with odd composite order) and Burnside’s p^a q^b theorem to easy corollaries. (Of course, these theorems were almost certainly used in establishing the Classification, so this would be logically circular.)

It’s also a very beautiful theorem, with a vast wealth of exciting groups such as PSL(2,7), exceptional Lie groups over finite fields, and the Monster group. The Classification was also involved in proving theorems in areas outside group theory, such as establishing which graphs are 4- and 5-ultrahomogeneous.

1. Bezout’s theorem

This is a result in geometry, which says that if a degree-m and degree-n algebraic curve intersect in finitely many points, then they do so in at most mn points. Moreover, equality holds when the curves are on the complex projective plane, and we count intersections with the appropriate multiplicity.

An immediate corollary of this is the fundamental theorem of calculus, by taking one of the curves to be the line y = 0 and the other to be y = f(x) (where terms have been multiplied by appropriate powers of z to homogeneise them, and f is an arbitrary polynomial).

Also, it can be used to establish the Cayley-Bacharach theorem of cubic curves, which itself can prove the associativity of the elliptic curve group operation, Pascal’s theorem, Pappus’ theorem, and thus Desargues’ theorem.

Posted in Uncategorized | 17 Comments

W. T. Tutte

In the not-too-distant future, people are going to be celebrating the 100th birthday of Bill Tutte. He’s not quite as well-known as Alan Turing, which is a shame since they were both equally invaluable in the cryptanalysis at Bletchley Park. You may also know him from the problem of squaring the square, which he accomplished with three of his friends whilst they were students at Trinity. The story of how they met was rather interesting and extremely serendipitous, and is recounted at the beginning of this article on squaring.net.

The catalysis for this was a complete chance encounter in London between Trinity mathmos Arthur Stone and Cedric Smith, who happened to both reside in the area. The scope of the conversation was as vacuous as the empty set, and not even their names were exchanged at the time. Much more interesting was the next edge of the acquaintance graph, which was created in an equally improbable manner:

As first-year undergraduates, Cedric Smith met Rowland Brooks in a lecture on almost periodic functions. This was the equivalent of a Part III lecture, and their attendance was the result of an error in a timetable. This administrative mistake became apparent from the incomprehensibility of the lecture! Nevertheless, it was very fortunate that the timetables were erroneous, since it ultimately led to a very productive union of four mathematicians.

The fourth, of course, was Bill Tutte, who was reading chemistry (which has since been engulfed into the Natural Sciences Tripos) as well as mathematics. He knew Rowland Brooks, and later met Smith and Stone by extension. Tutte will be the primary focus of the majority of this article, so it seems appropriate to include a picture. Even better, here is a photograph of a bronze bust by the accomplished sculptor, artist, painter and former musician, singer and actress, Gabriella Bollobás, who happens to be married to the eminent mathematician Professor Bollobás:

Sculpture of Bill Tutte by Gabriella Bollobás

Squaring the square

The four of them met on a regular basis to discuss mathematical problems. One of these was the problem of finding a squared square, that is to say a dissection of a square into a finite number of smaller squares, no two of which are the same size. William Tutte wrote a detailed account of their effort to solve this mathematical problem, which spanned the years 1936 through to 1938.

The first major insight was that a squared rectangle could be represented by a network of one-ohm resistors, where the Thevenin equivalent resistance is equal to the aspect ratio of the squared rectangle. Specifically, imagine that the rectangle is made from a material of uniform resistivity, where a square of the material has a resistance of one ohm between opposite edges. Then, given a dissection of the rectangle into squares, we can replace each horizontal line with a perfect conductor, and each vertical line with a perfect insulator, and since all current flows vertically this has no effect on the total resistance.

A corollary of this is that the resulting squared rectangle must have sides in a ratio corresponding to the resistance of a circuit composed of one-ohm resistors; this forces it to be rational. Tutte et al also showed that the dimensions of the constituent squares must also be commensurate with the side lengths of the rectangle.

The next important discovery was by Smith’s mother, who assembled the squares into a perfect rectangle in a different arrangement from the original configuration. Eventually, it was noticed that this corresponded to a resistance-preserving transformation of the corresponding electrical circuit. They reasoned that it might be possible to tile the same rectangle with two completely different sets of squares, which would trivially result in a squared square:

Problematically, the squared rectangle produced by Smith’s mother featured precisely the same set of squares as the original, so the resulting dissection of the square would be replete with pairs of identical squares. Hence, further ingenuity was required to discover a perfect squared square.

The structure of Smith’s circuit was observed to naturally partition into a ‘rotor’ with threefold rotational symmetry and a ‘stator’. With a one-wire stator, it was realised that it could be possible to produce two rectangles sharing only one common square, which could be overlapped to yield a configuration trivially completable into a squared square. It transpires that Smith and Stone collaboratively found one, at almost precisely the same time that Brooks did so.

These order-69 squares were the first to be discovered, but far from being optimal. The Trinity Mathematical Society logo is an order-21 squared square, which has been proved to be minimal by exhaustive search.

Cryptanalysis of the Lorenz cipher

A year later, Britain and Germany plunged into the Second World War. (I’m not going to mention the war again, I promise!) This involved quite a lot of secret communication on both sides, and much of the Allied codebreaking efforts were concentrated in Bletchley Park.

Bletchley Park defeated two great ciphers. The first of these, which is probably more well known, was Arthur Scherbius’ Enigma cipher used by the Luftwaffe and Kriegsmarine. The effort began with the Polish mathematician Marian Rejewski. He knew roughly how the Enigma worked from models stolen by intelligence agencies, but had to embrace the immense challenge of determining both the internal wiring of the rotors and a method for cryptanalysing the variable key (which changes with each day). Rejewski built a machine, the bomba, for brute-forcing the key settings. Eventually, the Polish intelligence passed their discoveries to Bletchley Park, where people such as Alan Turing (King’s College, Cambridge) and Gordon Welchman (Trinity, yet again) found alternative and more flexible methods of cryptanalysing the Enigma based on Rejewski’s insights.

However, the cryptanalysis of the more difficult Lorenz cipher was an infinitely more impressive achievement, mainly because it was accomplished without anyone having ever seen one of the Lorenz encryption machines. I shall briefly describe its modus operandi.

The Lorenz cipher used the Baudot code, where each character is represented by five binary digits (either dots or crosses). The enciphering machine (or Schlüsselzusatz) produced a pseudorandom stream of information which was XOR’d with the plaintext to yield a ciphertext. The same stream of information, when XOR’d with the incoming ciphertext, would result in perfectly comprehensible plaintext. This Vernam cipher was a known form of encryption, but cryptanalysists at Bletchley Park had no idea how the pseudorandom bit generator operated.

What would be helpful is if the output of the pseudorandom bit generator could actually be isolated. This happened when a lazy operator was asked to repeat a message with the same key, and abbreviated it slightly, giving a huge vulnerability. Specifically, XORing the two encrypted messages would neutralise the contribution of the Schlüsselzusatz, giving the symmetric difference of the two messages. John Tiltman was able to carefully separate this into the two original messages, treating it as a basic autokey cipher. Moreover, he could XOR the encrypted and decrypted messages to isolate the output of the Schlüsselzusatz.

Enter Tutte. Tutte was given the output of the Schlüsselzusatz, and asked to deduce the internal workings of the machine. This, of course, was an incredibly difficult task. He started by looking for approximate periodicity (perhaps the lecture on almost periodic functions helped!) in the bitstream. He found a period-41 pattern, suggesting the presence of a rotor χ1 with 41 teeth. Moreover, he determined that another more complicated rotor (ψ1) driven by a motor wheel μ contributed to the encryption. Proceeding in this manner, cryptanalysts determined how each of the other four bits were generated (there were five independent mechanisms with coprime periods).

Armed with Tutte’s deduction, Alan Turing et al were able to work on algorithmically cracking this cipher. This began with the electromechanical Heath Robinson machines, which were replaced by the much more efficient electronic Colossus computers, designed and built by engineer Tommy Flowers. Gordon Welchman was somewhat disapproving of this shift from good old electromechanical relays to those new-fangled thermionic valves; fortunately, Turing was more forgiving. Nowadays, of course, relays and valves have both been rendered obsolete by microscopic transistors on silicon chips, and may eventually be further antiquated by advances in carbon nanotechnology.

The intercepted and decrypted communications gave the Allies a massive advantage. I seem to recall that they even sent a Lorenz-encrypted message to sabotage the Nazi military by issuing unhelpful orders, although I might be confusing this with the Sherlock Holmes mystery, The Adventure of the Dancing Men.

Graph theory

Squaring the square, whilst initially appearing to be combinatorial geometry, eventually succumbed to a graph-theoretic approach. Many of Bill Tutte’s mathematical accomplishments were in graph theory, including the disproof of Tait’s conjecture that every 3-regular polyhedral graph admits a Hamiltonian cycle.

He has a few graphs named after him, one of which is the Tutte 8-cage. This is a 3-regular graph with girth 8, and indeed is the unique smallest such graph. It possesses a large amount of symmetry, a small amount of which is apparent in the following picture:

tutte-8-cage

Its full symmetry group is PΛL(2,9), the automorphism group of S6. This is apparent from the standard construction of the 8-cage: take the 15 unordered pairs of letters {a,b,c,d,e,f}, together with the 15 unordered triples of unordered pairs (such as (ac,bf,de)), and connect a triple to a pair when they are incident. This has an obvious action of S6 (permuting the letters), but also features an outer automorphism interchanging the two halves of this bipartite graph. Indeed, this is one of the simplest ways to see the outer automorphism of S6.

Tutte was also responsible for the snark conjecture, that every snark contains the Petersen graph as a minor. When proved much later by Robertson, Seymour et al, this gave an indirect proof of the four colour theorem as an immediate corollary. Other theorems include the BEST theorem (where the ‘T’ in the acronym stands for ‘Tutte’), Tutte-Berge formula and Tutte’s homotopy theorem.

William Tutte passed away in 2002, after a long life (by contrast with Turing’s untimely cyanide-laced Rosacaean death) and a very fruitful and productive mathematical career. There is an obituary in the Guardian.

Posted in Uncategorized | Leave a comment

Cipher 45: Sisyphean

Occasionally, some of my ciphers have the frustrating property that whilst easy to realise what to do, they can still be incredibly long-winded to solve. I can imagine that this is one such cipher:

homophonic

I apologise for the profusion of Petersen graphs.

Posted in Ciphers | Leave a comment

Perrin sequence

The Perrin sequence is defined by a recurrence relation, and is qualitatively similar to the Lucas sequence. The initial terms are P(0) = 3, P(1) = 0, P(2) = 2 and subsequent terms are defined by P(n) = P(n-2) + P(n-3), and summarised in the following spiral of equilateral triangles:

perrin-spiral

As you can see, I’ve cheated slightly by tucking a corner of P(3) = 3 beneath P(5) = 5. This is because the spiral was designed for the Padovan sequence, which is to the Perrin sequence as the Fibonacci sequence is to the Lucas sequence. This analogy can be made precise by considering the closed form of the Perrin sequence:

P(n) = \alpha^n + \beta^n + \gamma^n

These Greek letters are the roots of the characteristic polynomial x³ − x − 1 = 0. They can be computed by an easy application of Tartaglia’s method for solving the cubic, but the resulting expression is messy and not particularly insightful. The closed form, however, can be used to deduce that if p is prime, then p divides P(p):

P(p) = \alpha^p + \beta^p + \gamma^p \equiv (\alpha + \beta + \gamma)^p = 0 modulo p.

The non-trivial step can be deduced by expanding the expression (\alpha + \beta + \gamma)^p - (\alpha^p + \beta^p + \gamma^p), which gives p multiplied by some symmetric polynomial (as all non-trivial multinomial coefficients are divisible by p), which in turn can be expressed as a polynomial in the elementary symmetric polynomials (thanks to Newton’s theorem), which are all integers (by Vieta’s formulae). This result is a massive generalisation of the freshman’s dream.

A nicer way to prove that p divides P(p) is by noticing that P(p) counts the number of ways to tile an order-p cycle graph with tiles of length 2 or 3 (or, equivalently, the number of maximal independent sets). Here is an example tiling for p = 17:

circular-partition

When acted upon by the cyclic group Cp of rotations, no tiling can have a non-trivial stabiliser. Hence, the tilings are partitioned into orbits of size p, so the total number of tilings must be divisible by p. This argument generalises to any depressed monic polynomial with integer coefficients, as does the proof involving multinomial coefficients.

This can be exploited to give a Fermat-style test for primality. If n does not divide P(n), then n must be composite. We can compute this residue in O(log² n log log n log log log n) by using a matrix form of the recurrence relation, repeated squaring and the Schönhage-Strassen algorithm for multiplication.

Perrin pseudoprimes appear to be rarer than Fermat pseudoprimes, with only 17 below 10^9 compared with 646 Carmichael numbers and 5597 base-2 pseudoprimes in this interval. The smallest is 521² = 271441, followed by 904631 and 16532714. It’s possible to make better primality tests by (for instance) combining the Perrin, Lucas and Fermat primality tests. I don’t know of any composite numbers that are simultaneously Perrin and Lucas pseudoprimes, but they most probably exist.

Posted in Uncategorized | Leave a comment

Leech lattice

I’ve been intending to write about the Leech lattice for a while now. I wanted to ensure that I could actually do it justice, and in particular make it strictly more comprehensive than the Wikipedia and Mathworld articles. The authoritative source on this topic is the book Sphere Packings, Lattices and Groups (Conway and Sloane), which is a compilation of papers by several authors.

A lattice, in this context, is a set of vectors in Euclidean space that form an Abelian group under addition. Equivalently, it is a set of points (including the origin), such that for any two points A and B, there exists a translation mapping A to B and leaving the lattice invariant. For example, the midpoints of the cells of the regular square and hexagonal tilings form lattices in two dimensions, familiar as the Gaussian and Eisenstein integers:

2D-lattices

By comparison, this construction applied to the equilateral triangular tiling does not yield a lattice, since there is no translation capable of mapping an up-pointing triangle to a down-pointing one. Nevertheless, the set of points can be expressed as the disjoint union of two hexagonal lattices.

The hexagonal lattice can be regarded as the best lattice in two dimensions, for the following reasons:

  • It offers a denser packing of unit discs than any other arrangement;
  • It gives a thinner covering of overlapping discs than any other lattice;
  • In the packing, each disc touches six others, which is trivially maximal.

The next time a single lattice wins all three awards is in 24 dimensions, with the spectacular Leech lattice. (The E8 lattice in eight dimensions comes close, but is slightly beaten by a lattice called A8* in the covering department.) The rest of this post is primarily concerned with the Leech lattice, although the E8 lattice will undoubtedly be mentioned a few times.

Characterisation and properties

So, what exactly is the Leech lattice? It turns out that it’s the only even integral unimodular lattice with no roots in fewer than 32 dimensions. Each of these terms will take some explaining:

  • A lattice, as mentioned at the beginning of this article, is a set of vectors in Euclidean space which is closed under addition.
  • A lattice is integral if the dot product of any two vectors is an integer.
  • An integral lattice is described as even if the squared norm of any vectors is an even integer. Otherwise, it is an odd integral lattice.
  • A lattice is unimodular if the fundamental parallelotope has unit volume, or the determinant of the generating matrix is ±1. Equivalently, the lattice has a density of one point per unit volume. Even integral unimodular lattices only occur in dimensions divisible by 8, with the unique example in 8 dimensions being the E8 lattice.
  • For an even unimodular lattice, a root is a vector of squared norm 2. The Leech lattice has none of these, so the shortest nonzero vectors have a squared norm of 4 (or, equivalently, a length of 2).

Because the shortest distance between two lattice points is equal to 2, that means that we can place a ball of unit radius centred on each lattice point. Each sphere touches precisely 196560 others, which is the maximum for 24 dimensions. (The kissing number problem has also been solved in 1, 2, 3, 4 and 8 dimensions, with values of 2, 6, 12, 24 and 240, respectively, attained by the integer lattice, hexagonal lattice, FCC lattice, D4 lattice and E8 lattice, respectively.)

If we enlarge the spheres to a radius of √2, they will overlap and cover all space. Establishing that the covering radius is equal to √2 is non-trivial, and involves analysing all 23 types of deep hole (points maximally distant from lattice points) encountered in the Leech lattice. Each type of deep hole corresponds to one of the 23 other even unimodular 24-dimensional lattices, known as Niemeier lattices.

Coordinate-based constructions

Whilst saying ‘the unique even unimodular lattice in 24 dimensions with no roots’ is sufficient to characterise the Leech lattice, it is appallingly non-constructive. It’s a very complicated lattice, and it’s quite non-trivial to construct the Leech lattice. By comparison, it’s very easy to construct the cubic lattice:

  • Points in \mathbb{R}^3 such that all coordinates are integers

and it’s not much harder to construct the D4 lattice:

  • Points in \mathbb{R}^4 such that all coordinates are integers and the sum of coordinates is even

or even the E8 lattice:

  • Points in \mathbb{R}^8 such that either all coordinates are integers or all coordinates are half-integers, and the sum of all coordinates is even.

Conversely, constructing the Leech lattice using coordinates is really complicated! Even though the following definition is elementary, it involves lots of intermediate steps and is quite complicated indeed:

  • Points in \mathbb{R}^{24} such that the coordinates are all integers congruent to m modulo 2, the sum of all coordinates is congruent to 4m modulo 8, and such that for each i in {0, 1, …, 11}, the (i + 12)th coordinate is congruent modulo 4 to the sum of the jth coordinates over all j in {0, 1, …, 11} such that (i, j) does not describe an edge of the icosahedron with vertices labelled {0, 1, …, 11}. Then scale the resulting lattice by 1/√8 so that it is unimodular.

This elementary construction was obtained by taking the simplest construction based on the binary Golay code, together with a relatively simple construction of the binary Golay code itself, and combining them. It doesn’t seem to be a particularly motivated construction, though, and there are some simpler ones.

Laminated lattices

This is quite a fun way to create dense lattice sphere packings without much effort. We start with the ordinary integer lattice:

lambda1

We now add an extra dimension (increasing it from 1 to 2), and place a sphere as low down as possible whilst resting on top of the lattice:

lambda1_plus1

Now take the additive closure (i.e. make it closed under all translations mapping spheres to spheres):

lambda2

Now imagine it horizontally embedded in three-dimensional space, and add an extra sphere as low down as possible whilst still resting on our hexagonal lattice of spheres. In the image below (plan view), I’ve added six of the spheres of the second layer, and two of the third layer:

lambda2_plus8

Taking the additive closure gives the FCC lattice:

The sequence of laminated lattices obtained in this manner begins Z, A2 (hexagonal), A3 (FCC), D4, D5, E6, E7, E8, and gives rather dense lattices. The laminated lattices in 9 and 10 dimensions are also unique, but there are two types of deep hole in Λ10, giving two different choices for the laminated lattice Λ11. There are three laminated lattices in each of 12 and 13 dimensions, but remarkably instead of getting an exponential explosion of possibilities, all routes converge to a unique laminated lattice in 14 dimensions.

This continues as a nice steady sequence of unique lattices, culminating in Λ24 being the Leech lattice itself! Note that this construction is much simpler than the direct coordinate-based approach using the binary Golay code. However, when one wants to investigate the geometry of the Leech lattice, it is easier to work with the coordinate-based parametrisation.

This is followed by an explosion, as each of the 23 deep holes of the Leech lattice yield different laminated lattices in 25 dimensions, and not much is known about the plethora of laminated lattices in 26 dimensions or above. However, there is a very remarkable periodicity observed in one particular sequence of laminated lattices: the centre density (number of lattice points per unit volume) oscillates with period 24 (and is an even function), where the lattice \Lambda_{s+24} is obtained by ‘gluing’ \Lambda_s to the Leech lattice.

Eisenstein integers, Golay codes and Mathieu groups

We have thought of the Leech lattice in its most natural state, which is a 24-dimensional real lattice. Since the complex numbers can be identified with the real plane, this suggests a 12-dimensional complex representation of the Leech lattice. The construction using Gaussian integers is identical to the ordinary 24-dimensional real construction (since the Gaussian integers are isometric to the Cartesian product \mathbb{Z}^2), whereas the Eisenstein construction is distinct.

Instead of using the 24-dimensional binary Golay code over F2, it exploits the 12-dimensional ternary Golay code over F3. The ternary Golay code, whilst named after Golay, was actually discovered a couple of years earlier by a football pool enthusiast (see here). Incidentally, the binary and ternary Golay code constructions of the Leech lattice are two of 23 constructions, one for each Niemeier lattice. Conway and Sloane discuss these ‘holy constructions’ in Sphere Packings, Lattices and Groups.

It’s worth discussing the symmetry groups of these codes. The permutations of the coordinates that fix the binary Golay code form the Mathieu group M24, which has two other sporadic groups (non-Abelian simple groups not of Lie type), M23 and M22, as one- and two-point stabilisers. Similarly, the symmetries of the ternary Golay code give the sporadic group M12, which has M11 as a one-point stabiliser. M21 is isomorphic to PSL(3,4) therefore non-sporadic, and M10 isn’t even simple.

M12 has a nice construction based on the icosahedron. We place twelve ball-bearings around a central sphere (above), corresponding to the vertices of an icosahedron. An inverted twist is an operation where you choose a ball-bearing and its five neighbours, rotate them by 72°, and reflect the arrangement of ball-bearings in the centre of the central sphere. M12 is the group generated by an even number of inverted twists.

M24 can be constructed by a gadget known as the Miracle Octad Generator, which I’ll be explaining in more detail in about a fortnight. For M12, there is a smaller gadget known as the miniMOG (or ‘kitten’, alluding to the fact that ‘mog’ is an informal term for a cat!).

Icosians and the E8 lattice

So far, we have seen that there are real and complex constructions of the Leech lattice. There are also two quaternionic constructions, one of which is boring and analogous to the real and complex constructions. Instead, we’ll describe the more exciting construction, involving the (non-commutative) ring of icosians.

The icosians are a beautiful ring of quaternions with a large degree of symmetry. They are dense in the quaternions, and share some similarities with the integral elements of the quadratic field \mathbb{Q}[\sqrt{5}] in the complex numbers. Whilst this quadratic field can be used to construct the Penrose tilings with fivefold symmetry, the icosians can be used to construct a four-dimensional quasicrystal with the H4 symmetry group possessed by the 120-cell.

120-cell

The icosians are finite sums of unit icosians, of which there are 120. These include the 24 Hurwitz integers together with the 96 even permutations of ½(0, ±1, ±ψ, ±φ), where φ and ψ are the golden ratio and its Galois conjugate. The unit icosians under multiplication form the binary icosahedral group of order 120, which is the double cover of the icosahedral group.

It is trivial to observe that every icosian is of the form [(a + b√5) + (c + d√5)i + (e + f√5)j + (g + h√5)k], where the variables are rational numbers. This is an icosian if and only if the vector (a, b, c, d, e, f, g, h) lies in a particular scaled and rotated copy of the E8 lattice. We can think of this as a bijection θ from the icosians to the points of the E8 lattice.

Then the Leech lattice can be described as (θ(x), θ(y), θ(z)), where x + y + z = 0 (mod h) and x = y = z (mod h*), where h = ½(−√5 + i + j + k) and h* is its conjugate. We also have a dense three-dimensional quaternionic lattice, L, which is formed by the corresponding points (x, y, z).

Observe that the point symmetry group of the icosians is H4, of order 14400, whereas the point symmetry group of the E8 lattice is the Weyl group of E8, of order 696729600. Hence, there is no reason to assume that L has the same symmetry group as the Leech lattice (described later), and indeed it doesn’t. Whereas the symmetries of the Leech lattice form the double-cover of Conway’s sporadic group Co1, the symmetries of L form the double-cover of the Hall-Janko sporadic group J2. Similarly, the symmetries of the complex Leech lattice yield a setuple cover of the Suzuki sporadic group.

Theta series and automorphism group

To motivate the discussion of theta series, consider Problem 354 from the Project Euler website. This features a scaling of a hexagonal lattice, where the shortest vectors have squared norm 3 (so the Voronoi hexagons have a side length of 1). They provide a nice picture of this lattice:

The theta series is (the generating function of) the sequence giving the number of points at a specified squared distance. So, in this case, we obtain the theta series 1 + 6q^3 + 6q^9 + 6q^{12} + 12q^{21} + \cdots. The Project Euler problem asks how many of these coefficients below some large finite order are equal to 450.

The theta series of the square lattice gives the number of ways to express an integer as the sum of two squares of (not necessarily positive) integers. For the Leech lattice, the number of vectors of squared norm 2m is equal to \dfrac{65520}{691}(\sigma_{11}(m) - \tau(m)), where τ is the Ramanujan tau function and σ11 gives the sum of eleventh powers of divisors.

The first few terms of the theta series of the Leech lattice (A008408) are 1, 0, 196560, 16773120, 398034000, and so on. John Conway observed that the 398034000 vectors of squared norm 8 naturally partition into 8292375 ‘frames’ of 48 vectors positioned at the vertices of an orthoplex (generalised octahedron). Before we divide all vectors by √8 in the coordinate-based construction of the Leech lattice, one of these frames comprises all permutations of (±8,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0).

The automorphism group is transitive on these coordinate frames, and fixing a coordinate frame gives a subgroup 2^12.M24, corresponding to permutations and sign changes fixing the binary Golay code. By the Orbit-Stabiliser theorem, we recover the size of the automorphism group:

8292375 × 244823040 × 4096 = 8315553613086720000

This group is equal to its own automorphism group. It has a normal subgroup of order 2 (namely its centre, {±I}); the quotient with respect to this is Conway’s sporadic group Co1. The sporadic groups Co2 and Co3 are obtained by fixing additional points. We can also obtain many other sporadic groups in this manner, two of which are described in the next section.

Higman-Sims graph and McLaughlin graph

Take three lattice points that form the vertices of a triangle with side lengths of {2, √6, √6}. There are 100 lattice points of minimal distance 2 from all three of these three vertices. Consider these 100 points, and join two points with an edge if the distance is √6. The result is the Higman-Sims graph on 100 vertices. Claudio Rocchini produced the following drawing:

The Higman-Sims graph contains other exceptional graphs embedded within it. There are 352 ways that the vertices can be partitioned into two copies of the Hoffman-Singleton graph. Removing a vertex of the Higman-Sims graph together with its 22 neighbours gives the M22 graph.

We can also obtain the 275-vertex McLaughlin graph, by applying the same construction to a {2, 2, √6} triangle instead of a {2, √6, √6} triangle. The automorphism groups of the Higman-Sims graph and McLaughlin graph are the double covers of two more sporadic groups, the McLaughlin group and the Higman-Sims group.

If we take the convex hull of the 240 shortest vectors in the E8 lattice, we obtain the beautiful semiregular Gosset E8 polytope. The convex hull of the 196560 shortest vectors in the Leech lattice is a much more complicated object, but recently its structure was analysed. For each facet of the polytope, the authors construct a graph based on the lengths of its edges. The connected components of these graphs include:

Lorentzian lattices, string theory and the Monster Lie algebra

I was going to include this with the other constructions of the Leech lattice, but I’ve postponed discussion until now due to its depth, abstractness and amount of conjecture.

An elegant construction of the Leech lattice embeds it in (25+1)-dimensional Minkowski spacetime. In this spacetime, we use 25 spatial coordinates and one temporal coordinate, separated by a vertical line. We endow the space with an ‘inner product’ with signature (+, +, …, +|−).

The Lorentzian lattice II(25,1) consists of points such that:

  • All coordinates are integers or all half-integers;
  • The sum of coordinates is even.

The Leech lattice can be obtained from this by taking the lattice points r such that r.r = 2 and r.w = −1, where w = (0, 1, 2, 3, 4, …, 18, 19, 20, 21, 22, 23, 24, −70) is the Weyl vector, and ‘flattening’ it into Euclidean space whilst preserving the squared distances between points. Remarkably, w is a lightlike vector, since 0² + 1² + … + 24² = 70², as discovered by Edouard Lucas.

The Lorentzian lattice II(9, 1) is also of interest, being intimately related to the 9-dimensional hyperbolic tessellation which completes the sequence {triangular prism, rectified* 4-simplex, demipenteract, E6 polytope, E7 polytope, E8 polytope, E8 lattice, …} of exceptional semiregular polytopes. However, II(9, 1) and II(17, 1) are not as exceptional as II(25, 1), since their Weyl vectors are not lightlike and the corresponding constructions give finite Coxeter-Dynkin diagrams instead of a copy of the Leech lattice.

Conway et al defined an infinite-dimensional Lie algebra based on the Leech roots (those vectors r such that r.r = 2 and r.w = −1), conjecturing that the Monster group occurs naturally as a subquotient of the automorphism group of a sub-algebra. An explicit Monster Lie algebra could be created by applying a particular string-theoretic theorem to a vertex algebra defined by Richard Borcherds.

People such as Stephen Wolfram, Konrad Zuse and Ed Fredkin proposed that our universe is a cellular automaton. One of the major criticisms is that, whilst cellular automata on hexagonal grids can exhibit large-scale SO(2) symmetry (lattice gases are examples), they lack Lorentz invariance. This problem could possibly be circumvented by cellular automata on Lorentzian lattices such as II(9,1) or II(25,1), which exhibit a discrete group of symmetries including Lorentz transformations. Indeed, a particularly well-chosen cellular automaton on II(9,1) or II(25,1) would be a discretised version of 10- or 26-dimensional string theory.

(This article apparently has precisely 3000 words, making it the longest cp4space post so far…)

Posted in Uncategorized | 25 Comments

Category of scones

(This post is completely unrelated to the category of cones. Apologies if you were mistakenly directed here by a fuzzy search engine.)

In May 2013, the category theorist Dr Eugenia Cheng (whom you may recognise from her quotations on the Imre Leader Appreciation Society) wrote a paper on the ideal cream tea. This received a lot of media attention, including from the Daily Mail, Telegraph and The Aperiodical (in increasing order of accuracy).

The paper discusses the optimal proportions of cream, strawberry preserve and scone for an ideal cream tea. Unlike most academic papers, this one has an accompanying video:

[youtube=http://www.youtube.com/watch?v=xUtNNCprm9w]

I was slightly disappointed that there was no category theory in the paper, despite the author being a category theorist. It did, however, explicitly mention that the preserve should be added before the cream, in the style of the Cornish. By comparison, residents of Devon advocate the reversed approach of adding the cream first. The fact that these are genuinely different outcomes can be neatly abstracted into the statement that this diagram in the category of scones does not commute:

scone-category

The other shortcoming of Eugenia Cheng’s otherwise brilliant paper is that it doesn’t mention that cream teas can be augmented by the addition of a bisected strawberry, as in the image below. The left-hand scone has been prepared using the Devonian order of morphism composition, whereas the right-hand scone has been prepared using the (strictly superior) Cornish method:

VLUU L200  / Samsung L200

Jacob Aron mentioned on Twitter that if the scone has a diameter of 2 centimetres, the clotted cream would need to be infinitely tall, distributed in the form of a Dirac delta function† V \delta(x) \delta(y), where V is the desired volume of cream.

† Disclaimer: This is not a function in the pure-mathematical sense.

Flexagons

There have been other recent examples of gastronomical mathematics in the last year. I presume that you’ve seen the integral banana I posted several months ago. Additionally, Evelyn Lamb has reviewed a multitude of videos involving both mathematics and food, including the culmination of Vi Hart’s flexagon series of videos.

[youtube=http://www.youtube.com/watch?v=VIVIegSt81k]

Evident from the construction of the simplest hexaflexagon is that we are dealing with a Möbius strip, which is folded to form a multiple cover of the hexagon. The automorphisms of the Möbius strip (C18) quotiented by the automorphisms of the resulting hexagon (C6) gives the group of flexations (C3).

With three of the strips used to build a hexaflexagon, one can also construct a Boerdijk-Coxeter helix. Twenty cyclic Boerdijk-Coxeter helices of 30 tetrahedra each are obtained by partitioning the 600-cell (a four-dimensional regular polytope) according to a discrete Hopf fibration.

Posted in Uncategorized | Leave a comment

Cipher 44: Dostin

This is classically used as a rudimentary compression algorithm, although it also makes for a nice cipher challenge.

000011010110111 0100001111000111 1110 11000011100111101101001
001011101010100111 110100101
01001110100111110001010110001110111101111011010,
001001110101101001101010 11100111 1110
01100100000101001010011111101010 0000101100100,
0011001101011010010110 01011001100010101100011110010100000111
1111010000000010001010111 0011010110000110
01001110111111110110100111 110100101
001000111001011011111011000011110100 1111010010101100110000110
111111001100011001010111111001010000 11011010 0000110100
01011001111011100001011001000100
00101010110000100100101000100101001001110010. 0000110100
110001111001110111001101101010111111 10110111
1111110010101111101011011000101101001.

Good luck.

Posted in Ciphers | Leave a comment

Ten things you (possibly) didn’t know about the Petersen graph

Some mathematical objects are completely boring. Others have a few interesting properties. The Petersen graph, by comparison, has a plethora of quite remarkable features that, taken alone, would each qualify the graph as being interesting. I’ve mentioned this graph a few times before, but this is the first post exclusively dedicated to it. Firstly, here’s a picture of the graph:

petersen

For completeness, I’ll mention that it’s a 3-regular graph with 10 vertices and 15 edges, and that it is both vertex- and edge-transitive. Now, let’s discuss why it’s interesting:

1. Linklessly embeddable graphs

Wagner’s theorem is well known, and states that a graph is non-planar if and only if it contains either the complete graph K5 or the bipartite graph K3,3 as a graph minor. The Petersen graph has both of these as minors, so it is really non-planar! It has the stricter property that if you draw the Petersen graph in three-dimensional space (with no edges crossing), you can necessarily find two linked cycles.

linkful-embedding

In fact, like planar graphs, the linklessly embeddable graphs are closed under the operation of taking graph minors. The Petersen graph, together with K6 and five other graphs, form a finite set of forbidden minors (the existence of which is guaranteed by the Robertson-Seymour theorem together with Zorn’s lemma). They all have 15 edges, and can be interconverted by delta-wye transformations.

2. Toroidal and projective-plane embeddings

In the previous section, I mentioned how the Petersen graph is linklessly embeddable, therefore far from being planar. Consequently, removing any vertex of the Petersen graph leaves a non-planar graph. Hence, it is somewhat surprising that it can be drawn on the surface of a torus without any edges crossing. Indeed, it can be shown that every generalised Petersen graph (formed by taking the skeleton of a regular n-gonal prism and replacing one of the terminal n-gons with a regular star n-gon) is toroidal.

Edit: The Desargues graph is not toroidal, so the previous statement is erroneous. Thanks go to Taylor Ball for noticing this fallacy!

The Petersen graph has a more natural and very symmetric embedding on the projective plane. It can be obtained by drawing a regular dodecahedron on the surface of a sphere, and identifying antipodal points, edges and faces. This results in an abstract polyhedron called the hemi-dodecahedron, the skeleton of which is the Petersen graph. The hemi-dodecahedron and hemi-icosahedron can then be used to construct even more pathological polytopes, the 11-cell and the 57-cell.

3. Construction as a Kneser graph

The hemi-dodecahedron has 60 symmetries, abstractly isomorphic to the alternating group A5. It follows that the automorphism group of the Petersen graph must contain A5 as a subgroup. You may intuitively think that it is precisely A5, but the subtle operation of ignoring the faces of the hemi-dodecahedron could result in additional automorphisms.

It transpires that this is indeed the case, and the Petersen graph has 120 symmetries, abstractly isomorphic to S5. The easiest way to see this is to construct the Petersen graph as a Kneser graph. Specifically, each vertex is associated with a subset of {a,b,c,d,e} of cardinality 2, and two vertices are joined with an edge if the corresponding subsets are disjoint.

kneser

To show that the only symmetries are the permutations of {a,b,c,d,e}, it suffices to demonstrate that we can recover the elements from the graph. We describe a pair of edges x, y as maximally distant if no vertex of x is connected to any vertex of y. It can easily be seen that if we select an edge x, we can find precisely two edges y, z maximally distant from x. Also, y and z are maximally distant from each other.

Hence, the 15 edges of the Petersen graph uniquely and naturally partition into 5 subsets of three edges. These are bijected with the elements of {a,b,c,d,e}, where the six vertices belonging to the union of the three edges are those not containing that element. This completes the proof that the symmetries of the Petersen graph are exactly the permutations in S5.

4. Not a Cayley graph

I’ve mentioned that it is vertex-transitive, that is to say that there are symmetries taking any one vertex to any other (or, equivalently, that every vertex is ‘essentially the same’). Graphs with this property are ten-a-penny, such as the following:

vertex-transitive

These are Cayley graphs of the groups C4, C7 and D10, respectively (N.B. a group can have multiple Cayley graphs, depending on which generators are used). Every Cayley graph is vertex-transitive (by construction), but not all vertex-transitive graphs are Cayley. The smallest counter-example is, unsurprisingly, the Petersen graph.

5. Hypohamiltonian

A graph is described as Hamiltonian if we can find a cyclic tour, which visits every vertex exactly once. Most vertex-transitive connected graphs are Hamiltonian, and it is conjectured that there are only five counter-examples. The smallest of these is really trivial, being the complete graph K2 (once the edge is traversed, you cannot return). The others are the Petersen graph, the Coxeter graph, and two larger graphs derived from the Petersen and Coxeter graphs.

However, removing any vertex leaves a Hamiltonian graph, so the Petersen graph is described as hypohamiltonian. We can see that this is the case by using vertex-transitivity (hence only one vertex needs to be considered) and observing the following drawing of the Petersen graph:

hypohamiltonian

Indeed, the Petersen graph is the smallest hypohamiltonian graph.

6. Snarks

In an earlier post, I mentioned how the four-colour theorem is equivalent to showing that no snark is planar. A snark, by the way, is a 3-regular connected graph with no isthmus* (edge whose removal would result in a disconnected graph), such that we cannot 3-colour the edges such that each vertex is incident with one edge of each colour. The dodecahedral graph, for example, is 3-regular, connected and bridgeless**, but not a snark as it admits the following edge tricolouring:

edge-colouring

The Trinity mathematician William Tutte, whom you may know from such things as the cryptanalysis of the Lorenz cipher in the Second World War and the discovery of the first perfect squared square, conjectured that every snark has the Petersen graph (the smallest snark) as a graph minor. This was proved recently, with the four-colour theorem following as a corollary.

* Some people use the alternative term ‘bridge‘. I am not one of those people…

** …but nevertheless, I do admit that ‘isthmusless’ sounds clumsy.

7. Desargues’ theorem and right-angled hexagon theorem

Even more recently, I wrote about Pappus’ theorem and Desargues’ theorem. The latter can be interpreted as a statement about the Desargues graph (bipartite double cover of the Petersen graph), or equivalently about the Petersen graph itself. Another theorem involving the Petersen graph is this one, mentioned in an e-mail from John Conway:

Suppose we have 10 lines in Euclidean 3-space, corresponding to the vertices of the Petersen graph. To each edge, we assign the statement ‘the lines corresponding to the two endpoints intersect at right angles’. If fourteen of the statements are true, then so is the fifteenth (thereby constituting a rather impressive example of (15,14)-TFAE).

Anyway, I suppose I ought to explain what a bipartite double cover is. Basically, we replace each vertex of the Petersen graph with a red vertex and a blue vertex, joining two vertices in the resulting graph if the vertices are different colours and the parent vertices were connected by an edge. What results is the 30-edge 20-vertex Desargues graph:

desargues

This is a different from the dodecahedral graph, which can also be regarded as a double cover of the Petersen graph.

8. The Hoffman-Singleton theorem

The Petersen graph has diameter 2 and girth 5. In other words, the shortest cycle has length 5, and any two vertices are either adjacent or share a common vertex. This is equivalent to a set of constraints posed in an AMS question. Ignoring the empty graph, the only possible graphs are:

  • The 5-vertex 2-regular pentagon;
  • The 10-vertex 3-regular Petersen graph;
  • The 50-vertex 7-regular Hoffman-Singleton graph;
  • A particular 3250-vertex 57-regular graph whose existence and uniqueness are unknown.

Intriguingly, the Hoffman-Singleton theorem has reduced the problem of finding all graphs with this property to a finite case-bash, but where ‘finite’ is still prohibitively large for an exhaustive search. This is a counter-example to the common misconception that all finite problems are trivial.

9. Induced subgraphs

The Petersen graph appears as an induced subgraph in many larger interesting graphs. The aforementioned Hoffman-Singleton graph is one such example, as is the Clebsch graph (a 16-vertex 5-regular graph, such that the removal of any vertex and its five neighbours leaves the Petersen graph).

The Clebsch graph can be constructed by taking a tesseract and joining pairs of opposite vertices. It is interesting in that the edges of K16 can be partitioned into three copies of the Clebsch graph, resulting in a 3-colouring of K16 with no monochromatic triangle. This, combined with a trivial upper bound, establishes the Ramsey number R(3,3,3) = 17.

Naturally, the Petersen graph appears infinitely often as an induced subgraph of the Rado graph, as noted by Gabriel Gendler in his cp4space guest post.

10. Cycle-continuous maps

A map from (the edges of) a graph G to a graph H is described as cycle-continuous if the pre-image of any Eulerian subgraph is also Eulerian. This definition is similar to that of continuity in a general topological space, but with Eulerian subgraphs instead of open sets. Jaeger conjectured that any bridgeless graph can be mapped cycle-continuously into the Petersen graph, and demonstrated that (if true) this would imply three more open conjectures for free:

Conclusion

In summary, there are at least as many interesting properties of the Petersen graph as there are vertices. I shall end on a quotation in true oratory style, mentioning that Donald Knuth (inventor of LaTeX, used by mathematicians and prophylactic manufacturers alike!) described the Petersen graph as:

“a remarkable configuration that serves as a counterexample to many optimistic predictions about what might be true for graphs in general”

Posted in Uncategorized | 11 Comments

Exotic continued fractions

Given a real number x, computing its continued fraction can reveal a lot of information. In the simplest case, the continued fraction terminates if and only if the number is rational. One particular example of this is the rational \dfrac{355}{113}, the continued fraction of which is shown below:

rational-cf

(The square-bracketed expression [3; 7, 16] is just a convenient notation for the continued fraction expansion.)

Rational numbers are simply the roots of linear polynomials with integer coefficients. Hence, the next simplest case is when x is the root of a quadratic equation with integer coefficients. It transpires that this is equivalent to the continued fraction being eventually periodic, such as \sqrt{19} = [4; 2, 1, 3, 1, 2, 8, 2, 1, 3, 1, 2, 8, \dots], where the sequence (2, 1, 3, 1, 2, 8) is repeated indefinitely.

Roots of higher-degree irreducible polynomials, along with transcendental numbers, all have aperiodic continued fractions. Some of these are simple and predictable, such as e = [2; 1, 2, 1, 1, 4, 1, 1, 6, 1, 1, 8, 1, \dots], whereas others seem to exhibit no obvious patterns, such as π. However, this article is concerned with algebraic numbers rather than transcendental ones, so the next cases to explore are roots of cubics.

Brillhart’s cubic

In 1965, Brillhart noticed that the positive real root of the depressed cubic x³ − 8x − 10 has an ‘exotic’ continued fraction. This algebraic number is equivalently the dominant eigenvalue of the following 3 × 3 matrix:

matrix

What makes the continued fraction exotic is the presence of large terms early in the expansion:

[3; 3, 7, 4, 2, 30, 1, 8, 3, 1, 1, 1, 9, 2, 2, 1, 3, 22986, 2, 1, 32, 8, 2, 1, 8, 55, 1, 5, 2, 28, 1, 5, 1, 1501790, 1, 2, 1, 7, 6, 1, 1, 5, 2, 1, 6, 2, 2, 1, 2, 1, 1, 3, 1, 3, 1, 2, 4, 3, 1, 35657, 1, 17, 2, 15, 1, 1, 2, 1, 1, 5, 3, 2, 1, 1, 7, 2, 1, 7, 1, 3, 25, 49405, 1, 1, 3, 1, 1, 4, 1, 2, 15, 1, 2, 83, 1, 162, 2, 1, 1, 1, 2, 2, 1, 53460, 1, 6, 4, 3, 4, 13, 5, 15, 6, 1, 4, 1, 4, 1, 1, 2, 1, 16467250, 1, 3, 1, 7, 2, 6, 1, 95, 20, 1, 2, 1, 6, 1, 1, 8, 1, 48120, 1, 2, 17, 2, 1, 2, 1, 4, 2, 3, 1, 2, 23, 3, 2, 1, 1, 1, 2, 1, 27, 325927, 1, 60, 1, 87, 1, 2, 1, 5, 1, 1, 1, 2, 2, 2, 2, 2, 17, 4, 9, 9, 1, 7, 11, 1, 2, 9, 1, 14, 4, 6, 1, 22, 11, 1, 1, 1, 1, 4, 1, 3, 2, 1, 2, 1, 1, 2, 4, 2, 1, 5, 1, 8, 2, 2, 5, 1, 2, 1, 1, 1, 1, 1, 3, …]

To understand why that occurs, it helps to have an exact closed form for the root. Fortunately, that’s not too hard, since we know how to solve a depressed cubic. In particular, we equate x³ − 8x − 10 with the ‘difference of three cubes’ x³ + a³ + b³ − 3abx. Equating coefficients, we get:

  • a³ + b³ = −10
  • ab = 8/3

We can easily solve the resulting quadratic equation to obtain a and b. Then, the real root is simply x = −(a + b), which is given by:

Note that the \sqrt{163} is significant, as 163 is a Heegner number (the ring of integers in the quadratic field \mathbb{Q}[\sqrt{-163}] is a unique factorisation domain). The reason for the large partial quotients in the continued fraction expansion was explained by Stark, who incidentally was the first person to correctly prove Gauss’s conjecture that the Heegner numbers are precisely {1, 2, 3, 7, 11, 19, 43, 67, 163}.

Posted in Uncategorized | Leave a comment