Previous Up Next
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Lecture 4 The Extended Fillmore–Springer–Cnops Construction

Cycles—circles, parabolas and hyperbolas—are invariant families under their respective Möbius transformations. We will now proceed with a study of the invariant properties of cycles according to the Erlangen programme. A crucial step is to treat cycles not as subsets of ℝn but rather as points of some projective space of higher dimensionality, see [33]*Ch. 3 [59] [276]. It is common in algebra to represent quadratic forms by square matrices, thus circles and their Möbius transformations are naturally described by 2× 2 matrices, see [306]*Ch. 9 [300]*Ch. 1. Yet one important component is still missing in those works: the invariant cycle product. This enhancement was independently introduced by several researchers in hypercomplex context [65, 100], see also [163]. This chapter introduces a further extension: the signature of the cycle project can vary even for the same type of point space.

4.1 Invariance of Cycles

K-orbits, shown in Fig. 1.2, are K-invariant in a trivial way. Moreover, since actions of both A and N for any σ are extremely ‘shape-preserving’, see Exercise 12, we meet natural invariant objects of the Möbius map:

Definition 1 The common name cycle [339] is used to denote circles, parabolas with horizontal directrices and equilateral hyperbolas with vertical axes of symmetry (as well as straight lines as the limiting cases of any from above) in the respective EPH case.

It is known, from analytic geometry, that a cycle is defined by the equation

k(u2−σ v2)−2lu−2nv+m=0,  (1)

where σ = ι2 and k, l, n, m are real parameters, such that not all of them are equal to zero. Using hypercomplex numbers, we can write the same equation as, cf. [339]*Suppl. C(42a),

KwwLw+Lw+M=0, (2)

where w= uv, Kk, L=nl, Mm and conjugation is defined by w= u−ι v.

Exercise 2 Check that such cycles result in straight lines for certain k, l, n, m and, depending from the case, one of the following:
  1. In the elliptic case: circles with centre (l/k,n/k) and squared radius (l2+n2mk)/k2.
  2. In the parabolic case: parabolas with horizontal directrices and focus at (l/k, m/2nl2/2nk+n/2k).
  3. In the hyperbolic case: rectangular hyperbolas with centre (l/k,−n/k) and vertical axes of symmetry.
Hereafter, the words parabola and hyperbola always assume only the above described types. Straight lines are also called flat cycles.

All three EPH types of cycles enjoy many common properties, sometimes even beyond those which we normally expect. For example, the following definition is quite intelligible even when extended from the above elliptic and hyperbolic cases to the parabolic one:

Definition 3 σr-Centre of the σ-cycle (1) for any EPH case is the point (l/k, −σrn/k)∈ℝσ. Notions of e-centre, p-centre, h-centre are used according the adopted EPH notations.

Centres of straight lines are at infinity, see Subsection 8.1.

Remark 4 Here, we use a signature σr=−1, 0 or 1 of a number system which does not coincide with the signature σ of the space σ. We will also need a third signature σc to describe the geometry of cycles in Definition 11.
Exercise 5 Note that some quadruples (k,l,n,m) may correspond through the Equation (1) to the empty set on a certain point space σ. Give the full description of parameters (k,l,n,m) and σ which leads to the empty set. Hint: Use the coordinates of the cycle’s centre and completion to full squares to show that the empty set may appear only in the elliptic point space. Such circles are usually called imaginary.

The usefulness of Defn. 3, even in the parabolic case, will be justified, for example, by

Here is one more example of the natural appearance of concentric parabolas:

Exercise 6 Show that, in all EPH cases, the locus of points having a fixed power with respect to a given cycle C is a cycle concentric with C.

This property is classical for circles [71]*§ 2.3 and is also known for parabolas [339]*§ 10. However, for parabolas, Yaglom used the word ‘concentric’ in quotes, since he did not define the centres of a parabola explicitly.

The family of all cycles from Definition 1 is invariant under Möbius transformations (4) in all EPH cases, which was already stated in Theorem 2. If a cycle does not intersect the real axis then the proof of Theorem 2 has the only gap: a demonstration that we can always transform the cycle to a K-orbit.

Exercise 7 Let C be a cycle in σ with its centre on the vertical axis. Show that there is the unique scaling wa2w which maps C into a K-orbit. Hint: Check that a cycle in σ with its centre belonging to the vertical axis is completely defined by the point of its intersection with the vertical axis and its curvature at this point. Then find the value of a for a scaling such that the image of C will satisfy the relation (7).

To finish the proof of Theorem 2 we need to consider two more cases which are also related to the structure of the group SL2(ℝ):

Exercise 7(a)
  1. An arbitrary cycle may have only two, one or none common point(s) with the real axis. This number is invariant under Möbius transformations.
  2. If a cycle has two, one or none common point(s) with the real axis and is fixed by a Möbius transformation, then the map is generated by a matrix which is conjugated to an element of subgroup A, N or K. Hint: Use the Exercise 3.

If all the above details are taking into consideration the proof of Theorem 2 may not appear as simple as we may initially thought. However, it still highlights important connections between the invariance of cycles and the structure of the group SL2(ℝ). Furthermore, we fully describe how cycles are transformed by Möbius transformations in Theorem 13.

4.2 Projective Spaces of Cycles

Figure 1.3 suggests that we may obtain a unified treatment of cycles in all EPH cases by consideration of higher-dimension spaces. The standard mathematical method is to declare objects under investigation to be simply points of some larger space.

Example 8 In functional analysis, sequences or functions are considered as points (vectors) of certain linear spaces. Linear operations (addition and multiplication by a scalar) on vectors (that is, functions) are defined point-wise.

If an object is considered as a point (in a new space) all information about its inner structure is lost, of course. Therefore, this space should be equipped with an appropriate structure to hold information externally which previously described the inner properties of our objects. In other words, the inner structure of an object is now revealed through its relations to its peers1.

Example 9 Take the linear space of continuous real-valued functions on the interval [0,1] and introduce the inner product of two functions by the formula:
    ⟨ f,g  ⟩=
1
0
f(t) g(t) dt.
It allows us to define the norm of a function and the orthogonality of two functions. These are the building blocks of Hilbert space theory, which recovers much of Euclidean geometry in terms of the spaces of functions.

In the geometry, the similar ideas are known as representational geometry, see [101, 99] and references there for a discussion. We will utilise the this fundamental approach for cycles. A generic cycle from Definition 1 is the set of points (u,v)∈ℝσ defined for the respective values of σ by the equation

k(u2−σ v2)−2lu−2nv+m=0. (3)

This equation (and the corresponding cycle) is completely determined by a point (k, l, n, m)∈ ℝ4. However, this is not a one-to-one correspondence. For a scaling factor λ ≠ 0, the point (λ k, λ l, λ n, λ m) defines an equivalent equation to (3). Thus, we prefer to consider the projective space3, that is, ℝ4 factorised by the equivalence relation (k, l, n, m)∼ (λ k, λ l, λ n, λ m) for any real λ ≠ 0. A good introductory read on projective spaces is [296]*Ch. 10, see also Ch. 13 in [311]. The chapter in the later book is titled “Projective Geometry Is All Geometry”, which is a quote from Arthur Cayley and our consideration below is a further illustration to this claim.

Definition 10 We call 3 the cycle space and refer to the initial σ as the point space. The correspondence which associates a point of the cycle space to a cycle equation (3) is called map Q.
Exercise 10(a) Which cycles correspond to the quadruples (1,0,0,0), (0,1,0,0), (0,0,1,0) and (0,0,0,1)?
Hint: for the last cycle see Defn. 1 and the following Ex. 2.

We also note that the Equation (2) of a cycle can be written as a quadratic form

Kw1w1Lw1w2+Lw1w2+Mw2w2=0 (4)

in the homogeneous coordinates (w1,w2), such that w=w1/w2. Since quadratic forms are related to square matrices, see Section 4.4, we define another map on the cycle space as follows.

Definition 11 We arrange numbers (k, l, n, m) into the cycle matrix
Cσcs=


      lcsnm
klcsn


, (5)
with a new hypercomplex unit ιc and an additional real parameter s, which is usually equal to ± 1. If we omit it in the cycle notation Cσc, then the value s=1 is assumed.

The values of σc:=ιc2 are −1, 0 or 1, independent of the value of σ. The parameter s=±1 often (but not always) is equal to σ. Matrices which differ by a real non-zero factor are considered to be equivalent.

We denote by M such a map from 3 to the projective space of 2× 2 matrices.

The matrix (5) is the cornerstone of the (extended) Fillmore–Springer–Cnops construction (FSCc) [65, 100] and is closely related to the technique recently used by A.A. Kirillov to study the Apollonian gasket [163]. A hint for the composition of this matrix is provided by the following exercise.

Exercise 12 Let the space A2 be equipped with a product of symplectic type
    [w,w′]=w1w2w2w1,
where w(′)=(w1(′),w2(′))∈A2. Letting C=(
LM
KL
), check that the equation of the cycle (4) is given by the expression [w, Cw]=0 .

Identifications of both Q and M are straightforward. Indeed, a point (k, l, n, m)∈ℙ3 equally well represents (as soon as σ , σc and s are already fixed) both the Equation (3) and the line of matrix (5). Thus, for fixed σ , σc and s, one can introduce the correspondence between quadrics and matrices shown by the horizontal arrow on the following diagram:

3 <−>[dl]Q <−>[dr]M    Quadrics on ℝσ  <−>[rr](.55)Q∘ MM2(A) (6)

which combines Q and M.

4.3 Covariance of FSCc

At first glance, the horizontal arrow in (6) seems to be of little practical interest since it depends on too many different parameters (σ, σc and s). However, the following result demonstrates that it is compatible with simple calculations of cycles’ images under the Möbius transformations.

Theorem 13The image S σcs of a cycle Cσcs under transformation (4) in σ with gSL2(ℝ) is given by similarity of the matrix (5):
S σcs= gCσcsg−1. (7)
In other word, FSCc (5) intertwines Möbius action (4) on cycles with linear map (7). Explicitly, it means:


      l′+ιcsnm
kl′+ιcsn


= 


      ab
cd




    lcsnm
klcsn




    db
ca


. (8)

Proof. There are several ways to prove (7), but for now we present a brute-­force calculation, which can fortunately be performed by a CAS [191]. See Appendix C for information on how to install (Section C.2) and start (Section C.3) the software, which will be sufficient for this proof. Further use of the software would be helped by learning which methods are available from the library (Section C.4) and which predefined objects exist upon initialisation (Section C.5). Assuming the above basics are known we proceed as follows.

First, we build a cycle passing a given point P=[u, v]. For this, a generic cycle C with parameters (k,l,n,m) is bounded by the corresponding condition:

In [2]: C2=C.subject_to(C.passing(P))

Then, we build the conjugated cycle with a generic g= (

    ab
cd

)∈SL2(ℝ) and a hypercomplex unit es and parameter s=±1:

In [3]: C3=C2.sl2_similarity(a, b, c, d, es, matrix([[1,0],[0,s]])

We also find the image of P under the Möbius transformation with the same element of gSL2(ℝ) but a different hypercomplex unit e:

In [4]: P1=clifford_moebius_map(sl2_clifford(a, b, c, d, e), P, e)

Finally, we check that the conjugated cycle C3 passes the Möbius transform P1. A simplification based on the determinant value 1 and s=±1 will be helpful:

In [5]: print  C3.val(P1).subs([a==(1+b*c)/d,pow(s,2)==1]) \
                .normal().is_zero() 
Out[5]: True

Thus, we have confirmation that the theorem is true in the stated generality. One may wish that every mathematical calculation can be done as simply.


Remark 14 There is a bit of cheating in the above proof. In fact, the library does not use the hypercomplex form (5) of FSCc matrices. Instead, it operates with non-commuting Clifford algebras, which makes it usable at any dimension—see Section B.5 and [191, 186].

The above proof cannot satisfy everyone’s aesthetic feeling. For this reason, an alternative route based on orthogonality of cycles [65] will be given later—see Exercise 7.

It is worth noticing that the image S σcs under similarity (7) is independent of the values of s and σc. This, in particular, follows from the following exercise.

Exercise 15Check that the image (k′,l′,n′,m′) of the cycle (k,l,n,m) under similarity with g= (
    ab
cd
)∈SL2(ℝ) is
    (k′,l′,n′,m′)= (kd2+2 lc  d+mc2, kbd+l( bc+ad)+mac, n, kb2+2 la  b+ma2).
This can also be presented through matrix multiplication:
    




    k
l
n
m




=




    d22  c  d0 c2
    bdbc+ad0 ac
    001 0
    b22 a  b0 a2








      k
l
n
m




.

Now we have an efficient tool for investigating the properties of some notable cycles, which have appeared before.

Exercise 16Use the similarity formula (7) for the following:
  1. Show that the real axis v=0 is represented by the line passing through (0,0,1,0) and a matrix (
          sιc0
          0sιc
    ). For any (
          ab
          cd
    )∈ SL2(ℝ), we have
          


            ab
            cd




            sιc0
          0sιc




          db
          −ca


    =


          sιc0
          0sιc


    ,
    i.e. the real line is SL2(ℝ)-invariant.
  2. Write matrices which describe cycles represented by A, N and K-orbits shown in Figs. 1.1 and 1.2. Verify that matrices representing these cycles are invariant under the similarity with elements of the respective subgroups A, N and K.
  3. Show that cycles (1,0,n,σ), which are orbits of isotropy groups as described in Exercise 2, are invariant under the respective matrix similarity for the respective values of σ=ι2 and any real n.
  4. Find the cycles which are transverse to orbits of the isotropy subgroups, i.e. are obtained from the vertical axis by the corresponding actions.

These easy examples also show that the software is working as expected.

4.4 Origins of FSCc

The Fillmore–Springer–Cnops construction, in its generalised form, will play a central rôle in our subsequent investigation. Thus, it is worth looking at its roots and its origins before we begin using it. So far, it has appeared from the thin air, but can we intentionally invent it? Are there further useful generalisations of FSCc? All these are important questions and we will make an attempt to approach them here.

As is implied by its name, FSCc was developed in stages. Moreover, it appeared independently in a different form in the recent work of Kirillov [163]. This indicates the naturalness and objectivity of the construction. We are interested, for now, in the flow of ideas rather than exact history or proper credits. For the latter, the reader may consult the original works [100] [64] [65] [163] [289]*Ch. 18 as well as references therein. Here, we treat the simplest two-dimensional case. In higher dimensions, non-commutative Clifford algebras are helpful with some specific adjustments.

4.4.1 Projective Coordiantes and Polynomials

An important old observation is that Möbius maps appear from linear transformations of homogeneous (projective) coordinates, see [269]*Ch. 1 for this in a context of invariant theory. This leads to the FSCc in several steps:

  1. Take a real projective space1 as a quotient of ℝ2 by the equivalence relation (x,y)∼ (tx,ty) for t≠ 0. Then, any line with y≠ 0 can be represented by (x/y,1). Thus, the invertible linear transformation


          ab
    cd




          x
    y


    =


          ax+by
    cx+dy


    ,   g=  


          ab
    cd


    SL2(ℝ),   


          x
    y


    ∈ℝ2 (9)
    will become the Möbius transformation (3) on the representatives (
        u
    1
    ). Similarly, we can consider Möbius actions on complex numbers w=u+i v.
  2. The next observation [64] is that, if we replace the vector (
        w
    1
    ), w∈ℂ by a 2× 2 matrix (
    ww
    11
    ), then the matrix multiplication with gSL2(ℝ) will transform it as two separate copies of the vector in (9). The “twisted square” of this matrix is
    Z= 


          www
    1w


    =
    1
    2


          ww
    11




          1w
    1w


    . (10)
    Then, the linear action (9) on vectors is equivalent to the similarity g Z g−1 for the respective matrix Z from (10).
  3. Finally, one can link matrices Z in (10) with zero-radius circles, see Exercise 1, which are also in one-to-one correspondences with their centres. Then, the above similarity Zg Z g−1 can be generalised to the action (7).

Another route was used in a later book of Cnops [65]: a predefined geometry of spheres, specifically their orthogonality, was encoded in the respective matrices of the type (5). Similar connections between geometry of cycles and matrices lead Kirillov, see [163]*§ 6.3 and the end of this section. He arrived at an identification of disks with Hermitian matrices (which is similar to FSCc) through the geometry of Minkowski space-time and the intertwining property of the actions of SL2(ℂ).

There is one more derivation of FSCc based on projective coordinates. We can observe that the homogeneous form (4) of cycle’s equation (2) can be written using matrices as follows:

Kw1w1Lw1w2+Lw1w2+Mw2w2=

    −w2w1



    LM
    KL




    w1
w2


=0. (11)

Then, the linear action (9) on vectors (

  w1
w2

) corresponds to conjugated action on 2× 2 matrices (

    LM
    KL

) by the intertwining identity:


    −w2w1



    LM
    KL




    w1
w2


=

    −w2w1



    LM
    KL




    w1
w2


, (12)

where the respective actions of SL2(ℝ) on vectors and matrices are

  


    w1
w2


=


      ab
cd




    w1
w2


,  and 


    LM
    KL


=


      ab
cd


−1


 


      LM
      KL




      ab
cd


.

In other words, we obtained the usual FSCc with the intertwining property of the type (7). However, the generalised form FSCc does not result from this consideration yet.

Alternatively, we can represent the Equation (4) as:

  Kw1w1Lw1w2+Lw1w2+Mw2w2=

    w1w2



    KL
    −LM




    w1
w2


=0.

Then, the intertwining relation similar to (12) holds if matrix similarity is replaced by the matrix congruence:

  


    KL
    −LM


=


      ab
cd


T


 


      KL
      −LM




      ab
cd


.

This identity provides a background to the Kirillov correspondence between circles and matrices, see [163]*§ 6.3 and [306]*§ 9.2. Clearly, it is essentially equivalent to FSCc and either of them may be depending on a convenience. It does not hint at the generalised form of FSCc either.

We will mainly work with the FSCc, occasionally stating equivalent forms of our results for the Kirillov correspondence.

4.4.2 Co-Adjoint Representation

In the above construction, the identity (11) requires the same imaginary unit to be used in the quadratic form (the left-hand side) and the FSCc matrix (the right-hand side). How can we arrive at the generalised FSCc directly without an intermediate step of the standard FSCc with the same type of EPH geometry used in the point cycle spaces? We will consider a route originating from the representation theory.

Any group G acts on itself by the conjugation g: xg x g−1, see 1. This map obviously fixes the group identity e. For a Lie group G, the tangent space at e can be identified with its Lie algebra g, see Subsection 2.3.1. Then, the derived map for the conjugation at e will be a linear map gg. This is an adjoint representation of a Lie group G on its Lie algebra. This is the departure point for Kirillov’s orbit method, which is closely connected to induced representations, see [159, 161].

Example 17 In the case of G=SL2(ℝ), the group operation is the multiplication of 2× 2 matrices. The Lie algebra g can be identified with the set of traceless matrices, which we can write in the form (
    lm
kl
) for k, l, m∈ℝ. Then, the co-adjoint action of SL2(ℝ) on sl2 is:


      ab
cd


: 


    lm
kl




    ab
cd


−1


 


    lm
kl




    ab
cd


. (13)

We can also note that the above transformation fixes matrices (

  n0
0n

) which are scalar multiples of the identity matrix. Thus, we can consider the conjugated action (13) of SL2(ℝ) on the pairs of matrices, or intervals in the matrix space of the type

  



      lm
kl


, 


      n0
0n




,  or, using hypercomplex numbers,


    lcnm
klcn


.

In other words, we obtained the generalised FSCc, cf. (5), and the respective action of SL2(ℝ). Furthermore, a connection of the above pairs of matrices with the cycles on a plane can be realised through the Poincaré extension, see Ch. 14.

Remark 18 Note that scalar multiplies of the identity matrix, which are invariant under similarity, correspond in FSCc to the real line, which is also Möbius invariant.

4.5 Projective Cross Ratio

An important invariant of Möbius transformations in complex numbers is the cross ratio of four distinct points, see [26]*§ 13.4:

[z1 , z2 , z3 , z4 ] =
(z1 − z3 )(z2 − z4 )
(z1 − z2 )(z3 − z4)
. (14)

Zero divisors make it difficult to handle a similar fraction in dual and double numbers. Projective coordinates are again helpful in this situation—see [52]. Define the symplectic form, cf. [11]*§ 41, of two vectors by

ω(z,z′)=xy′−xy,    where z=(x,y), z′=(x′,y′) ∈ ℝσ2.  (15)

The one-dimensional projective space ℙ1(ℝσ) is the quotient of ℝσ2 with respect to the equivalence relation: zz′ if their symplectic form vanishes. We also say [52] that two points in ℙ1(ℝσ) are essentially distinct if there exist their representatives z, z′∈ℝσ2 such that ω(z,z′) is not a zero divisor.

Definition 19[52] The projective cross ratio of four essentially distinct points zi∈ ℙ1(ℝσ) represented by (xi, yi)∈ℝσ2, i=1,…,4, respectively, is defined by:
[z1 , z2 , z3 , z4 ]=


      (x1y3 − x3y1 )(x2y4 − x4y2 )
      (x1y2 − x2y1 )(x3y4 − x4y3 )


∈  ℙ1(ℝσ). (16)
Exercise 20 Check that:
  1. Let the map S: ℝσ → ℙ1(ℝσ) be defined by z↦ (z,1). Then, it intertwines the Möbius transformations on σ with linear maps (9).
  2. The map S intertwines cross ratios in the following sense:
    S([z1,z2,z3,z4])=[S(z1),S(z2),S(z3),S(z4)], (17)
    where zi∈ℝe, and the left-hand side contains the classic cross ratio (14) while the right-hand side the projective (16).
  3. The symplectic form (15) on σ2 is invariant if vectors are multiplied by a 2× 2 real matrix with the unit determinant.
  4. The projective cross ratio (16) is invariant if points in σ2 are multiplied by a matrix from SL2(ℝ). Then, the classic cross ratio (14) is invariant under the Möbius transformations, cf. (17).
Exercise 21[52] Check further properties of the projective cross ratio.
  1. Find transformations of [z1,z2,z3,z4] under all permutations of points.
  2. Demonstrate the cancellation formula for cross ratio:
    [z1 , z2 , z3 , z4 ][z1 , z3 , z5 , z4 ] = [z1 , z2 , z5 , z4 ], (18)
    where, in the left-hand side, values in σ2 are multiplied component-wise. Such a multiplication is commutative but not associative on 1(ℝσ).

We say that a collection of points of ℙ1(ℝσ) is concyclic if all their representatives in ℝσ2 satisfy to Equation (11) for some FSCc matrix (

  LM
  KL

) .

Exercise 22[52] Show that:
  1. Any collection of points in σ belonging to some cycle is mapped by S from Exercise 1 to concyclic points in 1(ℝσ).
  2. For any three essentially distinct points z1, z2 and z3∈ ℙ1(ℝσ) there is a fixed 2× 2 matrix A such that [z1,z2,z,z3]=Az for any z∈ ℙ1(ℝσ). Moreover, the matrix A has a determinant which is not a zero divisor.
  3. Any four essentially distinct points in 1(ℝσ) are concyclic if and only if their projective cross ratio is S(r) for some real number r. Hint: Let [z1,z2,z,z3] correspond through S to a real number. We know that [z1,z2,z,z3]=Az for an invertible A, then Āz=Az or z−1Az. Multiply both sides of the last identity by row vector (−ȳ,x), where z= (
            x
    y
    ). The final step is to verify that Ā−1A has the FSCc structure, cf. [52]*§ 4.

We have seen another way to obtain FSCc—from the projective cross ratio.


1
The same is true for human beings.
site search by freefind advanced

Last modified: October 28, 2024.
Previous Up Next