Plücker coordinates

From Xenharmonic Wiki
Jump to navigation Jump to search
This is an expert page. It is written to allow experienced readers to learn more about the advanced elements of the topic.
Schematic illustration of the Plücker embedding. Linear subspaces of [math]\displaystyle{ \mathbb{R}^n }[/math] (here lines) get mapped to points on a quadric surface in projective space.
English Wikipedia has an article on:

In exterior algebra applied to regular temperament theory, Plücker coordinates (also known as the wedgie) are a way to assign coordinates to abstract temperaments, by viewing them as elements of some projective space.

The usual way to write down an abstract temperament is via its mapping matrix, but Plücker coordinates give us a unique description that is useful for some calculations. The definition here is given in terms of temperament matrices, but by duality, we can also embed interval spaces in the same way. More specifically, the interval subspace spanned by the commas of some temperament can also be used to give unique coordinates to that temperament. These two representations are related via the Hodge dual.

How to read a wedgie

Following intuitions from ploidacot, one way to characterize a temperament is how many generators it splits the perfect fifth (3/2) into. For an example, meantone doesn't split it at all, so we say it is monocot. We also say it is haploid, since it doesn't split the octave (2/1) at all.

A wedgie is essentially a way to generalize ploidacot information to all possible combinations of primes within a temperament, and format that information in a concise manner; as it turns out, this is enough to uniquely characterize the temperament.

For example, take the wedgie for meantone: ⟨⟨1 4 4]]. Each entry corresponds to a pair of primes: 2.3, 2.5, and 3.5. The first entry of the wedgie is the ploidacot signatures multiplied together, which in this case is 1, telling us that the octave finds 2 at one step, and the fifth finds 3 at one generator minus one octave (which, since there are no even splits, still counts as 1).

For 2.5, the procedure generalizes, with the entry, 4, being the number of steps 2 and 5 are divided into respectively multiplied together. But since we already know 2 is divided into only one octave, this must mean 5 is split into four parts. In fact, 5 is found at four fifths up.

For the final entry, which is for the 3.5 subgroup, we have another 4. But this time, we're thinking tritave-equivalently now, so we'll be reaching 5/3. This is the number of parts 3 and 5/3 are divided into respectively, multiplied together. 3 is reached by going up one 3/2 and one 2/1, but no splitting is happening, so the factor of 4 must come from 5/3, which is indeed reached by four 3/2s.

For another example, take father, which has the wedgie ⟨⟨1 -1 -4]].

Here, we again have a 1 as our entry for 2.3, meaning that the temperament is haploid monocot, or in other words that 2/1 is unsplit and 3/2 is one generator.

Thus, going into our second entry, -1 for 2.5. Again, since we already know our temperament is haploid, the negative sign must come from the mapping for 5. But what could a negative sign possibly mean in a wedgie? Well, to reach the 5th harmonic (specifically, 5/4) in father, you go up a perfect fourth. Since we're thinking octave-equivalently, this can be seen as going down a perfect fifth.

Finally, for 3.5, we have the entry -4. Again, we're tritave-equivalent and 3/1 is simply found by an octave and a fifth, so we'll be finding 5/3 by splitting it into four parts. 5/3 is equated to 16/9 in father, which is found by going up two octaves and down two fifths. This might seem like only a split into two, but keep in mind - we're in tritave-equivalent territory. Octaves are the tritave complement of fifths. So instead of going up two octaves, we can instead simply go down two more fifths to reach 5/3. And there we have it - 5/3 is split into four parts, which each contain a negative generator.

For our final example, we will consider blackwood ⟨⟨0 5 8]].

Blackwood's first entry is 0, which means that it reduces 2.3 to a rank-1 structure. This can be seen as 3 being found 0 generators from some ploid (since 3/2 in blackwood is 3\5), since 0 times anything is 0.

The next entry, 5, is simple: in 2.5, 5 (in this case, 5/4) is found by going up one generator, but remember that each entry is where the two primes are found multiplied together. Since 2 is found at 5 ploids, the entry is 1 * 5 = 5. (Technically, there's a hidden 5 in the 2.3 entry that gets multiplied by 0 and vanishes.)

And then the final entry, for 3.5, is 8. Again, 5 (i.e. 5/3) is found at one generator, but 3/1 is split into 8 parts by blackwood temperament. So, 1 * 8 = 8.

For wedgies of temperaments of larger prime subgroups, the number of entries is increased, so a rank-2 temperament of 7-limit JI would have 6 entries, for 2.3, 2.5, 2.7, 3.5, 3.7, and 5.7. Note that the new septimal entries are inserted between the entries for the 5-limit! If it helps, think of arranging all the entries in a grid, where rows represent the first prime, and columns represent the second, and reading them off one by one.

For wedgies of higher-rank temperaments, the number of primes per entry is increased, so that for a rank-3 temperament of the 7-limit, all possible combinations of 3 primes (2.3.5, 2.3.7, 2.5.7, and 3.5.7) would be covered.

Definition

A temperament can be viewed as a point in what is called a Grassmannian variety, written as [math]\displaystyle{ \mathrm{Gr} (k, n) }[/math]. This variety contains all possible k-dimensional subspaces of [math]\displaystyle{ \mathbb{R}^n }[/math]. In musical terms, k represents the rank of the temperament (how many independent generators it has), and n is the number of primes we're considering in our just intonation subgroup.

Let [math]\displaystyle{ M }[/math] be an element of [math]\displaystyle{ \mathrm{Gr} (k, n) }[/math], spanned by basis vectors [math]\displaystyle{ m_1, \ldots, m_k }[/math]. These basis vectors are the rows of the temperament mapping matrix.

The Plücker map takes a temperament and embeds it into a projective space by taking the wedge product of the basis vectors:

[math]\displaystyle{ \begin{align} \iota: \mathrm{Gr} (k, n) & \to \mathbf{P}\left(\Lambda^{k} \, \mathbb{R}^n \right) \\ \operatorname {span} (m_1, \ldots, m_k) & \mapsto \left[ m_1 \wedge \ldots \wedge m_k \right] \, . \end{align} }[/math]

Here, [math]\displaystyle{ \Lambda^{k} \, \mathbb{R}^n }[/math] is the k-th exterior power (the subspace containing all k-vectors). This construction is independent of the basis we choose. While the original space of temperaments has dimension [math]\displaystyle{ k(n-k) }[/math], the space of Plücker coordinates is typically larger, with dimension [math]\displaystyle{ \binom{n}{k} - 1 }[/math].

Examples

The space of lines through the origin is exactly projective space, so [math]\displaystyle{ \mathrm{Gr} (1, n) \cong \mathbf{P} (\mathbb{R}^n) }[/math]. In 3 dimensions, a plane through the origin is completely defined by its normal, so we get that [math]\displaystyle{ \mathrm{Gr} (2, 3) \cong \mathrm{Gr} (1, 3) \cong \mathbf{P} (\mathbb{R}^3) }[/math], the projective plane.

The simplest non-trivial case is [math]\displaystyle{ \mathrm{Gr} (2, 4) }[/math]. An element [math]\displaystyle{ M }[/math] spanned by two lines [math]\displaystyle{ x, y }[/math], can be represented as the matrix

[math]\displaystyle{ \begin{equation} \begin{bmatrix} x_{1} & x_{2} & x_{3} & x_{4} \\ y_{1} & y_{2} & y_{3} & y_{4} \end{bmatrix} \, . \end{equation} }[/math]

These are not 'proper' coordinates, as doing row operations on this matrix preserves the row-span.

The projective coordinates can be calculated by taking the determinants of all [math]\displaystyle{ 2 \times 2 }[/math] sub-matrices

[math]\displaystyle{ p_{ij} = \begin{vmatrix} x_i & x_j \\ y_i & y_j \end{vmatrix} \, , }[/math]

which finally gives us

[math]\displaystyle{ \begin{equation} \iota (M) = \left[ x \wedge y \right] = \left[ p_{12} : p_{13} : p_{14} : p_{23} : p_{24} : p_{34} \right] \, . \end{equation} }[/math]

Note the use of colons to signify that these coordinates are homogeneous.

Plücker relations

The coordinates must satisfy some algebraic relations called Plücker relations. Generally, the projective space is much 'larger' than the Grassmannian, and the image in the projective space is some quadric surface.

For the example above on [math]\displaystyle{ \mathrm{Gr} (2, 4) }[/math], the Plücker relation is

[math]\displaystyle{ p_{12} p_{34} - p_{13} p_{24} + p_{14} p_{23} = 0 \, . }[/math]

Note that in this case, there is only one such relation, but in higher dimensions there will be many.

Rational points

A rational point [math]\displaystyle{ P }[/math] on [math]\displaystyle{ \mathrm{Gr}(k, n) }[/math] is a k-dimensional subspace such that [math]\displaystyle{ \mathcal{L} = P \cap \mathbb{Z}^n }[/math] is a rank k sublattice of [math]\displaystyle{ \mathbb{Z}^n }[/math]. Abstract temperaments correspond exactly to these rational points, although most have no practical musical use.

The same relations as above can be derived, where we represent P as integer matrix [math]\displaystyle{ M \in \mathbb{Z} ^ {k \times n} }[/math], whose rows span [math]\displaystyle{ \mathcal{L} }[/math]. The projective coordinates similarly have integer entries. Because the Plücker coordinates are homogeneous, we can always put them in a canonical form by dividing all entries by their greatest common divisor (GCD) and ensuring the first element is non-negative.

An advantage of studying rational points is that we do not have to worry about torsion. The quotient group [math]\displaystyle{ \mathbb{Z}^n / \mathcal{L} }[/math] is a finitely generated abelian group. When the Plücker coordinates are normalized (GCD = 1), we ensure that [math]\displaystyle{ \mathbb{Z}^n / \mathcal{L} \cong \mathbb{Z}^{n-k}, }[/math] which is torsion-free.

Height

A height function is a way to measure the 'arithmetic complexity' of a rational point. For example, the rational numbers [math]\displaystyle{ \frac{3}{2} }[/math] and [math]\displaystyle{ \frac{3001}{2001} }[/math] are close to eachother, but intuitively the second is much more complicated.

We can define the height of a rational point simply as the Euclidean norm on its Plücker coordinates [math]\displaystyle{ X = \iota (P) }[/math].

[math]\displaystyle{ H(P) = \left\| X \right\| = \left\| m_1 \wedge \ldots \wedge m_n \right\| \\ }[/math]

In terms of the lattice defined by P, this definition is equivalent to the volume of the fundamental domain, also known as the lattice determinant. It is easy to show that this does not depend on the basis we choose.

The height can be easily computed using the Gram matrix:

[math]\displaystyle{ \begin{align} \mathrm{G}_{ij} &= \left\langle m_i, m_j \right\rangle \\ \sqrt{\det(\mathrm{G})} &= \left\| m_1 \wedge \ldots \wedge m_n \right\| = \left\| X \right\| \, . \end{align} }[/math]

In regular temperament theory, this height is usually known as simply the complexity.

Projective distance

Given a temperament, we want to have some notion of distance, so that we can measure how well the temperament approximates JI. Since we are talking about linear subspaces (which all intersect at the origin), the only thing that is sensible to measure is the angle between them.

In Euclidean space, one usually takes advantage of the dot product to measure angles. Given vectors [math]\displaystyle{ a, b \in \mathbb{R^n} }[/math], we famously have

[math]\displaystyle{ \frac{a \cdot b}{\left\| a \right\| \left\| b \right\| } = \cos (\theta) \, . }[/math]

In projective space, there is an analogous formula, using the wedge product instead. Given some real point [math]\displaystyle{ j \in \mathbb{R^n} }[/math] with homogeneous coordinates [math]\displaystyle{ y }[/math], and a linear subspace [math]\displaystyle{ P \in \mathrm{Gr} (k, n) }[/math] with Plücker coordinates [math]\displaystyle{ X }[/math], we define the projective distance as

[math]\displaystyle{ d(P, j) = \frac{ \left\| X \wedge y \right\| }{\left\| X \right\| \left\| y \right\| } = \sin (\theta) \, . }[/math]

Where we can take [math]\displaystyle{ j }[/math] to be the usual n-limit vector of log primes, so that [math]\displaystyle{ y = \left[ 1 : \log_2 (3) : \ldots : \log_2 (p_n) \right] }[/math]. Unlike the dot product formula, this works for subspaces of any dimension.

Since for any decent temperament this angle will be extremely small, we can take [math]\displaystyle{ \sin (\theta) \approx \theta }[/math].

See also