User:Grady/Harmonic similarity

This page serves to document a personal theory of mine that attempts to serve as a generalization of octave equivalence, as well as the various implications of the theory. I've seen other people express similar ideas, but I'm not sure if the concept in this exact form has been articulated before. If anyone knows if it has, I'd love to know more!

If you want to leave feedback about anything on this page, you can leave it on the Discussion page (see the tabs at the top) or contact me on Discord! Feel free to let me know whether you think these ideas hold up to scrutiny, or whether your own experience agrees or disagrees with what I've written here.

Quick aside: Naming

I wasn't exactly sure what term to use for this concept, but I settled on harmonic similarity for now, at least for the purpose of deciding what to name this page. Some other terms I've considered are harmonic relatedness and harmonic affinity. I'm sure all of these terms have been used before, and some of them have certainly been used to express a similar concept as I'm putting forth here. If you have any opinions on what the most suitable name would be, I'd be glad to hear them!

Motivation

I developed this theory to attempt to answer the question, "Why are two notes an octave apart perceived as equivalent?" The typical explanation given is that it has something to do with the fact that the upper note is prominently featured in the harmonic spectrum of the lower note, assuming a typical harmonic timbre. However, this completely fails to explain why the octave is special in this regard, and why a similar phenomenon seemingly never occurs with other harmonics (even if less commonly, under more specific circumstances, or in a less pronounced manner), most namely the tritave.

Some people have claimed to hear tritave equivalence, but so far no one has purported to hear notes any number of tritaves apart as being within the same pitch class, something many musicians simply take for granted with regard to octaves, and in my opinion, a necessary requisite for true equivalence perception.

Overview

Harmonic similarity is a measure of how similar in harmonic function two pitches are, or in other words, how closely related the two pitches are to each other. This is in contrast to consonance or concordance, which is generally defined as a measure of how harmonious or stable two or more pitches sound when played together, or perhaps less commonly a measure of how much timbral fusion occurs when playing them together.

The concept of harmonic similarity is a generalization of octave equivalence: two notes that are some number of octaves apart will be considered highly harmonically similar, but there are other pairs of notes that are harmonically similar to a lesser degree, such as two notes a tritave apart.

Often, harmonic similarity between two notes correlates very strongly with perceived consonance: for example, two notes an octave apart are both extremely consonant with each other and extremely similar to each other. However, this is not always the case. For instance, two notes a major second apart are more harmonically similar to each other than two notes a major third apart, but typically less consonant when played simultaneously. I'll be elaborating more on what I mean by this, but it aligns with how Western music theory would approach the question: two notes a major second apart are more closely related via the circle of fifths.

Definition

Pseudo-transitivity

It should be intuitive that any equivalence relation follows the transitive property. For example, if x = y and y = z, then x = z. Or in geometry, if A is congruent to B and B is congruent to C, then A is congruent to C. For a more practical example, if Alice's car is the same model as Bob's car, and Bob's car is the same model as Charlie's car, then Alice and Charlie also have the same model of car.

An extension of this is that any similarity or relatedness relation sort of follows a pseudo-transitive property. For example, if x ≈ y and y ≈ z, then you can probably say that x ≈ z, but the similarity in quantity between x and z might be less strong than that between x and y, or y and z. To extend the car example, if Alice's car is a similar color to Bob's car, and Bob's car is a similar color to Charlie's car, then Alice's car is probably a similar color to Charlie's car as well, but it depends on what your threshold is for defining two colors as "similar".

(Note that despite the choice of word, the concept of similarity in geometry is actually an equivalence relation by this definition, not a similarity relation: it means both objects have the exact same shape, not that they're close in shape.)

It follows from this that if we want to define a notion of two auditory pitches being similar or related in their harmonic function, said notion should also follow this pseudo-transitive property as well. This is the core principle that differentiates the notion of harmonic similarity from that of consonance, since it doesn't necessarily follow that if note X and note Y are consonant together, and note Y and note Z are consonant together, that notes X and Z ought to be consonant together to some degree as well.

Basic similarity relations

Of course, the knowledge that harmonic similarity should follow this pseudo-transitive property isn't very helpful in determining which notes are harmonically similar without establishing some basic similarity relations first. Essentially, two pitches have a basic similarity relation if the frequency of one is a low integer multiple of the other, and said relation is stronger the lower the integer multiple is. The reason for this might be that we're highly accustomed to hearing harmonic timbres, so usually when we hear a note, we also somewhat prominently hear low integer multiples of that frequency as well, and over time this causes our brains to associate those higher harmonics as "similar" to the fundamental. This would also explain why lower integer multiples correspond to stronger basic similarity relations, because the lower the integer multiple is, the more prominent the corresponding overtone typically is in those harmonic timbres.

With this knowledge, we can use the pseudo-transitive property to discover higher-order similarity relations. For example, two notes a perfect fifth apart are perceived as harmonically similar, but not because the upper note is present in the harmonics of the lower note (which it isn't, assuming a typical harmonic spectrum). Rather, they're similar because two notes an octave apart are similar for that reason, and so are two notes a tritave apart, and thus by the pseudo-transitive property, two notes a perfect fifth apart must be similar as well. For example, the note C4 is related to G4 (the note a perfect fifth above it) because both of those notes are related to G5 (or to C3) by a basic similarity relation.

Something to note about the basic similarity relations is that the falloff in similarity with increasing integer values seems to be extremely fast. It's not exactly clear why this might be the case, but it's one of the core underlying assumptions of the theory. For example, the octave is a much, much stronger similarity relation than the tritave to most listeners, hence the notion of octave equivalence.

A seeming caveat of all this is that most listeners would rate the double octave to be a stronger similarity relation than the tritave, despite being a higher integer multiple. However, this is only because the double octave can be decomposed into two octaves, meaning this strong similarity can be explained with the pseudo-transitive property. In other words, the note C4 is very strongly related to the note C5, and C5 is very strongly related to C6, therefore C4 and C6 are also very strongly related. However, C4 is only moderately related to G5 (the note a tritave above it), and the tritave can't be decomposed into any simpler relations.

Margin for error

Since our ears are imperfect (and perhaps even because the overtones we hear that may have trained our mental map of harmonic similarity aren't perfect integer harmonics either), it makes sense to assign some margin for error to the notion of harmonic similarity by adding the assertion that two notes are harmonically similar if they're very close in pitch. This allows us to treat two notes that are an interval such as a perfect fifth apart in a tempered system like 12edo to be harmonically similar, even if the ratios are inexact.

Lattice visualization

A very good way to visualize a map of harmonically similar pitches is using a 3-limit lattice, with octaves on one axis and tritaves on the other. In order to demonstrate that octaves are a significantly stronger harmonic similarity relation than tritaves, one might consider spacing the notes much farther apart on the tritave axis than the octave axis. (No diagram yet, but that might be something I can add to this page later!)

This essentially ignores the effects of any basic similarity relations beyond the third harmonic, which I believe is reasonable due to the extremely fast falloff mentioned earlier.

String analogy

There's a very useful analogy I came up with to help understand the mechanism of determining how harmonically similar two pitches are. Imagine you have two strings such that the ratio between their lengths equals the ratio between the two frequencies you want to compare. (This is especially on the nose since affixing the ends of those two strings and plucking them would produce the musical interval in question.) In order to determine how harmonically similar the two notes are, ask yourself how easy it is to fold both strings to be the same length.

For example, if one string is four times the length of the other, simply fold the longer string in half twice to make it the same length as the shorter string. This is extremely easy to do, since folding something in half is extremely easy. Therefore, two pitches related by a factor of four are highly harmonically similar.

If one string is 3/2 the length of the other, this requires folding the shorter string in half and folding the longer string into thirds. This is arguably more challenging, as folding something into thirds requires a fair bit more futzing than folding it in half. However, it's still massively easier than folding something into fifths or sevenths, so it isn't a huge challenge to fold these two strings to be the same length. Thus, two pitches a perfect fifth apart are also fairly harmonically similar.

This analogy may even be deeper than it appears, as it may in fact provide some insight into what the brain is doing when comparing two pitches. It may be very easy for the brain to subconsciously slide pitches around by octaves for a similar reason that it's easy to fold something in half repeatedly, and the same logic may extend to other harmonically similar pairs of pitches.

Explanation of octave equivalence

Most musicians think of two notes any number of octaves apart as being equivalent to some extent; that is, being within the same pitch class. This obviously doesn't mean musicians are entirely unable to distinguish between notes that are some number of octaves apart, but it does mean that it's viewed as a type of equivalence relation, not just a similarity relation. That is, there is some property, namely pitch class, which is considered to be entirely invariant under transposition by octaves. This seems in direct contradiction to the theory of harmonic similarity, which posits that two notes separated by one or more octaves merely share a similarity relation, not an equivalence relation. I have two different hypotheses that attempt to explain this, which I'll detail below.

Limited hearing range hypothesis

We humans have a finite hearing range of only around 10 octaves, and only around six or seven octaves of that range is actually musically useful. Because of this, it may be impossible for any number of stacked octaves that fits within our musical hearing range to have a lower similarity than the tritave for most listeners, since the dissimilarity induced by each successive octave doesn't accumulate quickly enough. (Again, the tritave is assumed to have significantly higher dissimilarity than the octave, due to the extremely fast falloff mentioned earlier.) This may mean that if we could experience sound perception with a wider hearing range, that it would be possible to hear two notes a very high number of octaves apart as no longer sounding very equivalent due to the accumulation of dissimilarity.

Octave reduction hypothesis

Because there's such a wide range of possible pitches to work with in music, it's often convenient to mentally reduce them down to the span of a single octave. In this sense, there actually is a property of notes that's truly invariant when transposing them by octaves, and that property is the note within any given one-octave span that they're most similar to. For example, the note within octave 3 (in other words, the notes C3 to B3, assuming a 12edo system) that the note E7 is most similar to is E3 (which is four octaves away). E7 might also be moderately similar to other notes within octave 3, such as A3 (which is two octaves and one tritave away), but it's most closely related to E3. As for why a span of one octave is chosen as opposed to any other size, that's because there isn't any other possible combination of window size and equivalence interval, besides one octave and one octave respectively, for which this property holds (but this remains to be rigorously proven).

Why this causes octave equivalence

Presumably, for one of these reasons or the other, or perhaps both, our brains create the abstraction that notes any number of octaves apart fall within the same pitch class. I believe this is essentially something our brains do just because they can, at least after receiving some musical training, since it creates a useful layer of abstraction without any real downsides.

It's only a useful layer of abstraction to do this with octaves only, because if we attempted to do it with both octaves and tritaves simultaneously, then every possible pitch would fall within the same pitch class, making the abstraction useless. This is because any two notes relate to each other by some combination of octaves and tritaves (within some arbitrarily small margin of error), but most pairs of notes cannot be related to each other by octaves only. In more technical terms, this means any possible interval is arbitrarily close to some 3-limit interval, but not arbitrarily close to some 2-limit interval.

Other implications

In addition to providing an attempted explanation for the perception of octave equivalence, the theory of harmonic similarity provides explanations for various other noteworthy phenomena.

Colorlessness of the major second

The major second is often regarded as a dissonant interval, but it seems rather unique in that it produces auditory roughness, but doesn't necessarily feel emotionally jarring and unstable in the way something like one of the various flavors of tritone does. This is easily explained by the fact that two notes a major second apart are somewhat harmonically similar, due to being related by two factors of 3 and three factors of 2, which gives the major second a more emotionally colorless flavor than other dissonant intervals.

Usage of the circle of fifths

In Western music theory, the circle of fifths is often touted as a tool for determining which chords and keys are related to each other. The circle or chain of fifths is essentially an abstraction of the idea of harmonic similarity: it first assumes octave equivalence by implying that notes within the same pitch class can be treated as equivalent, and then denotes which pitch classes are harmonically related to each other. Chord progressions often involve successive chords whose root notes are adjacent on the chain of fifths; this is viewed as a smooth harmonic movement, which can be explained by the fact that the two root notes are harmonically similar to each other. The full theory of harmonic similarity correctly demonstrates that an even smoother harmonic movement can be achieved by moving from one chord to another chord whose root note is an octave away from that of the first, although this of course results in a change of character that's so minimal that Western music theory wouldn't even consider it to be a different chord at all.

Interval quality

Although the circle or chain of fifths is commonly used to relate pitch classes to each other, I never see it used to relate interval classes to each other. What this would mean is that one of the pitch classes comprising the interval class would be fixed in place, while the other would be moved around the circle or chain of fifths in order to generate a mapping of harmonically similar interval classes. For example, ascending the chain of fifths in this manner starting at the unison would produce the unison, then the perfect fifth, then the major second, then the major sixth, and so on.

As an important note, the concept of interval class is usually defined to refer to undirected interval class: for example, a major third and minor sixth would be considered the same interval class due to being octave complements. However, this theory requires establishing the concept of directed interval class, in which octave complements are separate classes. In order for this to work, we need to define intervals as being directed in general; in other words, that an upward major third and a downward major third be considered two separate intervals. The downward major third falls into the same directed interval class as the upward minor sixth, but the upward major third does not. If the upward major third can be thought of as the ratio 5/4, then the downward major third can be thought of as 4/5. In the absence of a specifier for upward or downward direction, it should be assumed that an interval points in the upward direction.

The theory of harmonic similarity provides credence to the idea that various intervals with the same interval quality specifier (e.g. major, minor, perfect, neutral, etc.) should often sound similar in emotional character, since those interval classes are usually near each other on an interval-based circle or chain of fifths. In other words, just as the notes C4 and G4 sound related to each other since C4 relates to G5 by a tritave, and G5 relates to G4 by an octave, a major third and major seventh should be expected to sound similar in character to each other since the major third relates to the major fourteenth by a tritave, and the major fourteenth relates to the major seventh by an octave.

Something to note about this is that the interval quality terms in wide use are arbitrary categorical boundaries. For instance, this theory posits that out of all the interval classes labeled as major, the major second is the one that's closest to a "perfect" interval quality, while out of all the ones labeled as minor, the minor seventh is the closest in that regard.

Brightness and darkness

The definition of brightness in wide use within the xenharmonic community is that a scale is brighter if its scale degrees are farther above the root. Most Western listeners will agree that brighter modes of the diatonic scale sound brighter and darker modes sound darker; however, in my experience and probably the experience of most other Western listeners as well, "brighter" modes (as per the xenharmonic definition) of the pentatonic scale actually sound darker, and "darker" modes sound brighter. This seems to indicate that the perception of brightness isn't actually determined by distances of scale degrees above the root. The theory of harmonic similarity provides an explanation for this, asserting that the reason brighter diatonic modes sound bright is because they contain more scale degrees of major quality, while darker modes sound dark because they contain more scale degrees of minor quality, and that there exists an actual reason that various scale degrees of the same interval quality should sound similar in character. This logic is of course inverted with the pentatonic scale, where brighter modes contain more scale degrees of minor quality and darker modes contain more of major quality.

In fact, this correlation in scale degree quality between modes of similar brightness levels is effectively only due to the special case where the period of the MOS scale in question is an octave and the generator is a tritave, as is the case for the diatonic and pentatonic scales. If the generator is any other interval, the new "bright" scale degrees introduced as you ascend the modal brightness order have little harmonic similarity to each other, and the old "dark" scale degrees that get removed also have little harmonic similarity to each other, and furthermore, the scale degrees within the scale at any given time have little harmonic similarity to each other, which means that traversing the modal brightness order doesn't result in a cohesive shift in emotional character in the same way. This certainly aligns with my experiences comparing different modes of xenharmonic MOS scales, where the brightest and darkest modes don't really feel very distinct in character from each other compared to, say, Lydian and Locrian, but it's unclear how much of this is due to the implications of the harmonic similarity theory versus simply cultural conditioning.

Attitudes toward microtonal music

Microtonal music is often polarizing for the average Western listener, with many finding microtonal intervals jarring or strange to listen to. Although this is usually dismissed purely as cultural conditioning, there could also be another aspect at play here. I believe harmonic similarity could be the key behind why so many Western listeners have such an immediate reaction to hearing a microtonal interval.

Because 12edo has a perfectly tuned octave and near-perfectly tuned tritave, if any given interval is well approximated by the tuning system, then all the intervals that are harmonically similar to that interval will also be well approximated. Conversely, if an interval falls well outside of the tuning system, for example the subminor seventh, then all the intervals that are harmonically similar to that interval, such as the subminor third, will fall well outside of the tuning system as well. Because of this, not only do Western listeners not become acclimated to the subminor seventh itself, but they also don't become acclimated to other intervals that are similar in character, like the subminor third. If Western listeners regularly heard the subminor third but not the subminor seventh (which would be the case if 9edo was the dominant tuning system instead, for example), then they might not find the subminor seventh as jarring, since they'd already be very familiar with a similar interval.

12edo even has a reasonable tuning of the fifth harmonic, so it's arguably possible that this further contributes to the cohesion in harmonic similarity between the notes and intervals within the tuning system. However, it's debatable whether two frequencies separated by a factor of five have any non-negligible amount of harmonic similarity to each other, or whether 12edo's tuning of the fifth harmonic is accurate enough for this to take effect.

The tritave versus the perfect fifth

Perhaps a more contentious implication of this theory is that it implies the tritave is a slightly stronger similarity relation than the perfect fifth. In Western music theory, they'd be thought of as equal in similarity, due to the abstraction of octave equivalence. However, this implication seems to generally align with the experiences of xenharmonic musicians, many of whom claim the tritave works better as an equave than the perfect fifth.