Fraenkel word: Difference between revisions

From Xenharmonic Wiki
Jump to navigation Jump to search
Inthar (talk | contribs)
Inthar (talk | contribs)
Line 17: Line 17:
== Facts ==
== Facts ==
Below we denote the length of a word ''w'' by |''w''| and the number of occurrences of the letter '''i''' in ''w'' as {{!}}''w''{{!}}<sub>'''i'''</sub>, as is standard notation in combinatorics on words. The notation ''w''(''u''<sub>0</sub>, ..., ''u''<sub>''r''&minus;1</sub>) represents the word ''w'' in '''0''', '''1''', ..., '''r&minus;1''' but with '''i''' replaced by the word ''u''<sub>''i''</sub>.
Below we denote the length of a word ''w'' by |''w''| and the number of occurrences of the letter '''i''' in ''w'' as {{!}}''w''{{!}}<sub>'''i'''</sub>, as is standard notation in combinatorics on words. The notation ''w''(''u''<sub>0</sub>, ..., ''u''<sub>''r''&minus;1</sub>) represents the word ''w'' in '''0''', '''1''', ..., '''r&minus;1''' but with '''i''' replaced by the word ''u''<sub>''i''</sub>.
 
=== Fraenkel words are balanced ===
{{theorem|contents=As circular words, Fraenkel words are [[balanced]].}}
{{theorem|contents=As circular words, Fraenkel words are [[balanced]].}}


Line 43: Line 43:
{{proof|contents=We use the previous lemma. In the first case, ''w'' is guaranteed to have exactly ''k''-many '''i'''s where {{!}}''w''{{!}} = ''k''2<sup>''i''+1</sup>. In the second case, if ''k''2<sup>''i''+1</sup> < {{!}}''w''{{!}} < (''k'' + 1)2<sup>''i''+1</sup> and ''w'' = ''uv'' or ''vu'' where {{!}}''u''{{!}} ≡ 0 mod 2<sup>''i''+1</sup>,  then ''u'' satisfies {{!}}''u''{{!}}<sub>'''i'''</sub> = ''k''2<sup>''i''+1</sup>/2<sup>''i''+1</sup> = ''k'' by the previous case. Thus {{!}}''w''{{!}}<sub>'''i'''</sub>  is determined by {{!}}''v''{{!}}<sub>'''i'''</sub>, which is 1 if ''v'' contains the '''i''' in the middle of ''F''<sub>''i''</sub>, implying {{!}}''w''{{!}}<sub>'''i'''</sub> = ceil({{!}}''w''{{!}}/2<sup>''i''+1</sup>), and 0 otherwise, implying {{!}}''w''{{!}}<sub>'''i'''</sub> = floor({{!}}''w''{{!}}/2<sup>''i''+1</sup>).}}
{{proof|contents=We use the previous lemma. In the first case, ''w'' is guaranteed to have exactly ''k''-many '''i'''s where {{!}}''w''{{!}} = ''k''2<sup>''i''+1</sup>. In the second case, if ''k''2<sup>''i''+1</sup> < {{!}}''w''{{!}} < (''k'' + 1)2<sup>''i''+1</sup> and ''w'' = ''uv'' or ''vu'' where {{!}}''u''{{!}} ≡ 0 mod 2<sup>''i''+1</sup>,  then ''u'' satisfies {{!}}''u''{{!}}<sub>'''i'''</sub> = ''k''2<sup>''i''+1</sup>/2<sup>''i''+1</sup> = ''k'' by the previous case. Thus {{!}}''w''{{!}}<sub>'''i'''</sub>  is determined by {{!}}''v''{{!}}<sub>'''i'''</sub>, which is 1 if ''v'' contains the '''i''' in the middle of ''F''<sub>''i''</sub>, implying {{!}}''w''{{!}}<sub>'''i'''</sub> = ceil({{!}}''w''{{!}}/2<sup>''i''+1</sup>), and 0 otherwise, implying {{!}}''w''{{!}}<sub>'''i'''</sub> = floor({{!}}''w''{{!}}/2<sup>''i''+1</sup>).}}


TODO: A final lemma handling the case where the subword ''w'' of ''F''<sub>''n''</sub> is a concatenation of a suffix of ''F''<sub>''n''</sub> and a prefix of ''F''<sub>''n''</sub>.
{{theorem|name=Lemma|contents=Let ''G''<sub>''n''</sub> denote the circular Fraenkel word on ''n'' letters. Suppose ''w'' is a proper subword of ''G''<sub>''n''</sub> such that ''w'' = ''uv'' where ''u'' is a nonempty suffix of ''F''<sub>''n''</sub> and ''v'' is a nonempty prefix of ''F''<sub>''n''</sub>. For 1 &le; {{!}}''w''{{!}} &le; 2<sup>''n''/2</sup> &minus; 2, either {{!}}''w''{{!}}<sub>'''i'''</sub> = ceil({{!}}''w''{{!}}/2<sup>''i''+1</sup>) or ceil({{!}}''w''{{!}}/2<sup>''i''+1</sup>) &minus; 1.
}}
 
{{proof|contents=
There are 2 cases:
# Both {{!}}''u''{{!}} and {{!}}''v''{{!}} are 0 mod 2<sup>''i''+1</sup>.
# At least one of {{!}}''u''{{!}} and {{!}}''v''{{!}} is not 0 mod 2<sup>''i''+1</sup>.
In case 1, by the preceding lemma {{!}}''u''{{!}}<sub>'''i'''</sub> = {{!}}''u''{{!}}/2<sup>''i''+1</sup> and {{!}}''v''{{!}}<sub>'''i'''</sub> = {{!}}''v''{{!}}/2<sup>''i''+1</sup>, and hence {{!}}''w''{{!}}<sub>'''i'''</sub> = {{!}}''w''{{!}}/2<sup>''i''+1</sup> = ceil({{!}}''w''{{!}}/2<sup>''i''+1</sup>).
 
In case 2, suppose ''w'' = ''ustv'' where ''st'' is as in case 1 and {{!}}''u''{{!}} and {{!}}''v''{{!}} are less than 2<sup>''i''+1</sup>. Neither ''u'' nor ''v'' can contain an '''i''', as they are subwords of ''F''<sub>''i''</sub>; hence {{!}}''u''{{!}}<sub>'''i'''</sub> = {{!}}''st''{{!}}<sub>'''i'''</sub> = {{!}}''st''{{!}}/2<sup>''i''+1</sup>. As {{!}}''u''{{!}} + {{!}}''v''{{!}} &le; 2<sup>''i''+1</sup> &minus; 2, we have {{!}}''w''{{!}}<sub>'''i'''</sub> &ge; ceil({{!}}''w''{{!}}/2<sup>''i''+1</sup>) &minus; 1 = {{!}}''st''{{!}}<sub>'''i'''</sub>. On the other hand, {{!}}''w''{{!}}<sub>'''i'''</sub> < ceil({{!}}''w''{{!}}/2<sup>''i''+1</sup>), lest ''u'' or ''v'' have an '''i'''. Therefore {{!}}''w''{{!}}<sub>'''i'''</sub> = ceil({{!}}''w''{{!}}/2<sup>''i''+1</sup>) &minus; 1.
}}


== Open problems ==
== Open problems ==

Revision as of 14:20, 15 February 2024

In combinatorics on words, the Fraenkel word over n letters [math]\displaystyle{ \mathbf{0}, \mathbf{1}, ..., (\mathbf{n-1}) }[/math] is defined recursively by

[math]\displaystyle{ \displaystyle{ \begin{align*} F_0 &= \epsilon, \\ F_1 &= \mathbf{0}, \\ F_2 &= \mathbf{010}, \\ F_3 &= \mathbf{0102010}, \\ &\ \ \vdots \\ F_{n} &= F_{n-1}(\mathbf{n-1})F_{n-1}. \end{align*}} }[/math]

Here ε is the empty word.

Fraenkel words are named after mathematician Aviezri S. Fraenkel.

Facts

Below we denote the length of a word w by |w| and the number of occurrences of the letter i in w as |w|i, as is standard notation in combinatorics on words. The notation w(u0, ..., ur−1) represents the word w in 0, 1, ..., r−1 but with i replaced by the word ui.

Fraenkel words are balanced

Theorem — As circular words, Fraenkel words are balanced.

The theorem will be a consequence of the following lemmas:

Lemma — Let Fn denote the non-circular Fraenkel word on n letters. For n ≥ 1 and 0 ≤ in − 1, the letter i appears once every 2i+1 letters in Fn; i.e. in every subword of the form iwi, |w| = 2i+1 − 1.

Proof
We prove this by induction on n. The n = 1 case being trivial, for n > 1 we start by observing that i always occurs as the greatest letter of a subword that is the Fraenkel word Fi+1, and Fi+1 is always flanked by letters that are greater. This subword iFi is not a suffix of Fn, and we thus have

iFik...

where k > i, and F0 is the empty word. Since k occurs as the middle letter of Fk+1, there is a copy of Fk that follows k; Fk has Fi as a prefix. Thus Fn has a subword

iFikFii,

as desired, since |Fii| = 2i − 1. [math]\displaystyle{ \square }[/math]

Lemma — For all n ≥ 1, 0 ≤ in − 1, and 1 ≤ |w| ≤ 2n/2 − 2, the following holds for any subword w of Fn:

  1. If |w| ≡ 0 mod 2i+1, then |w|i = |w|/2i+1.
  2. If |w| ≢ 0 mod 2i+1, then |w|i = either floor(|w|/2i+1) or ceil(|w|/2i+1).
    • More precisely, if for a given i we have w = uv or vu where u is a possibly empty word whose length is 0 mod 2i+1, and v is a nonempty word intersecting the middle of an Fi+1, then |w|i = ceil(|w|/2i+1). Otherwise, |w|i = floor(|w|/2i+1).
Proof
We use the previous lemma. In the first case, w is guaranteed to have exactly k-many is where |w| = k2i+1. In the second case, if k2i+1 < |w| < (k + 1)2i+1 and w = uv or vu where |u| ≡ 0 mod 2i+1, then u satisfies |u|i = k2i+1/2i+1 = k by the previous case. Thus |w|i is determined by |v|i, which is 1 if v contains the i in the middle of Fi, implying |w|i = ceil(|w|/2i+1), and 0 otherwise, implying |w|i = floor(|w|/2i+1). [math]\displaystyle{ \square }[/math]

Lemma — Let Gn denote the circular Fraenkel word on n letters. Suppose w is a proper subword of Gn such that w = uv where u is a nonempty suffix of Fn and v is a nonempty prefix of Fn. For 1 ≤ |w| ≤ 2n/2 − 2, either |w|i = ceil(|w|/2i+1) or ceil(|w|/2i+1) − 1.

Proof
There are 2 cases:
  1. Both |u| and |v| are 0 mod 2i+1.
  2. At least one of |u| and |v| is not 0 mod 2i+1.

In case 1, by the preceding lemma |u|i = |u|/2i+1 and |v|i = |v|/2i+1, and hence |w|i = |w|/2i+1 = ceil(|w|/2i+1).

In case 2, suppose w = ustv where st is as in case 1 and |u| and |v| are less than 2i+1. Neither u nor v can contain an i, as they are subwords of Fi; hence |u|i = |st|i = |st|/2i+1. As |u| + |v| ≤ 2i+1 − 2, we have |w|i ≥ ceil(|w|/2i+1) − 1 = |st|i. On the other hand, |w|i < ceil(|w|/2i+1), lest u or v have an i. Therefore |w|i = ceil(|w|/2i+1) − 1. [math]\displaystyle{ \square }[/math]

Open problems

For circular words (equivalently, infinite periodic words), Fraenkel's conjecture asserts that the only balanced circular words over n ≥ 3 letters with letter occurrences pairwise distinct are (letter reassignments of) [math]\displaystyle{ F_n. }[/math][1] The conjecture is known to be true for 3 ≤ n ≤ 7.

References

  1. Bulgakova, D. V., Buzhinsky, N., & Goncharov, Y. O. (2023). On balanced and abelian properties of circular words over a ternary alphabet. Theoretical Computer Science, 939, 227-236.