Unavoidable pattern
In mathematics and theoretical computer science, a pattern(term) is a sequence of characters over some alphabet .
A word is an instance of pattern , if there exists a non-erasing semigroup morphism such that . Non-erasing means .
A word matches(encounters) pattern if there exists a factor(subword) of , which is also an instance of .
A word avoids pattern if does not match . If avoids , is also called -free.
A pattern is unavoidable on a finite alphabet if there exists , word matches for all . Otherwise, is avoidable on , which implies there exist infinitely many words over alphabet that avoid . This is equivalent to saying that is avoidable on if there exists an infinite word that avoids .
A pattern is an unavoidable pattern(blocking term) if is unavoidable on any finite alphabet.
When a pattern is said to be unavoidable and does not specify an alphabet, then is unavoidable for any finite alphabet. Conversely, a pattern is said to be avoidable if is avoidable on some finite alphabet.
Zimin words
Given alphabet , Zimin words(patterns) are defined recursively for and base case .
A word is unavoidable if and only if it's a factor of a Zimin word.[1]
Given a finite alphabet , let representing the smallest such that matches for all . We have following properties:[2]
Given a pattern , let representing the number of distinct characters of . is avoidable if . Hence is one of the longest unavoidable patterns constructed by alphabet .
Pattern reduction
Given a pattern over some alphabet , we say is free for if there exist subsets of such that the following hold:
- is a factor of and ↔ is a factor of and
e.g. let , then is free for since there exist satisfied conditions above.
A pattern can reduce to pattern if there exists a character such that is free for , and can be obtained by removing all occurrence of from . Denote as .
e.g. let , then can reduce to since is free for .
Given patterns , if can reduce to and can reduce to , then can reduce to . Denote as .
A pattern is unavoidable if and only if can reduce to the empty word, hence .[1][3]
Avoidability index
A pattern is -avoidable if is avoidable on an alphabet of size . Otherwise, is -unavoidable, which means is unavoidable on every alphabet of size .[4][5]
if pattern is -avoidable, then is -avoidable for all .
The avoidability index of a pattern is the smallest such that is -avoidable, if is unavoidable.[6]
- Many patterns are 2-avoidable.
- xx,xxy,xyy,xxyx,xxyy,xyxx,xyxy,xyyx,xxyxx,xxyxy,xyxyy, as well as many doubled words, are 3-avoidable, though this list is not complete.[7]
- abwbaxbcyaczca is 4-avoidable, as well as other locked words. (Baker, McNulty, Taylor 1989)
- abvbawbcxacycdazdcd is 5-avoidable. (Clark 2004)
- no words with index greater than 5 have been found.
Examples
- The Thue–Morse sequence is cube-free and overlap-free, hence it avoids the patterns and .[4][5]
- The patterns and are unavoidable on any alphabet, since they are factors of the Zimin words.[8][9]
- The power patterns for are 2-avoidable.[4]
- A square-free word is one avoiding the pattern . An example is the word over the alphabet obtained by taking the first difference of the Thue–Morse sequence.[10][11] See also Square-free word.
- all binary patterns can divided into three categories:[7]
- are unavoidable.
- are 3-avoidable
- others are 2-avoidable
References
- ^ a b Zimin, A.I. (1982). BLOCKING SETS OF TERMS.
- ^ Joshua, Cooper; Rorabaugh, Danny (2013). Bounds on Zimin Word Avoidance. arXiv:1409.3080. Bibcode:2014arXiv1409.3080C.
- ^ BEAN, DWIGHT RICHARD; EHRENFEUCHT, ANDRZEJ; MCNULTY, GEORGE FRANK (1979). Avoidable Patterns in Strings of Symbols (PDF).
- ^ a b c Lothaire (2011) p. 113
- ^ a b Berstel et al (2009) p.127
- ^ Lothaire (2011) p.124
- ^ a b Lothaire, M. (2002). Algebraic Combinatorics on Words.
- ^ Allouche & Shallit (2003) p.24
- ^ Lothaire (2011) p.115
- ^ Pytheas Fogg (2002) p.104
- ^ Berstel et al (2009) p.97
- Allouche, Jean-Paul; Shallit, Jeffrey (2003). Automatic Sequences: Theory, Applications, Generalizations. Cambridge University Press. ISBN 978-0-521-82332-6. Zbl 1086.11015.
- Berstel, Jean; Lauve, Aaron; Reutenauer, Christophe; Saliola, Franco V. (2009). Combinatorics on words. Christoffel words and repetitions in words. CRM Monograph Series. Vol. 27. Providence, RI: American Mathematical Society. ISBN 978-0-8218-4480-9. Zbl 1161.68043.
- Lothaire, M. (2011). Algebraic combinatorics on words. Encyclopedia of Mathematics and Its Applications. Vol. 90. With preface by Jean Berstel and Dominique Perrin (Reprint of the 2002 hardback ed.). Cambridge University Press. ISBN 978-0-521-18071-9. Zbl 1221.68183.
- Pytheas Fogg, N. (2002). Berthé, Valérie; Ferenczi, Sébastien; Mauduit, Christian; Siegel, A. (eds.). Substitutions in dynamics, arithmetics and combinatorics. Lecture Notes in Mathematics. Vol. 1794. Berlin: Springer-Verlag. ISBN 3-540-44141-7. Zbl 1014.11015.