Unavoidable pattern
In mathematics and theoretical computer science, a pattern(term) is a sequence of characters over some alphabet .
A word is an instance of pattern , if there exists a non-erasing semigroup morphism such that . Non-erasing means .
A word matches(encounters) pattern if there exists a factor of , which is also an instance of . Otherwise, avoids . If avoids , is also called -free.
A pattern is unavoidable on a finite alphabet if there exists , word matches for all . Otherwise, is avoidable on , which implies there exist infinitely many words over alphabet that avoid . This is equivalent to saying that is avoidable on if there exists an infinite word that avoids .
A pattern is an unavoidable pattern(blocking term) if is unavoidable on any finite alphabet.
When a pattern is said to be unavoidable and does not specify an alphabet, then is unavoidable for any finite alphabet. Conversely, a pattern is said to be avoidable if is avoidable on some finite alphabet.
Zimin words
Given alphabet , Zimin words(patterns) are defined recursively for and base case .
A word is unavoidable if and only if it's a factor of a Zimin word.[1]
Given a finite alphabet , let representing the smallest such that matches for all . We have following properties:[2]
Given a pattern , let representing the number of distinct characters of . is avoidable if . Hence is one of the longest unavoidable patterns constructed by alphabet .
Pattern reduction
Given a pattern over some alphabet , we say is free for if there exist subsets of such that the following hold:
- is a factor of and ↔ is a factor of and
e.g. let , then is free for since there exist satisfied conditions above.
A pattern can reduce to pattern if there exists a character such that is free for , and can be obtained by removing all occurrence of from . Denote as .
e.g. let , then can reduce to since is free for .
Given patterns , if can reduce to and can reduce to , then can reduce to . Denote as .
A pattern is unavoidable if and only if can reduce to the empty word, hence .[3][1]
Avoidability index
A pattern is -avoidable if is avoidable on an alphabet of size . Otherwise, is -unavoidable, which means is unavoidable on every alphabet of size .[4][5]
if pattern is -avoidable, then is -avoidable for all .
The avoidability index of a pattern is the smallest such that is -avoidable, if is unavoidable.[6]
- xx,xxy,xyy,xxyx,xxyy,xyxx,xyxy,xyyx,xxyxx,xxyxy,xyxyy, as well as many doubled words, have avoidability index of 3, though this list is not complete.[7]
- abwbaxbcyaczca has avoidability index of 4, as well as other locked words. (Baker, McNulty, Taylor 1989)
- abvbawbcxacycdazdcd has avoidability index of 5. (Clark 2004)
- no words with avoidability index greater than 5 have been found.
Graph pattern avoidance[8]
Given a graph , a coloring is said to match the pattern if there exists a simple path in such that the sequence of colors matches . Otherwise, is said to avoid or -free.
The pattern chromatic number is the minimal number of distinct colors needed for a -free coloring over the graph .
let representing all the graphs with maximum degree no more than . let
A pattern is avoidable on graphs if is bounded by a constant .
An avoidable pattern on words can be expressed as a specific case of avoidance on graphs, hence for all where is a graph of vertexs concatenated.
Examples
- The Thue–Morse sequence is cube-free and overlap-free, hence it avoids the patterns and .[4][5]
- The patterns and are unavoidable on any alphabet, since they are factors of the Zimin words.[9][10]
- The power patterns for are 2-avoidable.[4]
- A square-free word is one avoiding the pattern . An example is the word over the alphabet obtained by taking the first difference of the Thue–Morse sequence.[11][12] See also Square-free word.
- all binary patterns can divided into three categories:[7]
- are unavoidable.
- have avoidability index of 3.
- others have avoidability index of 2.
Open problems
- Is there an avoidable pattern such that the avoidability index of is 6?
References
- ^ a b Zimin, A. I. (1984). "BLOCKING SETS OF TERMS". Mathematics of the USSR-Sbornik. 47 (2): 353. doi:10.1070/SM1984v047n02ABEH002647. ISSN 0025-5734.
- ^ Joshua, Cooper; Rorabaugh, Danny (2013). Bounds on Zimin Word Avoidance. arXiv:1409.3080. Bibcode:2014arXiv1409.3080C.
- ^ BEAN, DWIGHT RICHARD; EHRENFEUCHT, ANDRZEJ; MCNULTY, GEORGE FRANK (1979). Avoidable Patterns in Strings of Symbols (PDF).
- ^ a b c Lothaire (2011) p. 113
- ^ a b Berstel et al (2009) p.127
- ^ Lothaire (2011) p.124
- ^ a b Lothaire, M. (2002). Algebraic Combinatorics on Words.
- ^ Grytczuk, Jarosław (2007-05-28). "Pattern avoidance on graphs". Discrete Mathematics. The Fourth Caracow Conference on Graph Theory. 307 (11): 1341–1346. doi:10.1016/j.disc.2005.11.071. ISSN 0012-365X.
- ^ Allouche & Shallit (2003) p.24
- ^ Lothaire (2011) p.115
- ^ Pytheas Fogg (2002) p.104
- ^ Berstel et al (2009) p.97
- Allouche, Jean-Paul; Shallit, Jeffrey (2003). Automatic Sequences: Theory, Applications, Generalizations. Cambridge University Press. ISBN 978-0-521-82332-6. Zbl 1086.11015.
- Berstel, Jean; Lauve, Aaron; Reutenauer, Christophe; Saliola, Franco V. (2009). Combinatorics on words. Christoffel words and repetitions in words. CRM Monograph Series. Vol. 27. Providence, RI: American Mathematical Society. ISBN 978-0-8218-4480-9. Zbl 1161.68043.
- Lothaire, M. (2011). Algebraic combinatorics on words. Encyclopedia of Mathematics and Its Applications. Vol. 90. With preface by Jean Berstel and Dominique Perrin (Reprint of the 2002 hardback ed.). Cambridge University Press. ISBN 978-0-521-18071-9. Zbl 1221.68183.
- Pytheas Fogg, N. (2002). Berthé, Valérie; Ferenczi, Sébastien; Mauduit, Christian; Siegel, A. (eds.). Substitutions in dynamics, arithmetics and combinatorics. Lecture Notes in Mathematics. Vol. 1794. Berlin: Springer-Verlag. ISBN 3-540-44141-7. Zbl 1014.11015.