Unavoidable pattern
In mathematics and theoretical computer science, a pattern is an unavoidable pattern if it's unavoidable on any finite alphabet.
Definitions
Pattern
Like a word, a pattern (also referred to as term) is a sequence of characters over some alphabet.
The minimum multiplicity of pattern is where is the number of occurrence of character in pattern . In other words, it is the number of occurrences in of the least frequently occurring character in .
Instance
Given finite alphabets and , a word is an instance of pattern , if there exists a non-erasing semigroup morphism such that , where denotes the Kleene star of . Non-erasing means that for all , where denotes the empty string.
Avoid / Match
A word is said to match, or encounter, a pattern if a substring of is an instance of . Otherwise, is said to avoid , or to be -free. This definition can be generalized to the case of an infinite , based on a generalized definition of "substring".
Avoidability / Unavoidability on a specific alphabet
A pattern is unavoidable on a finite alphabet if each sufficiently long word must match ; formally: if . Otherwise, is avoidable on , which implies there exist infinitely many words over alphabet that avoid .
By Kőnig's lemma, pattern is avoidable on if, and only if, there exists an infinite word that avoids .[1]
Avoidable / Unavoidable pattern
A pattern is an unavoidable pattern(also reference as: blocking term) if is unavoidable on any finite alphabet.
If a pattern is said to be unavoidable and not limited to a specific alphabet, then it's unavoidable for any finite alphabet by default. Conversely, if a pattern is said to be avoidable and not limited to a specific alphabet, then it's avoidable on some finite alphabet by default.
-avoidable / -unavoidable
A pattern is -avoidable if is avoidable on an alphabet of size . Otherwise, is -unavoidable, which means is unavoidable on every alphabet of size .[2][3]
If pattern is -avoidable, then is -avoidable for all .
Given a finite set of avoidable patterns , there exist an infinte word such that avoids all patterns of .[1] Let denote the size of minimal alphabet such that avoids all patterns of .
Avoidability index
The avoidability index of a pattern is the smallest such that is -avoidable, if is unavoidable.[4]
Properties
- Let pattern be an instance of avoidable pattern , then is also avoidable.[5]
- Let avoidable pattern be a factor of pattern , then is also avoidable.[5]
- A pattern is unavoidable if and only if is a factor of some unavoidable pattern .
- Given an unavoidable pattern and a character not in , then is unavoidable.[5]
- Given an unavoidable pattern , then the reflection is unavoidable.
- Given an unavoidable pattern , there exist a character such that occurs excatly once in .[5]
- Given a pattern , let represents the number of distinct characters of . if , then is avoidable.[5]
Zimin words
Given alphabet , Zimin words (patterns) are defined recursively for and base case .
Unavoidability
All Zimin words are unavoidable.[6]
A word is unavoidable if and only if it's a factor of a Zimin word.[6]
Given a finite alphabet , let represents the smallest such that matches for all . We have following properties:[7]
is the longest unavoidable patterns constructed by alphabet since
Pattern reduction
Free letter
Given a pattern over some alphabet , we say is free for if there exist subsets of such that the following hold:
- is a factor of and ↔ is a factor of and
e.g. let , then is free for since there exist satisfied conditions above.
Reduce
A pattern can reduce to pattern if there exists a character such that is free for , and can be obtained by removing all occurrence of from . Denote as .
e.g. let , then can reduce to since is free for .
Transitivity
Given patterns , if can reduce to and can reduce to , then can reduce to . Denote as .
Unavoidability
A pattern is unavoidable if and only if can reduce to a word of length one, hence where .[8][6]
Graph pattern avoidance[9]
Avoid / Match on a specific graph
Given a simple graph , a edge coloring matches pattern if there exists a simple path in such that the sequence matches . Otherwise, is said to avoid or -free.
Similarly, a vertex coloring matches pattern if there exists a simple path in such that the sequence matches .
Pattern chromatic number
The pattern chromatic number is the minimal number of distinct colors needed for a -free vertex coloring over the graph .
Let where is the set of all simple graphs with a maximum degree no more than .
Similarly, and are defined on edge coloring.
Avoidability / Unavoidability on graphs
A pattern is avoidable on graphs if is bounded by , where only depend on .
- Avoidance on words can be expressed as a specific case of avoidance on graphs, hence a pattern is avoidable on any finite alphabet if and only if for all where is a graph of vertexs concatenated.
Probabilistic bound on
There exist an absolute constant , such that for all pattern with .
Given a pattern , let represents the number of distinct characters of . if , then is avoidable on graphs.
Explicit colorings
Given a pattern such that is even for all , then for all where is the complete graph of vertices.
Given a pattern such that , and an arbitary tree . Let be the set of all avoidable subpatterns and their reflections of , then .
Given a pattern such that , and a tree with degree . Let be the set of all avoidable subpatterns and their reflections of , then .
Examples
- The Thue–Morse sequence is cube-free and overlap-free, hence it avoids the patterns and .[2][3]
- A square-free word is one avoiding the pattern . The word over the alphabet obtained by taking the first difference of the Thue–Morse sequence is an example of infinite square-free words.[10][11]
- The patterns and are unavoidable on any alphabet, since they are factors of the Zimin words.[12][13]
- The power patterns for are 2-avoidable.[2]
- All binary patterns can divided into three categories:[1]
- are unavoidable.
- have avoidability index of 3.
- others have avoidability index of 2.
- has avoidability index of 4, as well as other locked words.[14]
- has avoidability index of 5.[15]
- The repetitive threshold is the infimum of exponent such that is avoidable on the alphabet of size . See also Dejean's theorem.
Open problems
- Is there an avoidable pattern such that the avoidability index of is 6?
- Given an arbitary pattern , is there an algorithm to determine the avoidability index of ?[1]
- Are all avoidable patterns also avoidable on graphs?[9]
References
- ^ a b c d Lothaire, M. (2002). Algebraic Combinatorics on Words. Cambridge University Press.
- ^ a b c Lothaire (2011) p. 113
- ^ a b Berstel et al (2009) p.127
- ^ Lothaire (2011) p.124
- ^ a b c d e Schmidt, Ursula (1987-08-01). "Long unavoidable patterns". Acta Informatica. 24 (4): 433–445. doi:10.1007/BF00292112. ISSN 1432-0525.
- ^ a b c Zimin, A. I. (1984). "BLOCKING SETS OF TERMS". Mathematics of the USSR-Sbornik. 47 (2): 353. doi:10.1070/SM1984v047n02ABEH002647. ISSN 0025-5734.
- ^ Joshua, Cooper; Rorabaugh, Danny (2013). Bounds on Zimin Word Avoidance. arXiv:1409.3080. Bibcode:2014arXiv1409.3080C.
- ^ BEAN, DWIGHT RICHARD; EHRENFEUCHT, ANDRZEJ; MCNULTY, GEORGE FRANK (1979). Avoidable Patterns in Strings of Symbols (PDF).
- ^ a b Grytczuk, Jarosław (2007-05-28). "Pattern avoidance on graphs". Discrete Mathematics. The Fourth Caracow Conference on Graph Theory. 307 (11): 1341–1346. doi:10.1016/j.disc.2005.11.071. ISSN 0012-365X.
- ^ Pytheas Fogg (2002) p.104
- ^ Berstel et al (2009) p.97
- ^ Allouche & Shallit (2003) p.24
- ^ Lothaire (2011) p.115
- ^ Baker, Kirby A.; McNulty, George F.; Taylor, Walter (1989-12-18). "Growth problems for avoidable words". Theoretical Computer Science. 69 (3): 319–345. doi:10.1016/0304-3975(89)90071-6. ISSN 0304-3975.
- ^ Clark, Ronald J. (2006-04-01). "The existence of a pattern which is 5-avoidable but 4-unavoidable". International Journal of Algebra and Computation. 16 (02): 351–367. doi:10.1142/S0218196706002950. ISSN 0218-1967.
- Allouche, Jean-Paul; Shallit, Jeffrey (2003). Automatic Sequences: Theory, Applications, Generalizations. Cambridge University Press. ISBN 978-0-521-82332-6. Zbl 1086.11015.
- Berstel, Jean; Lauve, Aaron; Reutenauer, Christophe; Saliola, Franco V. (2009). Combinatorics on words. Christoffel words and repetitions in words. CRM Monograph Series. Vol. 27. Providence, RI: American Mathematical Society. ISBN 978-0-8218-4480-9. Zbl 1161.68043.
- Lothaire, M. (2011). Algebraic combinatorics on words. Encyclopedia of Mathematics and Its Applications. Vol. 90. With preface by Jean Berstel and Dominique Perrin (Reprint of the 2002 hardback ed.). Cambridge University Press. ISBN 978-0-521-18071-9. Zbl 1221.68183.
- Pytheas Fogg, N. (2002). Berthé, Valérie; Ferenczi, Sébastien; Mauduit, Christian; Siegel, A. (eds.). Substitutions in dynamics, arithmetics and combinatorics. Lecture Notes in Mathematics. Vol. 1794. Berlin: Springer-Verlag. ISBN 3-540-44141-7. Zbl 1014.11015.