Jump to content

Unavoidable pattern

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Jochen Burghardt (talk | contribs) at 10:13, 18 February 2020 (Zimin words: use template; use redirecting namethat is immediately motivated by section head). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In mathematics and theoretical computer science, a pattern(term) is a sequence of characters over some alphabet .

A word  is an instance of pattern , if there exists a non-erasing semigroup morphism  such that . Non-erasing means .

A word matches(encounters) pattern if there exists a factor(subword) of , which is also an instance of .

A word avoids pattern if does not match . If avoids , is also called -free.

A pattern is unavoidable on a finite alphabet if there exists , word matches for all . Otherwise, is avoidable on , which implies there exist infinitely many words over alphabet that avoid . This is equivalent to saying that is avoidable on if there exists an infinite word that avoids .

A pattern is an unavoidable pattern(blocking term) if is unavoidable on any finite alphabet.

When a pattern is said to be unavoidable and does not specify an alphabet, then is unavoidable for any finite alphabet. Conversely, a pattern is said to be avoidable if is avoidable on some finite alphabet.

Zimin words

Given alphabet , Zimin words(patterns) are defined recursively  for and base case .

A word  is unavoidable if and only if it's a factor of a Zimin word.[1]


Given a finite alphabet , let representing the smallest such that matches for all . We have following properties:[2]

Given a pattern , let representing the number of distinct characters of . is avoidable if . Hence is one of the longest unavoidable patterns constructed by alphabet .

Pattern reduction

Given a pattern  over some alphabet , we say  is free for  if there exist subsets  of  such that the following hold:

  1.  is a factor of  and  ↔   is a factor of  and

e.g. let , then  is free for  since there exist  satisfied conditions above.

A pattern  can reduce to pattern  if there exists a character such that is free for , and  can be obtained by removing all occurrence of from . Denote as .

e.g. let , then can reduce to  since  is free for .

Given patterns , if  can reduce to  and  can reduce to , then  can reduce to . Denote as .

A pattern  is unavoidable if and only if can reduce to the empty word, hence .[1][3]

Avoidability index

A pattern is -avoidable if is avoidable on an alphabet of size . Otherwise, is -unavoidable, which means is unavoidable on every alphabet of size .[4][5]

if pattern is -avoidable, then is -avoidable for all .

The avoidability index of a pattern is the smallest such that is -avoidable, if is unavoidable.[6]

  • Many patterns are 2-avoidable.
  • xx,xxy,xyy,xxyx,xxyy,xyxx,xyxy,xyyx,xxyxx,xxyxy,xyxyy, as well as many doubled words, are 3-avoidable, though this list is not complete.[7]
  • abwbaxbcyaczca is 4-avoidable, as well as other locked words. (Baker, McNulty, Taylor 1989)
  • abvbawbcxacycdazdcd is 5-avoidable. (Clark 2004)
  • no words with index greater than 5 have been found.

Examples

  • The Thue–Morse sequence is cube-free and overlap-free, hence it avoids the patterns and .[4][5]
  • The patterns and are unavoidable on any alphabet, since they are factors of the Zimin words.[8][9]
  • The power patterns for are 2-avoidable.[4]
  • A square-free word is one avoiding the pattern . An example is the word over the alphabet obtained by taking the first difference of the Thue–Morse sequence.[10][11] See also Square-free word.
  • all binary patterns can divided into three categories:[7]
    • are unavoidable.
    • are 3-avoidable
    • others are 2-avoidable

References

  1. ^ a b Zimin, A.I. (1982). BLOCKING SETS OF TERMS.
  2. ^ Joshua, Cooper; Rorabaugh, Danny (2013). Bounds on Zimin Word Avoidance. arXiv:1409.3080. Bibcode:2014arXiv1409.3080C.
  3. ^ BEAN, DWIGHT RICHARD; EHRENFEUCHT, ANDRZEJ; MCNULTY, GEORGE FRANK (1979). Avoidable Patterns in Strings of Symbols (PDF).
  4. ^ a b c Lothaire (2011) p. 113
  5. ^ a b Berstel et al (2009) p.127
  6. ^ Lothaire (2011) p.124
  7. ^ a b Lothaire, M. (2002). Algebraic Combinatorics on Words.
  8. ^ Allouche & Shallit (2003) p.24
  9. ^ Lothaire (2011) p.115
  10. ^ Pytheas Fogg (2002) p.104
  11. ^ Berstel et al (2009) p.97