Jump to content

Longest repeated substring problem

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Ruud Koot (talk | contribs) at 03:22, 5 December 2011 (added Category:Formal languages using HotCat). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In computer science, the longest repeated substring problem is the problem of finding the longest substring of a string that occurs at least twice. This problem can be solved in linear time and space by building a suffix tree for the string, and finding the deepest internal node in the tree. The string spelled by the edges from the root to such a node is a longest repeated substring. The problem of finding the longest substring with at least occurrences can be found by first preprocessing the tree to count the number of leaf descendants for each internal node, and then finding the deepest node with at least descendants.

  • Allison, L. "Suffix Trees". Retrieved 2008-10-14.