Jump to content

Talk:Sukhotin's algorithm

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 83.134.178.100 (talk) at 16:26, 26 February 2012 (The algorithm itself: new section). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
WikiProject iconRobotics Redirect‑class Low‑importance
WikiProject iconThis redirect is within the scope of WikiProject Robotics, a collaborative effort to improve the coverage of Robotics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
RedirectThis redirect does not require a rating on Wikipedia's content assessment scale.
LowThis redirect has been rated as Low-importance on the project's importance scale.
Note icon
This redirect has been marked as needing immediate attention.
WikiProject iconLinguistics: Applied Linguistics Redirect‑class
WikiProject iconThis redirect is within the scope of WikiProject Linguistics, a collaborative effort to improve the coverage of linguistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
RedirectThis redirect does not require a rating on Wikipedia's content assessment scale.
Taskforce icon
This redirect is supported by Applied Linguistics Task Force.
Note icon
This redirect has been automatically rated by a bot or other tool because one or more other projects use this class. Please ensure the assessment is correct before removing the |auto= parameter.

Attention needed

  • Expand - the article does not explain what it is, merely whaere it may be used and what category it falls into. Expand to explain what the algorithm is
  • References needed
Chaosdruid (talk) 23:47, 9 January 2011 (UTC)[reply]

The algorithm itself

Hi I just read the sukhotin paper, the idea behind the algorithm is:

Hi I just read the sukhotin paper, the idea behind the algorithm is:

it is hard to pronounce a prolonged string of consonants CCCCCC... but few variation seems possible with vowels VVVVV... so most languages alternate vowels and consonants. Consider some function of co-occurence, i.e. being adjacent in a word.

Call D a possible division of letters into disjoint sets of vowels V and consonants C.

Call f(l_i,l_j) the frequency of co-occurence in some corpus.

Then we expect Q(D)=sum over all vowels according to D of (sum over all consonants according to D of (frequency of co-occurence)) should be a maximum.

He alludes to some optimizations but does not give them in the paper...