Jump to content

Talk:Statistical model

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Illia Connell (talk | contribs) at 22:10, 28 February 2013 (stats rating using AWB). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
WikiProject iconStatistics Start‑class High‑importance
WikiProject iconThis article is within the scope of WikiProject Statistics, a collaborative effort to improve the coverage of statistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
HighThis article has been rated as High-importance on the importance scale.

Could this be explained for the layman?

statical modelling

b As we discussed above the problem of main interest for us is to obtain a measure of both the complexityandthe(useful)informationinadataset.Asinthealgorithmictheorythecomplexity istheprimarynotion,whichthenallowsustode¯nethemoreintricatenotionofinformation.Our planistode¯nethecomplexityintermsoftheshortestcodelengthwhenthedataisencodedwith aclassofmodelsascodes.Intheprevioussectionwesawthatthisleadsintothenoncomputability problemifwelettheclassofmodelsincludethesetofallcomputerprograms,a`model'identi¯ed withacomputerprogram(code)thatgeneratesthegivendata.However,ifweselectasmallerclass thenoncomputabilityproblemcanbeavoidedbutwehavetoovercomeanotherdi±culty:Howare wetode¯netheshortestcodelength?ItseemsthatinordernottofallbacktotheKolmogorov complexitywemustspelloutexactlyhowthedistributionsasmodelsaretobeusedtorestrictthe codingoperations.Inuniversalcodingwedidjustthatbydoingthecodinginapredictiveway,in Lempel-Zivcodebytheindexoftheleafinthetreeofthesegmentsdeterminedbythepastdata,and inContextCodingbyapplyinganarithmeticcodetoeach`next'symbol,conditionedonacontext de¯nedbythealgorithmasafunctionofthepastdata.Hereweadoptadi®erentstrategy:we de¯netheideaofshortestcodelengthinaprobabilisticsense,whichturnsouttosatisfypractical requirements.Todothatwemustbemoreformalaboutmodels.

statical modelling

b Aswediscussedabovetheproblemofmaininterestforusistoobtainameasureofboththe complexityandthe(useful)informationinadataset.Asinthealgorithmictheorythecomplexity istheprimarynotion,whichthenallowsustode¯nethemoreintricatenotionofinformation.Our planistode¯nethecomplexityintermsoftheshortestcodelengthwhenthedataisencodedwith aclassofmodelsascodes.Intheprevioussectionwesawthatthisleadsintothenoncomputability problemifwelettheclassofmodelsincludethesetofallcomputerprograms,a`model'identi¯ed withacomputerprogram(code)thatgeneratesthegivendata.However,ifweselectasmallerclass thenoncomputabilityproblemcanbeavoidedbutwehavetoovercomeanotherdi±culty:Howare wetode¯netheshortestcodelength?ItseemsthatinordernottofallbacktotheKolmogorov complexitywemustspelloutexactlyhowthedistributionsasmodelsaretobeusedtorestrictthe codingoperations.Inuniversalcodingwedidjustthatbydoingthecodinginapredictiveway,in Lempel-Zivcodebytheindexoftheleafinthetreeofthesegmentsdeterminedbythepastdata,and inContextCodingbyapplyinganarithmeticcodetoeach`next'symbol,conditionedonacontext de¯nedbythealgorithmasafunctionofthepastdata.Hereweadoptadi®erentstrategy:we de¯netheideaofshortestcodelengthinaprobabilisticsense,whichturnsouttosatisfypractical requirements.Todothatwemustbemoreformalaboutmodels. —Preceding unsigned comment added by 220.227.55.53 (talk) 05:20, 6 March 2010 (UTC)[reply]