User:Fularp/sandbox
SEMMA is an acronym that stands for Sample, Exlore, Modify, Model and Assess. It is a list of sequential steps that pretends to guide the implementation of data mining applications developed by SAS Institute Inc., one of the largest producer of business intelligence software[1]. Although SEMMA is often considered as a general data mining methodology, SAS claims that it is rather a logical organisation of the functional tool set of one of their product, SAS Enterprise Miner, for carrying out the core tasks of data mining.[2]
Background
In the expanding field of data mining, there has been a call for a standard, a methodology or a simply list of best practices for the deverisified and iterative process of data mining that users can apply to themselves regardless of industry. While Cross Industry Standard Process for Data Mining, founded by the European Strategic Program on Research in Information Technology initiative, aimed to create a netural methodology, SAS also offered a pattern fo follow in its data mining tools.
Phases of SEMMA
Sample
ba
Explore
bal
Modify
bla
Model
bla
Assess
bla
Criticism
See also
References
- ^ Azevedo A., Santos M. F. KDD, SEMMA and CRISP-DM: a parallel overview In Proceedings of the IADIS European Conference on Data Mining 2008, pp 182-185.
- ^ SAS Enterprise Miner website