From Wikipedia, the free encyclopedia
The following tables compare some of the datasets that can be used in machine learning for training and testing.
Image datasets
Facial image datasets
Sound datasets
Dataset
|
Creator
|
Free
|
License[a]
|
Description
|
Number of examples (training + test)
|
Size (MB)
|
Web page
|
TIMIT
|
John Garofolo, Lori Lamel, William Fisher, Jonathan Fiscus, David Pallett, Nancy Dahlgren, Victor Zue
|
No
|
LDC User Agreement for Non-Members
|
Recordings of 630 speakers of eight major dialects of American English, each reading ten phonetically rich sentences
|
6,300
|
?
|
[6]
|
- ^ a b Licenses here are a summary, and are not taken to be complete statements of the licenses.
See also
References