User:Datakeeper/DatasetsOnDeck
Appearance
The purpose of this page is to curate datasets before putting them on pages like List of datasets for machine learning research.
All of the following datasets have been added to List of datasets for machine learning research
[edit]Dataset Name | Brief description | Preprocessing | Instances | Format | Default Task | Created (updated) | Reference | Creator |
---|---|---|---|---|---|---|---|---|
PASCAL VOC Dataset | Large number of images for classification tasks. | Labeling, bounding box included | 500,000 | Images, text | Classification, object detection | 2010 | [1][2] | M. Everingham et al. |
UCF 101 Dataset | Self described as "a dataset of 101 human actions classes from videos in the wild." Dataset is large with over 27 hours of video. | Actions classified and labeled. | 13,000 | Video, images, text | Classification, action detection | 2012 | [3][4] | K. Soomro et al. |
THUMOS Dataset | Large video dataset for action classification. | Actions classified and labeled. | 45M frames of video | Video, images, text | Classification, action detection | 2013 | [5][6] | Y. Jiang et al. |
German Traffic Sign Detection Benchmark Dataset | Images from vehicles of traffic signs on German roads. These signs comply with UN standards and therefore are the same as in other countries. | Signs manually labeled | 900 | Images | Classification | 2013 | [7][8] | S Houben et al. |
CIFAR-10 Dataset | Many small, low-resolution, images of 10 classes of objects. | Classes labelled, training set splits created. | 60,000 | Images | Classification | 2009 | [9][10] | A. Krizhevsky et al. |
CIFAR-100 Dataset | Like CIFAR-10, above, but 100 classes of objects are given. | Classes labelled, training set splits created. | 60,000 | Images | Classification | 2009 | [9][10] | A. Krizhevsky et al. |
Caltech-UCSD Birds-200-2011 Dataset | Large dataset of images of birds. | Part locations for birds, bounding boxes, 312 binary attributes given | 11,788 | Images, text | Classification | 2011 | [11][12] | C. Wah et al. |
Oxford Flower Dataset | 17 category dataset of flowers. | Train/test splits, labeled images, | 1360 | Images, text | Classification | 2006 | [13][14] | M-E Nilsback et al. |
KITTI Vision Benchmark Dataset | Autonomous vehicles driving through a mid-size city captured images of various areas using cameras and laser scanners. | Many benchmarks extracted from data. | >100 GB of data | Images, text | Classification, object detection | 2012 | [15][16] | A Geiger et al. |
References
[edit]- ^ Everingham, Mark, et al. "The pascal visual object classes (voc) challenge."International journal of computer vision 88.2 (2010): 303-338.
- ^ Felzenszwalb, Pedro F., et al. "Object detection with discriminatively trained part-based models." Pattern Analysis and Machine Intelligence, IEEE Transactions on 32.9 (2010): 1627-1645.
- ^ Soomro, Khurram, Amir Roshan Zamir, and Mubarak Shah. "UCF101: A dataset of 101 human actions classes from videos in the wild." arXiv preprint arXiv:1212.0402 (2012).
- ^ Karpathy, Andrej, et al. "Large-scale video classification with convolutional neural networks." Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 2014.
- ^ Jiang, Y. G., et al. "THUMOS challenge: Action recognition with a large number of classes." ICCV Workshop on Action Recognition with a Large Number of Classes, http://crcv. ucf. edu/ICCV13-Action-Workshop. 2013.
- ^ Simonyan, Karen, and Andrew Zisserman. "Two-stream convolutional networks for action recognition in videos." Advances in Neural Information Processing Systems. 2014.
- ^ Houben, Sebastian, et al. "Detection of traffic signs in real-world images: The German Traffic Sign Detection Benchmark." Neural Networks (IJCNN), The 2013 International Joint Conference on. IEEE, 2013.
- ^ Mathias, Mayeul, et al. "Traffic sign recognition—How far are we from the solution?." Neural Networks (IJCNN), The 2013 International Joint Conference on. IEEE, 2013.
- ^ a b Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks." Advances in neural information processing systems. 2012.
- ^ a b Gong, Yunchao, and Svetlana Lazebnik. "Iterative quantization: A procrustean approach to learning binary codes." Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 2011.
- ^ Wah, Catherine, et al. "The caltech-ucsd birds-200-2011 dataset." (2011).
- ^ Duan, Kun, et al. "Discovering localized attributes for fine-grained recognition." Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2012.
- ^ Nilsback, Maria-Elena, and Andrew Zisserman. "A visual vocabulary for flower classification."Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on. Vol. 2. IEEE, 2006.
- ^ Razavian, Ali, et al. "CNN features off-the-shelf: an astounding baseline for recognition." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2014.
- ^ Geiger, Andreas, Philip Lenz, and Raquel Urtasun. "Are we ready for autonomous driving? the kitti vision benchmark suite." Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2012.
- ^ Sturm, Jürgen, et al. "A benchmark for the evaluation of RGB-D SLAM systems." Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on. IEEE, 2012.