Datasets
Have trouble finding data? I have compiled a list of potentially useful sites to help you find free datasets.
Resources
Kaggle is a popular online data science community to share datasets, build and compete with machine learning models, and learn from others.
data.world provides some open data datasets that can be searched here.
Google Dataset Search is Google's tool for finding online datasets.
ImageNet contains millions of images good for classification or object-detection in computer vision.
COCO is an annual computer vision competition that also shares its image dataset with the community.
Open Images Dataset consists of millions of images including segmentation, classification and annotation.
Papers with Code matches research papers with datasets used.
Youtube-8M is a large-scale labeled video dataset containing millions of annotated YouTube videos.
Web Robots crawls all Kickstarter projects and neatly compiles them in a JSON or CSV format every month.
World Bank Open Data compiles information and trends about countries such as GDP, population.
This blog provides a list of 25 open datasets for computer vision, natural language processing, audio, etc.
The datasets subreddit community may be helpful if all else fails.
Last updated
Was this helpful?