✏️
wiki
  • Introduction
  • Career
    • Data Science
      • Machine Learning
      • Natural Language Processing
      • Datasets
      • Computer Vision
      • Data Engineering
      • Web Scraping
      • Data Visualisation
    • Robotics
      • Localisation
        • Kalman Filtering
    • Web Development
      • React
  • Tech
    • Apps
      • Mac OS
        • Magnet
        • PDF Expert
        • PixelSnap
        • Digital Colour Meter
        • Daisy Disk
        • Alfred
        • Bartender
        • iStat Menus
      • Linux
        • Playerctl
      • Terminal Emulators
      • Text Editors
      • Shell
    • CLI
      • Git & Github
      • Monitoring
    • Desktop Customisation
  • Lifestyle
    • In progress... :)
  • Blog
    • Why I Migrated from Windows to Linux
Powered by GitBook
On this page

Was this helpful?

  1. Career
  2. Data Science

Datasets

Have trouble finding data? I have compiled a list of potentially useful sites to help you find free datasets.

PreviousNatural Language ProcessingNextComputer Vision

Last updated 4 years ago

Was this helpful?

Resources

  • is a popular online data science community to share datasets, build and compete with machine learning models, and learn from others.

  • provides some open data datasets that can be searched .

  • is Google's tool for finding online datasets.

  • contains millions of images good for classification or object-detection in computer vision.

  • is an annual computer vision competition that also shares its image dataset with the community.

  • consists of millions of images including segmentation, classification and annotation.

  • matches research papers with datasets used.

  • is a large-scale labeled video dataset containing millions of annotated YouTube videos.

  • crawls all projects and neatly compiles them in a JSON or CSV format every month.

  • compiled millions of reviews from .

  • compiles information and trends about countries such as GDP, population.

  • provides a list of 25 open datasets for computer vision, natural language processing, audio, etc.

  • The community may be helpful if all else fails.

Kaggle
data.world
here
Google Dataset Search
AWS Open Data Registry
UC Irvine Machine Learning Repository
ImageNet
COCO
Open Images Dataset
Papers with Code
Youtube-8M
Web Robots
Kickstarter
SNAP
amazon
World Bank Open Data
data.gov.in
This blog
datasets subreddit