Datasets
Datasets Collection
- ATLAS of economic complexity – Center for International Development at Harvard University
- Data – National Bureau of Economic Research
- EU Open Data Portal – Access to European Union open data
- EU Open Data Portal – 2017 Techno-economics for larger heating and cooling technologies
- Federal Reserve Bank of St. Louis
- Google Datasets
- Google Tools
- ICB Project International Crisis Behavior
- IMF DataMapper
- IMF DataMapper – Real GDP growth Annual percent change
- International Banking Library – Data
- International Banking Library – Systemic Risk Indicators and Financial Crises
- OECD Data
- OECD Statistics
- OPEN DATA DK
- Machine Learning Repository
- Quandl
- Socrata Opendata
- Socrata Opendata – Airplane Crashes and Fatalities Since 1908
- The Humanitarian Data Exchange
- Trading Economics
- Trading Economics – API
- Trading Economics – Country-list Interest-rate
- Worldbank Data Catalog
- Worldbank Data Catalog – Quarterly External Debt Statistics SDDS
- Aabne data – Offentlige datasaet
Image Datasets
Natural Language Processing
Audio/Speech Datasets
Analytics Vidhya Practice Problems
The key to getting better at deep learning (or most fields in life) is practice. Practice on a variety of problems – from image processing to speech recognition. Each of these problem has it’s own unique nuance and approach. But where can you get this data? A lot of research papers you see these … www.analyticsvidhya.com |
Images Datasets
Open Images Dataset V5 + Extensions. 15,851,536 boxes on 600 categories. 2,785,498 instance segmentations on 350 categories. 36,464,560 image-level labels on 19,959 …
storage.googleapis.com
THE MNIST DATABASE of handwritten digits Yann LeCun, Courant Institute, NYU Corinna Cortes, Google Labs, New York Christopher J.C. Burges, Microsoft Research, Redmond The MNIST database of handwritten digits, available from this page, has a training set of 60,000 examples, and a test set of 10,000 examples.
yann.lecun.com
ImageNet is an image database organized according to the WordNet hierarchy (currently only the nouns), in which each node of the hierarchy is depicted by hundreds and thousands of images. Currently we have an average of over five hundred images per node. We hope ImageNet will become a useful resource for researchers, educators, students and all of you who share our passion for pictures.
www.image-net.org
The Open Images dataset. Contribute to openimages/dataset development by creating an account on GitHub. github.com |
On VQA v2.0 dataset. Challenge deadline: May 20, 2018. VQA is a new dataset containing open-ended questions about images. These questions require an understanding of vision, language and commonsense knowledge to answer. visualqa.org |
10 classes, 1 for each digit. Digit ‘1’ has label 1, ‘9’ has label 9 and ‘0’ has label 10.; 73257 digits for training, 26032 digits for testing, and 531131 additional, somewhat less difficult samples, to use as extra training data ufldl.stanford.edu |
Back to Alex Krizhevsky’s home page. The CIFAR-10 and CIFAR-100 are labeled subsets of the 80 million tiny images dataset. They were collected by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton.
www.cs.toronto.edu
Also, an official Tensorflow tutorial of using tf.keras, a high-level API to train Fashion-MNIST can be found here.. Loading data with other machine learning libraries. To date, the following libraries have included Fashion-MNIST as a built-in dataset. Therefore, you don’t need to download Fashion-MNIST by yourself. Just follow their API and you are ready to go. github.com |
Datasets Articles
In this part of our series of articles on open datasets for machine learning, we’ll feature 17 best finance and economic datasets. gengo.ai |
The key to getting better at deep learning (or most fields in life) is practice. Practice on a variety of problems – from image processing to speech recognition. Each of these problem has it’s own unique nuance and approach. But where can you get this data? A lot of research papers you see these … www.analyticsvidhya.com |
Whether you’re doing a science project, creating a cool infographic, or giving a presentation, data makes everything more interesting. But gathering interesting data makes you want to pull your hair out and not everyone has the resources to gather data on a large scale. Luckily, there are enough … piktochart.com |
Looking for public data sets could be a challenge. Therefore, we’ve created a comprehensive list of the best machine learning datasets in one place, grouped into sections according to dataset sources, types, and a number of topics. Choose the one for you out of these publicly available datasets. www.altexsoft.com |
Government, State, City, Local, public data sites and portals Data APIs, Hubs, Marketplaces, Platforms, and Search Engines. Data Mining and Data Science Competitions Google Dataset Search …
www.kdnuggets.com
Introduction. We wish that data sets from India are readily available to practitioners across the world for research and development purposes. We have hosts some data sets below.
ml-india.org
Curated list of free, high-quality datasets for data science and machine learning. Organized into 11 of the most popular use cases. elitedatascience.com |
If you’ve ever worked on a personal data science project, you’ve probably spent a lot of time browsing the internet looking for interesting data sets to analyze. It can be fun to sift through dozens of data sets to find the perfect one. But it can also be frustrating to download and import … www.dataquest.io |
These datasets are used for machine-learning research and have been cited in peer-reviewed academic journals. Datasets are an integral part of the field of machine learning. Major advances in this field can result from advances in learning algorithms (such as deep learning), computer hardware, and, less-intuitively, the availability of high-quality training datasets. en.wikipedia.org |
Below is a wealth of links pointing out to free and open datasets that can be used to build predictive models. We hope that our readers will make the best use of these by gaining insights into the … blog.bigml.com |
Welcome to the data repository for the Machine Learning course by Kirill Eremenko and Hadelin de Ponteves. The datasets and other supplementary materials are below. Enjoy!
www.superdatascience.com
Open Datasets. Machine learning starts by getting the right data. Below you will find a list of links to publicly available datasets for a variety of domains.
skymind.ai
Public Data Sets. Below are links to publicly available data sets and resources. Datasets are such an integral part of data science and algorithms that it’s almost impossible to talk about our space without talking about data.
h2o-release.s3.amazonaws.com
What are some open datasets for machine learning? We at Lionbridge decided to create the ultimate cheat sheet for high quality datasets. gengo.ai |
Top 15 Datasets for Machine Learning and Statistics Projects -[Infographic] – datasciencelearner.com Specially the beginner who just started with data science waste lot of time in searching the best Datasets for machine learning projects . To help them out and save their valuable time , We have designed this article which include chain of data source links for Datasets for machine learning projects. www.datasciencelearner.com |
Use the sample datasets in Azure Machine Learning Studio. 01/19/2018; 14 minutes to read +7; In this article. When you create a new workspace in Azure Machine Learning Studio, a number of sample datasets and experiments are included by default. docs.microsoft.com |