Public Datasets =============== .. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/ok-24.png Agriculture ----------- * |OK_ICON| `The global dataset of historical yields for major crops 1981–2016 - The Global Dataset of [...] `_ [`Meta `_] * |OK_ICON| `Hyperspectral benchmark dataset on soil moisture - This dataset was measured in a five-day [...] `_ [`Meta `_] * |OK_ICON| `Lemons quality control dataset - Lemon dataset has been prepared to investigate the [...] `_ [`Meta `_] * |OK_ICON| `Optimized Soil Adjusted Vegetation Index - The IDB is a tool for working with remote sensing [...] `_ [`Meta `_] * |OK_ICON| `U.S. Department of Agriculture's PLANTS Database - The Complete PLANTS Checklist is nearly 7 [...] `_ [`Meta `_] Architecture ------------ * |OK_ICON| `Swiss Apartment Models - This dataset contains detailed data on 42,207 apartments (242,257 [...] `_ [`Meta `_] Biology ------- * |OK_ICON| `1000 Genomes - The 1000 Genomes Project ran between 2008 and 2015, creating the largest [...] `_ [`Meta `_] * |OK_ICON| `ANHIR - Automatic Non-rigid Histological Image Registration (ANHIR) consists of 2D [...] `_ [`Meta `_] * |OK_ICON| `American Gut (Microbiome Project) - The American Gut project is the largest crowdsourced [...] `_ [`Meta `_] * |OK_ICON| `BCNB - There are WSIs of 1058 patients, part of tumor regions are annotated in WSIs. Except [...] `_ [`Meta `_] * |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) - The Broad Bioimage Benchmark Collection (BBBC) [...] `_ [`Meta `_] * |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ [`Meta `_] * |OK_ICON| `CIMA - CIMA dataset includes images of 2D histological microscopy tissue slices. `_ [`Meta `_] * |OK_ICON| `Cell Image Library - This library is a public and easily accessible resource database of [...] `_ [`Meta `_] * |OK_ICON| `CytoImageNet - A large-scale dataset of microscopy images. Contains 890,737 total grayscale [...] `_ [`Meta `_] * |OK_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data stores data from high- [...] `_ [`Meta `_] * |OK_ICON| `EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank (EMDB) is a public [...] `_ [`Meta `_] * |OK_ICON| `ENCODE project - The Encyclopedia of DNA Elements (ENCODE) Consortium is an ongoing [...] `_ [`Meta `_] * |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) - EMPIAR, the Electron Microscopy Public [...] `_ [`Meta `_] * |OK_ICON| `Ensembl Genomes `_ [`Meta `_] * |OK_ICON| `Gene Expression Omnibus (GEO) - GEO is a public functional genomics data repository [...] `_ [`Meta `_] * |OK_ICON| `Gene Ontology (GO) - GO annotation files `_ [`Meta `_] * |OK_ICON| `Global Biotic Interactions (GloBI) `_ [`Meta `_] * |OK_ICON| `Harvard Medical School (HMS) LINCS Project - The Harvard Medical School (HMS) LINCS Center is [...] `_ [`Meta `_] * |OK_ICON| `Human Microbiome Project (HMP) - The HMP sequenced over 2000 reference genomes isolated from [...] `_ [`Meta `_] * |OK_ICON| `ICOS PSP Benchmark - The ICOS PSP benchmarks repository contains an adjustable real-world [...] `_ [`Meta `_] * |OK_ICON| `International HapMap Project `_ [`Meta `_] * |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of [...] `_ [`Meta `_] * |OK_ICON| `NCBI Proteins `_ [`Meta `_] * |OK_ICON| `NCBI Taxonomy - The NCBI Taxonomy database is a curated set of names and classifications for [...] `_ [`Meta `_] * |OK_ICON| `NCI Genomic Data Commons - The GDC Data Portal is a robust data-driven platform that allows [...] `_ [`Meta `_] * |OK_ICON| `NIH Microarray data `_ [`Meta `_] * |OK_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer genetic tests to [...] `_ [`Meta `_] * |OK_ICON| `Palmer Penguins - The goal of palmerpenguins is to provide a great dataset for data [...] `_ [`Meta `_] * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ [`Meta `_] * |OK_ICON| `Protein Data Bank - This resource is powered by the Protein Data Bank archive-information [...] `_ [`Meta `_] * |OK_ICON| `Psychiatric Genomics Consortium - The purpose of the Psychiatric Genomics Consortium (PGC) is [...] `_ [`Meta `_] * |OK_ICON| `PubChem Project - PubChem is the world's largest collection of freely accessible chemical [...] `_ [`Meta `_] * |OK_ICON| `PubGene (now Coremine Medical) - COREMINE™ is a family of tools developed by the Norwegian [...] `_ [`Meta `_] * |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) - COSMIC, the Catalogue Of Somatic [...] `_ [`Meta `_] * |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ [`Meta `_] * |OK_ICON| `Sequence Read Archive(SRA) - The Sequence Read Archive (SRA) stores raw sequence data from [...] `_ [`Meta `_] * |OK_ICON| `Serratus - Analysis of 7.1 million RNA/DNA sequencing datasets to discover the total [...] `_ [`Meta `_] * |OK_ICON| `Stanford Microarray Data (Retired NOW) `_ [`Meta `_] * |OK_ICON| `Stowers Institute Original Data Repository `_ [`Meta `_] * |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database - Systems Science of Biological [...] `_ [`Meta `_] * |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ [`Meta `_] * |OK_ICON| `The Catalogue of Life - The Catalogue of Life is a quality-assured checklist of more than 1.8 [...] `_ [`Meta `_] * |OK_ICON| `The Personal Genome Project - The Personal Genome Project, initiated in 2005, is a vision and [...] `_ [`Meta `_] * |OK_ICON| `UCSC Public Data `_ [`Meta `_] * |OK_ICON| `UniGene `_ [`Meta `_] * |OK_ICON| `Universal Protein Resource (UnitProt) - The Universal Protein Resource (UniProt) is a [...] `_ [`Meta `_] * |OK_ICON| `Rfam - The Rfam database is a collection of RNA families, each represented by multiple [...] `_ [`Meta `_] Chemistry --------- * |OK_ICON| `Ionic Liquids Database - ILThermo `_ [`Meta `_] Climate+Weather --------------- * |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) - Data related to climate and weather [...] `_ [`Meta `_] * |OK_ICON| `Caravan - a dataset for large-sample hydrology - Caravan is an open community dataset of [...] `_ [`Meta `_] * |OK_ICON| `Climate Data from UEA (updated monthly) `_ [`Meta `_] * |OK_ICON| `Dutch Weather - The KNMI Data Center (KDC) portal provides access to KNMI data on weather, [...] `_ [`Meta `_] * |OK_ICON| `European Climate Assessment & Dataset `_ [`Meta `_] * |OK_ICON| `German Climate Data Center `_ [`Meta `_] * |OK_ICON| `Global Climate Data Since 1929 `_ [`Meta `_] * |OK_ICON| `Charting The Global Climate Change News Narrative 2009-2020 - These four datasets represent [...] `_ [`Meta `_] * |OK_ICON| `NASA Global Imagery Browse Services `_ [`Meta `_] * |OK_ICON| `NOAA Bering Sea Climate `_ [`Meta `_] * |OK_ICON| `NOAA Climate Datasets `_ [`Meta `_] * |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ [`Meta `_] * |OK_ICON| `Open-Meteo - Open-Source Weather API - Open-source weather API with free access for non- [...] `_ [`Meta `_] * |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ [`Meta `_] * |OK_ICON| `UEA Climatic Research Unit `_ [`Meta `_] * |OK_ICON| `WU Historical Weather Worldwide `_ [`Meta `_] * |OK_ICON| `Wahington Post Climate Change - To analyze warming temperatures in the United States, The [...] `_ [`Meta `_] * |OK_ICON| `WorldClim - Global Climate Data `_ [`Meta `_] ComplexNetworks --------------- * |OK_ICON| `AMiner Citation Network Dataset `_ [`Meta `_] * |OK_ICON| `CrossRef DOI URLs `_ [`Meta `_] * |OK_ICON| `DBLP Citation dataset `_ [`Meta `_] * |OK_ICON| `DIMACS Road Networks Collection `_ [`Meta `_] * |OK_ICON| `NBER Patent Citations `_ [`Meta `_] * |OK_ICON| `NIST complex networks data collection `_ [`Meta `_] * |OK_ICON| `Protein-protein interaction network `_ [`Meta `_] * |OK_ICON| `PyPI and Maven Dependency Network `_ [`Meta `_] * |OK_ICON| `Scopus Citation Database `_ [`Meta `_] * |OK_ICON| `Small Network Data `_ [`Meta `_] * |OK_ICON| `Stanford GraphBase `_ [`Meta `_] * |OK_ICON| `Stanford Large Network Dataset Collection `_ [`Meta `_] * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ [`Meta `_] * |OK_ICON| `UCI Network Data Repository `_ [`Meta `_] * |OK_ICON| `UFL sparse matrix collection `_ [`Meta `_] ComputerNetworks ---------------- * |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ [`Meta `_] * |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ [`Meta `_] * |OK_ICON| `CAIDA Internet Datasets `_ [`Meta `_] * |OK_ICON| `ClueWeb09 - 1B web pages `_ [`Meta `_] * |OK_ICON| `ClueWeb12 - 733M web pages `_ [`Meta `_] * |OK_ICON| `CommonCrawl Web Data over 7 years `_ [`Meta `_] * |OK_ICON| `Shopper Intent Prediction from Clickstream E‑Commerce Data with Minimal Browsing Information `_ [`Meta `_] * |OK_ICON| `Criteo click-through data `_ [`Meta `_] * |OK_ICON| `Internet-Wide Scan Data Repository `_ [`Meta `_] * |OK_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic analysis with [...] `_ [`Meta `_] * |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ [`Meta `_] * |OK_ICON| `Open Mobile Data by MobiPerf `_ [`Meta `_] * |OK_ICON| `The Peer-to-Peer Trace Archive - Real-world measurements play a key role in studying the [...] `_ [`Meta `_] * |OK_ICON| `Rapid7 Sonar Internet Scans `_ [`Meta `_] * |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ [`Meta `_] CyberSecurity ------------- * |OK_ICON| `CCCS-CIC-AndMal-2020 - The dataset includes 200K benign and 200K malware samples totalling to [...] `_ [`Meta `_] * |OK_ICON| `Traffic and Log Data Captured During a Cyber Defense Exercise - This dataset was acquired [...] `_ [`Meta `_] DataChallenges -------------- * |OK_ICON| `AIcrowd Competitions `_ [`Meta `_] * |OK_ICON| `Bruteforce Database `_ [`Meta `_] * |OK_ICON| `Challenges in Machine Learning `_ [`Meta `_] * |OK_ICON| `DrivenData Competitions for Social Good `_ [`Meta `_] * |OK_ICON| `ICWSM Data Challenge (since 2009) `_ [`Meta `_] * |OK_ICON| `KDD Cup by Tencent 2012 `_ [`Meta `_] * |OK_ICON| `Kaggle Competition Data `_ [`Meta `_] * |OK_ICON| `Localytics Data Visualization Challenge `_ [`Meta `_] * |OK_ICON| `Netflix Prize `_ [`Meta `_] * |OK_ICON| `Space Apps Challenge `_ [`Meta `_] * |OK_ICON| `Yelp Dataset Challenge - The Yelp dataset is a subset of our businesses, reviews, and user [...] `_ [`Meta `_] EarthScience ------------ * |OK_ICON| `38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their manually extracted [...] `_ [`Meta `_] * |OK_ICON| `AQUASTAT - Global water resources and uses `_ [`Meta `_] * |OK_ICON| `BODC - marine data of ~22K vars `_ [`Meta `_] * |OK_ICON| `EOSDIS - NASA's earth observing system data `_ [`Meta `_] * |OK_ICON| `Earth Models `_ [`Meta `_] * |OK_ICON| `Global Wind Atlas - The Global Wind Atlas is a free, web-based application developed to help [...] `_ [`Meta `_] * |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ [`Meta `_] * |OK_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - long-term estuarine [...] `_ [`Meta `_] * |OK_ICON| `Oil and Gas Authority Open Data - The dataset covers 12,500 offshore wellbores, 5,000 seismic [...] `_ [`Meta `_] * |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ [`Meta `_] * |OK_ICON| `USGS Earthquake Archives `_ [`Meta `_] * |OK_ICON| `Wellhead Protection Area (protection zone) prediction using breakthrough curves - This [...] `_ [`Meta `_] Economics --------- * |OK_ICON| `Asian Productivity Organization (APO) - The AEPM provides a graphic dashboard view of [...] `_ [`Meta `_] * |OK_ICON| `ASEAN Stats - The ASEANstatsDataPortal was first launched in June 2018. The Portal is [...] `_ [`Meta `_] * |OK_ICON| `American Economic Association (AEA) `_ [`Meta `_] * |OK_ICON| `Asian KLEMS - Asia KLEMS is an Asian regional research consortium to promote building [...] `_ [`Meta `_] * |OK_ICON| `Harvard Atlas of Economic Complexity - A database for people to explore global trade flows [...] `_ [`Meta `_] * |OK_ICON| `BIS Financial Database - The files contain the same data as in the BIS Statistics Explorer [...] `_ [`Meta `_] * |OK_ICON| `Barro-Lee Education Attainment - Barro-Lee Educational Attainment Data from 1950 to 2010. [...] `_ [`Meta `_] * |OK_ICON| `CEPII Database - A database of the world economy, through its country and region profiles, in [...] `_ [`Meta `_] * |OK_ICON| `EUKLEMS - EU KLEMS is an industry level, growth and productivity research project. EU KLEMS [...] `_ [`Meta `_] * |OK_ICON| `Economic Freedom of the World Data `_ [`Meta `_] * |OK_ICON| `Historical National Accounts - The datahub on Comparative Historical National Accounts [...] `_ [`Meta `_] * |OK_ICON| `Historical MacroEconomic Statistics `_ [`Meta `_] * |OK_ICON| `DBnomics – the world's economic database - Aggregates hundreds of millions of time series [...] `_ [`Meta `_] * |OK_ICON| `International Trade Statistics `_ [`Meta `_] * |OK_ICON| `Internet Product Code Database `_ [`Meta `_] * |OK_ICON| `Joint External Debt Data Hub `_ [`Meta `_] * |OK_ICON| `Latin America KLEMS - LAKLEMS is a technical cooperation project financed by the Inter- [...] `_ [`Meta `_] * |OK_ICON| `Long-Term Productivity Database - The Long-Term Productivity database was created as a [...] `_ [`Meta `_] * |OK_ICON| `Maddison Project Database - The Maddison Project Database provides information on comparative [...] `_ [`Meta `_] * |OK_ICON| `National Transfer Accounts - The goal of the National Transfer Accounts (NTA) project is to [...] `_ [`Meta `_] * |OK_ICON| `OpenCorporates Database of Companies in the World `_ [`Meta `_] * |OK_ICON| `Our World in Data `_ [`Meta `_] * |OK_ICON| `Penn World Table - PWT version 10.0 is a database with information on relative levels of [...] `_ [`Meta `_] * |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ [`Meta `_] * |OK_ICON| `The Atlas of Economic Complexity `_ [`Meta `_] * |OK_ICON| `The Center for International Data `_ [`Meta `_] * |OK_ICON| `UN Human Development Reports `_ [`Meta `_] * |OK_ICON| `World Input-Output Database - World Input-Output Tables and underlying data, covering 43 [...] `_ [`Meta `_] * |OK_ICON| `World KLEMS - Analytical KLEMS-type data sets for a broad set of countries around the world. [...] `_ [`Meta `_] Education --------- * |OK_ICON| `College Scorecard Data `_ [`Meta `_] * |OK_ICON| `New York State Education Department Data - The New York State Education Department (NYSED) is [...] `_ [`Meta `_] * |OK_ICON| `Student Data from Free Code Camp `_ [`Meta `_] Energy ------ * |OK_ICON| `AMPds - The Almanac of Minutely Power dataset `_ [`Meta `_] * |OK_ICON| `COMBED `_ [`Meta `_] * |OK_ICON| `DBFC - Direct Borohydride Fuel Cell (DBFC) Dataset `_ [`Meta `_] * |OK_ICON| `DEL - Domestic Electrical Load study datsets for South Africa (1994 - 2014) `_ [`Meta `_] * |OK_ICON| `ECO - The ECO data set is a comprehensive data set for non-intrusive load monitoring and [...] `_ [`Meta `_] * |OK_ICON| `EIA `_ [`Meta `_] * |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a comprehensive, open source [...] `_ [`Meta `_] * |OK_ICON| `HES - Household Electricity Study, UK `_ [`Meta `_] * |OK_ICON| `HFED `_ [`Meta `_] * |OK_ICON| `MORED: a Moroccan Buildings’ Electricity Consumption Dataset - Since spring of 2019, a data [...] `_ [`Meta `_] * |OK_ICON| `Marktstammdatenregister - The German Marktstammdatenregister (MaStR) is a database of all [...] `_ [`Meta `_] * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ [`Meta `_] * |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`Meta `_] * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy data easier to [...] `_ [`Meta `_] * |OK_ICON| `SYND - A synthetic energy dataset for non-intrusive load monitoring - With SynD, we present a [...] `_ [`Meta `_] * |OK_ICON| `Tracebase `_ [`Meta `_] * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ [`Meta `_] * |OK_ICON| `WHITED `_ [`Meta `_] * |OK_ICON| `iAWE `_ [`Meta `_] Entertainment ------------- * |OK_ICON| `Top Streamers on Twitch - This contains data of Top 1000 Streamers from past year. `_ [`Meta `_] Finance ------- * |OK_ICON| `BIS Statistics - BIS statistics, compiled in cooperation with central banks and other [...] `_ [`Meta `_] * |OK_ICON| `Blockmodo Coin Registry - A registry of JSON formatted information files that is primarily [...] `_ [`Meta `_] * |OK_ICON| `Complete FAANG Stock data - This data set contains all the stock data of FAANG companies from [...] `_ [`Meta `_] * |OK_ICON| `Google Finance `_ [`Meta `_] * |OK_ICON| `Google Trends `_ [`Meta `_] * |OK_ICON| `NASDAQ `_ [`Meta `_] * |OK_ICON| `NYSE Market Data `_ [`Meta `_] * |OK_ICON| `Quandl `_ [`Meta `_] * |OK_ICON| `St Louis Federal `_ [`Meta `_] * |OK_ICON| `Yahoo Finance `_ [`Meta `_] GIS --- * |OK_ICON| `Awesome 3D Semantic City Models - Collection of open 3D semantic city and region models. `_ [`Meta `_] * |OK_ICON| `ArcGIS Open Data portal `_ [`Meta `_] * |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ [`Meta `_] * |OK_ICON| `Database of all continents, countries, States/Subdivisions/Provinces and Cities - Database [...] `_ [`Meta `_] * |OK_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ [`Meta `_] * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ [`Meta `_] * |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ [`Meta `_] * |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ [`Meta `_] * |OK_ICON| `GeoNames Worldwide `_ [`Meta `_] * |OK_ICON| `Global Administrative Areas Database (GADM) - Geospatial data organized by country. Includes [...] `_ [`Meta `_] * |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ [`Meta `_] * |OK_ICON| `Landsat 8 on AWS `_ [`Meta `_] * |OK_ICON| `List of all countries in all languages `_ [`Meta `_] * |OK_ICON| `National Weather Service GIS Data Portal `_ [`Meta `_] * |OK_ICON| `OpenAddresses `_ [`Meta `_] * |OK_ICON| `OpenStreetMap (OSM) `_ [`Meta `_] * |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ [`Meta `_] * |OK_ICON| `Reverse Geocoder using OSM data `_ [`Meta `_] * |OK_ICON| `Robin Wilson - Free GIS Datasets `_ [`Meta `_] * |OK_ICON| `Shadow Accrual Maps - The repository contains the accumulated shadow information for New York [...] `_ [`Meta `_] * |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`Meta `_] * |OK_ICON| `TZ Timezones shapefile `_ [`Meta `_] * |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ [`Meta `_] * |OK_ICON| `UN Environmental Data `_ [`Meta `_] * |OK_ICON| `World boundaries from the U.S. Department of State `_ [`Meta `_] * |OK_ICON| `World countries in multiple formats `_ [`Meta `_] Government ---------- * |OK_ICON| `Alberta, Province of Canada `_ [`Meta `_] * |OK_ICON| `Austin, TX, US `_ [`Meta `_] * |OK_ICON| `Australia (abs.gov.au) `_ [`Meta `_] * |OK_ICON| `Australia (data.gov.au) `_ [`Meta `_] * |OK_ICON| `Austria (data.gv.at) `_ [`Meta `_] * |OK_ICON| `Baton Rouge, LA, US `_ [`Meta `_] * |OK_ICON| `Belgium `_ [`Meta `_] * |OK_ICON| `City of Berkeley Open Data `_ [`Meta `_] * |OK_ICON| `Brazil `_ [`Meta `_] * |OK_ICON| `Buenos Aires, Argentina `_ [`Meta `_] * |OK_ICON| `Calgary, AB, Canada `_ [`Meta `_] * |OK_ICON| `Cambridge, MA, US `_ [`Meta `_] * |OK_ICON| `Canada `_ [`Meta `_] * |OK_ICON| `Chicago `_ [`Meta `_] * |OK_ICON| `Dallas Open Data `_ [`Meta `_] * |OK_ICON| `DataBC - data from the Province of British Columbia `_ [`Meta `_] * |OK_ICON| `Debt to the Penny - The Debt to the Penny dataset provides information about the total [...] `_ [`Meta `_] * |OK_ICON| `Denver Open Data `_ [`Meta `_] * |OK_ICON| `Durham, NC Open Data `_ [`Meta `_] * |OK_ICON| `Edmonton, AB, Canada `_ [`Meta `_] * |OK_ICON| `England LGInform `_ [`Meta `_] * |OK_ICON| `EuroStat `_ [`Meta `_] * |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ [`Meta `_] * |OK_ICON| `Federal Committee on Statistical Methodology (FCSM) (formerly FedStats) `_ [`Meta `_] * |OK_ICON| `Finland `_ [`Meta `_] * |OK_ICON| `France `_ [`Meta `_] * |OK_ICON| `Gatineau, QC, Canada `_ [`Meta `_] * |OK_ICON| `Germany `_ [`Meta `_] * |OK_ICON| `Ghent, Belgium `_ [`Meta `_] * |OK_ICON| `Glasgow, Scotland, UK `_ [`Meta `_] * |OK_ICON| `Greece `_ [`Meta `_] * |OK_ICON| `Guardian world governments `_ [`Meta `_] * |OK_ICON| `Helsinki Region, Finland `_ [`Meta `_] * |OK_ICON| `Hong Kong, China `_ [`Meta `_] * |OK_ICON| `Houston, TX, US `_ [`Meta `_] * |OK_ICON| `Indian Government Data `_ [`Meta `_] * |OK_ICON| `Indonesian Data Portal `_ [`Meta `_] * |OK_ICON| `Iowa - Welcome to the State of Iowa's data portal. Please explore data about Iowa and your [...] `_ [`Meta `_] * |OK_ICON| `Ireland's Open Data Portal `_ [`Meta `_] * |OK_ICON| `Israel's Open Data Portal `_ [`Meta `_] * |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati relativi ai dati [...] `_ [`Meta `_] * |OK_ICON| `Japan `_ [`Meta `_] * |OK_ICON| `Laval, QC, Canada `_ [`Meta `_] * |OK_ICON| `Lexington, KY `_ [`Meta `_] * |OK_ICON| `London Datastore, UK `_ [`Meta `_] * |OK_ICON| `Los Angeles Open Data `_ [`Meta `_] * |OK_ICON| `Luxembourg - Luxembourgish Open Data Portal `_ [`Meta `_] * |OK_ICON| `MassGIS, Massachusetts, U.S. `_ [`Meta `_] * |OK_ICON| `Metropolitan Transportation Commission (MTC), California, US `_ [`Meta `_] * |OK_ICON| `Mexico `_ [`Meta `_] * |OK_ICON| `Mississauga, ON, Canada `_ [`Meta `_] * |OK_ICON| `Moldova `_ [`Meta `_] * |OK_ICON| `Moncton, NB, Canada `_ [`Meta `_] * |OK_ICON| `Montreal, QC, Canada `_ [`Meta `_] * |OK_ICON| `Mountain View, California, US (GIS) `_ [`Meta `_] * |OK_ICON| `NYC betanyc `_ [`Meta `_] * |OK_ICON| `Netherlands `_ [`Meta `_] * |OK_ICON| `New York Department of Sanitation Monthly Tonnage - DSNY Monthly Tonnage Data provides [...] `_ [`Meta `_] * |OK_ICON| `New Zealand `_ [`Meta `_] * |OK_ICON| `OECD `_ [`Meta `_] * |OK_ICON| `Oklahoma `_ [`Meta `_] * |OK_ICON| `Open Government Data (OGD) Platform India `_ [`Meta `_] * |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ [`Meta `_] * |OK_ICON| `Oregon `_ [`Meta `_] * |OK_ICON| `Ottawa, ON, Canada `_ [`Meta `_] * |OK_ICON| `Palo Alto, California, US `_ [`Meta `_] * |OK_ICON| `OpenDataPhilly - OpenDataPhilly is a catalog of open data in the Philadelphia region. In [...] `_ [`Meta `_] * |OK_ICON| `Portland, Oregon `_ [`Meta `_] * |OK_ICON| `Portugal - Pordata organization `_ [`Meta `_] * |OK_ICON| `Quebec Province of Canada `_ [`Meta `_] * |OK_ICON| `Regina SK, Canada `_ [`Meta `_] * |OK_ICON| `Rio de Janeiro, Brazil `_ [`Meta `_] * |OK_ICON| `Romania `_ [`Meta `_] * |OK_ICON| `San Diego, CA `_ [`Meta `_] * |OK_ICON| `San Antonio, TX - Community Information Now - CI:Now is a nonprofit serving Bexar (San [...] `_ [`Meta `_] * |OK_ICON| `San Francisco Data sets `_ [`Meta `_] * |OK_ICON| `San Jose, California, US `_ [`Meta `_] * |OK_ICON| `San Mateo County, California, US `_ [`Meta `_] * |OK_ICON| `Seattle `_ [`Meta `_] * |OK_ICON| `Singapore Government Data `_ [`Meta `_] * |OK_ICON| `South Africa Trade Statistics `_ [`Meta `_] * |OK_ICON| `South Africa `_ [`Meta `_] * |OK_ICON| `State of Utah, US `_ [`Meta `_] * |OK_ICON| `Switzerland `_ [`Meta `_] * |OK_ICON| `Taiwan gov `_ [`Meta `_] * |OK_ICON| `Taiwan `_ [`Meta `_] * |OK_ICON| `Texas Open Data `_ [`Meta `_] * |OK_ICON| `The World Bank `_ [`Meta `_] * |OK_ICON| `Toronto, ON, Canada `_ [`Meta `_] * |OK_ICON| `Tunisia `_ [`Meta `_] * |OK_ICON| `U.K. Government Data `_ [`Meta `_] * |OK_ICON| `U.S. American Community Survey `_ [`Meta `_] * |OK_ICON| `U.S. CDC Public Health datasets `_ [`Meta `_] * |OK_ICON| `U.S. Census Bureau `_ [`Meta `_] * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ [`Meta `_] * |OK_ICON| `U.S. Federal Government Data Catalog `_ [`Meta `_] * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ [`Meta `_] * |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`Meta `_] * |OK_ICON| `U.S. Open Government `_ [`Meta `_] * |OK_ICON| `UK 2011 Census Open Atlas Project `_ [`Meta `_] * |OK_ICON| `US Counties - This is a repository of various data, broken down by US county. While most of [...] `_ [`Meta `_] * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ [`Meta `_] * |OK_ICON| `Ukraine `_ [`Meta `_] * |OK_ICON| `United Nations `_ [`Meta `_] * |OK_ICON| `Uruguay `_ [`Meta `_] * |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ [`Meta `_] * |OK_ICON| `Vancouver, BC Open Data Catalog `_ [`Meta `_] * |OK_ICON| `Victoria, BC, Canada `_ [`Meta `_] * |OK_ICON| `Vienna, Austria `_ [`Meta `_] * |OK_ICON| `U.S. Congressional Research Service (CRS) Reports `_ [`Meta `_] Healthcare ---------- * |OK_ICON| `AWS COVID-19 Datasets - We're working with organizations who make COVID-19-related data [...] `_ [`Meta `_] * |OK_ICON| `COVID-19 Case Surveillance Public Use Data - The COVID-19 case surveillance system database [...] `_ [`Meta `_] * |OK_ICON| `Covid-19 non-processed data of Ecuador - It's a project which provides non-processed datasets [...] `_ [`Meta `_] * |OK_ICON| `2019 Novel Coronavirus COVID-19 Data Repository by Johns Hopkins CSSE - This is the data [...] `_ [`Meta `_] * |OK_ICON| `Coronavirus (Covid-19) Data in the United States - The New York Times is releasing a series [...] `_ [`Meta `_] * |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ [`Meta `_] * |OK_ICON| `The COVID Tracking Project - The COVID Tracking Project collects and publishes the most [...] `_ [`Meta `_] * |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ [`Meta `_] * |OK_ICON| `Gapminder World demographic databases `_ [`Meta `_] * |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ [`Meta `_] * |OK_ICON| `MeDAL - A large medical text dataset curated for abbreviation disambiguation - Medical [...] `_ [`Meta `_] * |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ [`Meta `_] * |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ [`Meta `_] * |OK_ICON| `Medicare Data File `_ [`Meta `_] * |OK_ICON| `Nightingale Open Science `_ [`Meta `_] * |OK_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ [`Meta `_] * |OK_ICON| `Open-ODS (structure of the UK NHS) `_ [`Meta `_] * |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ [`Meta `_] * |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ [`Meta `_] * |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ [`Meta `_] * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ [`Meta `_] * |OK_ICON| `World Health Organization Global Health Observatory `_ [`Meta `_] * |OK_ICON| `Yahoo Knowledge Graph COVID-19 Datasets - The Yahoo Knowledge Graph team at Verizon Media is [...] `_ [`Meta `_] * |OK_ICON| `Informatics for Integrating Biology and the Bedside `_ [`Meta `_] ImageProcessing --------------- * |OK_ICON| `10k US Adult Faces Database `_ [`Meta `_] * |OK_ICON| `Audience Unfiltered faces for gender and age classification `_ [`Meta `_] * |OK_ICON| `Affective Image Classification `_ [`Meta `_] * |OK_ICON| `Airborne Object Detection and Tracking - The Airborne Object Tracking (AOT) dataset is a [...] `_ [`Meta `_] * |OK_ICON| `Animals with attributes `_ [`Meta `_] * |OK_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - Contains 10K stereo pair [...] `_ [`Meta `_] * |OK_ICON| `Cytology Dataset – CCAgT: Images of Cervical Cells with AgNOR Stain Technique - Contains 9339 [...] `_ [`Meta `_] * |OK_ICON| `Cube++ - 4890 raw 18-megapixel images, each containing a SpyderCube color target in their [...] `_ [`Meta `_] * |OK_ICON| `Densely Annotated Video Driving Data Set - This data set consists of 28 video sequences of [...] `_ [`Meta `_] * |OK_ICON| `Danbooru Tagged Anime Illustration Dataset - A large-scale anime image database with 3.33m+ [...] `_ [`Meta `_] * |OK_ICON| `Face Recognition Benchmark `_ [`Meta `_] * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ [`Meta `_] * |OK_ICON| `HumanEva Dataset - The HumanEva-I dataset contains 7 calibrated video sequences (4 grayscale [...] `_ [`Meta `_] * |OK_ICON| `ImageNet (in WordNet hierarchy) `_ [`Meta `_] * |OK_ICON| `Indoor Scene Recognition `_ [`Meta `_] * |OK_ICON| `International Affective Picture System, UFL `_ [`Meta `_] * |OK_ICON| `KITTI Vision Benchmark Suite `_ [`Meta `_] * |OK_ICON| `Labeled Information Library of Alexandria - Biology and Conservation - Contains over 10 [...] `_ [`Meta `_] * |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ [`Meta `_] * |OK_ICON| `Multi-View Region of Interest Prediction Dataset for Autonomous Driving - Contains 16 driving [...] `_ [`Meta `_] * |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ [`Meta `_] * |OK_ICON| `Newspaper Navigator - This dataset consists of extracted visual content for 16,358,041 [...] `_ [`Meta `_] * |OK_ICON| `Open Images From Google - Pictures with segmentation masks for 2.8 million object instances [...] `_ [`Meta `_] * |OK_ICON| `RuFa - Contains images of text written in one of two Arabic fonts (Ruqaa and Nastaliq [...] `_ [`Meta `_] * |OK_ICON| `SUN database, MIT `_ [`Meta `_] * |OK_ICON| `SVIRO Synthetic Vehicle Interior Rear Seat Occupancy - 25.000 synthetic scenery's across ten [...] `_ [`Meta `_] * |OK_ICON| `Stanford Dogs Dataset `_ [`Meta `_] * |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ [`Meta `_] * |OK_ICON| `The Oxford-IIIT Pet Dataset `_ [`Meta `_] * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ [`Meta `_] * |OK_ICON| `YouTube Faces Database `_ [`Meta `_] MachineLearning --------------- * |OK_ICON| `All-Age-Faces Dataset - Contains 13'322 Asian face images distributed across all ages (from 2 [...] `_ [`Meta `_] * |OK_ICON| `Audi Autonomous Driving Dataset - We have published the Audi Autonomous Driving Dataset [...] `_ [`Meta `_] * |OK_ICON| `B3FD - Facial age (and gender) estimation dataset with 375k images - The B3FD dataset is a [...] `_ [`Meta `_] * |OK_ICON| `Context-aware data sets from five domains `_ [`Meta `_] * |OK_ICON| `Delve Datasets for classification and regression `_ [`Meta `_] * |OK_ICON| `Discogs Monthly Data `_ [`Meta `_] * |OK_ICON| `Fluorescent Neuronal Cells - By releasing this dataset, we aim at providing a new testbed for [...] `_ [`Meta `_] * |OK_ICON| `Free Music Archive `_ [`Meta `_] * |OK_ICON| `IMDb Database `_ [`Meta `_] * |OK_ICON| `Iranis - A Large-scale Dataset of Farsi/Arabic License Plate Characters `_ [`Meta `_] * |OK_ICON| `Keel Repository for classification, regression and time series `_ [`Meta `_] * |OK_ICON| `LLVIP - This dataset contains 30976 images, or 15488 pairs, most of which were taken at very [...] `_ [`Meta `_] * |OK_ICON| `Labeled Faces in the Wild (LFW) `_ [`Meta `_] * |OK_ICON| `Lending Club Loan Data `_ [`Meta `_] * |OK_ICON| `Million Song Dataset `_ [`Meta `_] * |OK_ICON| `More Song Datasets `_ [`Meta `_] * |OK_ICON| `MovieLens Data Sets `_ [`Meta `_] * |OK_ICON| `New Yorker caption contest ratings `_ [`Meta `_] * |OK_ICON| `Restaurants Health Score Data in San Francisco `_ [`Meta `_] * |OK_ICON| `TikTok Dataset - More than 300 dance videos that capture a single person performing dance [...] `_ [`Meta `_] * |OK_ICON| `UCI Machine Learning Repository `_ [`Meta `_] * |OK_ICON| `Yahoo! Ratings and Classification Data `_ [`Meta `_] * |OK_ICON| `YouTube-BoundingBoxes `_ [`Meta `_] * |OK_ICON| `Youtube 8m `_ [`Meta `_] * |OK_ICON| `eBay Online Auctions (2012) `_ [`Meta `_] Museums ------- * |OK_ICON| `Cooper-Hewitt's Collection Database `_ [`Meta `_] * |OK_ICON| `Metropolitan Museum of Art Collection API `_ [`Meta `_] * |OK_ICON| `Minneapolis Institute of Arts metadata `_ [`Meta `_] * |OK_ICON| `Natural History Museum (London) Data Portal `_ [`Meta `_] * |OK_ICON| `Rijksmuseum Historical Art Collection `_ [`Meta `_] * |OK_ICON| `Tate Collection metadata `_ [`Meta `_] * |OK_ICON| `The Getty vocabularies `_ [`Meta `_] NaturalLanguage --------------- * |OK_ICON| `Automatic Keyphrase Extraction `_ [`Meta `_] * |OK_ICON| `The Big Bad NLP Database `_ [`Meta `_] * |OK_ICON| `Blizzard Challenge Speech - The speech + text data comes from professional audiobooks [...] `_ [`Meta `_] * |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ [`Meta `_] * |OK_ICON| `ClueWeb09 FACC `_ [`Meta `_] * |OK_ICON| `ClueWeb12 FACC `_ [`Meta `_] * |OK_ICON| `DBpedia - Structured data from Wikipedia `_ [`Meta `_] * |OK_ICON| `Dirty Words - With millions of images in our library and billions of user-submitted keywords, [...] `_ [`Meta `_] * |OK_ICON| `German Political Speeches Corpus - Collection of political speeches from the German [...] `_ [`Meta `_] * |OK_ICON| `Google Books Ngrams (2.2TB) `_ [`Meta `_] * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ [`Meta `_] * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ [`Meta `_] * |OK_ICON| `LJ Speech - Speech dataset consisting of 13,100 short audio clips of a single speaker reading [...] `_ [`Meta `_] * |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`Meta `_] * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ [`Meta `_] * |OK_ICON| `Machine Translation of European languages `_ [`Meta `_] * |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ [`Meta `_] * |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ [`Meta `_] * |OK_ICON| `No Language Left Behind (NLLB - 200vo) - Dataset based on Meta's metadata for mined bitext. [...] `_ [`Meta `_] * |OK_ICON| `Noisy speech database for training speech enhancement algorithms and TTS models - Clean and [...] `_ [`Meta `_] * |OK_ICON| `POS/NER/Chunk annotated data `_ [`Meta `_] * |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ [`Meta `_] * |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ [`Meta `_] * |OK_ICON| `USENET postings corpus of 2005~2011 `_ [`Meta `_] * |OK_ICON| `Universal Dependencies `_ [`Meta `_] * |OK_ICON| `Wikidata - Wikipedia databases `_ [`Meta `_] * |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`Meta `_] * |OK_ICON| `WordNet databases and tools `_ [`Meta `_] * |OK_ICON| `WorldTree Corpus of Explanation Graphs for Elementary Science Questions - a corpus of [...] `_ [`Meta `_] Neuroscience ------------ * |OK_ICON| `Allen Institute Datasets `_ [`Meta `_] * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`Meta `_] * |OK_ICON| `FCP-INDI `_ [`Meta `_] * |OK_ICON| `Human Connectome Project `_ [`Meta `_] * |OK_ICON| `NDAR `_ [`Meta `_] * |OK_ICON| `NIMH Data Archive `_ [`Meta `_] * |OK_ICON| `NeuroData `_ [`Meta `_] * |OK_ICON| `Neuroelectro `_ [`Meta `_] * |OK_ICON| `OASIS `_ [`Meta `_] * |OK_ICON| `OpenNEURO `_ [`Meta `_] * |OK_ICON| `OpenfMRI `_ [`Meta `_] * |OK_ICON| `Study Forrest `_ [`Meta `_] * |OK_ICON| `The Nencki-Symfonia EEG/ERP dataset - A high-density electroencephalography (EEG) dataset [...] `_ [`Meta `_] Physics ------- * |OK_ICON| `CERN Open Data Portal `_ [`Meta `_] * |OK_ICON| `Crystallography Open Database `_ [`Meta `_] * |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ [`Meta `_] * |OK_ICON| `Ligo Open Science Center (LOSC) - Gravitational wave data from the LIGO Hanford and [...] `_ [`Meta `_] * |OK_ICON| `NASA Exoplanet Archive `_ [`Meta `_] * |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ [`Meta `_] * |OK_ICON| `Quantum simulations of an electron in a two dimensional potential well - The data was [...] `_ [`Meta `_] * |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ [`Meta `_] ProstateCancer -------------- * |OK_ICON| `EOPC-DE-Early-Onset-Prostate-Cancer-Germany - Early Onset Prostate Cancer - Germany. [...] `_ [`Meta `_] * |OK_ICON| `GENIE - Data from the Genomics Evidence Neoplasia Information Exchange (GENIE) project of the [...] `_ [`Meta `_] * |OK_ICON| `Genomic-Hallmarks-Prostate-Adenocarcinoma-CPC-GENE - Comprehensive genomic profiling of 477 [...] `_ [`Meta `_] * |OK_ICON| `MSK-IMPACT-Clinical-Sequencing-Cohort-MSKCC-Prostate-Cancer - Targeted sequencing of clinical [...] `_ [`Meta `_] * |OK_ICON| `Metastatic-Prostate-Adenocarcinoma-MCTP - Comprehensive profiling of 61 prostate cancer [...] `_ [`Meta `_] * |OK_ICON| `Metastatic-Prostate-Cancer-SU2CPCF-Dream-Team - Comprehensive analysis of 150 metastatic [...] `_ [`Meta `_] * |OK_ICON| `NPCR-2001-2015 - Database from CDC's National Program of Cancer Registries (NPCR). The [...] `_ [`Meta `_] * |OK_ICON| `NPCR-2005-2015 - Database from CDC's National Program of Cancer Registries (NPCR). The [...] `_ [`Meta `_] * |OK_ICON| `NaF-Prostate - NaF Prostate is a collection of F-18 NaF positron emission tomography/computed [...] `_ [`Meta `_] * |OK_ICON| `Neuroendocrine-Prostate-Cancer - Whole exome and RNA Seq data of castration resistant [...] `_ [`Meta `_] * |OK_ICON| `PLCO-Prostate-Diagnostic-Procedures - The Prostate Diagnostic Procedures dataset (95,837 [...] `_ [`Meta `_] * |OK_ICON| `PLCO-Prostate-Medical-Complications - The Prostate Medical Complications dataset (3,350 [...] `_ [`Meta `_] * |OK_ICON| `PLCO-Prostate-Screening-Abnormalities - The Prostate Screening Abnormalities dataset (10,527 [...] `_ [`Meta `_] * |OK_ICON| `PLCO-Prostate-Screening - The Prostate Screening dataset (177,315 records, 35,875 subjects, [...] `_ [`Meta `_] * |OK_ICON| `PLCO-Prostate-Treatments - The Prostate Treatments dataset (13,409 records, 7,614 subjects, [...] `_ [`Meta `_] * |OK_ICON| `PLCO-Prostate - The Prostate dataset is a comprehensive dataset that contains nearly all the [...] `_ [`Meta `_] * |OK_ICON| `PRAD-CA-Prostate-Adenocarcinoma-Canada - Prostate Adenocarcinoma - Canada. Collected by the [...] `_ [`Meta `_] * |OK_ICON| `PRAD-FR-Prostate-Adenocarcinoma-France - Prostate Adenocarcinoma - France. Collected by ten [...] `_ [`Meta `_] * |OK_ICON| `PRAD-UK-Prostate-Adenocarcinoma-United-Kingdom - Prostate Adenocarcinoma - United Kingdom. [...] `_ [`Meta `_] * |OK_ICON| `Prostate-3T - The Prostate-3T project provided imaging data to TCIA as part of an ISBI [...] `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-Broad-Cornell-2012 - Comprehensive profiling of 112 prostate cancer [...] `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-Broad-Cornell-2013 - Comprehensive profiling of 57 prostate cancer [...] `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-CNA-study-MSKCC - Copy-number profiling of 103 primary prostate [...] `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-Fred-Hutchinson-CRC - Comprehensive profiling of prostate cancer [...] `_ [`Meta `_] * |OK_ICON| `Prostate Adenocarcinoma (MSKCC/DFCI) - Whole Exome Sequencing of 1013 prostate cancer samples. `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-MSKCC - MSKCC Prostate Oncogenome Project. 181 primary, 37 metastatic [...] `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-Organoids-MSKCC - Exome profiling of prostate cancer samples and [...] `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-Sun-Lab - Whole-genome and Transcriptome Sequencing of 65 Prostate [...] `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-TCGA-PanCancer-Atlas - Comprehensive TCGA PanCanAtlas data from 11k [...] `_ [`Meta `_] * |OK_ICON| `Prostate-Adenocarcinoma-TCGA - Integrated profiling of 333 primary prostate adenocarcinoma samples. `_ [`Meta `_] * |OK_ICON| `Prostate-Diagnosis - PCa T1- and T2-weighted magnetic resonance images (MRIs) were acquired [...] `_ [`Meta `_] * |OK_ICON| `Prostate-MRI - The Prostate-MRI collection of prostate Magnetic Resonance Images (MRIs) was [...] `_ [`Meta `_] * |OK_ICON| `Prostate-R - The R package 'ElemStatLearn' contains a prostate cancer dataset from Stamey et [...] `_ [`Meta `_] * |OK_ICON| `QIN-PROSTATE-Repeatability - The QIN-PROSTATE-Repeatability dataset is a dataset with [...] `_ [`Meta `_] * |OK_ICON| `QIN-PROSTATE - The QIN PROSTATE collection of the Quantitative Imaging Network (QIN) contains [...] `_ [`Meta `_] * |OK_ICON| `SEER-YR1973_2015.SEER9 - The SEER November 2017 Research Data files from nine SEER registries [...] `_ [`Meta `_] * |OK_ICON| `SEER-YR1992_2015.SJ_LA_RG_AK - The SEER November 2017 Research Data files from the San Jose- [...] `_ [`Meta `_] * |OK_ICON| `SEER-YR2000_2015.CA_KY_LO_NJ_GA - The SEER November 2017 Research Data files from the Greater [...] `_ [`Meta `_] * |OK_ICON| `SEER-YR2000_2015.CA_KY_LO_NJ_GA - The July - December 2005 diagnoses for Louisiana from their [...] `_ [`Meta `_] * |OK_ICON| `TCGA-PRAD-US - TCGA Prostate Adenocarcinoma (499 samples). `_ [`Meta `_] Psychology+Cognition -------------------- * |OK_ICON| `Open Cognitive Science Data - Publicly available behavioral datasets from across cognitive [...] `_ [`Meta `_] PublicDomains ------------- * |OK_ICON| `Ably Open Realtime Data `_ [`Meta `_] * |OK_ICON| `Amazon `_ [`Meta `_] * |OK_ICON| `Archive.org Datasets `_ [`Meta `_] * |OK_ICON| `Archive-it from Internet Archive `_ [`Meta `_] * |OK_ICON| `CMU JASA data archive `_ [`Meta `_] * |OK_ICON| `CMU StatLab collections `_ [`Meta `_] * |OK_ICON| `Data.World `_ [`Meta `_] * |OK_ICON| `Enigma Public `_ [`Meta `_] * |OK_ICON| `Google `_ [`Meta `_] * |OK_ICON| `KDNuggets Data Collections `_ [`Meta `_] * |OK_ICON| `Microsoft Azure Data Market Free DataSets `_ [`Meta `_] * |OK_ICON| `Microsoft Data Science for Research `_ [`Meta `_] * |OK_ICON| `Microsoft Research Open Data `_ [`Meta `_] * |OK_ICON| `Open Library Data Dumps `_ [`Meta `_] * |OK_ICON| `Reddit Datasets `_ [`Meta `_] * |OK_ICON| `Sample R data sets `_ [`Meta `_] * |OK_ICON| `Stack Overflow Annual Developer Survey - Annual developer surverys full data sets from 2011 [...] `_ [`Meta `_] * |OK_ICON| `StatSci.org `_ [`Meta `_] * |OK_ICON| `Stats4Stem R data sets (archived) `_ [`Meta `_] * |OK_ICON| `UCLA SOCR data collection `_ [`Meta `_] * |OK_ICON| `UFO Reports `_ [`Meta `_] * |OK_ICON| `Wikileaks 911 pager intercepts `_ [`Meta `_] * |OK_ICON| `Yahoo Webscope `_ [`Meta `_] SearchEngines ------------- * |OK_ICON| `Academic Torrents of data sharing from UMB `_ [`Meta `_] * |OK_ICON| `Datahub.io `_ [`Meta `_] * |OK_ICON| `Domains Project - Sorted list of Internet domains `_ [`Meta `_] * |OK_ICON| `Harvard Dataverse Network of scientific data `_ [`Meta `_] * |OK_ICON| `ICPSR (UMICH) `_ [`Meta `_] * |OK_ICON| `Institute of Education Sciences `_ [`Meta `_] * |OK_ICON| `National Technical Reports Library `_ [`Meta `_] * |OK_ICON| `Open Data Certificates (beta) `_ [`Meta `_] * |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ [`Meta `_] * |OK_ICON| `Statista.com - statistics and Studies `_ [`Meta `_] * |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ [`Meta `_] SocialNetworks -------------- * |OK_ICON| `2021 Portuguese Elections Twitter Dataset - 57M+ tweets, 1M+ users - This dataset contains [...] `_ [`Meta `_] * |OK_ICON| `72 hours #gamergate Twitter Scrape `_ [`Meta `_] * |OK_ICON| `CMU Enron Email of 150 users `_ [`Meta `_] * |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ [`Meta `_] * |OK_ICON| `China Biographical Database - The China Biographical Database is a freely accessible [...] `_ [`Meta `_] * |OK_ICON| `Clubhouse Dataset `_ [`Meta `_] * |OK_ICON| `A Twitter Dataset of 40+ million tweets related to COVID-19 - Due to the relevance of the [...] `_ [`Meta `_] * |OK_ICON| `43k+ Donald Trump Twitter Screenshots - This archive contains screenshots of 43,475 Donald [...] `_ [`Meta `_] * |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ [`Meta `_] * |OK_ICON| `Facebook Data Scrape (2005) `_ [`Meta `_] * |OK_ICON| `Facebook Social Connectedness Index - We use an anonymized snapshot of all active Facebook [...] `_ [`Meta `_] * |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ [`Meta `_] * |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ [`Meta `_] * |OK_ICON| `GitHub Collaboration Archive `_ [`Meta `_] * |OK_ICON| `Google Scholar citation relations `_ [`Meta `_] * |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ [`Meta `_] * |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ [`Meta `_] * |OK_ICON| `Mobile Social Networks from UMASS `_ [`Meta `_] * |OK_ICON| `Network Twitter Data `_ [`Meta `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ [`Meta `_] * |OK_ICON| `Social Twitter Data `_ [`Meta `_] * |OK_ICON| `The Reddit COVID dataset - This dataset attempts to capture the full extent of COVID-19 [...] `_ [`Meta `_] * |OK_ICON| `Twitch Top Streamer's Data `_ [`Meta `_] * |OK_ICON| `Twitter Data for Online Reputation Management `_ [`Meta `_] * |OK_ICON| `Twitter Data for Sentiment Analysis `_ [`Meta `_] * |OK_ICON| `Twitter Graph of entire Twitter site `_ [`Meta `_] * |OK_ICON| `UNIMI/LAW Social Network Datasets `_ [`Meta `_] * |OK_ICON| `United States Congress Twitter Data - Daily datasets with tweets of 1100+ accounts associated [...] `_ [`Meta `_] * |OK_ICON| `Yahoo! Graph and Social Data `_ [`Meta `_] * |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`Meta `_] SocialSciences -------------- * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ [`Meta `_] * |OK_ICON| `Authoritarian Ruling Elites Database - The Authoritarian Ruling Elites Database (ARED) is a [...] `_ [`Meta `_] * |OK_ICON| `Canadian Legal Information Institute `_ [`Meta `_] * |OK_ICON| `Correlates of War Project `_ [`Meta `_] * |OK_ICON| `Cryptome Conspiracy Theory Items `_ [`Meta `_] * |OK_ICON| `European Social Survey `_ [`Meta `_] * |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ [`Meta `_] * |OK_ICON| `GDELT Global Events Database `_ [`Meta `_] * |OK_ICON| `General Social Survey (GSS) since 1972 `_ [`Meta `_] * |OK_ICON| `German Social Survey `_ [`Meta `_] * |OK_ICON| `Global Religious Futures Project `_ [`Meta `_] * |OK_ICON| `Gun Violence Data - A comprehensive, accessible database that contains records of over 260k [...] `_ [`Meta `_] * |OK_ICON| `Humanitarian Data Exchange `_ [`Meta `_] * |OK_ICON| `INFORM Index for Risk Management `_ [`Meta `_] * |OK_ICON| `Institute for Demographic Studies `_ [`Meta `_] * |OK_ICON| `International Networks Archive `_ [`Meta `_] * |OK_ICON| `International Social Survey Program ISSP `_ [`Meta `_] * |OK_ICON| `International Studies Compendium Project `_ [`Meta `_] * |OK_ICON| `James McGuire Cross National Data `_ [`Meta `_] * |OK_ICON| `MIT Reality Mining Dataset `_ [`Meta `_] * |OK_ICON| `Mass Mobilization Data Project - The Mass Mobilization (MM) data are an effort to understand [...] `_ [`Meta `_] * |OK_ICON| `Microsoft Academic Knowledge Graph - The Microsoft Academic Knowledge Graph is a large RDF [...] `_ [`Meta `_] * |OK_ICON| `Minnesota Population Center `_ [`Meta `_] * |OK_ICON| `Notre Dame Global Adaptation Index (ND-GAIN) `_ [`Meta `_] * |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ [`Meta `_] * |OK_ICON| `OpenSanctions - A global database of persons and companies of political, criminal, or [...] `_ [`Meta `_] * |OK_ICON| `Paul Hensel General International Data Page `_ [`Meta `_] * |OK_ICON| `PewResearch Internet Survey Project `_ [`Meta `_] * |OK_ICON| `PewResearch Society Data Collection `_ [`Meta `_] * |OK_ICON| `StackExchange Data Explorer `_ [`Meta `_] * |OK_ICON| `Titanic Survival Data Set `_ [`Meta `_] * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ [`Meta `_] * |OK_ICON| `UCLA Social Sciences Data Archive `_ [`Meta `_] * |OK_ICON| `UPJOHN for Labor Employment Research `_ [`Meta `_] * |OK_ICON| `Universities Worldwide `_ [`Meta `_] * |OK_ICON| `World Bank Open Data `_ [`Meta `_] * |OK_ICON| `World Inequality Database - The World Inequality Database (WID.world) aims to provide open [...] `_ [`Meta `_] Software -------- * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ [`Meta `_] * |OK_ICON| `GHTorrent - Scalable, queryable, offline mirror of data offered through the GitHub REST API. `_ [`Meta `_] * |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ [`Meta `_] * |OK_ICON| `Public Git Archive - a Big Code dataset for all – dataset of 182,014 top-bookmarked Git [...] `_ [`Meta `_] * |OK_ICON| `Code duplicates - 2k Java file and 600 Java function pairs labeled as similar or different by [...] `_ [`Meta `_] * |OK_ICON| `Commit messages - 1.3 billion GitHub commit messages till March 2019 `_ [`Meta `_] * |OK_ICON| `Pull Request review comments - 25.3 million GitHub PR review comments since January 2015 till [...] `_ [`Meta `_] * |OK_ICON| `Source Code Identifiers - 41.7 million distinct splittable identifiers collected from 182,014 [...] `_ [`Meta `_] Sports ------ * |OK_ICON| `American Ninja Warrior Obstacles - Contains every obstacle in the history of American Ninja [...] `_ [`Meta `_] * |OK_ICON| `Cricsheet Matches (cricket) `_ [`Meta `_] * |OK_ICON| `Equity in Athletics - The Equity in Athletics Data Analysis Cutting Tool is brought to you by [...] `_ [`Meta `_] * |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`Meta `_] * |OK_ICON| `Football/Soccer resources (data and APIs) `_ [`Meta `_] * |OK_ICON| `NFL play-by-play data - NFL play-by-play data sourced from: [...] `_ [`Meta `_] * |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ [`Meta `_] * |OK_ICON| `Pro Kabadi season 1 to 7 - Pro Kabadi League is a professional-level Kabaddi league in India. [...] `_ [`Meta `_] * |OK_ICON| `Retrosheet Baseball Statistics `_ [`Meta `_] * |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ [`Meta `_] * |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ [`Meta `_] * |OK_ICON| `Transfermarkt Datasets - Clean, structured and automatically updated football (soccer) data [...] `_ [`Meta `_] * |OK_ICON| `USA Soccer Teams and Locations - USA soccer teams and locations. MLS, NWSL, and USL [...] `_ [`Meta `_] TimeSeries ---------- * |OK_ICON| `3W dataset - To the best of its authors' knowledge, this is the first realistic and public [...] `_ [`Meta `_] * |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ [`Meta `_] * |OK_ICON| `Hard Drive Failure Rates `_ [`Meta `_] * |OK_ICON| `Heart Rate Time Series from MIT `_ [`Meta `_] * |OK_ICON| `Time Series Data Library (TSDL) from MU `_ [`Meta `_] * |OK_ICON| `Turing Change Point Dataset - Contains 42 annotated time series collected for the development [...] `_ [`Meta `_] * |OK_ICON| `UC Riverside Time Series Dataset `_ [`Meta `_] Transportation -------------- * |OK_ICON| `Airlines OD Data 1987-2008 `_ [`Meta `_] * |OK_ICON| `Ford GoBike Data (formerly Bay Area Bike Share Data) `_ [`Meta `_] * |OK_ICON| `Bike Share Systems (BSS) collection `_ [`Meta `_] * |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ [`Meta `_] * |OK_ICON| `German train system by Deutsche Bahn `_ [`Meta `_] * |OK_ICON| `Hubway Million Rides in MA `_ [`Meta `_] * |OK_ICON| `Melbourne Pedestrian Counting - This dataset contains hourly pedestrian counts since 2009 [...] `_ [`Meta `_] * |OK_ICON| `Montreal BIXI Bike Share `_ [`Meta `_] * |OK_ICON| `NYC Taxi Trip Data 2009- `_ [`Meta `_] * |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ [`Meta `_] * |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ [`Meta `_] * |OK_ICON| `Open Traffic collection `_ [`Meta `_] * |OK_ICON| `Philadelphia Bike Share Stations (JSON) `_ [`Meta `_] * |OK_ICON| `Plane Crash Database, since 1920 `_ [`Meta `_] * |OK_ICON| `Renfe (Spanish National Railway Network) dataset `_ [`Meta `_] * |OK_ICON| `Toronto Bike Share Stations (JSON and GBFS files) `_ [`Meta `_] * |OK_ICON| `Transport for London (TFL) `_ [`Meta `_] * |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ [`Meta `_] * |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ [`Meta `_] * |OK_ICON| `U.S. National Highway Traffic Safety Administration - Fatalities since 1975 - Contains CSV [...] `_ [`Meta `_] eSports ------- * |OK_ICON| `CS:GO Competitive Matchmaking Data - In this data set we have data about the CSGO matchmaking [...] `_ [`Meta `_] * |OK_ICON| `FIFA-2021 Complete Player Dataset `_ [`Meta `_] * |OK_ICON| `OpenDota data dump `_ [`Meta `_]