René's URL Explorer Experiment


Title: GitHub - MagicPixel/awesome-public-datasets: An awesome list of high-quality open datasets in public domains (on-going).

Open Graph Title: GitHub - MagicPixel/awesome-public-datasets: An awesome list of high-quality open datasets in public domains (on-going).

X Title: GitHub - MagicPixel/awesome-public-datasets: An awesome list of high-quality open datasets in public domains (on-going).

Description: An awesome list of high-quality open datasets in public domains (on-going). - MagicPixel/awesome-public-datasets

Open Graph Description: An awesome list of high-quality open datasets in public domains (on-going). - MagicPixel/awesome-public-datasets

X Description: An awesome list of high-quality open datasets in public domains (on-going). - MagicPixel/awesome-public-datasets

Opengraph URL: https://github.com/MagicPixel/awesome-public-datasets

X: @github

direct link

Domain: github.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:dbd21ef3-af5b-2446-f2c5-8e4b28e7dbd0
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idB306:181326:32F7758:41FDB7F:6992BE4A
html-safe-nonce7b418a03f98685f11685ce5d246e939a5211d816a8f765d1ed19ef43837e4a6d
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCMzA2OjE4MTMyNjozMkY3NzU4OjQxRkRCN0Y6Njk5MkJFNEEiLCJ2aXNpdG9yX2lkIjoiNjU1MzgxMjg5NzkzMTM3ODI1MCIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac0dffcf97e5ef75693de2b2cfd1dc7eb0e7cd5e78e33807abdbf7f23d4b80294e
hovercard-subject-tagrepository:70967881
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/MagicPixel/awesome-public-datasets
twitter:imagehttps://opengraph.githubassets.com/6de77fcda91a5d0fd921009bacf035cf57fa0b03336a5ceb23892bfd9f8d7c9c/MagicPixel/awesome-public-datasets
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/6de77fcda91a5d0fd921009bacf035cf57fa0b03336a5ceb23892bfd9f8d7c9c/MagicPixel/awesome-public-datasets
og:image:altAn awesome list of high-quality open datasets in public domains (on-going). - MagicPixel/awesome-public-datasets
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None42c603b9d642c4a9065a51770f75e5e27132fef0e858607f5c9cb7e422831a7b
turbo-cache-controlno-preview
go-importgithub.com/MagicPixel/awesome-public-datasets git https://github.com/MagicPixel/awesome-public-datasets.git
octolytics-dimension-user_id3965295
octolytics-dimension-user_loginMagicPixel
octolytics-dimension-repository_id70967881
octolytics-dimension-repository_nwoMagicPixel/awesome-public-datasets
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id26898879
octolytics-dimension-repository_parent_nwoawesomedata/awesome-public-datasets
octolytics-dimension-repository_network_root_id26898879
octolytics-dimension-repository_network_root_nwoawesomedata/awesome-public-datasets
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release84dcb133269e3cfe6e0296cc85fbacb92cae92bb
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/MagicPixel/awesome-public-datasets#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FMagicPixel%2Fawesome-public-datasets
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FMagicPixel%2Fawesome-public-datasets
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=MagicPixel%2Fawesome-public-datasets
Reloadhttps://github.com/MagicPixel/awesome-public-datasets
Reloadhttps://github.com/MagicPixel/awesome-public-datasets
Reloadhttps://github.com/MagicPixel/awesome-public-datasets
MagicPixel https://github.com/MagicPixel
awesome-public-datasetshttps://github.com/MagicPixel/awesome-public-datasets
awesomedata/awesome-public-datasetshttps://github.com/awesomedata/awesome-public-datasets
Notifications https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets
Fork 0 https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets
Star 1 https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets
goo.gl/WZ8XAJhttps://goo.gl/WZ8XAJ
MIT license https://github.com/MagicPixel/awesome-public-datasets/blob/master/LICENSE
1 star https://github.com/MagicPixel/awesome-public-datasets/stargazers
11.1k forks https://github.com/MagicPixel/awesome-public-datasets/forks
Branches https://github.com/MagicPixel/awesome-public-datasets/branches
Tags https://github.com/MagicPixel/awesome-public-datasets/tags
Activity https://github.com/MagicPixel/awesome-public-datasets/activity
Star https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets
Notifications https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets
Code https://github.com/MagicPixel/awesome-public-datasets
Pull requests 0 https://github.com/MagicPixel/awesome-public-datasets/pulls
Actions https://github.com/MagicPixel/awesome-public-datasets/actions
Projects 0 https://github.com/MagicPixel/awesome-public-datasets/projects
Security 0 https://github.com/MagicPixel/awesome-public-datasets/security
Insights https://github.com/MagicPixel/awesome-public-datasets/pulse
Code https://github.com/MagicPixel/awesome-public-datasets
Pull requests https://github.com/MagicPixel/awesome-public-datasets/pulls
Actions https://github.com/MagicPixel/awesome-public-datasets/actions
Projects https://github.com/MagicPixel/awesome-public-datasets/projects
Security https://github.com/MagicPixel/awesome-public-datasets/security
Insights https://github.com/MagicPixel/awesome-public-datasets/pulse
Brancheshttps://github.com/MagicPixel/awesome-public-datasets/branches
Tagshttps://github.com/MagicPixel/awesome-public-datasets/tags
https://github.com/MagicPixel/awesome-public-datasets/branches
https://github.com/MagicPixel/awesome-public-datasets/tags
442 Commitshttps://github.com/MagicPixel/awesome-public-datasets/commits/master/
https://github.com/MagicPixel/awesome-public-datasets/commits/master/
Datasetshttps://github.com/MagicPixel/awesome-public-datasets/tree/master/Datasets
Datasetshttps://github.com/MagicPixel/awesome-public-datasets/tree/master/Datasets
.travis.ymlhttps://github.com/MagicPixel/awesome-public-datasets/blob/master/.travis.yml
.travis.ymlhttps://github.com/MagicPixel/awesome-public-datasets/blob/master/.travis.yml
Government.rsthttps://github.com/MagicPixel/awesome-public-datasets/blob/master/Government.rst
Government.rsthttps://github.com/MagicPixel/awesome-public-datasets/blob/master/Government.rst
LICENSEhttps://github.com/MagicPixel/awesome-public-datasets/blob/master/LICENSE
LICENSEhttps://github.com/MagicPixel/awesome-public-datasets/blob/master/LICENSE
README.rsthttps://github.com/MagicPixel/awesome-public-datasets/blob/master/README.rst
README.rsthttps://github.com/MagicPixel/awesome-public-datasets/blob/master/README.rst
READMEhttps://github.com/MagicPixel/awesome-public-datasets
MIT licensehttps://github.com/MagicPixel/awesome-public-datasets
https://github.com/MagicPixel/awesome-public-datasets#awesome-public-datasets
https://github.com/sindresorhus/awesome
This list of public data sourceshttps://github.com/caesar0301/awesome-public-datasets
awesome-awesomenesshttps://github.com/bayandin/awesome-awesomeness
sindresorhus's awesomehttps://github.com/sindresorhus/awesome
Agriculturehttps://github.com/MagicPixel/awesome-public-datasets#agriculture
Biologyhttps://github.com/MagicPixel/awesome-public-datasets#biology
Climate/Weatherhttps://github.com/MagicPixel/awesome-public-datasets#climate-weather
Complex Networkshttps://github.com/MagicPixel/awesome-public-datasets#complex-networks
Computer Networkshttps://github.com/MagicPixel/awesome-public-datasets#computer-networks
Contextual Datahttps://github.com/MagicPixel/awesome-public-datasets#contextual-data
Data Challengeshttps://github.com/MagicPixel/awesome-public-datasets#data-challenges
Earth Sciencehttps://github.com/MagicPixel/awesome-public-datasets#earth-science
Economicshttps://github.com/MagicPixel/awesome-public-datasets#economics
Educationhttps://github.com/MagicPixel/awesome-public-datasets#education
Energyhttps://github.com/MagicPixel/awesome-public-datasets#energy
Financehttps://github.com/MagicPixel/awesome-public-datasets#finance
GIShttps://github.com/MagicPixel/awesome-public-datasets#gis
Governmenthttps://github.com/MagicPixel/awesome-public-datasets#government
Healthcarehttps://github.com/MagicPixel/awesome-public-datasets#healthcare
Image Processinghttps://github.com/MagicPixel/awesome-public-datasets#image-processing
Machine Learninghttps://github.com/MagicPixel/awesome-public-datasets#machine-learning
Museumshttps://github.com/MagicPixel/awesome-public-datasets#museums
Natural Languagehttps://github.com/MagicPixel/awesome-public-datasets#natural-language
Neurosciencehttps://github.com/MagicPixel/awesome-public-datasets#neuroscience
Physicshttps://github.com/MagicPixel/awesome-public-datasets#physics
Psychology/Cognitionhttps://github.com/MagicPixel/awesome-public-datasets#psychology-cognition
Public Domainshttps://github.com/MagicPixel/awesome-public-datasets#public-domains
Search Engineshttps://github.com/MagicPixel/awesome-public-datasets#search-engines
Social Networkshttps://github.com/MagicPixel/awesome-public-datasets#social-networks
Social Scienceshttps://github.com/MagicPixel/awesome-public-datasets#social-sciences
Softwarehttps://github.com/MagicPixel/awesome-public-datasets#software
Sportshttps://github.com/MagicPixel/awesome-public-datasets#sports
Time Serieshttps://github.com/MagicPixel/awesome-public-datasets#time-series
Transportationhttps://github.com/MagicPixel/awesome-public-datasets#transportation
Complementary Collectionshttps://github.com/MagicPixel/awesome-public-datasets#complementary-collections
Agriculturehttps://github.com/MagicPixel/awesome-public-datasets#id2
https://github.com/MagicPixel/awesome-public-datasets#agriculture
U.S. Department of Agriculture's PLANTS Databasehttp://www.plants.usda.gov/dl_all.html
Biologyhttps://github.com/MagicPixel/awesome-public-datasets#id3
https://github.com/MagicPixel/awesome-public-datasets#biology
1000 Genomeshttp://www.1000genomes.org/data
American Gut (Microbiome Project)https://github.com/biocore/American-Gut
Broad Cancer Cell Line Encyclopedia (CCLE)http://www.broadinstitute.org/ccle/home
Broad Bioimage Benchmark Collection (BBBC)https://www.broadinstitute.org/bbbc
Cell Image Libraryhttp://www.cellimagelibrary.org
Complete Genomics Public Datahttp://www.completegenomics.com/public-data/69-genomes/
EBI ArrayExpresshttp://www.ebi.ac.uk/arrayexpress/
EBI Protein Data Bank in Europehttp://www.ebi.ac.uk/pdbe/emdb/index.html/
Electron Microscopy Pilot Image Archive (EMPIAR)http://www.ebi.ac.uk/pdbe/emdb/empiar/
ENCODE projecthttps://www.encodeproject.org
Ensembl Genomeshttp://ensemblgenomes.org/info/genomes
Gene Expression Omnibus (GEO)http://www.ncbi.nlm.nih.gov/geo/
Gene Ontology (GO)http://geneontology.org/page/download-annotations
Global Biotic Interactions (GloBI)https://github.com/jhpoelen/eol-globi-data/wiki#accessing-species-interaction-data
Harvard Medical School (HMS) LINCS Projecthttp://lincs.hms.harvard.edu
Human Genome Diversity Projecthttp://www.hagsc.org/hgdp/files.html
Human Microbiome Project (HMP)http://www.hmpdacc.org/reference_genomes/reference_genomes.php
ICOS PSP Benchmarkhttp://ico2s.org/datasets/psp_benchmark.html
International HapMap Projecthttp://hapmap.ncbi.nlm.nih.gov/downloads/index.html.en
Journal of Cell Biology DataViewerhttp://jcb-dataviewer.rupress.org
MIT Cancer Genomics Datahttp://www.broadinstitute.org/cgi-bin/cancer/datasets.cgi
NCBI Proteinshttp://www.ncbi.nlm.nih.gov/guide/proteins/#databases
NCBI Taxonomyhttp://www.ncbi.nlm.nih.gov/taxonomy
NIH Microarray datahttp://bit.do/VVW6
RAWhttps://raw.githubusercontent.com/caesar0301/awesome-public-datasets/master/README.rst
OpenSNP genotypes datahttps://opensnp.org/
Pathguid - Protein-Protein Interactions Cataloghttp://www.pathguide.org/
Protein Data Bankhttp://www.rcsb.org/
Psychiatric Genomics Consortiumhttps://www.med.unc.edu/pgc/downloads
PubChem Projecthttps://pubchem.ncbi.nlm.nih.gov/
PubGene (now Coremine Medical)http://www.pubgene.org/
Sanger Catalogue of Somatic Mutations in Cancer (COSMIC)http://cancer.sanger.ac.uk/cosmic
Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC)http://www.cancerrxgene.org/
Sequence Read Archive(SRA)http://www.ncbi.nlm.nih.gov/Traces/sra/
Stanford Microarray Datahttp://smd.stanford.edu/
Stowers Institute Original Data Repositoryhttp://www.stowers.org/research/publications/odr
Systems Science of Biological Dynamics (SSBD) Databasehttp://ssbd.qbic.riken.jp
The Cancer Genome Atlas (TCGA), available via Broad GDAChttps://gdac.broadinstitute.org/
The Catalogue of Lifehttp://www.catalogueoflife.org/content/annual-checklist-archive
The Personal Genome Projecthttp://www.personalgenomes.org/
PGPhttps://my.pgp-hms.org/public_genetic_data
UCSC Public Datahttp://hgdownload.soe.ucsc.edu/downloads.html
Universal Protein Resource (UnitProt)http://www.uniprot.org/downloads
UniGenehttp://www.ncbi.nlm.nih.gov/unigene
Climate/Weatherhttps://github.com/MagicPixel/awesome-public-datasets#id4
https://github.com/MagicPixel/awesome-public-datasets#climateweather
Australian Weatherhttp://www.bom.gov.au/climate/dwo/
Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace systemhttps://aviationweather.gov/adds/dataserver
Brazilian Weather - Historical data (In Portuguese)http://sinda.crn2.inpe.br/PCD/SITE/novo/site/
Canadian Meteorological Centrehttp://weather.gc.ca/grib/index_e.html
Climate Data from UEA (updated monthly)https://crudata.uea.ac.uk/cru/data/temperature/#datterandftp://ftp.cmdl.noaa.gov/
European Climate Assessment & Datasethttp://eca.knmi.nl/
Global Climate Data Since 1929http://en.tutiempo.net/climate
NASA Global Imagery Browse Serviceshttps://wiki.earthdata.nasa.gov/display/GIBS
NOAA Bering Sea Climatehttp://www.beringclimate.noaa.gov/
NOAA Climate Datasetshttp://www.ncdc.noaa.gov/data-access/quick-links
NOAA Realtime Weather Modelshttp://www.ncdc.noaa.gov/data-access/model-data/model-datasets/numerical-weather-prediction
The World Bank Open Data Resources for Climate Changehttp://data.worldbank.org/developers/climate-data-api
UEA Climatic Research Unithttp://www.cru.uea.ac.uk/data
WorldClim - Global Climate Datahttp://www.worldclim.org
WU Historical Weather Worldwidehttps://www.wunderground.com/history/index.html
Complex Networkshttps://github.com/MagicPixel/awesome-public-datasets#id5
https://github.com/MagicPixel/awesome-public-datasets#complex-networks
AMiner Citation Network Datasethttp://aminer.org/citation
CrossRef DOI URLshttps://archive.org/details/doi-urls
DBLP Citation datasethttps://kdl.cs.umass.edu/display/public/DBLP
NBER Patent Citationshttp://nber.org/patents/
Network Repository with Interactive Exploratory Analysis Toolshttp://networkrepository.com/
NIST complex networks data collectionhttp://math.nist.gov/~RPozo/complex_datasets.html
Protein-protein interaction networkhttp://vlado.fmf.uni-lj.si/pub/networks/data/bio/Yeast/Yeast.htm
PyPI and Maven Dependency Networkhttps://ogirardot.wordpress.com/2013/01/31/sharing-pypimaven-dependency-data/
Scopus Citation Databasehttps://www.elsevier.com/solutions/scopus
Small Network Datahttp://www-personal.umich.edu/~mejn/netdata/
Stanford GraphBase (Steven Skiena)http://www3.cs.stonybrook.edu/~algorith/implement/graphbase/implement.shtml
Stanford Large Network Dataset Collectionhttp://snap.stanford.edu/data/
Stanford Longitudinal Network Data Sourceshttp://stanford.edu/group/sonia/dataSources/index.html
The Koblenz Network Collectionhttp://konect.uni-koblenz.de/
The Laboratory for Web Algorithmics (UNIMI)http://law.di.unimi.it/datasets.php
The Nexus Network Repositoryhttp://nexus.igraph.org/
UCI Network Data Repositoryhttps://networkdata.ics.uci.edu/resources.php
UFL sparse matrix collectionhttp://www.cise.ufl.edu/research/sparse/matrices/
WSU Graph Databasehttp://www.eecs.wsu.edu/mgd/gdb.html
DIMACS Road Networks Collectionhttp://www.dis.uniroma1.it/challenge9/download.shtml
Computer Networkshttps://github.com/MagicPixel/awesome-public-datasets#id6
https://github.com/MagicPixel/awesome-public-datasets#computer-networks
3.5B Web Pages from CommonCraw 2012http://www.bigdatanews.com/profiles/blogs/big-data-set-3-5-billion-web-pages-made-available-for-all-of-us
53.5B Web clicks of 100K users in Indiana Univ.http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset/
CAIDA Internet Datasetshttp://www.caida.org/data/overview/
ClueWeb09 - 1B web pageshttp://lemurproject.org/clueweb09/
ClueWeb12 - 733M web pageshttp://lemurproject.org/clueweb12/
CommonCrawl Web Data over 7 yearshttp://commoncrawl.org/the-data/get-started/
CRAWDAD Wireless datasets from Dartmouth Univ.https://crawdad.cs.dartmouth.edu/
Criteo click-through datahttp://labs.criteo.com/2015/03/criteo-releases-its-new-dataset/
Open Mobile Data by MobiPerfhttps://console.developers.google.com/storage/openmobiledata_public/
Rapid7 Sonar Internet Scanshttps://sonar.labs.rapid7.com/
UCSD Network Telescope, IPv4 /8 nethttp://www.caida.org/projects/network_telescope/
Contextual Datahttps://github.com/MagicPixel/awesome-public-datasets#id7
https://github.com/MagicPixel/awesome-public-datasets#contextual-data
Context-aware data sets from five domainshttp://students.depaul.edu/~yzheng8/DataSets.html#Data
GitHubhttps://github.com/irecsys/CARSKit/tree/master/context-aware_data_sets
Data Challengeshttps://github.com/MagicPixel/awesome-public-datasets#id8
https://github.com/MagicPixel/awesome-public-datasets#data-challenges
Challenges in Machine Learninghttp://www.chalearn.org/
CrowdANALYTIX dataXhttp://data.crowdanalytix.com
D4D Challenge of Orangehttp://www.d4d.orange.com/en/home
DrivenData Competitions for Social Goodhttp://www.drivendata.org/
ICWSM Data Challenge (since 2009)http://icwsm.cs.umbc.edu/
Kaggle Competition Datahttps://www.kaggle.com/
KDD Cup by Tencent 2012http://www.kddcup2012.org/
Localytics Data Visualization Challengehttps://github.com/localytics/data-viz-challenge
Netflix Prizehttp://netflixprize.com/leaderboard.html
Space Apps Challengehttps://2015.spaceappschallenge.org
Telecom Italia Big Data Challengehttps://dandelion.eu/datamine/open-big-data/
Yelp Dataset Challengehttp://www.yelp.com/dataset_challenge
Bruteforce Databasehttps://github.com/duyetdev/bruteforce-database
Earth Sciencehttps://github.com/MagicPixel/awesome-public-datasets#id9
https://github.com/MagicPixel/awesome-public-datasets#earth-science
AQUASTAT - Global water resources and useshttp://www.fao.org/nr/water/aquastat/data/query/index.html?lang=en
BODC - marine data of ~22K varshttp://www.bodc.ac.uk/data/where_to_find_data/
Earth Modelshttp://www.earthmodels.org/
EOSDIS - NASA's earth observing system datahttp://sedac.ciesin.columbia.edu/data/sets/browse
Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurementshttps://imos.aodn.org.au
on S3http://imos-data.s3-website-ap-southeast-2.amazonaws.com/
Marinexplore - Open Oceanographic Datahttp://marinexplore.org/
Smithsonian Institution Global Volcano and Eruption Databasehttp://volcano.si.edu/
USGS Earthquake Archiveshttp://earthquake.usgs.gov/earthquakes/search/
Economicshttps://github.com/MagicPixel/awesome-public-datasets#id10
https://github.com/MagicPixel/awesome-public-datasets#economics
American Economic Association (AEA)https://www.aeaweb.org/resources/data
EconData from UMDhttp://inforumweb.umd.edu/econdata/econdata.html
Economic Freedom of the World Datahttp://www.freetheworld.com/datasets_efw.html
Historical MacroEconomc Statisticshttp://www.historicalstatistics.org/
International Economics Databasehttp://widukind.cepremap.org/
various data toolshttps://github.com/Widukind
International Trade Statisticshttp://www.econostatistics.co.za/
Internet Product Code Databasehttp://www.upcdatabase.com/
Joint External Debt Data Hubhttp://www.jedh.org/
Jon Haveman International Trade Data Linkshttp://www.macalester.edu/research/economics/PAGE/HAVEMAN/Trade.Resources/TradeData.html
OpenCorporates Database of Companies in the Worldhttps://opencorporates.com/
Our World in Datahttp://ourworldindata.org/
SciencesPo World Trade Gravity Datasetshttp://econ.sciences-po.fr/thierry-mayer/data
The Atlas of Economic Complexityhttp://atlas.cid.harvard.edu
The Center for International Datahttp://cid.econ.ucdavis.edu
The Observatory of Economic Complexityhttp://atlas.media.mit.edu/en/
UN Commodity Trade Statisticshttp://comtrade.un.org/db/
UN Human Development Reportshttp://hdr.undp.org/en
Educationhttps://github.com/MagicPixel/awesome-public-datasets#id11
https://github.com/MagicPixel/awesome-public-datasets#education
Student Data from Free Code Camphttp://academictorrents.com/details/030b10dad0846b5aecc3905692890fb02404adbf
Energyhttps://github.com/MagicPixel/awesome-public-datasets#id12
https://github.com/MagicPixel/awesome-public-datasets#energy
AMPdshttp://ampds.org/
BLUEdhttp://nilm.cmubi.org/
COMBEDhttp://combed.github.io/
Dataporthttps://dataport.pecanstreet.org/
DREDhttp://www.st.ewi.tudelft.nl/~akshay/dred/
ECOhttp://www.vs.inf.ethz.ch/res/show.html?what=eco-data
EIAhttp://www.eia.gov/electricity/data/eia923/
HEShttp://randd.defra.gov.uk/Default.aspx?Menu=Menu&Module=More&Location=None&ProjectID=17359&FromSearch=Y&Publisher=1&SearchText=EV0702&SortString=ProjectCode&SortOrder=Asc&Paging=10#Description
HFEDhttp://hfed.github.io/
iAWEhttp://iawe.github.io/
PLAIDhttp://plaidplug.com/
REDDhttp://redd.csail.mit.edu/
Tracebasehttps://www.tracebase.org
UK-DALEhttp://www.doc.ic.ac.uk/~dk3810/data/
WHITEDhttp://nilmworkshop.org/2016/proceedings/Poster_ID18.pdf
Financehttps://github.com/MagicPixel/awesome-public-datasets#id13
https://github.com/MagicPixel/awesome-public-datasets#finance
CBOE Futures Exchangehttp://cfe.cboe.com/Data/
Google Financehttps://www.google.com/finance
Google Trendshttp://www.google.com/trends?q=google&ctab=0&geo=all&date=all&sort=0
NASDAQhttps://data.nasdaq.com/
OANDAhttp://www.oanda.com/
OSU Financial datahttp://fisher.osu.edu/fin/fdf/osudata.htm
Quandlhttps://www.quandl.com/
St Louis Federalhttps://research.stlouisfed.org/fred2/
Yahoo Financehttp://finance.yahoo.com/
RAWhttps://raw.githubusercontent.com/caesar0301/awesome-public-datasets/master/README.rst
GIShttps://github.com/MagicPixel/awesome-public-datasets#id14
https://github.com/MagicPixel/awesome-public-datasets#gis
Cambridge, MA, US, GIS data on GitHubhttp://cambridgegis.github.io/gisdata.html
Factual Global Location Datahttps://www.factual.com/
Geo Spatial Data from ASUhttp://geodacenter.asu.edu/datalist/
Geo Wiki Project - Citizen-driven Environmental Monitoringhttp://geo-wiki.org/
GeoFabrik - OSM data extracted to a variety of formats and areashttp://download.geofabrik.de/
GeoNames Worldwidehttp://www.geonames.org/
Global Administrative Areas Database (GADM)http://www.gadm.org/
Homeland Infrastructure Foundation-Level Datahttps://hifld-dhs-gii.opendata.arcgis.com/
Landsat 8 on AWShttps://aws.amazon.com/public-data-sets/landsat/
List of all countries in all languageshttps://github.com/umpirsky/country-list
National Weather Service GIS Data Portalhttp://www.nws.noaa.gov/gis/
Natural Earth - vectors and rasters of the worldhttp://www.naturalearthdata.com/
OpenAddresseshttp://openaddresses.io/
OpenStreetMap (OSM)http://wiki.openstreetmap.org/wiki/Downloading_data
Pleiades - Gazetteer and graph of ancient placeshttp://pleiades.stoa.org/
Reverse Geocoder using OSM datahttps://github.com/kno10/reversegeocode
additional high-resolution data fileshttp://data.ub.uni-muenchen.de/61/
TIGER/Line - U.S. boundaries and roadshttp://www.census.gov/geo/maps-data/data/tiger-line.html
TwoFishes - Foursquare's coarse geocoderhttps://github.com/foursquare/twofishes
TZ Timezones shapfileshttp://efele.net/maps/tz/world/
UN Environmental Datahttp://geodata.grid.unep.ch/
World boundaries from the U.S. Department of Statehttps://hiu.state.gov/data/data.aspx
World countries in multiple formatshttps://github.com/mledoze/countries
Governmenthttps://github.com/MagicPixel/awesome-public-datasets#id15
https://github.com/MagicPixel/awesome-public-datasets#government
OpenDataSoft's list of 1,600 open data portalshttps://www.opendatasoft.com/a-comprehensive-list-of-all-open-data-portals-around-the-world/
A list of cities and countries contributed by communityhttps://github.com/caesar0301/awesome-public-datasets/blob/master/Government.rst
Healthcarehttps://github.com/MagicPixel/awesome-public-datasets#id16
https://github.com/MagicPixel/awesome-public-datasets#healthcare
EHDP Large Health Data Setshttp://www.ehdp.com/vitalnet/datasets.htm
Gapminder World demographic databaseshttp://www.gapminder.org/data/
Medicare Coverage Database (MCD), U.S.https://www.cms.gov/medicare-coverage-database/
Medicare Data Engine of medicare.gov Datahttps://data.medicare.gov/
Medicare Data Filehttp://go.cms.gov/19xxPN4
MeSH, the vocabulary thesaurus used for indexing articles for PubMedhttps://www.nlm.nih.gov/mesh/filelist.html
Number of Ebola Cases and Deaths in Affected Countries (2014)https://data.hdx.rwlabs.org/dataset/ebola-cases-2014
Open-ODS (structure of the UK NHS)http://www.openods.co.uk
OpenPaymentsData, Healthcare financial relationship datahttps://openpaymentsdata.cms.gov
The Cancer Genome Atlas project (TCGA)https://tcga-data.nci.nih.gov/tcga/tcgaDownload.jsp
BigQuery tablehttp://google-genomics.readthedocs.org/en/latest/use_cases/discover_public_data/isb_cgc_data.html
World Health Organization Global Health Observatoryhttp://www.who.int/gho/en/
Image Processinghttps://github.com/MagicPixel/awesome-public-datasets#id17
https://github.com/MagicPixel/awesome-public-datasets#image-processing
10k US Adult Faces Databasehttp://wilmabainbridge.com/facememorability2.html
2GB of Photos of Catshttp://137.189.35.203/WebUI/CatDatabase/catData.html
Archive versionhttps://web.archive.org/web/20150520175645/http://137.189.35.203/WebUI/CatDatabase/catData.html
Affective Image Classificationhttp://www.imageemotion.org/
Animals with attributeshttp://attributes.kyb.tuebingen.mpg.de/
Face Recognition Benchmarkhttp://www.face-rec.org/databases/
ImageNet (in WordNet hierarchy)http://www.image-net.org/
Indoor Scene Recognitionhttp://web.mit.edu/torralba/www/indoor.html
International Affective Picture System, UFLhttp://csea.phhp.ufl.edu/media/iapsmessage.html
Massive Visual Memory Stimuli, MIThttp://cvcl.mit.edu/MM/stimuli.html
Several Shape-from-Silhouette Datasetshttp://kaiwolf.no-ip.org/3d-model-repository.html
Stanford Dogs Datasethttp://vision.stanford.edu/aditya86/ImageNetDogs/
SUN database, MIThttp://groups.csail.mit.edu/vision/SUN/hierarchy.html
The Oxford-IIIT Pet Datasethttp://www.robots.ox.ac.uk/~vgg/data/pets/
YouTube Faces Databasehttp://www.cs.tau.ac.il/~wolf/ytfaces/
Adience Unfiltered faces for gender and age classificationhttp://www.openu.ac.il/home/hassner/Adience/data.html
The Action Similarity Labeling (ASLAN) Challengehttp://www.openu.ac.il/home/hassner/data/ASLAN/ASLAN.html
Violent-Flows - Crowd Violence Non-violence Database and benchmarkhttp://www.openu.ac.il/home/hassner/data/violentflows/
Machine Learninghttps://github.com/MagicPixel/awesome-public-datasets#id18
https://github.com/MagicPixel/awesome-public-datasets#machine-learning
Delve Datasets for classification and regression (Univ. of Toronto)http://www.cs.toronto.edu/~delve/data/datasets.html
Discogs Monthly Datahttp://data.discogs.com/
eBay Online Auctions (2012)http://www.modelingonlineauctions.com/datasets
IMDb Databasehttp://www.imdb.com/interfaces
Keel Repository for classification, regression and time serieshttp://sci2s.ugr.es/keel/datasets.php
Labeled Faces in the Wild (LFW)http://vis-www.cs.umass.edu/lfw/
Lending Club Loan Datahttps://www.lendingclub.com/info/download-data.action
Machine Learning Data Set Repositoryhttp://mldata.org/
Million Song Datasethttp://labrosa.ee.columbia.edu/millionsong/
More Song Datasetshttp://labrosa.ee.columbia.edu/millionsong/pages/additional-datasets
New Yorker caption contest ratingshttps://github.com/nextml/caption-contest-data
MovieLens Data Setshttp://grouplens.org/datasets/movielens/
RDataMining - "R and Data Mining" ebook datahttp://www.rdatamining.com/data
Registered Meteorites on Earthhttp://healthintelligence.drupalgardens.com/content/registered-meteorites-has-impacted-earth-visualized
Restaurants Health Score Data in San Franciscohttp://missionlocal.org/san-francisco-restaurant-health-inspections/
UCI Machine Learning Repositoryhttp://archive.ics.uci.edu/ml/
Yahoo! Ratings and Classification Datahttp://webscope.sandbox.yahoo.com/catalog.php?datatype=r
Museumshttps://github.com/MagicPixel/awesome-public-datasets#id19
https://github.com/MagicPixel/awesome-public-datasets#museums
Canada Science and Technology Museums Corporation's Open Datahttp://techno-science.ca/en/data.php
Cooper-Hewitt's Collection Databasehttps://github.com/cooperhewitt/collection
Minneapolis Institute of Arts metadatahttps://github.com/artsmia/collection
Natural History Museum (London) Data Portalhttp://data.nhm.ac.uk/
Rijksmuseum Historical Art Collectionhttps://www.rijksmuseum.nl/en/api
Tate Collection metadatahttps://github.com/tategallery/collection
The Getty vocabularieshttp://vocab.getty.edu
Natural Languagehttps://github.com/MagicPixel/awesome-public-datasets#id20
https://github.com/MagicPixel/awesome-public-datasets#natural-language
Blogger Corpushttp://u.cs.biu.ac.il/~koppel/BlogCorpus.htm
CLiPS Stylometry Investigation Corpushttp://www.clips.uantwerpen.be/datasets/csi-corpus
ClueWeb09 FACChttp://lemurproject.org/clueweb09/FACC1/
ClueWeb12 FACChttp://lemurproject.org/clueweb12/FACC1/
DBpedia - 4.58M things with 583M factshttp://wiki.dbpedia.org/Datasets
Flickr Personal Taxonomieshttp://www.isi.edu/~lerman/downloads/flickr/flickr_taxonomies.html
Freebase.com of people, places, and thingshttp://www.freebase.com/
Google Books Ngrams (2.2TB)https://aws.amazon.com/datasets/google-books-ngrams/
Google Web 5gram (1TB, 2006)https://catalog.ldc.upenn.edu/LDC2006T13
Gutenberg eBooks Listhttp://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs
Hansards text chunks of Canadian Parliamenthttp://www.isi.edu/natural-language/download/hansard/
Machine Comprehension Test (MCTest) of text from Microsoft Researchhttp://research.microsoft.com/en-us/um/redmond/projects/mctest/index.html
Machine Translation of European languageshttp://statmt.org/wmt11/translation-task.html#download
Personae Corpushttp://www.clips.uantwerpen.be/datasets/personae-corpus
SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles)https://github.com/ParallelMazen/SaudiNewsNet
SMS Spam Collection in Englishhttp://www.dt.fee.unicamp.br/~tiago/smsspamcollection/
USENET postings corpus of 2005~2011http://www.psych.ualberta.ca/~westburylab/downloads/usenetcorpus.download.html
Wikidata - Wikipedia databaseshttps://www.wikidata.org/wiki/Wikidata:Database_download
Wikipedia Links data - 40 Million Entities in Contexthttps://code.google.com/p/wiki-links/downloads/list
Universal Dependencieshttp://universaldependencies.org
WordNet databases and toolshttp://wordnet.princeton.edu/wordnet/download/
Open Multilingual Wordnethttp://compling.hss.ntu.edu.sg/omw/
Neurosciencehttps://github.com/MagicPixel/awesome-public-datasets#id21
https://github.com/MagicPixel/awesome-public-datasets#neuroscience
Allen Institute Datasetshttp://www.brain-map.org/
Brain Cataloguehttp://braincatalogue.org/
Brainomicshttp://brainomics.cea.fr/localizer
CodeNeuro Datasetshttp://datasets.codeneuro.org/
Collaborative Research in Computational Neuroscience (CRCNS)http://crcns.org/data-sets
FCP-INDIhttp://fcon_1000.projects.nitrc.org/index.html
Human Connectome Projecthttp://www.humanconnectome.org/data/
NDARhttps://ndar.nih.gov/
NIMH Data Archivehttp://data-archive.nimh.nih.gov/
NeuroDatahttp://neurodata.io
OASIShttp://www.oasis-brains.org/
OpenfMRIhttps://openfmri.org/
Neuroelectrohttp://neuroelectro.org/
Study Forresthttp://studyforrest.org
Physicshttps://github.com/MagicPixel/awesome-public-datasets#id22
https://github.com/MagicPixel/awesome-public-datasets#physics
CERN Open Data Portalhttp://opendata.cern.ch/
Crystallography Open Databasehttp://www.crystallography.net/
NASA Exoplanet Archivehttp://exoplanetarchive.ipac.caltech.edu/
NSSDC (NASA) data of 550 space spacecrafthttp://nssdc.gsfc.nasa.gov/nssdc/obtaining_data.html
Sloan Digital Sky Survey (SDSS) - Mapping the Universehttp://www.sdss.org/
Psychology/Cognitionhttps://github.com/MagicPixel/awesome-public-datasets#id23
https://github.com/MagicPixel/awesome-public-datasets#psychologycognition
OSU Cognitive Modeling Repository Datasetshttp://www.cmr.osu.edu/browse/datasets
Public Domainshttps://github.com/MagicPixel/awesome-public-datasets#id24
https://github.com/MagicPixel/awesome-public-datasets#public-domains
Amazonhttp://aws.amazon.com/datasets/
Archive-it from Internet Archivehttps://www.archive-it.org/explore?show=Collections
Archive.org Datasetshttps://archive.org/details/datasets
CMU JASA data archivehttp://lib.stat.cmu.edu/jasadata/
CMU StatLab collectionshttp://lib.stat.cmu.edu/datasets/
Data360http://www.data360.org/index.aspx
Datamob.orghttp://datamob.org/datasets
Googlehttp://www.google.com/publicdata/directory
Infochimpshttp://www.infochimps.com/
KDNuggets Data Collectionshttp://www.kdnuggets.com/datasets/index.html
Microsoft Azure Data Market Free DataSetshttp://datamarket.azure.com/browse/data?price=free
Numbrayhttp://numbrary.com/
Open Library Data Dumpshttps://openlibrary.org/developers/dumps
Reddit Datasetshttps://www.reddit.com/r/datasets
RevolutionAnalytics Collectionhttp://packages.revolutionanalytics.com/datasets/
Sample R data setshttp://stat.ethz.ch/R-manual/R-patched/library/datasets/html/00Index.html
Stats4Stem R data setshttp://www.stats4stem.org/data-sets.html
StatSci.orghttp://www.statsci.org/datasets.html
The Washington Post Listhttp://www.washingtonpost.com/wp-srv/metro/data/datapost.html
UCLA SOCR data collectionhttp://wiki.stat.ucla.edu/socr/index.php/SOCR_Data
UFO Reportshttp://www.nuforc.org/webreports.html
Wikileaks 911 pager interceptshttps://911.wikileaks.org/files/index.html
Yahoo Webscopehttp://webscope.sandbox.yahoo.com/catalog.php
Search Engineshttps://github.com/MagicPixel/awesome-public-datasets#id25
https://github.com/MagicPixel/awesome-public-datasets#search-engines
Academic Torrents of data sharing from UMBhttp://academictorrents.com/
Datahub.iohttps://datahub.io/dataset
DataMarket (Qlik)https://datamarket.com/data/list/?q=all
Harvard Dataverse Network of scientific datahttps://dataverse.harvard.edu/
ICPSR (UMICH)http://www.icpsr.umich.edu/icpsrweb/ICPSR/index.jsp
Institute of Education Scienceshttp://eric.ed.gov
National Technical Reports Libraryhttp://www.ntis.gov/products/ntrl/
Open Data Certificates (beta)https://certificates.theodi.org/en/datasets
OpenDataNetwork - A search engine of all Socrata powered data portalshttp://www.opendatanetwork.com/
Statista.com - statistics and Studieshttp://www.statista.com/
Zenodo - An open dependable home for the long-tail of sciencehttps://zenodo.org/collection/datasets
Social Networkshttps://github.com/MagicPixel/awesome-public-datasets#id26
https://github.com/MagicPixel/awesome-public-datasets#social-networks
72 hours #gamergate Twitter Scrapehttp://waxy.org/random/misc/gamergate_tweets.csv
Ancestry.com Forum Dataset over 10 yearshttp://www.cs.cmu.edu/~jelsas/data/ancestry.com/
Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrapehttps://archive.org/details/twitter_cikm_2010
CMU Enron Email of 150 usershttp://www.cs.cmu.edu/~enron/
EDRM Enron EMail of 151 users, hosted on S3https://aws.amazon.com/datasets/enron-email-data/
Facebook Data Scrape (2005)https://archive.org/details/oxford-2005-facebook-matrix
Facebook Social Networks from LAW (since 2007)http://law.di.unimi.it/datasets.php
Foursquare from UMN/Sarwat (2013)https://archive.org/details/201309_foursquare_dataset_umn
GetGlue - users rating TV showshttp://getglue-data.s3.amazonaws.com/getglue_sample.tar.gz
GitHub Collaboration Archivehttps://www.githubarchive.org/
Google Scholar citation relationshttp://www3.cs.stonybrook.edu/~leman/data/gscholar.db
High-Resolution Contact Networks from Wearable Sensorshttp://www.sociopatterns.org/datasets/
Mobile Social Networks from UMASShttps://kdl.cs.umass.edu/display/public/Mobile+Social+Networks
Network Twitter Datahttp://snap.stanford.edu/data/higgs-twitter.html
Reddit Commentshttps://www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/
Skytrax' Air Travel Reviews Datasethttps://github.com/quankiquanki/skytrax-reviews-dataset
Social Twitter Datahttp://snap.stanford.edu/data/egonets-Twitter.html
SourceForge.net Research Datahttp://www3.nd.edu/~oss/Data/data.html
Twitter Data for Sentiment Analysishttp://help.sentiment140.com/for-students/
Twitter Data for Online Reputation Managementhttp://nlp.uned.es/replab2013/
Twitter Graph of entire Twitter sitehttp://an.kaist.ac.kr/traces/WWW2010.html
Twitter Scrape Calufa May 2011http://archive.org/details/2011-05-calufa-twitter-sql
UNIMI/LAW Social Network Datasetshttp://law.di.unimi.it/datasets.php
Yahoo! Graph and Social Datahttp://webscope.sandbox.yahoo.com/catalog.php?datatype=g
Youtube Video Social Graph in 2007,2008http://netsg.cs.sfu.ca/youtubedata/
Social Scienceshttps://github.com/MagicPixel/awesome-public-datasets#id27
https://github.com/MagicPixel/awesome-public-datasets#social-sciences
ACLED (Armed Conflict Location & Event Data Project)http://www.acleddata.com/
Canadian Legal Information Institutehttps://www.canlii.org/en/index.php
Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etchttp://www.systemicpeace.org/
Correlates of War Projecthttp://www.correlatesofwar.org/
Cryptome Conspiracy Theory Itemshttp://cryptome.org
Datacardshttp://datacards.org
European Social Surveyhttp://www.europeansocialsurvey.org/data/
FBI Hate Crime 2013 - aggregated datahttps://github.com/emorisse/FBI-Hate-Crime-Statistics/tree/master/2013
GDELT Global Events Databasehttp://gdeltproject.org/data.html
General Social Survey (GSS) since 1972http://gss.norc.org
German Social Surveyhttp://www.gesis.org/en/home/
Global Religious Futures Projecthttp://www.globalreligiousfutures.org/
Humanitarian Data Exchangehttps://data.hdx.rwlabs.org/
Institute for Demographic Studieshttp://www.ined.fr/en/
International Networks Archivehttp://www.princeton.edu/~ina/
International Social Survey Program ISSPhttp://www.issp.org
International Studies Compendium Projecthttp://www.isacompendium.com/public/
James McGuire Cross National Datahttp://jmcguire.faculty.wesleyan.edu/welcome/cross-national-data/
MacroData Guide by Norsk samfunnsvitenskapelig datatjenestehttp://nsd.uib.no
Minnesota Population Centerhttps://www.ipums.org/
MIT Reality Mining Datasethttp://realitycommons.media.mit.edu/realitymining.html
Open Crime and Policing Data in England, Wales and Northern Irelandhttps://data.police.uk/data/
Paul Hensel General International Data Pagehttp://www.paulhensel.org/dataintl.html
PewResearch Internet Survey Projecthttp://www.pewinternet.org/datasets/pages/2/
PewResearch Society Data Collectionhttp://www.pewresearch.org/data/download-datasets/
Political Polarity Datahttp://www3.cs.stonybrook.edu/~leman/data/14-icwsm-political-polarity-data.zip
StackExchange Data Explorerhttp://data.stackexchange.com/help
Terrorism Research and Analysis Consortiumhttp://www.trackingterrorism.org/
Texas Inmates Executed Since 1984http://www.tdcj.state.tx.us/death_row/dr_executed_offenders.html
Titanic Survival Data Sethttps://github.com/caesar0301/awesome-public-datasets/tree/master/Datasets
UCB's Archive of Social Science Data (D-Lab)http://ucdata.berkeley.edu/
Uppsala Conflict Data Programhttp://ucdp.uu.se/
UCLA Social Sciences Data Archivehttp://dataarchives.ss.ucla.edu/Home.DataPortals.htm
UN Civil Society Databasehttp://esango.un.org/civilsociety/
Universities Worldwidehttp://univ.cc/
UPJOHN for Labor Employment Researchhttp://www.upjohn.org/services/resources/employment-research-data-center
World Bank Datahttp://data.worldbank.org/
WorldPop project - Worldwide human population distributionshttp://www.worldpop.org.uk/data/get_data/
Softwarehttps://github.com/MagicPixel/awesome-public-datasets#id28
https://github.com/MagicPixel/awesome-public-datasets#software
FLOSSmole data about free, libre, and open source software developmenthttp://flossdata.syr.edu/data/
Sportshttps://github.com/MagicPixel/awesome-public-datasets#id29
https://github.com/MagicPixel/awesome-public-datasets#sports
Basketball (NBA/NCAA/Euro) Player Database and Statisticshttp://www.draftexpress.com/stats.php
Betfair Historical Exchange Datahttp://data.betfair.com/
Cricsheet Matches (cricket)http://cricsheet.org/
Ergast Formula 1, from 1950 up to date (API)http://ergast.com/mrd/db
Football/Soccer resources (data and APIs)http://www.jokecamp.com/blog/guide-to-football-and-soccer-data-and-apis/
Lahman's Baseball Databasehttp://www.seanlahman.com/baseball-archive/statistics/
Pinhooker: Thoroughbred Bloodstock Sale Datahttps://github.com/phillc73/pinhooker
Retrosheet Baseball Statisticshttp://www.retrosheet.org/game.htm
Time Serieshttps://github.com/MagicPixel/awesome-public-datasets#id30
https://github.com/MagicPixel/awesome-public-datasets#time-series
Databanks International Cross National Time Series Data Archivehttp://www.cntsdata.com
Hard Drive Failure Rateshttps://www.backblaze.com/hard-drive-test-data.html
Heart Rate Time Series from MIThttp://ecg.mit.edu/time-series/
Time Series Data Library (TSDL) from MUhttps://datamarket.com/data/list/?q=provider:tsdl
UC Riverside Time Series Datasethttp://www.cs.ucr.edu/~eamonn/time_series_data/
Transportationhttps://github.com/MagicPixel/awesome-public-datasets#id31
https://github.com/MagicPixel/awesome-public-datasets#transportation
Airlines OD Data 1987-2008http://stat-computing.org/dataexpo/2009/the-data.html
Bay Area Bike Share Datahttp://www.bayareabikeshare.com/open-data
Bike Share Systems (BSS) collectionhttps://github.com/BetaNYC/Bike-Share-Data-Best-Practices/wiki/Bike-Share-Data-Systems
GeoLife GPS Trajectory from Microsoft Researchhttp://research.microsoft.com/en-us/downloads/b16d359d-d164-469e-9fd4-daa38f2b2e13/
German train system by Deutsche Bahnhttp://data.deutschebahn.com/datasets/
Hubway Million Rides in MAhttp://hubwaydatachallenge.org/trip-history-data/
Marine Traffic - ship tracks, port calls and morehttp://www.marinetraffic.com/de/ais-api-services
Montreal BIXI Bike Sharehttps://montreal.bixi.com/donn%C3%A9es-libre-service
NYC Taxi Trip Data 2009-http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml
NYC Taxi Trip Data 2013 (FOIA/FOILed)https://archive.org/details/nycTaxiTripData2013
NYC Uber trip data April 2014 to September 2014https://github.com/fivethirtyeight/uber-tlc-foil-response
Open Traffic collectionhttps://github.com/graphhopper/open-traffic-collection
OpenFlights - airport, airline and route datahttp://openflights.org/data.html
Philadelphia Bike Share Stations (JSON)https://www.rideindego.com/stations/json/
Plane Crash Database, since 1920http://www.planecrashinfo.com/database.htm
RITA Airline On-Time Performance datahttp://www.transtats.bts.gov/Tables.asp?DB_ID=120
RITA/BTS transport data collection (TranStat)http://www.transtats.bts.gov/DataIndex.asp
Toronto Bike Share Stations (XML file)http://www.bikesharetoronto.com/data/stations/bikeStations.xml
Transport for London (TFL)https://tfl.gov.uk/info-for/open-data-users/data-feeds
Travel Tracker Survey (TTS) for Chicagohttp://www.cmap.illinois.gov/data/transportation/travel-tracker-survey
U.S. Bureau of Transportation Statistics (BTS)http://www.rita.dot.gov/bts/
U.S. Domestic Flights 1990 to 2009http://academictorrents.com/details/a2ccf94bbb4af222bf8e69dad60a68a29f310d9a
U.S. Freight Analysis Framework since 2007http://ops.fhwa.dot.gov/freight/freight_analysis/faf/index.htm
Complementary Collectionshttps://github.com/MagicPixel/awesome-public-datasets#id32
https://github.com/MagicPixel/awesome-public-datasets#complementary-collections
Data Packaged Core Datasetshttps://github.com/datasets/
Database of Scientific Code Contributionshttps://mozillascience.org/collaborate
Some Datasets Available on the Webhttp://www.datawrangling.com/some-datasets-available-on-the-web
Finding Data on the Internethttp://www.inside-r.org/howto/finding-data-internet
An overview of available open data resources in Europehttp://opendatamonitor.eu
Where can I find large datasets open to the public?http://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
100+ Interesting Data Sets for Statisticshttp://rs.io/100-interesting-data-sets-for-statistics/
Leveraging open data to understand urban liveshttp://xiaming.me/posts/2014/10/23/leveraging-open-data-to-understand-urban-lives/
goo.gl/WZ8XAJhttps://goo.gl/WZ8XAJ
Readme https://github.com/MagicPixel/awesome-public-datasets#readme-ov-file
MIT license https://github.com/MagicPixel/awesome-public-datasets#MIT-1-ov-file
Please reload this pagehttps://github.com/MagicPixel/awesome-public-datasets
Activityhttps://github.com/MagicPixel/awesome-public-datasets/activity
1 starhttps://github.com/MagicPixel/awesome-public-datasets/stargazers
0 watchinghttps://github.com/MagicPixel/awesome-public-datasets/watchers
0 forkshttps://github.com/MagicPixel/awesome-public-datasets/forks
Report repository https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FMagicPixel%2Fawesome-public-datasets&report=MagicPixel+%28user%29
Releaseshttps://github.com/MagicPixel/awesome-public-datasets/releases
1 tags https://github.com/MagicPixel/awesome-public-datasets/tags
Packages 0https://github.com/users/MagicPixel/packages?repo_name=awesome-public-datasets
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.