| Skip to content | https://github.com/MagicPixel/awesome-public-datasets#start-of-content |
|
| https://github.com/ |
|
Sign in
| https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FMagicPixel%2Fawesome-public-datasets |
| GitHub CopilotWrite better code with AI | https://github.com/features/copilot |
| GitHub SparkBuild and deploy intelligent apps | https://github.com/features/spark |
| GitHub ModelsManage and compare prompts | https://github.com/features/models |
| MCP RegistryNewIntegrate external tools | https://github.com/mcp |
| ActionsAutomate any workflow | https://github.com/features/actions |
| CodespacesInstant dev environments | https://github.com/features/codespaces |
| IssuesPlan and track work | https://github.com/features/issues |
| Code ReviewManage code changes | https://github.com/features/code-review |
| GitHub Advanced SecurityFind and fix vulnerabilities | https://github.com/security/advanced-security |
| Code securitySecure your code as you build | https://github.com/security/advanced-security/code-security |
| Secret protectionStop leaks before they start | https://github.com/security/advanced-security/secret-protection |
| Why GitHub | https://github.com/why-github |
| Documentation | https://docs.github.com |
| Blog | https://github.blog |
| Changelog | https://github.blog/changelog |
| Marketplace | https://github.com/marketplace |
| View all features | https://github.com/features |
| Enterprises | https://github.com/enterprise |
| Small and medium teams | https://github.com/team |
| Startups | https://github.com/enterprise/startups |
| Nonprofits | https://github.com/solutions/industry/nonprofits |
| App Modernization | https://github.com/solutions/use-case/app-modernization |
| DevSecOps | https://github.com/solutions/use-case/devsecops |
| DevOps | https://github.com/solutions/use-case/devops |
| CI/CD | https://github.com/solutions/use-case/ci-cd |
| View all use cases | https://github.com/solutions/use-case |
| Healthcare | https://github.com/solutions/industry/healthcare |
| Financial services | https://github.com/solutions/industry/financial-services |
| Manufacturing | https://github.com/solutions/industry/manufacturing |
| Government | https://github.com/solutions/industry/government |
| View all industries | https://github.com/solutions/industry |
| View all solutions | https://github.com/solutions |
| AI | https://github.com/resources/articles?topic=ai |
| Software Development | https://github.com/resources/articles?topic=software-development |
| DevOps | https://github.com/resources/articles?topic=devops |
| Security | https://github.com/resources/articles?topic=security |
| View all topics | https://github.com/resources/articles |
| Customer stories | https://github.com/customer-stories |
| Events & webinars | https://github.com/resources/events |
| Ebooks & reports | https://github.com/resources/whitepapers |
| Business insights | https://github.com/solutions/executive-insights |
| GitHub Skills | https://skills.github.com |
| Documentation | https://docs.github.com |
| Customer support | https://support.github.com |
| Community forum | https://github.com/orgs/community/discussions |
| Trust center | https://github.com/trust-center |
| Partners | https://github.com/partners |
| GitHub SponsorsFund open source developers | https://github.com/sponsors |
| Security Lab | https://securitylab.github.com |
| Maintainer Community | https://maintainers.github.com |
| Accelerator | https://github.com/accelerator |
| Archive Program | https://archiveprogram.github.com |
| Topics | https://github.com/topics |
| Trending | https://github.com/trending |
| Collections | https://github.com/collections |
| Enterprise platformAI-powered developer platform | https://github.com/enterprise |
| GitHub Advanced SecurityEnterprise-grade security features | https://github.com/security/advanced-security |
| Copilot for BusinessEnterprise-grade AI features | https://github.com/features/copilot/copilot-business |
| Premium SupportEnterprise-grade 24/7 support | https://github.com/premium-support |
| Pricing | https://github.com/pricing |
| Search syntax tips | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
| documentation | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
|
Sign in
| https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FMagicPixel%2Fawesome-public-datasets |
|
Sign up
| https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=MagicPixel%2Fawesome-public-datasets |
| Reload | https://github.com/MagicPixel/awesome-public-datasets |
| Reload | https://github.com/MagicPixel/awesome-public-datasets |
| Reload | https://github.com/MagicPixel/awesome-public-datasets |
|
MagicPixel
| https://github.com/MagicPixel |
| awesome-public-datasets | https://github.com/MagicPixel/awesome-public-datasets |
| awesomedata/awesome-public-datasets | https://github.com/awesomedata/awesome-public-datasets |
|
Notifications
| https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets |
|
Fork
0
| https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets |
|
Star
1
| https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets |
| goo.gl/WZ8XAJ | https://goo.gl/WZ8XAJ |
|
MIT license
| https://github.com/MagicPixel/awesome-public-datasets/blob/master/LICENSE |
|
1
star
| https://github.com/MagicPixel/awesome-public-datasets/stargazers |
|
11.1k
forks
| https://github.com/MagicPixel/awesome-public-datasets/forks |
|
Branches
| https://github.com/MagicPixel/awesome-public-datasets/branches |
|
Tags
| https://github.com/MagicPixel/awesome-public-datasets/tags |
|
Activity
| https://github.com/MagicPixel/awesome-public-datasets/activity |
|
Star
| https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets |
|
Notifications
| https://github.com/login?return_to=%2FMagicPixel%2Fawesome-public-datasets |
|
Code
| https://github.com/MagicPixel/awesome-public-datasets |
|
Pull requests
0
| https://github.com/MagicPixel/awesome-public-datasets/pulls |
|
Actions
| https://github.com/MagicPixel/awesome-public-datasets/actions |
|
Projects
0
| https://github.com/MagicPixel/awesome-public-datasets/projects |
|
Security
0
| https://github.com/MagicPixel/awesome-public-datasets/security |
|
Insights
| https://github.com/MagicPixel/awesome-public-datasets/pulse |
|
Code
| https://github.com/MagicPixel/awesome-public-datasets |
|
Pull requests
| https://github.com/MagicPixel/awesome-public-datasets/pulls |
|
Actions
| https://github.com/MagicPixel/awesome-public-datasets/actions |
|
Projects
| https://github.com/MagicPixel/awesome-public-datasets/projects |
|
Security
| https://github.com/MagicPixel/awesome-public-datasets/security |
|
Insights
| https://github.com/MagicPixel/awesome-public-datasets/pulse |
| Branches | https://github.com/MagicPixel/awesome-public-datasets/branches |
| Tags | https://github.com/MagicPixel/awesome-public-datasets/tags |
| https://github.com/MagicPixel/awesome-public-datasets/branches |
| https://github.com/MagicPixel/awesome-public-datasets/tags |
| 442 Commits | https://github.com/MagicPixel/awesome-public-datasets/commits/master/ |
| https://github.com/MagicPixel/awesome-public-datasets/commits/master/ |
| Datasets | https://github.com/MagicPixel/awesome-public-datasets/tree/master/Datasets |
| Datasets | https://github.com/MagicPixel/awesome-public-datasets/tree/master/Datasets |
| .travis.yml | https://github.com/MagicPixel/awesome-public-datasets/blob/master/.travis.yml |
| .travis.yml | https://github.com/MagicPixel/awesome-public-datasets/blob/master/.travis.yml |
| Government.rst | https://github.com/MagicPixel/awesome-public-datasets/blob/master/Government.rst |
| Government.rst | https://github.com/MagicPixel/awesome-public-datasets/blob/master/Government.rst |
| LICENSE | https://github.com/MagicPixel/awesome-public-datasets/blob/master/LICENSE |
| LICENSE | https://github.com/MagicPixel/awesome-public-datasets/blob/master/LICENSE |
| README.rst | https://github.com/MagicPixel/awesome-public-datasets/blob/master/README.rst |
| README.rst | https://github.com/MagicPixel/awesome-public-datasets/blob/master/README.rst |
| README | https://github.com/MagicPixel/awesome-public-datasets |
| MIT license | https://github.com/MagicPixel/awesome-public-datasets |
| https://github.com/MagicPixel/awesome-public-datasets#awesome-public-datasets |
|
| https://github.com/sindresorhus/awesome |
| This list of public data sources | https://github.com/caesar0301/awesome-public-datasets |
| awesome-awesomeness | https://github.com/bayandin/awesome-awesomeness |
| sindresorhus's awesome | https://github.com/sindresorhus/awesome |
| Agriculture | https://github.com/MagicPixel/awesome-public-datasets#agriculture |
| Biology | https://github.com/MagicPixel/awesome-public-datasets#biology |
| Climate/Weather | https://github.com/MagicPixel/awesome-public-datasets#climate-weather |
| Complex Networks | https://github.com/MagicPixel/awesome-public-datasets#complex-networks |
| Computer Networks | https://github.com/MagicPixel/awesome-public-datasets#computer-networks |
| Contextual Data | https://github.com/MagicPixel/awesome-public-datasets#contextual-data |
| Data Challenges | https://github.com/MagicPixel/awesome-public-datasets#data-challenges |
| Earth Science | https://github.com/MagicPixel/awesome-public-datasets#earth-science |
| Economics | https://github.com/MagicPixel/awesome-public-datasets#economics |
| Education | https://github.com/MagicPixel/awesome-public-datasets#education |
| Energy | https://github.com/MagicPixel/awesome-public-datasets#energy |
| Finance | https://github.com/MagicPixel/awesome-public-datasets#finance |
| GIS | https://github.com/MagicPixel/awesome-public-datasets#gis |
| Government | https://github.com/MagicPixel/awesome-public-datasets#government |
| Healthcare | https://github.com/MagicPixel/awesome-public-datasets#healthcare |
| Image Processing | https://github.com/MagicPixel/awesome-public-datasets#image-processing |
| Machine Learning | https://github.com/MagicPixel/awesome-public-datasets#machine-learning |
| Museums | https://github.com/MagicPixel/awesome-public-datasets#museums |
| Natural Language | https://github.com/MagicPixel/awesome-public-datasets#natural-language |
| Neuroscience | https://github.com/MagicPixel/awesome-public-datasets#neuroscience |
| Physics | https://github.com/MagicPixel/awesome-public-datasets#physics |
| Psychology/Cognition | https://github.com/MagicPixel/awesome-public-datasets#psychology-cognition |
| Public Domains | https://github.com/MagicPixel/awesome-public-datasets#public-domains |
| Search Engines | https://github.com/MagicPixel/awesome-public-datasets#search-engines |
| Social Networks | https://github.com/MagicPixel/awesome-public-datasets#social-networks |
| Social Sciences | https://github.com/MagicPixel/awesome-public-datasets#social-sciences |
| Software | https://github.com/MagicPixel/awesome-public-datasets#software |
| Sports | https://github.com/MagicPixel/awesome-public-datasets#sports |
| Time Series | https://github.com/MagicPixel/awesome-public-datasets#time-series |
| Transportation | https://github.com/MagicPixel/awesome-public-datasets#transportation |
| Complementary Collections | https://github.com/MagicPixel/awesome-public-datasets#complementary-collections |
| Agriculture | https://github.com/MagicPixel/awesome-public-datasets#id2 |
| https://github.com/MagicPixel/awesome-public-datasets#agriculture |
| U.S. Department of Agriculture's PLANTS Database | http://www.plants.usda.gov/dl_all.html |
| Biology | https://github.com/MagicPixel/awesome-public-datasets#id3 |
| https://github.com/MagicPixel/awesome-public-datasets#biology |
| 1000 Genomes | http://www.1000genomes.org/data |
| American Gut (Microbiome Project) | https://github.com/biocore/American-Gut |
| Broad Cancer Cell Line Encyclopedia (CCLE) | http://www.broadinstitute.org/ccle/home |
| Broad Bioimage Benchmark Collection (BBBC) | https://www.broadinstitute.org/bbbc |
| Cell Image Library | http://www.cellimagelibrary.org |
| Complete Genomics Public Data | http://www.completegenomics.com/public-data/69-genomes/ |
| EBI ArrayExpress | http://www.ebi.ac.uk/arrayexpress/ |
| EBI Protein Data Bank in Europe | http://www.ebi.ac.uk/pdbe/emdb/index.html/ |
| Electron Microscopy Pilot Image Archive (EMPIAR) | http://www.ebi.ac.uk/pdbe/emdb/empiar/ |
| ENCODE project | https://www.encodeproject.org |
| Ensembl Genomes | http://ensemblgenomes.org/info/genomes |
| Gene Expression Omnibus (GEO) | http://www.ncbi.nlm.nih.gov/geo/ |
| Gene Ontology (GO) | http://geneontology.org/page/download-annotations |
| Global Biotic Interactions (GloBI) | https://github.com/jhpoelen/eol-globi-data/wiki#accessing-species-interaction-data |
| Harvard Medical School (HMS) LINCS Project | http://lincs.hms.harvard.edu |
| Human Genome Diversity Project | http://www.hagsc.org/hgdp/files.html |
| Human Microbiome Project (HMP) | http://www.hmpdacc.org/reference_genomes/reference_genomes.php |
| ICOS PSP Benchmark | http://ico2s.org/datasets/psp_benchmark.html |
| International HapMap Project | http://hapmap.ncbi.nlm.nih.gov/downloads/index.html.en |
| Journal of Cell Biology DataViewer | http://jcb-dataviewer.rupress.org |
| MIT Cancer Genomics Data | http://www.broadinstitute.org/cgi-bin/cancer/datasets.cgi |
| NCBI Proteins | http://www.ncbi.nlm.nih.gov/guide/proteins/#databases |
| NCBI Taxonomy | http://www.ncbi.nlm.nih.gov/taxonomy |
| NIH Microarray data | http://bit.do/VVW6 |
| RAW | https://raw.githubusercontent.com/caesar0301/awesome-public-datasets/master/README.rst |
| OpenSNP genotypes data | https://opensnp.org/ |
| Pathguid - Protein-Protein Interactions Catalog | http://www.pathguide.org/ |
| Protein Data Bank | http://www.rcsb.org/ |
| Psychiatric Genomics Consortium | https://www.med.unc.edu/pgc/downloads |
| PubChem Project | https://pubchem.ncbi.nlm.nih.gov/ |
| PubGene (now Coremine Medical) | http://www.pubgene.org/ |
| Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) | http://cancer.sanger.ac.uk/cosmic |
| Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) | http://www.cancerrxgene.org/ |
| Sequence Read Archive(SRA) | http://www.ncbi.nlm.nih.gov/Traces/sra/ |
| Stanford Microarray Data | http://smd.stanford.edu/ |
| Stowers Institute Original Data Repository | http://www.stowers.org/research/publications/odr |
| Systems Science of Biological Dynamics (SSBD) Database | http://ssbd.qbic.riken.jp |
| The Cancer Genome Atlas (TCGA), available via Broad GDAC | https://gdac.broadinstitute.org/ |
| The Catalogue of Life | http://www.catalogueoflife.org/content/annual-checklist-archive |
| The Personal Genome Project | http://www.personalgenomes.org/ |
| PGP | https://my.pgp-hms.org/public_genetic_data |
| UCSC Public Data | http://hgdownload.soe.ucsc.edu/downloads.html |
| Universal Protein Resource (UnitProt) | http://www.uniprot.org/downloads |
| UniGene | http://www.ncbi.nlm.nih.gov/unigene |
| Climate/Weather | https://github.com/MagicPixel/awesome-public-datasets#id4 |
| https://github.com/MagicPixel/awesome-public-datasets#climateweather |
| Australian Weather | http://www.bom.gov.au/climate/dwo/ |
| Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system | https://aviationweather.gov/adds/dataserver |
| Brazilian Weather - Historical data (In Portuguese) | http://sinda.crn2.inpe.br/PCD/SITE/novo/site/ |
| Canadian Meteorological Centre | http://weather.gc.ca/grib/index_e.html |
| Climate Data from UEA (updated monthly) | https://crudata.uea.ac.uk/cru/data/temperature/#datterandftp://ftp.cmdl.noaa.gov/ |
| European Climate Assessment & Dataset | http://eca.knmi.nl/ |
| Global Climate Data Since 1929 | http://en.tutiempo.net/climate |
| NASA Global Imagery Browse Services | https://wiki.earthdata.nasa.gov/display/GIBS |
| NOAA Bering Sea Climate | http://www.beringclimate.noaa.gov/ |
| NOAA Climate Datasets | http://www.ncdc.noaa.gov/data-access/quick-links |
| NOAA Realtime Weather Models | http://www.ncdc.noaa.gov/data-access/model-data/model-datasets/numerical-weather-prediction |
| The World Bank Open Data Resources for Climate Change | http://data.worldbank.org/developers/climate-data-api |
| UEA Climatic Research Unit | http://www.cru.uea.ac.uk/data |
| WorldClim - Global Climate Data | http://www.worldclim.org |
| WU Historical Weather Worldwide | https://www.wunderground.com/history/index.html |
| Complex Networks | https://github.com/MagicPixel/awesome-public-datasets#id5 |
| https://github.com/MagicPixel/awesome-public-datasets#complex-networks |
| AMiner Citation Network Dataset | http://aminer.org/citation |
| CrossRef DOI URLs | https://archive.org/details/doi-urls |
| DBLP Citation dataset | https://kdl.cs.umass.edu/display/public/DBLP |
| NBER Patent Citations | http://nber.org/patents/ |
| Network Repository with Interactive Exploratory Analysis Tools | http://networkrepository.com/ |
| NIST complex networks data collection | http://math.nist.gov/~RPozo/complex_datasets.html |
| Protein-protein interaction network | http://vlado.fmf.uni-lj.si/pub/networks/data/bio/Yeast/Yeast.htm |
| PyPI and Maven Dependency Network | https://ogirardot.wordpress.com/2013/01/31/sharing-pypimaven-dependency-data/ |
| Scopus Citation Database | https://www.elsevier.com/solutions/scopus |
| Small Network Data | http://www-personal.umich.edu/~mejn/netdata/ |
| Stanford GraphBase (Steven Skiena) | http://www3.cs.stonybrook.edu/~algorith/implement/graphbase/implement.shtml |
| Stanford Large Network Dataset Collection | http://snap.stanford.edu/data/ |
| Stanford Longitudinal Network Data Sources | http://stanford.edu/group/sonia/dataSources/index.html |
| The Koblenz Network Collection | http://konect.uni-koblenz.de/ |
| The Laboratory for Web Algorithmics (UNIMI) | http://law.di.unimi.it/datasets.php |
| The Nexus Network Repository | http://nexus.igraph.org/ |
| UCI Network Data Repository | https://networkdata.ics.uci.edu/resources.php |
| UFL sparse matrix collection | http://www.cise.ufl.edu/research/sparse/matrices/ |
| WSU Graph Database | http://www.eecs.wsu.edu/mgd/gdb.html |
| DIMACS Road Networks Collection | http://www.dis.uniroma1.it/challenge9/download.shtml |
| Computer Networks | https://github.com/MagicPixel/awesome-public-datasets#id6 |
| https://github.com/MagicPixel/awesome-public-datasets#computer-networks |
| 3.5B Web Pages from CommonCraw 2012 | http://www.bigdatanews.com/profiles/blogs/big-data-set-3-5-billion-web-pages-made-available-for-all-of-us |
| 53.5B Web clicks of 100K users in Indiana Univ. | http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset/ |
| CAIDA Internet Datasets | http://www.caida.org/data/overview/ |
| ClueWeb09 - 1B web pages | http://lemurproject.org/clueweb09/ |
| ClueWeb12 - 733M web pages | http://lemurproject.org/clueweb12/ |
| CommonCrawl Web Data over 7 years | http://commoncrawl.org/the-data/get-started/ |
| CRAWDAD Wireless datasets from Dartmouth Univ. | https://crawdad.cs.dartmouth.edu/ |
| Criteo click-through data | http://labs.criteo.com/2015/03/criteo-releases-its-new-dataset/ |
| Open Mobile Data by MobiPerf | https://console.developers.google.com/storage/openmobiledata_public/ |
| Rapid7 Sonar Internet Scans | https://sonar.labs.rapid7.com/ |
| UCSD Network Telescope, IPv4 /8 net | http://www.caida.org/projects/network_telescope/ |
| Contextual Data | https://github.com/MagicPixel/awesome-public-datasets#id7 |
| https://github.com/MagicPixel/awesome-public-datasets#contextual-data |
| Context-aware data sets from five domains | http://students.depaul.edu/~yzheng8/DataSets.html#Data |
| GitHub | https://github.com/irecsys/CARSKit/tree/master/context-aware_data_sets |
| Data Challenges | https://github.com/MagicPixel/awesome-public-datasets#id8 |
| https://github.com/MagicPixel/awesome-public-datasets#data-challenges |
| Challenges in Machine Learning | http://www.chalearn.org/ |
| CrowdANALYTIX dataX | http://data.crowdanalytix.com |
| D4D Challenge of Orange | http://www.d4d.orange.com/en/home |
| DrivenData Competitions for Social Good | http://www.drivendata.org/ |
| ICWSM Data Challenge (since 2009) | http://icwsm.cs.umbc.edu/ |
| Kaggle Competition Data | https://www.kaggle.com/ |
| KDD Cup by Tencent 2012 | http://www.kddcup2012.org/ |
| Localytics Data Visualization Challenge | https://github.com/localytics/data-viz-challenge |
| Netflix Prize | http://netflixprize.com/leaderboard.html |
| Space Apps Challenge | https://2015.spaceappschallenge.org |
| Telecom Italia Big Data Challenge | https://dandelion.eu/datamine/open-big-data/ |
| Yelp Dataset Challenge | http://www.yelp.com/dataset_challenge |
| Bruteforce Database | https://github.com/duyetdev/bruteforce-database |
| Earth Science | https://github.com/MagicPixel/awesome-public-datasets#id9 |
| https://github.com/MagicPixel/awesome-public-datasets#earth-science |
| AQUASTAT - Global water resources and uses | http://www.fao.org/nr/water/aquastat/data/query/index.html?lang=en |
| BODC - marine data of ~22K vars | http://www.bodc.ac.uk/data/where_to_find_data/ |
| Earth Models | http://www.earthmodels.org/ |
| EOSDIS - NASA's earth observing system data | http://sedac.ciesin.columbia.edu/data/sets/browse |
| Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements | https://imos.aodn.org.au |
| on S3 | http://imos-data.s3-website-ap-southeast-2.amazonaws.com/ |
| Marinexplore - Open Oceanographic Data | http://marinexplore.org/ |
| Smithsonian Institution Global Volcano and Eruption Database | http://volcano.si.edu/ |
| USGS Earthquake Archives | http://earthquake.usgs.gov/earthquakes/search/ |
| Economics | https://github.com/MagicPixel/awesome-public-datasets#id10 |
| https://github.com/MagicPixel/awesome-public-datasets#economics |
| American Economic Association (AEA) | https://www.aeaweb.org/resources/data |
| EconData from UMD | http://inforumweb.umd.edu/econdata/econdata.html |
| Economic Freedom of the World Data | http://www.freetheworld.com/datasets_efw.html |
| Historical MacroEconomc Statistics | http://www.historicalstatistics.org/ |
| International Economics Database | http://widukind.cepremap.org/ |
| various data tools | https://github.com/Widukind |
| International Trade Statistics | http://www.econostatistics.co.za/ |
| Internet Product Code Database | http://www.upcdatabase.com/ |
| Joint External Debt Data Hub | http://www.jedh.org/ |
| Jon Haveman International Trade Data Links | http://www.macalester.edu/research/economics/PAGE/HAVEMAN/Trade.Resources/TradeData.html |
| OpenCorporates Database of Companies in the World | https://opencorporates.com/ |
| Our World in Data | http://ourworldindata.org/ |
| SciencesPo World Trade Gravity Datasets | http://econ.sciences-po.fr/thierry-mayer/data |
| The Atlas of Economic Complexity | http://atlas.cid.harvard.edu |
| The Center for International Data | http://cid.econ.ucdavis.edu |
| The Observatory of Economic Complexity | http://atlas.media.mit.edu/en/ |
| UN Commodity Trade Statistics | http://comtrade.un.org/db/ |
| UN Human Development Reports | http://hdr.undp.org/en |
| Education | https://github.com/MagicPixel/awesome-public-datasets#id11 |
| https://github.com/MagicPixel/awesome-public-datasets#education |
| Student Data from Free Code Camp | http://academictorrents.com/details/030b10dad0846b5aecc3905692890fb02404adbf |
| Energy | https://github.com/MagicPixel/awesome-public-datasets#id12 |
| https://github.com/MagicPixel/awesome-public-datasets#energy |
| AMPds | http://ampds.org/ |
| BLUEd | http://nilm.cmubi.org/ |
| COMBED | http://combed.github.io/ |
| Dataport | https://dataport.pecanstreet.org/ |
| DRED | http://www.st.ewi.tudelft.nl/~akshay/dred/ |
| ECO | http://www.vs.inf.ethz.ch/res/show.html?what=eco-data |
| EIA | http://www.eia.gov/electricity/data/eia923/ |
| HES | http://randd.defra.gov.uk/Default.aspx?Menu=Menu&Module=More&Location=None&ProjectID=17359&FromSearch=Y&Publisher=1&SearchText=EV0702&SortString=ProjectCode&SortOrder=Asc&Paging=10#Description |
| HFED | http://hfed.github.io/ |
| iAWE | http://iawe.github.io/ |
| PLAID | http://plaidplug.com/ |
| REDD | http://redd.csail.mit.edu/ |
| Tracebase | https://www.tracebase.org |
| UK-DALE | http://www.doc.ic.ac.uk/~dk3810/data/ |
| WHITED | http://nilmworkshop.org/2016/proceedings/Poster_ID18.pdf |
| Finance | https://github.com/MagicPixel/awesome-public-datasets#id13 |
| https://github.com/MagicPixel/awesome-public-datasets#finance |
| CBOE Futures Exchange | http://cfe.cboe.com/Data/ |
| Google Finance | https://www.google.com/finance |
| Google Trends | http://www.google.com/trends?q=google&ctab=0&geo=all&date=all&sort=0 |
| NASDAQ | https://data.nasdaq.com/ |
| OANDA | http://www.oanda.com/ |
| OSU Financial data | http://fisher.osu.edu/fin/fdf/osudata.htm |
| Quandl | https://www.quandl.com/ |
| St Louis Federal | https://research.stlouisfed.org/fred2/ |
| Yahoo Finance | http://finance.yahoo.com/ |
| RAW | https://raw.githubusercontent.com/caesar0301/awesome-public-datasets/master/README.rst |
| GIS | https://github.com/MagicPixel/awesome-public-datasets#id14 |
| https://github.com/MagicPixel/awesome-public-datasets#gis |
| Cambridge, MA, US, GIS data on GitHub | http://cambridgegis.github.io/gisdata.html |
| Factual Global Location Data | https://www.factual.com/ |
| Geo Spatial Data from ASU | http://geodacenter.asu.edu/datalist/ |
| Geo Wiki Project - Citizen-driven Environmental Monitoring | http://geo-wiki.org/ |
| GeoFabrik - OSM data extracted to a variety of formats and areas | http://download.geofabrik.de/ |
| GeoNames Worldwide | http://www.geonames.org/ |
| Global Administrative Areas Database (GADM) | http://www.gadm.org/ |
| Homeland Infrastructure Foundation-Level Data | https://hifld-dhs-gii.opendata.arcgis.com/ |
| Landsat 8 on AWS | https://aws.amazon.com/public-data-sets/landsat/ |
| List of all countries in all languages | https://github.com/umpirsky/country-list |
| National Weather Service GIS Data Portal | http://www.nws.noaa.gov/gis/ |
| Natural Earth - vectors and rasters of the world | http://www.naturalearthdata.com/ |
| OpenAddresses | http://openaddresses.io/ |
| OpenStreetMap (OSM) | http://wiki.openstreetmap.org/wiki/Downloading_data |
| Pleiades - Gazetteer and graph of ancient places | http://pleiades.stoa.org/ |
| Reverse Geocoder using OSM data | https://github.com/kno10/reversegeocode |
| additional high-resolution data files | http://data.ub.uni-muenchen.de/61/ |
| TIGER/Line - U.S. boundaries and roads | http://www.census.gov/geo/maps-data/data/tiger-line.html |
| TwoFishes - Foursquare's coarse geocoder | https://github.com/foursquare/twofishes |
| TZ Timezones shapfiles | http://efele.net/maps/tz/world/ |
| UN Environmental Data | http://geodata.grid.unep.ch/ |
| World boundaries from the U.S. Department of State | https://hiu.state.gov/data/data.aspx |
| World countries in multiple formats | https://github.com/mledoze/countries |
| Government | https://github.com/MagicPixel/awesome-public-datasets#id15 |
| https://github.com/MagicPixel/awesome-public-datasets#government |
| OpenDataSoft's list of 1,600 open data portals | https://www.opendatasoft.com/a-comprehensive-list-of-all-open-data-portals-around-the-world/ |
| A list of cities and countries contributed by community | https://github.com/caesar0301/awesome-public-datasets/blob/master/Government.rst |
| Healthcare | https://github.com/MagicPixel/awesome-public-datasets#id16 |
| https://github.com/MagicPixel/awesome-public-datasets#healthcare |
| EHDP Large Health Data Sets | http://www.ehdp.com/vitalnet/datasets.htm |
| Gapminder World demographic databases | http://www.gapminder.org/data/ |
| Medicare Coverage Database (MCD), U.S. | https://www.cms.gov/medicare-coverage-database/ |
| Medicare Data Engine of medicare.gov Data | https://data.medicare.gov/ |
| Medicare Data File | http://go.cms.gov/19xxPN4 |
| MeSH, the vocabulary thesaurus used for indexing articles for PubMed | https://www.nlm.nih.gov/mesh/filelist.html |
| Number of Ebola Cases and Deaths in Affected Countries (2014) | https://data.hdx.rwlabs.org/dataset/ebola-cases-2014 |
| Open-ODS (structure of the UK NHS) | http://www.openods.co.uk |
| OpenPaymentsData, Healthcare financial relationship data | https://openpaymentsdata.cms.gov |
| The Cancer Genome Atlas project (TCGA) | https://tcga-data.nci.nih.gov/tcga/tcgaDownload.jsp |
| BigQuery table | http://google-genomics.readthedocs.org/en/latest/use_cases/discover_public_data/isb_cgc_data.html |
| World Health Organization Global Health Observatory | http://www.who.int/gho/en/ |
| Image Processing | https://github.com/MagicPixel/awesome-public-datasets#id17 |
| https://github.com/MagicPixel/awesome-public-datasets#image-processing |
| 10k US Adult Faces Database | http://wilmabainbridge.com/facememorability2.html |
| 2GB of Photos of Cats | http://137.189.35.203/WebUI/CatDatabase/catData.html |
| Archive version | https://web.archive.org/web/20150520175645/http://137.189.35.203/WebUI/CatDatabase/catData.html |
| Affective Image Classification | http://www.imageemotion.org/ |
| Animals with attributes | http://attributes.kyb.tuebingen.mpg.de/ |
| Face Recognition Benchmark | http://www.face-rec.org/databases/ |
| ImageNet (in WordNet hierarchy) | http://www.image-net.org/ |
| Indoor Scene Recognition | http://web.mit.edu/torralba/www/indoor.html |
| International Affective Picture System, UFL | http://csea.phhp.ufl.edu/media/iapsmessage.html |
| Massive Visual Memory Stimuli, MIT | http://cvcl.mit.edu/MM/stimuli.html |
| Several Shape-from-Silhouette Datasets | http://kaiwolf.no-ip.org/3d-model-repository.html |
| Stanford Dogs Dataset | http://vision.stanford.edu/aditya86/ImageNetDogs/ |
| SUN database, MIT | http://groups.csail.mit.edu/vision/SUN/hierarchy.html |
| The Oxford-IIIT Pet Dataset | http://www.robots.ox.ac.uk/~vgg/data/pets/ |
| YouTube Faces Database | http://www.cs.tau.ac.il/~wolf/ytfaces/ |
| Adience Unfiltered faces for gender and age classification | http://www.openu.ac.il/home/hassner/Adience/data.html |
| The Action Similarity Labeling (ASLAN) Challenge | http://www.openu.ac.il/home/hassner/data/ASLAN/ASLAN.html |
| Violent-Flows - Crowd Violence Non-violence Database and benchmark | http://www.openu.ac.il/home/hassner/data/violentflows/ |
| Machine Learning | https://github.com/MagicPixel/awesome-public-datasets#id18 |
| https://github.com/MagicPixel/awesome-public-datasets#machine-learning |
| Delve Datasets for classification and regression (Univ. of Toronto) | http://www.cs.toronto.edu/~delve/data/datasets.html |
| Discogs Monthly Data | http://data.discogs.com/ |
| eBay Online Auctions (2012) | http://www.modelingonlineauctions.com/datasets |
| IMDb Database | http://www.imdb.com/interfaces |
| Keel Repository for classification, regression and time series | http://sci2s.ugr.es/keel/datasets.php |
| Labeled Faces in the Wild (LFW) | http://vis-www.cs.umass.edu/lfw/ |
| Lending Club Loan Data | https://www.lendingclub.com/info/download-data.action |
| Machine Learning Data Set Repository | http://mldata.org/ |
| Million Song Dataset | http://labrosa.ee.columbia.edu/millionsong/ |
| More Song Datasets | http://labrosa.ee.columbia.edu/millionsong/pages/additional-datasets |
| New Yorker caption contest ratings | https://github.com/nextml/caption-contest-data |
| MovieLens Data Sets | http://grouplens.org/datasets/movielens/ |
| RDataMining - "R and Data Mining" ebook data | http://www.rdatamining.com/data |
| Registered Meteorites on Earth | http://healthintelligence.drupalgardens.com/content/registered-meteorites-has-impacted-earth-visualized |
| Restaurants Health Score Data in San Francisco | http://missionlocal.org/san-francisco-restaurant-health-inspections/ |
| UCI Machine Learning Repository | http://archive.ics.uci.edu/ml/ |
| Yahoo! Ratings and Classification Data | http://webscope.sandbox.yahoo.com/catalog.php?datatype=r |
| Museums | https://github.com/MagicPixel/awesome-public-datasets#id19 |
| https://github.com/MagicPixel/awesome-public-datasets#museums |
| Canada Science and Technology Museums Corporation's Open Data | http://techno-science.ca/en/data.php |
| Cooper-Hewitt's Collection Database | https://github.com/cooperhewitt/collection |
| Minneapolis Institute of Arts metadata | https://github.com/artsmia/collection |
| Natural History Museum (London) Data Portal | http://data.nhm.ac.uk/ |
| Rijksmuseum Historical Art Collection | https://www.rijksmuseum.nl/en/api |
| Tate Collection metadata | https://github.com/tategallery/collection |
| The Getty vocabularies | http://vocab.getty.edu |
| Natural Language | https://github.com/MagicPixel/awesome-public-datasets#id20 |
| https://github.com/MagicPixel/awesome-public-datasets#natural-language |
| Blogger Corpus | http://u.cs.biu.ac.il/~koppel/BlogCorpus.htm |
| CLiPS Stylometry Investigation Corpus | http://www.clips.uantwerpen.be/datasets/csi-corpus |
| ClueWeb09 FACC | http://lemurproject.org/clueweb09/FACC1/ |
| ClueWeb12 FACC | http://lemurproject.org/clueweb12/FACC1/ |
| DBpedia - 4.58M things with 583M facts | http://wiki.dbpedia.org/Datasets |
| Flickr Personal Taxonomies | http://www.isi.edu/~lerman/downloads/flickr/flickr_taxonomies.html |
| Freebase.com of people, places, and things | http://www.freebase.com/ |
| Google Books Ngrams (2.2TB) | https://aws.amazon.com/datasets/google-books-ngrams/ |
| Google Web 5gram (1TB, 2006) | https://catalog.ldc.upenn.edu/LDC2006T13 |
| Gutenberg eBooks List | http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs |
| Hansards text chunks of Canadian Parliament | http://www.isi.edu/natural-language/download/hansard/ |
| Machine Comprehension Test (MCTest) of text from Microsoft Research | http://research.microsoft.com/en-us/um/redmond/projects/mctest/index.html |
| Machine Translation of European languages | http://statmt.org/wmt11/translation-task.html#download |
| Personae Corpus | http://www.clips.uantwerpen.be/datasets/personae-corpus |
| SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) | https://github.com/ParallelMazen/SaudiNewsNet |
| SMS Spam Collection in English | http://www.dt.fee.unicamp.br/~tiago/smsspamcollection/ |
| USENET postings corpus of 2005~2011 | http://www.psych.ualberta.ca/~westburylab/downloads/usenetcorpus.download.html |
| Wikidata - Wikipedia databases | https://www.wikidata.org/wiki/Wikidata:Database_download |
| Wikipedia Links data - 40 Million Entities in Context | https://code.google.com/p/wiki-links/downloads/list |
| Universal Dependencies | http://universaldependencies.org |
| WordNet databases and tools | http://wordnet.princeton.edu/wordnet/download/ |
| Open Multilingual Wordnet | http://compling.hss.ntu.edu.sg/omw/ |
| Neuroscience | https://github.com/MagicPixel/awesome-public-datasets#id21 |
| https://github.com/MagicPixel/awesome-public-datasets#neuroscience |
| Allen Institute Datasets | http://www.brain-map.org/ |
| Brain Catalogue | http://braincatalogue.org/ |
| Brainomics | http://brainomics.cea.fr/localizer |
| CodeNeuro Datasets | http://datasets.codeneuro.org/ |
| Collaborative Research in Computational Neuroscience (CRCNS) | http://crcns.org/data-sets |
| FCP-INDI | http://fcon_1000.projects.nitrc.org/index.html |
| Human Connectome Project | http://www.humanconnectome.org/data/ |
| NDAR | https://ndar.nih.gov/ |
| NIMH Data Archive | http://data-archive.nimh.nih.gov/ |
| NeuroData | http://neurodata.io |
| OASIS | http://www.oasis-brains.org/ |
| OpenfMRI | https://openfmri.org/ |
| Neuroelectro | http://neuroelectro.org/ |
| Study Forrest | http://studyforrest.org |
| Physics | https://github.com/MagicPixel/awesome-public-datasets#id22 |
| https://github.com/MagicPixel/awesome-public-datasets#physics |
| CERN Open Data Portal | http://opendata.cern.ch/ |
| Crystallography Open Database | http://www.crystallography.net/ |
| NASA Exoplanet Archive | http://exoplanetarchive.ipac.caltech.edu/ |
| NSSDC (NASA) data of 550 space spacecraft | http://nssdc.gsfc.nasa.gov/nssdc/obtaining_data.html |
| Sloan Digital Sky Survey (SDSS) - Mapping the Universe | http://www.sdss.org/ |
| Psychology/Cognition | https://github.com/MagicPixel/awesome-public-datasets#id23 |
| https://github.com/MagicPixel/awesome-public-datasets#psychologycognition |
| OSU Cognitive Modeling Repository Datasets | http://www.cmr.osu.edu/browse/datasets |
| Public Domains | https://github.com/MagicPixel/awesome-public-datasets#id24 |
| https://github.com/MagicPixel/awesome-public-datasets#public-domains |
| Amazon | http://aws.amazon.com/datasets/ |
| Archive-it from Internet Archive | https://www.archive-it.org/explore?show=Collections |
| Archive.org Datasets | https://archive.org/details/datasets |
| CMU JASA data archive | http://lib.stat.cmu.edu/jasadata/ |
| CMU StatLab collections | http://lib.stat.cmu.edu/datasets/ |
| Data360 | http://www.data360.org/index.aspx |
| Datamob.org | http://datamob.org/datasets |
| Google | http://www.google.com/publicdata/directory |
| Infochimps | http://www.infochimps.com/ |
| KDNuggets Data Collections | http://www.kdnuggets.com/datasets/index.html |
| Microsoft Azure Data Market Free DataSets | http://datamarket.azure.com/browse/data?price=free |
| Numbray | http://numbrary.com/ |
| Open Library Data Dumps | https://openlibrary.org/developers/dumps |
| Reddit Datasets | https://www.reddit.com/r/datasets |
| RevolutionAnalytics Collection | http://packages.revolutionanalytics.com/datasets/ |
| Sample R data sets | http://stat.ethz.ch/R-manual/R-patched/library/datasets/html/00Index.html |
| Stats4Stem R data sets | http://www.stats4stem.org/data-sets.html |
| StatSci.org | http://www.statsci.org/datasets.html |
| The Washington Post List | http://www.washingtonpost.com/wp-srv/metro/data/datapost.html |
| UCLA SOCR data collection | http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data |
| UFO Reports | http://www.nuforc.org/webreports.html |
| Wikileaks 911 pager intercepts | https://911.wikileaks.org/files/index.html |
| Yahoo Webscope | http://webscope.sandbox.yahoo.com/catalog.php |
| Search Engines | https://github.com/MagicPixel/awesome-public-datasets#id25 |
| https://github.com/MagicPixel/awesome-public-datasets#search-engines |
| Academic Torrents of data sharing from UMB | http://academictorrents.com/ |
| Datahub.io | https://datahub.io/dataset |
| DataMarket (Qlik) | https://datamarket.com/data/list/?q=all |
| Harvard Dataverse Network of scientific data | https://dataverse.harvard.edu/ |
| ICPSR (UMICH) | http://www.icpsr.umich.edu/icpsrweb/ICPSR/index.jsp |
| Institute of Education Sciences | http://eric.ed.gov |
| National Technical Reports Library | http://www.ntis.gov/products/ntrl/ |
| Open Data Certificates (beta) | https://certificates.theodi.org/en/datasets |
| OpenDataNetwork - A search engine of all Socrata powered data portals | http://www.opendatanetwork.com/ |
| Statista.com - statistics and Studies | http://www.statista.com/ |
| Zenodo - An open dependable home for the long-tail of science | https://zenodo.org/collection/datasets |
| Social Networks | https://github.com/MagicPixel/awesome-public-datasets#id26 |
| https://github.com/MagicPixel/awesome-public-datasets#social-networks |
| 72 hours #gamergate Twitter Scrape | http://waxy.org/random/misc/gamergate_tweets.csv |
| Ancestry.com Forum Dataset over 10 years | http://www.cs.cmu.edu/~jelsas/data/ancestry.com/ |
| Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape | https://archive.org/details/twitter_cikm_2010 |
| CMU Enron Email of 150 users | http://www.cs.cmu.edu/~enron/ |
| EDRM Enron EMail of 151 users, hosted on S3 | https://aws.amazon.com/datasets/enron-email-data/ |
| Facebook Data Scrape (2005) | https://archive.org/details/oxford-2005-facebook-matrix |
| Facebook Social Networks from LAW (since 2007) | http://law.di.unimi.it/datasets.php |
| Foursquare from UMN/Sarwat (2013) | https://archive.org/details/201309_foursquare_dataset_umn |
| GetGlue - users rating TV shows | http://getglue-data.s3.amazonaws.com/getglue_sample.tar.gz |
| GitHub Collaboration Archive | https://www.githubarchive.org/ |
| Google Scholar citation relations | http://www3.cs.stonybrook.edu/~leman/data/gscholar.db |
| High-Resolution Contact Networks from Wearable Sensors | http://www.sociopatterns.org/datasets/ |
| Mobile Social Networks from UMASS | https://kdl.cs.umass.edu/display/public/Mobile+Social+Networks |
| Network Twitter Data | http://snap.stanford.edu/data/higgs-twitter.html |
| Reddit Comments | https://www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/ |
| Skytrax' Air Travel Reviews Dataset | https://github.com/quankiquanki/skytrax-reviews-dataset |
| Social Twitter Data | http://snap.stanford.edu/data/egonets-Twitter.html |
| SourceForge.net Research Data | http://www3.nd.edu/~oss/Data/data.html |
| Twitter Data for Sentiment Analysis | http://help.sentiment140.com/for-students/ |
| Twitter Data for Online Reputation Management | http://nlp.uned.es/replab2013/ |
| Twitter Graph of entire Twitter site | http://an.kaist.ac.kr/traces/WWW2010.html |
| Twitter Scrape Calufa May 2011 | http://archive.org/details/2011-05-calufa-twitter-sql |
| UNIMI/LAW Social Network Datasets | http://law.di.unimi.it/datasets.php |
| Yahoo! Graph and Social Data | http://webscope.sandbox.yahoo.com/catalog.php?datatype=g |
| Youtube Video Social Graph in 2007,2008 | http://netsg.cs.sfu.ca/youtubedata/ |
| Social Sciences | https://github.com/MagicPixel/awesome-public-datasets#id27 |
| https://github.com/MagicPixel/awesome-public-datasets#social-sciences |
| ACLED (Armed Conflict Location & Event Data Project) | http://www.acleddata.com/ |
| Canadian Legal Information Institute | https://www.canlii.org/en/index.php |
| Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc | http://www.systemicpeace.org/ |
| Correlates of War Project | http://www.correlatesofwar.org/ |
| Cryptome Conspiracy Theory Items | http://cryptome.org |
| Datacards | http://datacards.org |
| European Social Survey | http://www.europeansocialsurvey.org/data/ |
| FBI Hate Crime 2013 - aggregated data | https://github.com/emorisse/FBI-Hate-Crime-Statistics/tree/master/2013 |
| GDELT Global Events Database | http://gdeltproject.org/data.html |
| General Social Survey (GSS) since 1972 | http://gss.norc.org |
| German Social Survey | http://www.gesis.org/en/home/ |
| Global Religious Futures Project | http://www.globalreligiousfutures.org/ |
| Humanitarian Data Exchange | https://data.hdx.rwlabs.org/ |
| Institute for Demographic Studies | http://www.ined.fr/en/ |
| International Networks Archive | http://www.princeton.edu/~ina/ |
| International Social Survey Program ISSP | http://www.issp.org |
| International Studies Compendium Project | http://www.isacompendium.com/public/ |
| James McGuire Cross National Data | http://jmcguire.faculty.wesleyan.edu/welcome/cross-national-data/ |
| MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste | http://nsd.uib.no |
| Minnesota Population Center | https://www.ipums.org/ |
| MIT Reality Mining Dataset | http://realitycommons.media.mit.edu/realitymining.html |
| Open Crime and Policing Data in England, Wales and Northern Ireland | https://data.police.uk/data/ |
| Paul Hensel General International Data Page | http://www.paulhensel.org/dataintl.html |
| PewResearch Internet Survey Project | http://www.pewinternet.org/datasets/pages/2/ |
| PewResearch Society Data Collection | http://www.pewresearch.org/data/download-datasets/ |
| Political Polarity Data | http://www3.cs.stonybrook.edu/~leman/data/14-icwsm-political-polarity-data.zip |
| StackExchange Data Explorer | http://data.stackexchange.com/help |
| Terrorism Research and Analysis Consortium | http://www.trackingterrorism.org/ |
| Texas Inmates Executed Since 1984 | http://www.tdcj.state.tx.us/death_row/dr_executed_offenders.html |
| Titanic Survival Data Set | https://github.com/caesar0301/awesome-public-datasets/tree/master/Datasets |
| UCB's Archive of Social Science Data (D-Lab) | http://ucdata.berkeley.edu/ |
| Uppsala Conflict Data Program | http://ucdp.uu.se/ |
| UCLA Social Sciences Data Archive | http://dataarchives.ss.ucla.edu/Home.DataPortals.htm |
| UN Civil Society Database | http://esango.un.org/civilsociety/ |
| Universities Worldwide | http://univ.cc/ |
| UPJOHN for Labor Employment Research | http://www.upjohn.org/services/resources/employment-research-data-center |
| World Bank Data | http://data.worldbank.org/ |
| WorldPop project - Worldwide human population distributions | http://www.worldpop.org.uk/data/get_data/ |
| Software | https://github.com/MagicPixel/awesome-public-datasets#id28 |
| https://github.com/MagicPixel/awesome-public-datasets#software |
| FLOSSmole data about free, libre, and open source software development | http://flossdata.syr.edu/data/ |
| Sports | https://github.com/MagicPixel/awesome-public-datasets#id29 |
| https://github.com/MagicPixel/awesome-public-datasets#sports |
| Basketball (NBA/NCAA/Euro) Player Database and Statistics | http://www.draftexpress.com/stats.php |
| Betfair Historical Exchange Data | http://data.betfair.com/ |
| Cricsheet Matches (cricket) | http://cricsheet.org/ |
| Ergast Formula 1, from 1950 up to date (API) | http://ergast.com/mrd/db |
| Football/Soccer resources (data and APIs) | http://www.jokecamp.com/blog/guide-to-football-and-soccer-data-and-apis/ |
| Lahman's Baseball Database | http://www.seanlahman.com/baseball-archive/statistics/ |
| Pinhooker: Thoroughbred Bloodstock Sale Data | https://github.com/phillc73/pinhooker |
| Retrosheet Baseball Statistics | http://www.retrosheet.org/game.htm |
| Time Series | https://github.com/MagicPixel/awesome-public-datasets#id30 |
| https://github.com/MagicPixel/awesome-public-datasets#time-series |
| Databanks International Cross National Time Series Data Archive | http://www.cntsdata.com |
| Hard Drive Failure Rates | https://www.backblaze.com/hard-drive-test-data.html |
| Heart Rate Time Series from MIT | http://ecg.mit.edu/time-series/ |
| Time Series Data Library (TSDL) from MU | https://datamarket.com/data/list/?q=provider:tsdl |
| UC Riverside Time Series Dataset | http://www.cs.ucr.edu/~eamonn/time_series_data/ |
| Transportation | https://github.com/MagicPixel/awesome-public-datasets#id31 |
| https://github.com/MagicPixel/awesome-public-datasets#transportation |
| Airlines OD Data 1987-2008 | http://stat-computing.org/dataexpo/2009/the-data.html |
| Bay Area Bike Share Data | http://www.bayareabikeshare.com/open-data |
| Bike Share Systems (BSS) collection | https://github.com/BetaNYC/Bike-Share-Data-Best-Practices/wiki/Bike-Share-Data-Systems |
| GeoLife GPS Trajectory from Microsoft Research | http://research.microsoft.com/en-us/downloads/b16d359d-d164-469e-9fd4-daa38f2b2e13/ |
| German train system by Deutsche Bahn | http://data.deutschebahn.com/datasets/ |
| Hubway Million Rides in MA | http://hubwaydatachallenge.org/trip-history-data/ |
| Marine Traffic - ship tracks, port calls and more | http://www.marinetraffic.com/de/ais-api-services |
| Montreal BIXI Bike Share | https://montreal.bixi.com/donn%C3%A9es-libre-service |
| NYC Taxi Trip Data 2009- | http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml |
| NYC Taxi Trip Data 2013 (FOIA/FOILed) | https://archive.org/details/nycTaxiTripData2013 |
| NYC Uber trip data April 2014 to September 2014 | https://github.com/fivethirtyeight/uber-tlc-foil-response |
| Open Traffic collection | https://github.com/graphhopper/open-traffic-collection |
| OpenFlights - airport, airline and route data | http://openflights.org/data.html |
| Philadelphia Bike Share Stations (JSON) | https://www.rideindego.com/stations/json/ |
| Plane Crash Database, since 1920 | http://www.planecrashinfo.com/database.htm |
| RITA Airline On-Time Performance data | http://www.transtats.bts.gov/Tables.asp?DB_ID=120 |
| RITA/BTS transport data collection (TranStat) | http://www.transtats.bts.gov/DataIndex.asp |
| Toronto Bike Share Stations (XML file) | http://www.bikesharetoronto.com/data/stations/bikeStations.xml |
| Transport for London (TFL) | https://tfl.gov.uk/info-for/open-data-users/data-feeds |
| Travel Tracker Survey (TTS) for Chicago | http://www.cmap.illinois.gov/data/transportation/travel-tracker-survey |
| U.S. Bureau of Transportation Statistics (BTS) | http://www.rita.dot.gov/bts/ |
| U.S. Domestic Flights 1990 to 2009 | http://academictorrents.com/details/a2ccf94bbb4af222bf8e69dad60a68a29f310d9a |
| U.S. Freight Analysis Framework since 2007 | http://ops.fhwa.dot.gov/freight/freight_analysis/faf/index.htm |
| Complementary Collections | https://github.com/MagicPixel/awesome-public-datasets#id32 |
| https://github.com/MagicPixel/awesome-public-datasets#complementary-collections |
| Data Packaged Core Datasets | https://github.com/datasets/ |
| Database of Scientific Code Contributions | https://mozillascience.org/collaborate |
| Some Datasets Available on the Web | http://www.datawrangling.com/some-datasets-available-on-the-web |
| Finding Data on the Internet | http://www.inside-r.org/howto/finding-data-internet |
| An overview of available open data resources in Europe | http://opendatamonitor.eu |
| Where can I find large datasets open to the public? | http://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public |
| 100+ Interesting Data Sets for Statistics | http://rs.io/100-interesting-data-sets-for-statistics/ |
| Leveraging open data to understand urban lives | http://xiaming.me/posts/2014/10/23/leveraging-open-data-to-understand-urban-lives/ |
| goo.gl/WZ8XAJ | https://goo.gl/WZ8XAJ |
|
Readme
| https://github.com/MagicPixel/awesome-public-datasets#readme-ov-file |
|
MIT license
| https://github.com/MagicPixel/awesome-public-datasets#MIT-1-ov-file |
| Please reload this page | https://github.com/MagicPixel/awesome-public-datasets |
|
Activity | https://github.com/MagicPixel/awesome-public-datasets/activity |
|
1
star | https://github.com/MagicPixel/awesome-public-datasets/stargazers |
|
0
watching | https://github.com/MagicPixel/awesome-public-datasets/watchers |
|
0
forks | https://github.com/MagicPixel/awesome-public-datasets/forks |
|
Report repository
| https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FMagicPixel%2Fawesome-public-datasets&report=MagicPixel+%28user%29 |
| Releases | https://github.com/MagicPixel/awesome-public-datasets/releases |
|
1
tags
| https://github.com/MagicPixel/awesome-public-datasets/tags |
| Packages
0 | https://github.com/users/MagicPixel/packages?repo_name=awesome-public-datasets |
|
| https://github.com |
| Terms | https://docs.github.com/site-policy/github-terms/github-terms-of-service |
| Privacy | https://docs.github.com/site-policy/privacy-policies/github-privacy-statement |
| Security | https://github.com/security |
| Status | https://www.githubstatus.com/ |
| Community | https://github.community/ |
| Docs | https://docs.github.com/ |
| Contact | https://support.github.com?tags=dotcom-footer |