| route-pattern | /:user_id/:repository |
| route-controller | files |
| route-action | disambiguate |
| fetch-nonce | v2:6459ae82-96e8-0b33-f424-0d139d050c63 |
| current-catalog-service-hash | f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb |
| request-id | DECE:2E8F47:A3BAB6:E1A3A9:69695B4B |
| html-safe-nonce | 83edb41c2a4c5b14c4afc0716e9fa709a67a6e96bf18e482dd86f240a1e63ca0 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJERUNFOjJFOEY0NzpBM0JBQjY6RTFBM0E5OjY5Njk1QjRCIiwidmlzaXRvcl9pZCI6IjE5Nzk3MDMyMTc2NjE1MDAyMzUiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ== |
| visitor-hmac | 459c830582b9d33ff56589abe3f384b568d8cd1105b3785dcf76c0f9ccf9f77c |
| hovercard-subject-tag | repository:104411917 |
| github-keyboard-shortcuts | repository,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | // |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/hack-hyc/DAT4 |
| twitter:image | https://opengraph.githubassets.com/1ac0f35fc42fdb719a7a1391962413b3c215b578f638355a3fdc865081f33a05/hack-hyc/DAT4 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/1ac0f35fc42fdb719a7a1391962413b3c215b578f638355a3fdc865081f33a05/hack-hyc/DAT4 |
| og:image:alt | General Assembly's Data Science course in Washington, DC - hack-hyc/DAT4 |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| hostname | github.com |
| expected-hostname | github.com |
| None | 9db5f28da7e24035385d7f349f17890cbe016a939ddd7952be0f07b862094f5a |
| turbo-cache-control | no-preview |
| go-import | github.com/hack-hyc/DAT4 git https://github.com/hack-hyc/DAT4.git |
| octolytics-dimension-user_id | 20637790 |
| octolytics-dimension-user_login | hack-hyc |
| octolytics-dimension-repository_id | 104411917 |
| octolytics-dimension-repository_nwo | hack-hyc/DAT4 |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | true |
| octolytics-dimension-repository_parent_id | 27836310 |
| octolytics-dimension-repository_parent_nwo | justmarkham/DAT4 |
| octolytics-dimension-repository_network_root_id | 27836310 |
| octolytics-dimension-repository_network_root_nwo | justmarkham/DAT4 |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 4e59fe66217d3c72925af2a341ae3a8f2b5b5b2a |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
| Skip to content | https://github.com/hack-hyc/DAT4#start-of-content |
|
| https://github.com/ |
|
Sign in
| https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fhack-hyc%2FDAT4 |
| GitHub CopilotWrite better code with AI | https://github.com/features/copilot |
| GitHub SparkBuild and deploy intelligent apps | https://github.com/features/spark |
| GitHub ModelsManage and compare prompts | https://github.com/features/models |
| MCP RegistryNewIntegrate external tools | https://github.com/mcp |
| ActionsAutomate any workflow | https://github.com/features/actions |
| CodespacesInstant dev environments | https://github.com/features/codespaces |
| IssuesPlan and track work | https://github.com/features/issues |
| Code ReviewManage code changes | https://github.com/features/code-review |
| GitHub Advanced SecurityFind and fix vulnerabilities | https://github.com/security/advanced-security |
| Code securitySecure your code as you build | https://github.com/security/advanced-security/code-security |
| Secret protectionStop leaks before they start | https://github.com/security/advanced-security/secret-protection |
| Why GitHub | https://github.com/why-github |
| Documentation | https://docs.github.com |
| Blog | https://github.blog |
| Changelog | https://github.blog/changelog |
| Marketplace | https://github.com/marketplace |
| View all features | https://github.com/features |
| Enterprises | https://github.com/enterprise |
| Small and medium teams | https://github.com/team |
| Startups | https://github.com/enterprise/startups |
| Nonprofits | https://github.com/solutions/industry/nonprofits |
| App Modernization | https://github.com/solutions/use-case/app-modernization |
| DevSecOps | https://github.com/solutions/use-case/devsecops |
| DevOps | https://github.com/solutions/use-case/devops |
| CI/CD | https://github.com/solutions/use-case/ci-cd |
| View all use cases | https://github.com/solutions/use-case |
| Healthcare | https://github.com/solutions/industry/healthcare |
| Financial services | https://github.com/solutions/industry/financial-services |
| Manufacturing | https://github.com/solutions/industry/manufacturing |
| Government | https://github.com/solutions/industry/government |
| View all industries | https://github.com/solutions/industry |
| View all solutions | https://github.com/solutions |
| AI | https://github.com/resources/articles?topic=ai |
| Software Development | https://github.com/resources/articles?topic=software-development |
| DevOps | https://github.com/resources/articles?topic=devops |
| Security | https://github.com/resources/articles?topic=security |
| View all topics | https://github.com/resources/articles |
| Customer stories | https://github.com/customer-stories |
| Events & webinars | https://github.com/resources/events |
| Ebooks & reports | https://github.com/resources/whitepapers |
| Business insights | https://github.com/solutions/executive-insights |
| GitHub Skills | https://skills.github.com |
| Documentation | https://docs.github.com |
| Customer support | https://support.github.com |
| Community forum | https://github.com/orgs/community/discussions |
| Trust center | https://github.com/trust-center |
| Partners | https://github.com/partners |
| GitHub SponsorsFund open source developers | https://github.com/sponsors |
| Security Lab | https://securitylab.github.com |
| Maintainer Community | https://maintainers.github.com |
| Accelerator | https://github.com/accelerator |
| Archive Program | https://archiveprogram.github.com |
| Topics | https://github.com/topics |
| Trending | https://github.com/trending |
| Collections | https://github.com/collections |
| Enterprise platformAI-powered developer platform | https://github.com/enterprise |
| GitHub Advanced SecurityEnterprise-grade security features | https://github.com/security/advanced-security |
| Copilot for BusinessEnterprise-grade AI features | https://github.com/features/copilot/copilot-business |
| Premium SupportEnterprise-grade 24/7 support | https://github.com/premium-support |
| Pricing | https://github.com/pricing |
| Search syntax tips | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
| documentation | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
|
Sign in
| https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fhack-hyc%2FDAT4 |
|
Sign up
| https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=hack-hyc%2FDAT4 |
| Reload | https://github.com/hack-hyc/DAT4 |
| Reload | https://github.com/hack-hyc/DAT4 |
| Reload | https://github.com/hack-hyc/DAT4 |
|
hack-hyc
| https://github.com/hack-hyc |
| DAT4 | https://github.com/hack-hyc/DAT4 |
| justmarkham/DAT4 | https://github.com/justmarkham/DAT4 |
|
Notifications
| https://github.com/login?return_to=%2Fhack-hyc%2FDAT4 |
|
Fork
0
| https://github.com/login?return_to=%2Fhack-hyc%2FDAT4 |
|
Star
0
| https://github.com/login?return_to=%2Fhack-hyc%2FDAT4 |
|
0
stars
| https://github.com/hack-hyc/DAT4/stargazers |
|
668
forks
| https://github.com/hack-hyc/DAT4/forks |
|
Branches
| https://github.com/hack-hyc/DAT4/branches |
|
Tags
| https://github.com/hack-hyc/DAT4/tags |
|
Activity
| https://github.com/hack-hyc/DAT4/activity |
|
Star
| https://github.com/login?return_to=%2Fhack-hyc%2FDAT4 |
|
Notifications
| https://github.com/login?return_to=%2Fhack-hyc%2FDAT4 |
|
Code
| https://github.com/hack-hyc/DAT4 |
|
Pull requests
0
| https://github.com/hack-hyc/DAT4/pulls |
|
Actions
| https://github.com/hack-hyc/DAT4/actions |
|
Projects
0
| https://github.com/hack-hyc/DAT4/projects |
|
Security
Uh oh!
There was an error while loading. Please reload this page.
| https://github.com/hack-hyc/DAT4/security |
| Please reload this page | https://github.com/hack-hyc/DAT4 |
|
Insights
| https://github.com/hack-hyc/DAT4/pulse |
|
Code
| https://github.com/hack-hyc/DAT4 |
|
Pull requests
| https://github.com/hack-hyc/DAT4/pulls |
|
Actions
| https://github.com/hack-hyc/DAT4/actions |
|
Projects
| https://github.com/hack-hyc/DAT4/projects |
|
Security
| https://github.com/hack-hyc/DAT4/security |
|
Insights
| https://github.com/hack-hyc/DAT4/pulse |
| Branches | https://github.com/hack-hyc/DAT4/branches |
| Tags | https://github.com/hack-hyc/DAT4/tags |
| https://github.com/hack-hyc/DAT4/branches |
| https://github.com/hack-hyc/DAT4/tags |
| 140 Commits | https://github.com/hack-hyc/DAT4/commits/master/ |
| https://github.com/hack-hyc/DAT4/commits/master/ |
| code | https://github.com/hack-hyc/DAT4/tree/master/code |
| code | https://github.com/hack-hyc/DAT4/tree/master/code |
| data | https://github.com/hack-hyc/DAT4/tree/master/data |
| data | https://github.com/hack-hyc/DAT4/tree/master/data |
| homework | https://github.com/hack-hyc/DAT4/tree/master/homework |
| homework | https://github.com/hack-hyc/DAT4/tree/master/homework |
| notebooks | https://github.com/hack-hyc/DAT4/tree/master/notebooks |
| notebooks | https://github.com/hack-hyc/DAT4/tree/master/notebooks |
| slides | https://github.com/hack-hyc/DAT4/tree/master/slides |
| slides | https://github.com/hack-hyc/DAT4/tree/master/slides |
| .gitignore | https://github.com/hack-hyc/DAT4/blob/master/.gitignore |
| .gitignore | https://github.com/hack-hyc/DAT4/blob/master/.gitignore |
| README.md | https://github.com/hack-hyc/DAT4/blob/master/README.md |
| README.md | https://github.com/hack-hyc/DAT4/blob/master/README.md |
| peer_review.md | https://github.com/hack-hyc/DAT4/blob/master/peer_review.md |
| peer_review.md | https://github.com/hack-hyc/DAT4/blob/master/peer_review.md |
| project.md | https://github.com/hack-hyc/DAT4/blob/master/project.md |
| project.md | https://github.com/hack-hyc/DAT4/blob/master/project.md |
| public_data.md | https://github.com/hack-hyc/DAT4/blob/master/public_data.md |
| public_data.md | https://github.com/hack-hyc/DAT4/blob/master/public_data.md |
| resources.md | https://github.com/hack-hyc/DAT4/blob/master/resources.md |
| resources.md | https://github.com/hack-hyc/DAT4/blob/master/resources.md |
| README | https://github.com/hack-hyc/DAT4 |
| https://github.com/hack-hyc/DAT4#dat4-course-repository |
| General Assembly's Data Science course | https://generalassemb.ly/education/data-science/washington-dc/ |
| Data School blog | http://www.dataschool.io/ |
| email newsletter | http://www.dataschool.io/subscribe/ |
| YouTube channel | https://www.youtube.com/user/dataschool |
| Starbucks at 15th & K | http://www.yelp.com/biz/starbucks-washington-15 |
| Course Project information | https://github.com/hack-hyc/DAT4/blob/master/project.md |
| Introduction | https://github.com/hack-hyc/DAT4#class-1-introduction |
| Python | https://github.com/hack-hyc/DAT4#class-2-python |
| Getting Data | https://github.com/hack-hyc/DAT4#class-3-getting-data |
| Git and GitHub | https://github.com/hack-hyc/DAT4#class-4-git-and-github |
| Pandas | https://github.com/hack-hyc/DAT4#class-5-pandas |
| Numpy, Machine Learning, KNN | https://github.com/hack-hyc/DAT4#class-6-numpy-machine-learning-knn |
| scikit-learn, Model Evaluation Procedures | https://github.com/hack-hyc/DAT4#class-7-scikit-learn-model-evaluation-procedures |
| Linear Regression | https://github.com/hack-hyc/DAT4#class-8-linear-regression |
| Logistic Regression,Preview of Other Models | https://github.com/hack-hyc/DAT4#class-9-logistic-regression-preview-of-other-models |
| Model Evaluation Metrics | https://github.com/hack-hyc/DAT4#class-10-model-evaluation-metrics |
| Working a Data Problem | https://github.com/hack-hyc/DAT4#class-11-working-a-data-problem |
| Clustering and Visualization | https://github.com/hack-hyc/DAT4#class-12-clustering-and-visualization |
| Naive Bayes | https://github.com/hack-hyc/DAT4#class-13-naive-bayes |
| Natural Language Processing | https://github.com/hack-hyc/DAT4#class-14-natural-language-processing |
| Decision Trees | https://github.com/hack-hyc/DAT4#class-15-decision-trees |
| Ensembling | https://github.com/hack-hyc/DAT4#class-16-ensembling |
| Databases and MapReduce | https://github.com/hack-hyc/DAT4#class-17-databases-and-mapreduce |
| Recommenders | https://github.com/hack-hyc/DAT4#class-18-recommenders |
| Advanced scikit-learn | https://github.com/hack-hyc/DAT4#class-19-advanced-scikit-learn |
| Course Review | https://github.com/hack-hyc/DAT4#class-20-course-review |
| Project Presentations | https://github.com/hack-hyc/DAT4#class-21-project-presentations |
| Project Presentations | https://github.com/hack-hyc/DAT4#class-22-project-presentations |
| https://github.com/hack-hyc/DAT4#installation-and-setup |
| Anaconda distribution | http://continuum.io/downloads |
| Git | http://git-scm.com/book/en/v2/Getting-Started-Installing-Git |
| GitHub | https://github.com/ |
| Slack | https://slack.com/ |
| https://github.com/hack-hyc/DAT4#class-1-introduction |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/01_course_overview.pdf |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/01_intro_to_data_science.pdf |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/00_python_refresher.py |
| Analyzing the Analyzers | http://cdn.oreillystatic.com/oreilly/radarreport/0636920029014/Analyzing_the_Analyzers.pdf |
| Data Community DC newsletter | http://www.datacommunitydc.org/thenewsletter/ |
| event calendar | http://www.datacommunitydc.org/calendar |
| https://github.com/hack-hyc/DAT4#class-2-python |
| solution | https://github.com/hack-hyc/DAT4/blob/master/code/02_python_quiz_solution.py |
| public data source | https://github.com/hack-hyc/DAT4/blob/master/public_data.md |
| FiveThirtyEight alcohol data | https://github.com/fivethirtyeight/data/tree/master/alcohol-consumption |
| revised data | https://github.com/hack-hyc/DAT4/blob/master/data/drinks.csv |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/02_file_io.py |
| Python exercise | https://github.com/hack-hyc/DAT4/blob/master/code/02_file_io_homework.py |
| solution | https://github.com/hack-hyc/DAT4/blob/master/code/02_file_io_homework_solution.py |
| project page | https://github.com/hack-hyc/DAT4/blob/master/project.md |
| projects from past Data Science courses | https://github.com/justmarkham/DAT-project-examples |
| A Crash Course in Python | http://nbviewer.ipython.org/gist/rpmuller/5920182 |
| Codecademy's Python course | http://www.codecademy.com/en/tracks/python |
| Google's Python Class | https://developers.google.com/edu/python/ |
| student projects | http://cs229.stanford.edu/projects2013.html |
| Machine Learning course | http://cs229.stanford.edu/ |
| Online Python Tutor | http://pythontutor.com/ |
| https://github.com/hack-hyc/DAT4#class-3-getting-data |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/03_getting_data.pdf |
| regex code | https://github.com/hack-hyc/DAT4/blob/master/code/03_re_example.py |
| web scraping and API code | https://github.com/hack-hyc/DAT4/blob/master/code/03_getting_data.py |
| command line tutorial | http://generalassembly.github.io/prework/command-line/#/ |
| quiz | https://gahub.typeform.com/to/J6xirf |
| install Git | http://git-scm.com/book/en/v2/Getting-Started-Installing-Git |
| GitHub account | https://github.com/ |
| regex101 | https://regex101.com/#python |
| excellent regex lesson | https://developers.google.com/edu/python/regular-expressions |
| video | http://www.youtube.com/watch?v=kWyoYtvJpe4 |
| Mashape | https://www.mashape.com/explore |
| Apigee | https://apigee.com/providers |
| Python API wrapper | http://www.pythonforbeginners.com/api/list-of-python-apis |
| https://github.com/hack-hyc/DAT4#class-4-git-and-github |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/04_git_github.pdf |
| question and data set | https://github.com/hack-hyc/DAT4/blob/master/project.md |
| DAT4-students | https://github.com/justmarkham/DAT4-students |
| Pro Git | http://git-scm.com/book/en/v2 |
| GitRef | http://gitref.org/ |
| Git quick reference for beginners | http://www.dataschool.io/git-quick-reference-for-beginners/ |
| Markdown Cheatsheet | https://github.com/adam-p/markdown-here/wiki/Markdown-Cheatsheet |
| GitHub Flavored Markdown | https://help.github.com/articles/github-flavored-markdown/ |
| https://github.com/hack-hyc/DAT4#class-5-pandas |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/05_pandas.py |
| Split-Apply-Combine | http://i.imgur.com/yjNkiwL.png |
| joins in Pandas | http://www.gregreda.com/2013/10/26/working-with-pandas-dataframes/#joining |
| Pandas homework | https://github.com/hack-hyc/DAT4/blob/master/homework/05_pandas.md |
| three-part tutorial | http://www.gregreda.com/2013/10/26/intro-to-pandas-data-structures/ |
| introduction | http://nbviewer.ipython.org/urls/raw.github.com/fonnesbeck/Bios366/master/notebooks/Section2_5-Introduction-to-Pandas.ipynb |
| data wrangling | http://nbviewer.ipython.org/urls/raw.github.com/fonnesbeck/Bios366/master/notebooks/Section2_6-Data-Wrangling-with-Pandas.ipynb |
| plotting | http://nbviewer.ipython.org/urls/raw.github.com/fonnesbeck/Bios366/master/notebooks/Section2_7-Plotting-with-Pandas.ipynb |
| visualization page | http://pandas.pydata.org/pandas-docs/stable/visualization.html |
| notebook on matplotlib | http://nbviewer.ipython.org/github/fonnesbeck/Bios366/blob/master/notebooks/Section2_4-Matplotlib.ipynb |
| Choosing a Good Chart | http://www.extremepresentation.com/uploads/documents/choosing_a_good_chart.pdf |
| slide deck | http://www2.research.att.com/~volinsky/DataMining/Columbia2011/Slides/Topic2-EDAViz.ppt |
| https://github.com/hack-hyc/DAT4#class-6-numpy-machine-learning-knn |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/06_numpy.py |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/06_iris_prework.py |
| solution | https://github.com/hack-hyc/DAT4/blob/master/code/06_iris_solution.py |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/06_ml_knn.pdf |
| Understanding the Bias-Variance Tradeoff | http://scott.fortmann-roe.com/docs/BiasVariance.html |
| An Introduction to Statistical Learning | http://www-bcf.usc.edu/~gareth/ISL/ |
| https://github.com/hack-hyc/DAT4#class-7-scikit-learn-model-evaluation-procedures |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/07_sklearn_knn.py |
| user guide | http://scikit-learn.org/stable/modules/neighbors.html |
| module reference | http://scikit-learn.org/stable/modules/classes.html#module-sklearn.neighbors |
| class documentation | http://scikit-learn.org/stable/modules/generated/sklearn.neighbors.KNeighborsClassifier.html |
| article | http://scott.fortmann-roe.com/docs/BiasVariance.html |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/07_model_evaluation_procedures.pdf |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/07_model_evaluation_procedures.py |
| data exploration and analysis plan | https://github.com/hack-hyc/DAT4/blob/master/project.md |
| UCI Machine Learning Repository | http://archive.ics.uci.edu/ml/datasets.html |
| Glass Identification Data Set | http://archive.ics.uci.edu/ml/datasets/Glass+Identification |
| 30-second explanation of overfitting | http://www.quora.com/What-is-an-intuitive-explanation-of-overfitting/answer/Jessica-Su |
| overfitting and train/test split | https://www.youtube.com/watch?v=_2ij6eaaSl0 |
| cross-validation | https://www.youtube.com/watch?v=nZAM5OXrktY |
| An Introduction to Statistical Learning | http://www-bcf.usc.edu/~gareth/ISL/ |
| excellent, simple example of the bias-variance tradeoff | http://work.caltech.edu/library/081.html |
| https://github.com/hack-hyc/DAT4#class-8-linear-regression |
| IPython notebook | http://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/08_linear_regression.ipynb |
| data exploration and analysis plan | https://github.com/hack-hyc/DAT4/blob/master/project.md |
| An Introduction to Statistical Learning | http://www-bcf.usc.edu/~gareth/ISL/ |
| related videos | http://www.dataschool.io/15-hours-of-expert-machine-learning-videos/ |
| quick reference guide | http://www.dataschool.io/applying-and-interpreting-linear-regression/ |
| simple linear regression | http://www.datarobot.com/blog/ordinary-least-squares-in-python/ |
| multiple linear regression | http://www.datarobot.com/blog/multiple-regression-using-statsmodels/ |
| introduction to linear regression | http://people.duke.edu/~rnau/regintro.htm |
| assumptions of linear regression | http://pareonline.net/getvn.asp?n=2&v=8 |
| https://github.com/hack-hyc/DAT4#class-9-logistic-regression-preview-of-other-models |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/09_logistic_regression.pdf |
| exercise | https://github.com/hack-hyc/DAT4/blob/master/code/09_logistic_regression_exercise.py |
| solution | https://github.com/hack-hyc/DAT4/blob/master/code/09_logistic_regression_class.py |
| first three videos | https://www.youtube.com/playlist?list=PL5-da3qGB5IC4vaDba5ClatUmFppXLAhE |
| relationship between probability, odds, and log-odds | http://www.ats.ucla.edu/stat/mult_pkg/faq/general/odds_ratio.htm |
| intuition behind "e" | http://betterexplained.com/articles/an-intuitive-guide-to-exponential-functions-e/ |
| interpreting logistic regression coefficients | http://www.unm.edu/~schrader/biostat/bio2/Spr06/lec11.pdf |
| https://github.com/hack-hyc/DAT4#class-10-model-evaluation-metrics |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/07_model_evaluation_procedures.pdf |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/07_model_evaluation_procedures.py |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/10_model_evaluation_metrics.pdf |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/10_rmse.py |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/10_confusion_roc.py |
| video | https://www.youtube.com/watch?v=OAl6eAyP-yo |
| Model evaluation homework | https://github.com/hack-hyc/DAT4/blob/master/homework/10_model_evaluation.md |
| Sample solution code | https://github.com/hack-hyc/DAT4/blob/master/code/10_glass_id_homework_solution.py |
| Kaggle project presentation video | https://www.youtube.com/watch?v=HGr1yQV3Um0 |
| Smart Autofill | http://googleresearch.blogspot.com/2014/10/smart-autofill-harnessing-predictive.html |
| Kaggle Transforms Data Science Into Competitive Sport | https://www.youtube.com/watch?v=8w4UY66GKcM |
| model evaluation | http://scikit-learn.org/stable/modules/model_evaluation.html |
| model evaluation metrics | https://www.kaggle.com/wiki/Metrics |
| simple guide to confusion matrix terminology | http://www.dataschool.io/simple-guide-to-confusion-matrix-terminology/ |
| blog post about the ROC video | http://www.dataschool.io/roc-curves-and-auc-explained/ |
| Sensitivity and Specificity | https://www.youtube.com/watch?v=U4_3fditnWg&list=PL41ckbAGB5S2PavLIXUETzAmi5reIod23 |
| ROC Curves | https://www.youtube.com/watch?v=21Igj5Pr6u4&list=PL41ckbAGB5S2PavLIXUETzAmi5reIod23 |
| https://github.com/hack-hyc/DAT4#class-11-working-a-data-problem |
| data | https://github.com/hack-hyc/DAT4/blob/master/data/ZYX_prices.csv |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/11_GA_Stocks.pdf |
| https://github.com/hack-hyc/DAT4#class-12-clustering-and-visualization |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/12_clustering.pdf |
| code | https://github.com/hack-hyc/DAT4/blob/master/code |
| data | https://github.com/hack-hyc/DAT4/blob/master/data/songs.csv |
| A Plan for Spam | http://www.paulgraham.com/spam.html |
| Kevin's guide | http://www.dataschool.io/simple-guide-to-confusion-matrix-terminology/ |
| excellent video | https://www.youtube.com/watch?v=U4_3fditnWg&list=PL41ckbAGB5S2PavLIXUETzAmi5reIod23 |
| introductory slides | https://docs.google.com/presentation/d/1cM2dVbJgTWMkHoVNmYlB9df6P2H8BrjaqAcZTaLe9dA/edit#slide=id.gfc3caad2_00 |
| OpenIntro Statistics textbook | https://www.openintro.org/stat/textbook.php |
| Introduction to Data Mining | http://www-users.cs.umn.edu/~kumar/dmbook/index.php |
| chapter on cluster analysis | http://www-users.cs.umn.edu/~kumar/dmbook/ch8.pdf |
| section on clustering | http://scikit-learn.org/stable/modules/clustering.html |
| https://github.com/hack-hyc/DAT4#class-13-naive-bayes |
| A Plan for Spam | http://www.paulgraham.com/spam.html |
| Slides | https://github.com/hack-hyc/DAT4/blob/master/slides/13_naive_bayes.pdf |
| Visualization of conditional probability | http://setosa.io/conditional/ |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/13_bayes_iris.py |
| Slides | https://github.com/hack-hyc/DAT4/blob/master/slides/13_naive_bayes.pdf |
| Airport security example | http://www.quora.com/In-laymans-terms-how-does-Naive-Bayes-work/answer/Konstantin-Tt |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/13_naive_bayes.py |
| SMS Spam Collection | https://archive.ics.uci.edu/ml/datasets/SMS+Spam+Collection |
| CountVectorizer | http://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.CountVectorizer.html |
| Naive Bayes | http://scikit-learn.org/stable/modules/naive_bayes.html |
| Visualizing Bayes' theorem | http://oscarbonilla.com/2009/05/visualizing-bayes-theorem/ |
| Bayes' Rule for Ducks | https://planspacedotorg.wordpress.com/2014/02/23/bayes-rule-for-ducks/ |
| 5-minute video on conditional probability | https://www.youtube.com/watch?v=Zxm4Xxvzohk |
| slides on conditional probability | https://docs.google.com/presentation/d/1psUIyig6OxHQngGEHr3TMkCvhdLInnKnclQoNUr4G4U/edit#slide=id.gfc69f484_00 |
| Naive Bayes classifier | http://en.wikipedia.org/wiki/Naive_Bayes_classifier |
| Naive Bayes spam filtering | http://en.wikipedia.org/wiki/Naive_Bayes_spam_filtering |
| Q&A | http://stats.stackexchange.com/questions/21822/understanding-naive-bayes |
| his follow-up article | http://www.paulgraham.com/better.html |
| related paper | http://www.merl.com/publications/docs/TR2004-091.pdf |
| https://github.com/hack-hyc/DAT4#class-14-natural-language-processing |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/14_natural_language_processing.pdf |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/14_nlp_class.py |
| Natural Language Processing with Python | http://www.nltk.org/book/ |
| NLP online course | https://www.coursera.org/course/nlp |
| video lectures | https://class.coursera.org/nlp/lecture |
| slides | http://web.stanford.edu/~jurafsky/NLPCourseraSlides.html |
| Brief slides | http://files.meetup.com/7616132/DC-NLP-2013-09%20Charlie%20Greenbacker.pdf |
| Detailed slides | https://github.com/ga-students/DAT_SF_9/blob/master/16_Text_Mining/DAT9_lec16_Text_Mining.pdf |
| A visual survey of text visualization techniques | http://textvis.lnu.se/ |
| DC Natural Language Processing | http://www.meetup.com/DC-NLP/ |
| Stanford CoreNLP | http://nlp.stanford.edu/software/corenlp.shtml |
| Python introductory lesson | https://developers.google.com/edu/python/regular-expressions |
| reference guide | https://github.com/justmarkham/DAT3/blob/master/code/99_regex_reference.py |
| real-time regex tester | https://regex101.com/#python |
| in-depth tutorials | http://www.rexegg.com/ |
| SpaCy | http://honnibal.github.io/spaCy/ |
| https://github.com/hack-hyc/DAT4#class-15-decision-trees |
| IPython notebook | http://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/15_decision_trees.ipynb |
| these guidelines | https://github.com/hack-hyc/DAT4/blob/master/peer_review.md |
| Decision Trees | http://scikit-learn.org/stable/modules/tree.html |
| Download and install PKG file | http://www.graphviz.org/Download_macos.php |
| Download and install MSI file | http://www.graphviz.org/Download_windows.php |
| https://github.com/hack-hyc/DAT4#class-16-ensembling |
| IPython notebook | http://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/16_ensembling.ipynb |
| Ensemble Methods | http://scikit-learn.org/stable/modules/ensemble.html |
| How do random forests work in layman's terms? | http://www.quora.com/How-do-random-forests-work-in-laymans-terms/answer/Edwin-Chen-1 |
| https://github.com/hack-hyc/DAT4#class-17-databases-and-mapreduce |
| database code | https://github.com/hack-hyc/DAT4/blob/master/code/17_sql.py |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/17_db_mr.pdf |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/17_map_reduce.py |
| Forbes: Is it Time for Hadoop Alternatives? | http://www.forbes.com/sites/johnwebster/2014/12/08/is-it-time-for-hadoop-alternatives/ |
| IBM: What is MapReduce? | http://www-01.ibm.com/software/data/infosphere/hadoop/mapreduce/ |
| Wakari MapReduce IPython notebook | https://www.wakari.io/sharing/bundle/nkorf/MapReduce%20Example |
| What Every Data Scientist Needs to Know about SQL | http://joshualande.com/data-science-sql/ |
| Brandon's SQL Bootcamp | https://github.com/brandonmburroughs/sql_bootcamp |
| SQLZOO | http://sqlzoo.net/wiki/Main_Page |
| Mode Analytics | http://sqlschool.modeanalytics.com/ |
| https://github.com/hack-hyc/DAT4#class-18-recommenders |
| slides | https://github.com/hack-hyc/DAT4/blob/master/slides/18_recommendation_engines.pdf |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/18_recommenders_class.py |
| The Netflix Prize | http://www.netflixprize.com/ |
| Why Netflix never implemented the winning solution | https://www.techdirt.com/blog/innovation/articles/20120409/03412518422/why-netflix-never-implemented-algorithm-that-won-netflix-1-million-challenge.shtml |
| Visualization of the Music Genome Project | http://www.music-map.com/ |
| The People Inside Your Machine | http://www.npr.org/blogs/money/2015/01/30/382657657/episode-600-the-people-inside-your-machine |
| https://github.com/hack-hyc/DAT4#class-19-advanced-scikit-learn |
| code | https://github.com/hack-hyc/DAT4/blob/master/code/19_advanced_sklearn.py |
| GridSearchCV | http://scikit-learn.org/stable/modules/grid_search.html |
| StandardScaler | http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html |
| Pipeline | http://scikit-learn.org/stable/modules/pipeline.html |
| notebook | http://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/19_regularization.ipynb |
| Ridge, RidgeCV, Lasso, LassoCV | http://scikit-learn.org/stable/modules/linear_model.html |
| LogisticRegression | http://scikit-learn.org/stable/modules/linear_model.html |
| RFE, RFECV | http://scikit-learn.org/stable/modules/feature_selection.html |
| A Few Useful Things to Know about Machine Learning | http://homes.cs.washington.edu/~pedrod/papers/cacm12.pdf |
| feature scaling | http://nbviewer.ipython.org/github/rasbt/pattern_classification/blob/master/preprocessing/about_standardization_normalization.ipynb |
| Clever Methods of Overfitting | http://hunch.net/?p=22 |
| Common Pitfalls in Machine Learning | http://danielnee.com/?p=155 |
| https://github.com/hack-hyc/DAT4#class-20-course-review |
| Data science review | https://docs.google.com/document/d/1XCdyrsQwU5OC5os7RHdVTEtS-tpHBbsoKKWLpYI6Svo/edit?usp=sharing |
| Comparing supervised learning algorithms | https://docs.google.com/spreadsheets/d/15_QJXm6urctsbIXO-C_eXrsSffbHedio8z0E5ozxO-M/edit?usp=sharing |
| Choosing a Machine Learning Classifier | http://blog.echen.me/2011/04/27/choosing-a-machine-learning-classifier/ |
| scikit-learn "machine learning map" | http://scikit-learn.org/stable/tutorial/machine_learning_map/ |
| Machine Learning Done Wrong | http://ml.posthaven.com/machine-learning-done-wrong |
| Practical machine learning tricks from the KDD 2011 best industry paper | http://blog.david-andrzejewski.com/machine-learning/practical-machine-learning-tricks-from-the-kdd-2011-best-industry-paper/ |
| An Empirical Comparison of Supervised Learning Algorithms | http://www.cs.cornell.edu/~caruana/ctp/ct.papers/caruana.icml06.pdf |
| Getting in Shape for the Sport of Data Science | https://www.youtube.com/watch?v=kwt6XEh7U3g |
| Resources for continued learning! | https://github.com/hack-hyc/DAT4/blob/master/resources.md |
| https://github.com/hack-hyc/DAT4#class-21-project-presentations |
| https://github.com/hack-hyc/DAT4#class-22-project-presentations |
|
Readme
| https://github.com/hack-hyc/DAT4#readme-ov-file |
| Please reload this page | https://github.com/hack-hyc/DAT4 |
|
Activity | https://github.com/hack-hyc/DAT4/activity |
|
0
stars | https://github.com/hack-hyc/DAT4/stargazers |
|
1
watching | https://github.com/hack-hyc/DAT4/watchers |
|
0
forks | https://github.com/hack-hyc/DAT4/forks |
|
Report repository
| https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fhack-hyc%2FDAT4&report=hack-hyc+%28user%29 |
| Releases | https://github.com/hack-hyc/DAT4/releases |
| Packages
0 | https://github.com/users/hack-hyc/packages?repo_name=DAT4 |
|
| https://github.com |
| Terms | https://docs.github.com/site-policy/github-terms/github-terms-of-service |
| Privacy | https://docs.github.com/site-policy/privacy-policies/github-privacy-statement |
| Security | https://github.com/security |
| Status | https://www.githubstatus.com/ |
| Community | https://github.community/ |
| Docs | https://docs.github.com/ |
| Contact | https://support.github.com?tags=dotcom-footer |