René's URL Explorer Experiment


Title: GitHub - hack-hyc/DAT4: General Assembly's Data Science course in Washington, DC

Open Graph Title: GitHub - hack-hyc/DAT4: General Assembly's Data Science course in Washington, DC

X Title: GitHub - hack-hyc/DAT4: General Assembly's Data Science course in Washington, DC

Description: General Assembly's Data Science course in Washington, DC - hack-hyc/DAT4

Open Graph Description: General Assembly's Data Science course in Washington, DC - hack-hyc/DAT4

X Description: General Assembly's Data Science course in Washington, DC - hack-hyc/DAT4

Opengraph URL: https://github.com/hack-hyc/DAT4

X: @github

direct link

Domain: github.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:6459ae82-96e8-0b33-f424-0d139d050c63
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idDECE:2E8F47:A3BAB6:E1A3A9:69695B4B
html-safe-nonce83edb41c2a4c5b14c4afc0716e9fa709a67a6e96bf18e482dd86f240a1e63ca0
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJERUNFOjJFOEY0NzpBM0JBQjY6RTFBM0E5OjY5Njk1QjRCIiwidmlzaXRvcl9pZCI6IjE5Nzk3MDMyMTc2NjE1MDAyMzUiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmac459c830582b9d33ff56589abe3f384b568d8cd1105b3785dcf76c0f9ccf9f77c
hovercard-subject-tagrepository:104411917
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/hack-hyc/DAT4
twitter:imagehttps://opengraph.githubassets.com/1ac0f35fc42fdb719a7a1391962413b3c215b578f638355a3fdc865081f33a05/hack-hyc/DAT4
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/1ac0f35fc42fdb719a7a1391962413b3c215b578f638355a3fdc865081f33a05/hack-hyc/DAT4
og:image:altGeneral Assembly's Data Science course in Washington, DC - hack-hyc/DAT4
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None9db5f28da7e24035385d7f349f17890cbe016a939ddd7952be0f07b862094f5a
turbo-cache-controlno-preview
go-importgithub.com/hack-hyc/DAT4 git https://github.com/hack-hyc/DAT4.git
octolytics-dimension-user_id20637790
octolytics-dimension-user_loginhack-hyc
octolytics-dimension-repository_id104411917
octolytics-dimension-repository_nwohack-hyc/DAT4
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id27836310
octolytics-dimension-repository_parent_nwojustmarkham/DAT4
octolytics-dimension-repository_network_root_id27836310
octolytics-dimension-repository_network_root_nwojustmarkham/DAT4
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release4e59fe66217d3c72925af2a341ae3a8f2b5b5b2a
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/hack-hyc/DAT4#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fhack-hyc%2FDAT4
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fhack-hyc%2FDAT4
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=hack-hyc%2FDAT4
Reloadhttps://github.com/hack-hyc/DAT4
Reloadhttps://github.com/hack-hyc/DAT4
Reloadhttps://github.com/hack-hyc/DAT4
hack-hyc https://github.com/hack-hyc
DAT4https://github.com/hack-hyc/DAT4
justmarkham/DAT4https://github.com/justmarkham/DAT4
Notifications https://github.com/login?return_to=%2Fhack-hyc%2FDAT4
Fork 0 https://github.com/login?return_to=%2Fhack-hyc%2FDAT4
Star 0 https://github.com/login?return_to=%2Fhack-hyc%2FDAT4
0 stars https://github.com/hack-hyc/DAT4/stargazers
668 forks https://github.com/hack-hyc/DAT4/forks
Branches https://github.com/hack-hyc/DAT4/branches
Tags https://github.com/hack-hyc/DAT4/tags
Activity https://github.com/hack-hyc/DAT4/activity
Star https://github.com/login?return_to=%2Fhack-hyc%2FDAT4
Notifications https://github.com/login?return_to=%2Fhack-hyc%2FDAT4
Code https://github.com/hack-hyc/DAT4
Pull requests 0 https://github.com/hack-hyc/DAT4/pulls
Actions https://github.com/hack-hyc/DAT4/actions
Projects 0 https://github.com/hack-hyc/DAT4/projects
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/hack-hyc/DAT4/security
Please reload this pagehttps://github.com/hack-hyc/DAT4
Insights https://github.com/hack-hyc/DAT4/pulse
Code https://github.com/hack-hyc/DAT4
Pull requests https://github.com/hack-hyc/DAT4/pulls
Actions https://github.com/hack-hyc/DAT4/actions
Projects https://github.com/hack-hyc/DAT4/projects
Security https://github.com/hack-hyc/DAT4/security
Insights https://github.com/hack-hyc/DAT4/pulse
Brancheshttps://github.com/hack-hyc/DAT4/branches
Tagshttps://github.com/hack-hyc/DAT4/tags
https://github.com/hack-hyc/DAT4/branches
https://github.com/hack-hyc/DAT4/tags
140 Commitshttps://github.com/hack-hyc/DAT4/commits/master/
https://github.com/hack-hyc/DAT4/commits/master/
codehttps://github.com/hack-hyc/DAT4/tree/master/code
codehttps://github.com/hack-hyc/DAT4/tree/master/code
datahttps://github.com/hack-hyc/DAT4/tree/master/data
datahttps://github.com/hack-hyc/DAT4/tree/master/data
homeworkhttps://github.com/hack-hyc/DAT4/tree/master/homework
homeworkhttps://github.com/hack-hyc/DAT4/tree/master/homework
notebookshttps://github.com/hack-hyc/DAT4/tree/master/notebooks
notebookshttps://github.com/hack-hyc/DAT4/tree/master/notebooks
slideshttps://github.com/hack-hyc/DAT4/tree/master/slides
slideshttps://github.com/hack-hyc/DAT4/tree/master/slides
.gitignorehttps://github.com/hack-hyc/DAT4/blob/master/.gitignore
.gitignorehttps://github.com/hack-hyc/DAT4/blob/master/.gitignore
README.mdhttps://github.com/hack-hyc/DAT4/blob/master/README.md
README.mdhttps://github.com/hack-hyc/DAT4/blob/master/README.md
peer_review.mdhttps://github.com/hack-hyc/DAT4/blob/master/peer_review.md
peer_review.mdhttps://github.com/hack-hyc/DAT4/blob/master/peer_review.md
project.mdhttps://github.com/hack-hyc/DAT4/blob/master/project.md
project.mdhttps://github.com/hack-hyc/DAT4/blob/master/project.md
public_data.mdhttps://github.com/hack-hyc/DAT4/blob/master/public_data.md
public_data.mdhttps://github.com/hack-hyc/DAT4/blob/master/public_data.md
resources.mdhttps://github.com/hack-hyc/DAT4/blob/master/resources.md
resources.mdhttps://github.com/hack-hyc/DAT4/blob/master/resources.md
READMEhttps://github.com/hack-hyc/DAT4
https://github.com/hack-hyc/DAT4#dat4-course-repository
General Assembly's Data Science coursehttps://generalassemb.ly/education/data-science/washington-dc/
Data School bloghttp://www.dataschool.io/
email newsletterhttp://www.dataschool.io/subscribe/
YouTube channelhttps://www.youtube.com/user/dataschool
Starbucks at 15th & Khttp://www.yelp.com/biz/starbucks-washington-15
Course Project informationhttps://github.com/hack-hyc/DAT4/blob/master/project.md
Introductionhttps://github.com/hack-hyc/DAT4#class-1-introduction
Pythonhttps://github.com/hack-hyc/DAT4#class-2-python
Getting Datahttps://github.com/hack-hyc/DAT4#class-3-getting-data
Git and GitHubhttps://github.com/hack-hyc/DAT4#class-4-git-and-github
Pandashttps://github.com/hack-hyc/DAT4#class-5-pandas
Numpy, Machine Learning, KNNhttps://github.com/hack-hyc/DAT4#class-6-numpy-machine-learning-knn
scikit-learn, Model Evaluation Procedureshttps://github.com/hack-hyc/DAT4#class-7-scikit-learn-model-evaluation-procedures
Linear Regressionhttps://github.com/hack-hyc/DAT4#class-8-linear-regression
Logistic Regression,Preview of Other Modelshttps://github.com/hack-hyc/DAT4#class-9-logistic-regression-preview-of-other-models
Model Evaluation Metricshttps://github.com/hack-hyc/DAT4#class-10-model-evaluation-metrics
Working a Data Problemhttps://github.com/hack-hyc/DAT4#class-11-working-a-data-problem
Clustering and Visualizationhttps://github.com/hack-hyc/DAT4#class-12-clustering-and-visualization
Naive Bayeshttps://github.com/hack-hyc/DAT4#class-13-naive-bayes
Natural Language Processinghttps://github.com/hack-hyc/DAT4#class-14-natural-language-processing
Decision Treeshttps://github.com/hack-hyc/DAT4#class-15-decision-trees
Ensemblinghttps://github.com/hack-hyc/DAT4#class-16-ensembling
Databases and MapReducehttps://github.com/hack-hyc/DAT4#class-17-databases-and-mapreduce
Recommendershttps://github.com/hack-hyc/DAT4#class-18-recommenders
Advanced scikit-learnhttps://github.com/hack-hyc/DAT4#class-19-advanced-scikit-learn
Course Reviewhttps://github.com/hack-hyc/DAT4#class-20-course-review
Project Presentationshttps://github.com/hack-hyc/DAT4#class-21-project-presentations
Project Presentationshttps://github.com/hack-hyc/DAT4#class-22-project-presentations
https://github.com/hack-hyc/DAT4#installation-and-setup
Anaconda distributionhttp://continuum.io/downloads
Githttp://git-scm.com/book/en/v2/Getting-Started-Installing-Git
GitHubhttps://github.com/
Slackhttps://slack.com/
https://github.com/hack-hyc/DAT4#class-1-introduction
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/01_course_overview.pdf
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/01_intro_to_data_science.pdf
codehttps://github.com/hack-hyc/DAT4/blob/master/code/00_python_refresher.py
Analyzing the Analyzershttp://cdn.oreillystatic.com/oreilly/radarreport/0636920029014/Analyzing_the_Analyzers.pdf
Data Community DC newsletterhttp://www.datacommunitydc.org/thenewsletter/
event calendarhttp://www.datacommunitydc.org/calendar
https://github.com/hack-hyc/DAT4#class-2-python
solutionhttps://github.com/hack-hyc/DAT4/blob/master/code/02_python_quiz_solution.py
public data sourcehttps://github.com/hack-hyc/DAT4/blob/master/public_data.md
FiveThirtyEight alcohol datahttps://github.com/fivethirtyeight/data/tree/master/alcohol-consumption
revised datahttps://github.com/hack-hyc/DAT4/blob/master/data/drinks.csv
codehttps://github.com/hack-hyc/DAT4/blob/master/code/02_file_io.py
Python exercisehttps://github.com/hack-hyc/DAT4/blob/master/code/02_file_io_homework.py
solutionhttps://github.com/hack-hyc/DAT4/blob/master/code/02_file_io_homework_solution.py
project pagehttps://github.com/hack-hyc/DAT4/blob/master/project.md
projects from past Data Science courseshttps://github.com/justmarkham/DAT-project-examples
A Crash Course in Pythonhttp://nbviewer.ipython.org/gist/rpmuller/5920182
Codecademy's Python coursehttp://www.codecademy.com/en/tracks/python
Google's Python Classhttps://developers.google.com/edu/python/
student projectshttp://cs229.stanford.edu/projects2013.html
Machine Learning coursehttp://cs229.stanford.edu/
Online Python Tutorhttp://pythontutor.com/
https://github.com/hack-hyc/DAT4#class-3-getting-data
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/03_getting_data.pdf
regex codehttps://github.com/hack-hyc/DAT4/blob/master/code/03_re_example.py
web scraping and API codehttps://github.com/hack-hyc/DAT4/blob/master/code/03_getting_data.py
command line tutorialhttp://generalassembly.github.io/prework/command-line/#/
quizhttps://gahub.typeform.com/to/J6xirf
install Githttp://git-scm.com/book/en/v2/Getting-Started-Installing-Git
GitHub accounthttps://github.com/
regex101https://regex101.com/#python
excellent regex lessonhttps://developers.google.com/edu/python/regular-expressions
videohttp://www.youtube.com/watch?v=kWyoYtvJpe4
Mashapehttps://www.mashape.com/explore
Apigeehttps://apigee.com/providers
Python API wrapperhttp://www.pythonforbeginners.com/api/list-of-python-apis
https://github.com/hack-hyc/DAT4#class-4-git-and-github
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/04_git_github.pdf
question and data sethttps://github.com/hack-hyc/DAT4/blob/master/project.md
DAT4-studentshttps://github.com/justmarkham/DAT4-students
Pro Githttp://git-scm.com/book/en/v2
GitRefhttp://gitref.org/
Git quick reference for beginnershttp://www.dataschool.io/git-quick-reference-for-beginners/
Markdown Cheatsheethttps://github.com/adam-p/markdown-here/wiki/Markdown-Cheatsheet
GitHub Flavored Markdownhttps://help.github.com/articles/github-flavored-markdown/
https://github.com/hack-hyc/DAT4#class-5-pandas
codehttps://github.com/hack-hyc/DAT4/blob/master/code/05_pandas.py
Split-Apply-Combinehttp://i.imgur.com/yjNkiwL.png
joins in Pandashttp://www.gregreda.com/2013/10/26/working-with-pandas-dataframes/#joining
Pandas homeworkhttps://github.com/hack-hyc/DAT4/blob/master/homework/05_pandas.md
three-part tutorialhttp://www.gregreda.com/2013/10/26/intro-to-pandas-data-structures/
introductionhttp://nbviewer.ipython.org/urls/raw.github.com/fonnesbeck/Bios366/master/notebooks/Section2_5-Introduction-to-Pandas.ipynb
data wranglinghttp://nbviewer.ipython.org/urls/raw.github.com/fonnesbeck/Bios366/master/notebooks/Section2_6-Data-Wrangling-with-Pandas.ipynb
plottinghttp://nbviewer.ipython.org/urls/raw.github.com/fonnesbeck/Bios366/master/notebooks/Section2_7-Plotting-with-Pandas.ipynb
visualization pagehttp://pandas.pydata.org/pandas-docs/stable/visualization.html
notebook on matplotlibhttp://nbviewer.ipython.org/github/fonnesbeck/Bios366/blob/master/notebooks/Section2_4-Matplotlib.ipynb
Choosing a Good Charthttp://www.extremepresentation.com/uploads/documents/choosing_a_good_chart.pdf
slide deckhttp://www2.research.att.com/~volinsky/DataMining/Columbia2011/Slides/Topic2-EDAViz.ppt
https://github.com/hack-hyc/DAT4#class-6-numpy-machine-learning-knn
codehttps://github.com/hack-hyc/DAT4/blob/master/code/06_numpy.py
codehttps://github.com/hack-hyc/DAT4/blob/master/code/06_iris_prework.py
solutionhttps://github.com/hack-hyc/DAT4/blob/master/code/06_iris_solution.py
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/06_ml_knn.pdf
Understanding the Bias-Variance Tradeoffhttp://scott.fortmann-roe.com/docs/BiasVariance.html
An Introduction to Statistical Learninghttp://www-bcf.usc.edu/~gareth/ISL/
https://github.com/hack-hyc/DAT4#class-7-scikit-learn-model-evaluation-procedures
codehttps://github.com/hack-hyc/DAT4/blob/master/code/07_sklearn_knn.py
user guidehttp://scikit-learn.org/stable/modules/neighbors.html
module referencehttp://scikit-learn.org/stable/modules/classes.html#module-sklearn.neighbors
class documentationhttp://scikit-learn.org/stable/modules/generated/sklearn.neighbors.KNeighborsClassifier.html
articlehttp://scott.fortmann-roe.com/docs/BiasVariance.html
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/07_model_evaluation_procedures.pdf
codehttps://github.com/hack-hyc/DAT4/blob/master/code/07_model_evaluation_procedures.py
data exploration and analysis planhttps://github.com/hack-hyc/DAT4/blob/master/project.md
UCI Machine Learning Repositoryhttp://archive.ics.uci.edu/ml/datasets.html
Glass Identification Data Sethttp://archive.ics.uci.edu/ml/datasets/Glass+Identification
30-second explanation of overfittinghttp://www.quora.com/What-is-an-intuitive-explanation-of-overfitting/answer/Jessica-Su
overfitting and train/test splithttps://www.youtube.com/watch?v=_2ij6eaaSl0
cross-validationhttps://www.youtube.com/watch?v=nZAM5OXrktY
An Introduction to Statistical Learninghttp://www-bcf.usc.edu/~gareth/ISL/
excellent, simple example of the bias-variance tradeoffhttp://work.caltech.edu/library/081.html
https://github.com/hack-hyc/DAT4#class-8-linear-regression
IPython notebookhttp://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/08_linear_regression.ipynb
data exploration and analysis planhttps://github.com/hack-hyc/DAT4/blob/master/project.md
An Introduction to Statistical Learninghttp://www-bcf.usc.edu/~gareth/ISL/
related videoshttp://www.dataschool.io/15-hours-of-expert-machine-learning-videos/
quick reference guidehttp://www.dataschool.io/applying-and-interpreting-linear-regression/
simple linear regressionhttp://www.datarobot.com/blog/ordinary-least-squares-in-python/
multiple linear regressionhttp://www.datarobot.com/blog/multiple-regression-using-statsmodels/
introduction to linear regressionhttp://people.duke.edu/~rnau/regintro.htm
assumptions of linear regressionhttp://pareonline.net/getvn.asp?n=2&v=8
https://github.com/hack-hyc/DAT4#class-9-logistic-regression-preview-of-other-models
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/09_logistic_regression.pdf
exercisehttps://github.com/hack-hyc/DAT4/blob/master/code/09_logistic_regression_exercise.py
solutionhttps://github.com/hack-hyc/DAT4/blob/master/code/09_logistic_regression_class.py
first three videoshttps://www.youtube.com/playlist?list=PL5-da3qGB5IC4vaDba5ClatUmFppXLAhE
relationship between probability, odds, and log-oddshttp://www.ats.ucla.edu/stat/mult_pkg/faq/general/odds_ratio.htm
intuition behind "e"http://betterexplained.com/articles/an-intuitive-guide-to-exponential-functions-e/
interpreting logistic regression coefficientshttp://www.unm.edu/~schrader/biostat/bio2/Spr06/lec11.pdf
https://github.com/hack-hyc/DAT4#class-10-model-evaluation-metrics
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/07_model_evaluation_procedures.pdf
codehttps://github.com/hack-hyc/DAT4/blob/master/code/07_model_evaluation_procedures.py
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/10_model_evaluation_metrics.pdf
codehttps://github.com/hack-hyc/DAT4/blob/master/code/10_rmse.py
codehttps://github.com/hack-hyc/DAT4/blob/master/code/10_confusion_roc.py
videohttps://www.youtube.com/watch?v=OAl6eAyP-yo
Model evaluation homeworkhttps://github.com/hack-hyc/DAT4/blob/master/homework/10_model_evaluation.md
Sample solution codehttps://github.com/hack-hyc/DAT4/blob/master/code/10_glass_id_homework_solution.py
Kaggle project presentation videohttps://www.youtube.com/watch?v=HGr1yQV3Um0
Smart Autofillhttp://googleresearch.blogspot.com/2014/10/smart-autofill-harnessing-predictive.html
Kaggle Transforms Data Science Into Competitive Sporthttps://www.youtube.com/watch?v=8w4UY66GKcM
model evaluationhttp://scikit-learn.org/stable/modules/model_evaluation.html
model evaluation metricshttps://www.kaggle.com/wiki/Metrics
simple guide to confusion matrix terminologyhttp://www.dataschool.io/simple-guide-to-confusion-matrix-terminology/
blog post about the ROC videohttp://www.dataschool.io/roc-curves-and-auc-explained/
Sensitivity and Specificityhttps://www.youtube.com/watch?v=U4_3fditnWg&list=PL41ckbAGB5S2PavLIXUETzAmi5reIod23
ROC Curveshttps://www.youtube.com/watch?v=21Igj5Pr6u4&list=PL41ckbAGB5S2PavLIXUETzAmi5reIod23
https://github.com/hack-hyc/DAT4#class-11-working-a-data-problem
datahttps://github.com/hack-hyc/DAT4/blob/master/data/ZYX_prices.csv
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/11_GA_Stocks.pdf
https://github.com/hack-hyc/DAT4#class-12-clustering-and-visualization
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/12_clustering.pdf
codehttps://github.com/hack-hyc/DAT4/blob/master/code
datahttps://github.com/hack-hyc/DAT4/blob/master/data/songs.csv
A Plan for Spamhttp://www.paulgraham.com/spam.html
Kevin's guidehttp://www.dataschool.io/simple-guide-to-confusion-matrix-terminology/
excellent videohttps://www.youtube.com/watch?v=U4_3fditnWg&list=PL41ckbAGB5S2PavLIXUETzAmi5reIod23
introductory slideshttps://docs.google.com/presentation/d/1cM2dVbJgTWMkHoVNmYlB9df6P2H8BrjaqAcZTaLe9dA/edit#slide=id.gfc3caad2_00
OpenIntro Statistics textbookhttps://www.openintro.org/stat/textbook.php
Introduction to Data Mininghttp://www-users.cs.umn.edu/~kumar/dmbook/index.php
chapter on cluster analysishttp://www-users.cs.umn.edu/~kumar/dmbook/ch8.pdf
section on clusteringhttp://scikit-learn.org/stable/modules/clustering.html
https://github.com/hack-hyc/DAT4#class-13-naive-bayes
A Plan for Spamhttp://www.paulgraham.com/spam.html
Slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/13_naive_bayes.pdf
Visualization of conditional probabilityhttp://setosa.io/conditional/
codehttps://github.com/hack-hyc/DAT4/blob/master/code/13_bayes_iris.py
Slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/13_naive_bayes.pdf
Airport security examplehttp://www.quora.com/In-laymans-terms-how-does-Naive-Bayes-work/answer/Konstantin-Tt
codehttps://github.com/hack-hyc/DAT4/blob/master/code/13_naive_bayes.py
SMS Spam Collectionhttps://archive.ics.uci.edu/ml/datasets/SMS+Spam+Collection
CountVectorizerhttp://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.CountVectorizer.html
Naive Bayeshttp://scikit-learn.org/stable/modules/naive_bayes.html
Visualizing Bayes' theoremhttp://oscarbonilla.com/2009/05/visualizing-bayes-theorem/
Bayes' Rule for Duckshttps://planspacedotorg.wordpress.com/2014/02/23/bayes-rule-for-ducks/
5-minute video on conditional probabilityhttps://www.youtube.com/watch?v=Zxm4Xxvzohk
slides on conditional probabilityhttps://docs.google.com/presentation/d/1psUIyig6OxHQngGEHr3TMkCvhdLInnKnclQoNUr4G4U/edit#slide=id.gfc69f484_00
Naive Bayes classifierhttp://en.wikipedia.org/wiki/Naive_Bayes_classifier
Naive Bayes spam filteringhttp://en.wikipedia.org/wiki/Naive_Bayes_spam_filtering
Q&Ahttp://stats.stackexchange.com/questions/21822/understanding-naive-bayes
his follow-up articlehttp://www.paulgraham.com/better.html
related paperhttp://www.merl.com/publications/docs/TR2004-091.pdf
https://github.com/hack-hyc/DAT4#class-14-natural-language-processing
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/14_natural_language_processing.pdf
codehttps://github.com/hack-hyc/DAT4/blob/master/code/14_nlp_class.py
Natural Language Processing with Pythonhttp://www.nltk.org/book/
NLP online coursehttps://www.coursera.org/course/nlp
video lectureshttps://class.coursera.org/nlp/lecture
slideshttp://web.stanford.edu/~jurafsky/NLPCourseraSlides.html
Brief slideshttp://files.meetup.com/7616132/DC-NLP-2013-09%20Charlie%20Greenbacker.pdf
Detailed slideshttps://github.com/ga-students/DAT_SF_9/blob/master/16_Text_Mining/DAT9_lec16_Text_Mining.pdf
A visual survey of text visualization techniqueshttp://textvis.lnu.se/
DC Natural Language Processinghttp://www.meetup.com/DC-NLP/
Stanford CoreNLPhttp://nlp.stanford.edu/software/corenlp.shtml
Python introductory lessonhttps://developers.google.com/edu/python/regular-expressions
reference guidehttps://github.com/justmarkham/DAT3/blob/master/code/99_regex_reference.py
real-time regex testerhttps://regex101.com/#python
in-depth tutorialshttp://www.rexegg.com/
SpaCyhttp://honnibal.github.io/spaCy/
https://github.com/hack-hyc/DAT4#class-15-decision-trees
IPython notebookhttp://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/15_decision_trees.ipynb
these guidelineshttps://github.com/hack-hyc/DAT4/blob/master/peer_review.md
Decision Treeshttp://scikit-learn.org/stable/modules/tree.html
Download and install PKG filehttp://www.graphviz.org/Download_macos.php
Download and install MSI filehttp://www.graphviz.org/Download_windows.php
https://github.com/hack-hyc/DAT4#class-16-ensembling
IPython notebookhttp://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/16_ensembling.ipynb
Ensemble Methodshttp://scikit-learn.org/stable/modules/ensemble.html
How do random forests work in layman's terms?http://www.quora.com/How-do-random-forests-work-in-laymans-terms/answer/Edwin-Chen-1
https://github.com/hack-hyc/DAT4#class-17-databases-and-mapreduce
database codehttps://github.com/hack-hyc/DAT4/blob/master/code/17_sql.py
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/17_db_mr.pdf
codehttps://github.com/hack-hyc/DAT4/blob/master/code/17_map_reduce.py
Forbes: Is it Time for Hadoop Alternatives?http://www.forbes.com/sites/johnwebster/2014/12/08/is-it-time-for-hadoop-alternatives/
IBM: What is MapReduce?http://www-01.ibm.com/software/data/infosphere/hadoop/mapreduce/
Wakari MapReduce IPython notebookhttps://www.wakari.io/sharing/bundle/nkorf/MapReduce%20Example
What Every Data Scientist Needs to Know about SQLhttp://joshualande.com/data-science-sql/
Brandon's SQL Bootcamphttps://github.com/brandonmburroughs/sql_bootcamp
SQLZOOhttp://sqlzoo.net/wiki/Main_Page
Mode Analyticshttp://sqlschool.modeanalytics.com/
https://github.com/hack-hyc/DAT4#class-18-recommenders
slideshttps://github.com/hack-hyc/DAT4/blob/master/slides/18_recommendation_engines.pdf
codehttps://github.com/hack-hyc/DAT4/blob/master/code/18_recommenders_class.py
The Netflix Prizehttp://www.netflixprize.com/
Why Netflix never implemented the winning solutionhttps://www.techdirt.com/blog/innovation/articles/20120409/03412518422/why-netflix-never-implemented-algorithm-that-won-netflix-1-million-challenge.shtml
Visualization of the Music Genome Projecthttp://www.music-map.com/
The People Inside Your Machinehttp://www.npr.org/blogs/money/2015/01/30/382657657/episode-600-the-people-inside-your-machine
https://github.com/hack-hyc/DAT4#class-19-advanced-scikit-learn
codehttps://github.com/hack-hyc/DAT4/blob/master/code/19_advanced_sklearn.py
GridSearchCVhttp://scikit-learn.org/stable/modules/grid_search.html
StandardScalerhttp://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html
Pipelinehttp://scikit-learn.org/stable/modules/pipeline.html
notebookhttp://nbviewer.ipython.org/github/justmarkham/DAT4/blob/master/notebooks/19_regularization.ipynb
Ridge, RidgeCV, Lasso, LassoCVhttp://scikit-learn.org/stable/modules/linear_model.html
LogisticRegressionhttp://scikit-learn.org/stable/modules/linear_model.html
RFE, RFECVhttp://scikit-learn.org/stable/modules/feature_selection.html
A Few Useful Things to Know about Machine Learninghttp://homes.cs.washington.edu/~pedrod/papers/cacm12.pdf
feature scalinghttp://nbviewer.ipython.org/github/rasbt/pattern_classification/blob/master/preprocessing/about_standardization_normalization.ipynb
Clever Methods of Overfittinghttp://hunch.net/?p=22
Common Pitfalls in Machine Learninghttp://danielnee.com/?p=155
https://github.com/hack-hyc/DAT4#class-20-course-review
Data science reviewhttps://docs.google.com/document/d/1XCdyrsQwU5OC5os7RHdVTEtS-tpHBbsoKKWLpYI6Svo/edit?usp=sharing
Comparing supervised learning algorithmshttps://docs.google.com/spreadsheets/d/15_QJXm6urctsbIXO-C_eXrsSffbHedio8z0E5ozxO-M/edit?usp=sharing
Choosing a Machine Learning Classifierhttp://blog.echen.me/2011/04/27/choosing-a-machine-learning-classifier/
scikit-learn "machine learning map"http://scikit-learn.org/stable/tutorial/machine_learning_map/
Machine Learning Done Wronghttp://ml.posthaven.com/machine-learning-done-wrong
Practical machine learning tricks from the KDD 2011 best industry paperhttp://blog.david-andrzejewski.com/machine-learning/practical-machine-learning-tricks-from-the-kdd-2011-best-industry-paper/
An Empirical Comparison of Supervised Learning Algorithmshttp://www.cs.cornell.edu/~caruana/ctp/ct.papers/caruana.icml06.pdf
Getting in Shape for the Sport of Data Sciencehttps://www.youtube.com/watch?v=kwt6XEh7U3g
Resources for continued learning!https://github.com/hack-hyc/DAT4/blob/master/resources.md
https://github.com/hack-hyc/DAT4#class-21-project-presentations
https://github.com/hack-hyc/DAT4#class-22-project-presentations
Readme https://github.com/hack-hyc/DAT4#readme-ov-file
Please reload this pagehttps://github.com/hack-hyc/DAT4
Activityhttps://github.com/hack-hyc/DAT4/activity
0 starshttps://github.com/hack-hyc/DAT4/stargazers
1 watchinghttps://github.com/hack-hyc/DAT4/watchers
0 forkshttps://github.com/hack-hyc/DAT4/forks
Report repository https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fhack-hyc%2FDAT4&report=hack-hyc+%28user%29
Releaseshttps://github.com/hack-hyc/DAT4/releases
Packages 0https://github.com/users/hack-hyc/packages?repo_name=DAT4
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.