| route-pattern | /:user_id/:repository |
| route-controller | files |
| route-action | disambiguate |
| fetch-nonce | v2:97fcefbe-c5e0-23ca-1427-6d85ff837658 |
| current-catalog-service-hash | f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb |
| request-id | CB00:288CD2:BE819:FEFCE:6973E1D8 |
| html-safe-nonce | 11787423952ebc9e94eeae1305848584545988ba709098b56aed50cf88ada54e |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJDQjAwOjI4OENEMjpCRTgxOTpGRUZDRTo2OTczRTFEOCIsInZpc2l0b3JfaWQiOiI0MzQxMTE3NDE3MTQ5NDI0MDg5IiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0= |
| visitor-hmac | 24b8f283ed73f42c6201e909618670a55b1651a9c312f8820df2dae60a006e90 |
| hovercard-subject-tag | repository:509245682 |
| github-keyboard-shortcuts | repository,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | // |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/PlayingNumbers/ML_Process_Course |
| twitter:image | https://opengraph.githubassets.com/f97e16394ed8a9929cdbcf5a017bca95af11ed1f175617c9513922e600b9b410/PlayingNumbers/ML_Process_Course |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/f97e16394ed8a9929cdbcf5a017bca95af11ed1f175617c9513922e600b9b410/PlayingNumbers/ML_Process_Course |
| og:image:alt | Public repo for the 365 Data Science ML Process Course - PlayingNumbers/ML_Process_Course |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| hostname | github.com |
| expected-hostname | github.com |
| None | bcae444bf5f475d712cb2ad536c10eaebd406c84ae2d3a77c5f05aee5010c786 |
| turbo-cache-control | no-preview |
| go-import | github.com/PlayingNumbers/ML_Process_Course git https://github.com/PlayingNumbers/ML_Process_Course.git |
| octolytics-dimension-user_id | 45129620 |
| octolytics-dimension-user_login | PlayingNumbers |
| octolytics-dimension-repository_id | 509245682 |
| octolytics-dimension-repository_nwo | PlayingNumbers/ML_Process_Course |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 509245682 |
| octolytics-dimension-repository_network_root_nwo | PlayingNumbers/ML_Process_Course |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 7da815895d8ce3ba55f0017efb5a34bac921756a |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
| Skip to content | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#start-of-content |
|
| https://patch-diff.githubusercontent.com/ |
|
Sign in
| https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FPlayingNumbers%2FML_Process_Course |
| GitHub CopilotWrite better code with AI | https://github.com/features/copilot |
| GitHub SparkBuild and deploy intelligent apps | https://github.com/features/spark |
| GitHub ModelsManage and compare prompts | https://github.com/features/models |
| MCP RegistryNewIntegrate external tools | https://github.com/mcp |
| ActionsAutomate any workflow | https://github.com/features/actions |
| CodespacesInstant dev environments | https://github.com/features/codespaces |
| IssuesPlan and track work | https://github.com/features/issues |
| Code ReviewManage code changes | https://github.com/features/code-review |
| GitHub Advanced SecurityFind and fix vulnerabilities | https://github.com/security/advanced-security |
| Code securitySecure your code as you build | https://github.com/security/advanced-security/code-security |
| Secret protectionStop leaks before they start | https://github.com/security/advanced-security/secret-protection |
| Why GitHub | https://github.com/why-github |
| Documentation | https://docs.github.com |
| Blog | https://github.blog |
| Changelog | https://github.blog/changelog |
| Marketplace | https://github.com/marketplace |
| View all features | https://github.com/features |
| Enterprises | https://github.com/enterprise |
| Small and medium teams | https://github.com/team |
| Startups | https://github.com/enterprise/startups |
| Nonprofits | https://github.com/solutions/industry/nonprofits |
| App Modernization | https://github.com/solutions/use-case/app-modernization |
| DevSecOps | https://github.com/solutions/use-case/devsecops |
| DevOps | https://github.com/solutions/use-case/devops |
| CI/CD | https://github.com/solutions/use-case/ci-cd |
| View all use cases | https://github.com/solutions/use-case |
| Healthcare | https://github.com/solutions/industry/healthcare |
| Financial services | https://github.com/solutions/industry/financial-services |
| Manufacturing | https://github.com/solutions/industry/manufacturing |
| Government | https://github.com/solutions/industry/government |
| View all industries | https://github.com/solutions/industry |
| View all solutions | https://github.com/solutions |
| AI | https://github.com/resources/articles?topic=ai |
| Software Development | https://github.com/resources/articles?topic=software-development |
| DevOps | https://github.com/resources/articles?topic=devops |
| Security | https://github.com/resources/articles?topic=security |
| View all topics | https://github.com/resources/articles |
| Customer stories | https://github.com/customer-stories |
| Events & webinars | https://github.com/resources/events |
| Ebooks & reports | https://github.com/resources/whitepapers |
| Business insights | https://github.com/solutions/executive-insights |
| GitHub Skills | https://skills.github.com |
| Documentation | https://docs.github.com |
| Customer support | https://support.github.com |
| Community forum | https://github.com/orgs/community/discussions |
| Trust center | https://github.com/trust-center |
| Partners | https://github.com/partners |
| GitHub SponsorsFund open source developers | https://github.com/sponsors |
| Security Lab | https://securitylab.github.com |
| Maintainer Community | https://maintainers.github.com |
| Accelerator | https://github.com/accelerator |
| Archive Program | https://archiveprogram.github.com |
| Topics | https://github.com/topics |
| Trending | https://github.com/trending |
| Collections | https://github.com/collections |
| Enterprise platformAI-powered developer platform | https://github.com/enterprise |
| GitHub Advanced SecurityEnterprise-grade security features | https://github.com/security/advanced-security |
| Copilot for BusinessEnterprise-grade AI features | https://github.com/features/copilot/copilot-business |
| Premium SupportEnterprise-grade 24/7 support | https://github.com/premium-support |
| Pricing | https://github.com/pricing |
| Search syntax tips | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
| documentation | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
|
Sign in
| https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FPlayingNumbers%2FML_Process_Course |
|
Sign up
| https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=PlayingNumbers%2FML_Process_Course |
| Reload | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
| Reload | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
| Reload | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
|
PlayingNumbers
| https://patch-diff.githubusercontent.com/PlayingNumbers |
| ML_Process_Course | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
|
Notifications
| https://patch-diff.githubusercontent.com/login?return_to=%2FPlayingNumbers%2FML_Process_Course |
|
Fork
202
| https://patch-diff.githubusercontent.com/login?return_to=%2FPlayingNumbers%2FML_Process_Course |
|
Star
511
| https://patch-diff.githubusercontent.com/login?return_to=%2FPlayingNumbers%2FML_Process_Course |
|
511
stars
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/stargazers |
|
202
forks
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/forks |
|
Branches
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/branches |
|
Tags
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tags |
|
Activity
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/activity |
|
Star
| https://patch-diff.githubusercontent.com/login?return_to=%2FPlayingNumbers%2FML_Process_Course |
|
Notifications
| https://patch-diff.githubusercontent.com/login?return_to=%2FPlayingNumbers%2FML_Process_Course |
|
Code
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
|
Issues
1
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/issues |
|
Pull requests
0
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/pulls |
|
Actions
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/actions |
|
Projects
0
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/projects |
|
Security
0
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/security |
|
Insights
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/pulse |
|
Code
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
|
Issues
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/issues |
|
Pull requests
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/pulls |
|
Actions
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/actions |
|
Projects
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/projects |
|
Security
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/security |
|
Insights
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/pulse |
| Branches | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/branches |
| Tags | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tags |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/branches |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tags |
| 35 Commits | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/commits/main/ |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/commits/main/ |
| 10_imbalanced_data | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/10_imbalanced_data |
| 10_imbalanced_data | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/10_imbalanced_data |
| 11_modeling | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/11_modeling |
| 11_modeling | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/11_modeling |
| 12_model_evaluation | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/12_model_evaluation |
| 12_model_evaluation | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/12_model_evaluation |
| 5_data_preprocessing | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/5_data_preprocessing |
| 5_data_preprocessing | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/5_data_preprocessing |
| 6_exploratory_data_analysis | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/6_exploratory_data_analysis |
| 6_exploratory_data_analysis | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/6_exploratory_data_analysis |
| 7_feature_engineering | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/7_feature_engineering |
| 7_feature_engineering | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/7_feature_engineering |
| 8_cross_validation | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/8_cross_validation |
| 8_cross_validation | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/8_cross_validation |
| 9_feature_selection | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/9_feature_selection |
| 9_feature_selection | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/tree/main/9_feature_selection |
| .DS_Store | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/.DS_Store |
| .DS_Store | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/.DS_Store |
| 365datascience_ml_process_flashcards.apkg | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/365datascience_ml_process_flashcards.apkg |
| 365datascience_ml_process_flashcards.apkg | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/365datascience_ml_process_flashcards.apkg |
| README.md | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/README.md |
| README.md | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/README.md |
| dataset_simulation.py | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/dataset_simulation.py |
| dataset_simulation.py | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/dataset_simulation.py |
| ml_process_cheat_sheet.py | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/ml_process_cheat_sheet.py |
| ml_process_cheat_sheet.py | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/ml_process_cheat_sheet.py |
| requirements.txt | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/requirements.txt |
| requirements.txt | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/blob/main/requirements.txt |
| README | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#ml-process-course |
| ML Process Course | https://365datascience.com/learn-machine-learning-process-a-z/ |
| The Machine Learning A-Z Bundle | https://bit.ly/3NAZ5oP |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#flashcards |
| Ankiweb.net | https://ankiweb.net |
| here | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/365datascience_ml_process_flashcards.apkg |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#table-of-contents |
| Coding Workbooks for Each Course | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#coding-workbooks-for-each-course |
| Data Science Blogs | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#data-science-blogs |
| Applying ML | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#2-applying-ml |
| Problem Framing | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#3-problem-framing |
| Data Collection | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#4-data-collection |
| Data Preprocessing | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#5-data-preprocessing |
| Exploratory Data Analysis | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#6-exploratory-data-analysis |
| Feature Engineering | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#7-feature-engineering |
| Cross Validation | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#8-cross-validation |
| Feature Selection | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#9-feature-selection |
| Imbalanced Data | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#10-imbalanced-data |
| Modeling | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#11-modeling |
| Model Evaluation | https://github.com/PlayingNumbers/ML_Process_Course/blob/main/README.md#12-model-evaluation |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#coding-workbooks-for-each-course |
| 5-Missing Values | https://www.kaggle.com/code/kenjee/dealing-with-missing-values-section-5-1 |
| 5-Outliers | https://www.kaggle.com/code/kenjee/dealing-with-outliers-section-5-2 |
| 5-Missing Values | https://colab.research.google.com/drive/1P-4i_T1UE8_PLZibNApGbGPDxhOJDnd8?usp=sharing |
| 5-Outliers | https://colab.research.google.com/drive/1e_9VUn48sOebsEmDMRZ2R7OEkJLM9Zxt?usp=sharing |
| 6-EDA | https://www.kaggle.com/code/kenjee/basic-eda-example-section-6 |
| 6-EDA | https://colab.research.google.com/drive/18sWPkf2o6yX2RJu0YRZRCbJqNzc5q8ZS?usp=sharing |
| 7-Categoricals | https://www.kaggle.com/code/kenjee/categorical-feature-engineering-section-7-1 |
| 7-Continuous | https://www.kaggle.com/code/kenjee/numeric-feature-engineering-section-7-2 |
| 7-Categoricals | https://colab.research.google.com/drive/1F94kWYM_GTb-Neh_jCte04BxfcUdBodn?usp=sharing |
| 7-Continuous | https://colab.research.google.com/drive/1SGwguOuloOG7nd3OoOGBALR9jOg26UNt?usp=sharing |
| 8-Cross Validation | https://www.kaggle.com/code/kenjee/cross-validation-foundations-section-8 |
| 8-Cross Validation | https://colab.research.google.com/drive/1xsVT5MWAX1Yq8KXMPbqU6DBti-NN17YH?usp=sharing |
| 9-Feature Selection | https://www.kaggle.com/code/kenjee/feature-selection-section-9 |
| 9-Feature Selection | https://colab.research.google.com/drive/19uXC7Cm_K1FDTjkcRVjxcAY4f7dv6LDV?usp=sharing |
| 10-Imbalanced Data | https://www.kaggle.com/code/kenjee/dealing-with-imbalanced-data-section-10 |
| 10-Imbalanced Data | https://colab.research.google.com/drive/1pulqugw0V1xyoMbQrB3aTdlwO0KDXDFf?usp=sharing |
| 11-Model Selection | https://www.kaggle.com/code/kenjee/model-building-example-section-11 |
| 11-Model Selection | https://colab.research.google.com/drive/1oV675pKGmCLIYE44a_Quw1s66c4LVcOq?usp=sharing |
| 12-Model Evaluation Classification | https://www.kaggle.com/code/kenjee/model-evaluation-classification-section-12 |
| 12-Model Evaluation Regression | https://www.kaggle.com/code/kenjee/model-evaluation-regression-12 |
| 12-Model Evaluation Classification | https://colab.research.google.com/drive/1FYHAL3lbv7Rdh3EV9TooMFkPNv6MEJly?usp=sharing |
| 12-Model Evaluation Regression | https://colab.research.google.com/drive/1_of9a48P-rGkrS8US9YgwQm_58_1unAV?usp=sharing |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#data-science-blogs |
| AirBnB Engineering | https://medium.com/airbnb-engineering |
| Spotify Research | https://research.atspotify.com/blog/ |
| Netflix Research | https://research.netflix.com/ |
| DoorDash ML Blog | https://doordash.engineering/category/data-science-and-machine-learning/ |
| Uber Engineering | https://www.uber.com/blog/honolulu/engineering/ |
| Lyft Engineering | https://eng.lyft.com/tagged/data-science |
| Shopify Engineering | https://shopify.engineering/topics/data-science-engineering |
| Meta Engineering | https://engineering.fb.com/ |
| LinkedIn Engineering | https://engineering.linkedin.com/blog |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#2-applying-ml |
| Analytics Hierarchy of Needs by Ryan Foley | https://towardsdatascience.com/the-analytics-hierarchy-of-needs-6d57d0e205e2 |
| The First Rule of ML by Eugene Yan | https://eugeneyan.com/writing/first-rule-of-ml/ |
| ML for Product Analytics by Ron Tidhar | https://www.youtube.com/watch?v=L8D8LEDmLyE&ab_channel=cnvrg.io |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#3-problem-framing |
| Problem Understanding by Stratechi | https://www.stratechi.com/problem-statement/ |
| How to Define a Product Problem | https://userpilot.com/blog/product-problems/ |
| Intro to Opportunity Sizing | https://medium.com/related-works-inc/intro-to-opportunity-sizing-ce9d7e5a29c4 |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#4-data-collection |
| Getting Data From a Database | https://medium.com/swlh/searching-through-a-database-52eaaf64f89c |
| What is SQL - Video | https://www.youtube.com/watch?v=27axs9dO7AE&ab_channel=DanielleTh%C3%A9 |
| How Data Analysts Use SQL - Video | https://www.youtube.com/watch?v=GEBzsz8ZSXs&ab_channel=LukeBarousse |
| Why Are APIs Important For Data Science - Video | https://www.youtube.com/watch?v=s1gD35Z4eUc&ab_channel=KenJee |
| What is an API? | https://www.freecodecamp.org/news/what-is-an-api-in-english-please-b880a3214a82/ |
| How Companies Caputre Data | https://kb.narrative.io/how-do-companies-collect-data |
| How Cookies Work | https://www.acxiom.com/about-us/privacy/how-cookies-work/#:~:text=What%20Are%20Cookies%2C%20and%20How,some%20useful%20information%20about%20you. |
| Web Scraping Basics | https://towardsdatascience.com/web-scraping-basics-82f8b5acd45c |
| Types of Survey Techniques | https://www.questionpro.com/blog/survey-data-collection/ |
| 365 Data Science SQL Course | https://365datascience.com/courses/sql/ |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#5-data-preprocessing |
| All You Need To Know About Different Types Of Missing Data Values And How To Handle It | https://www.analyticsvidhya.com/blog/2021/10/handling-missing-value/#:~:text=Types%20Of%20Missing%20Values,Missing%20Not%20At%20Random%20(MNAR) |
| 7 Ways to Handle Missing Values in Machine Learning | https://towardsdatascience.com/7-ways-to-handle-missing-values-in-machine-learning-1a6326adf79e |
| Null Values Imputation by Utkarsh Gupta | https://www.kaggle.com/general/248836 |
| What are methods to make a predictive model more robust to outliers? | https://www.quora.com/What-are-methods-to-make-a-predictive-model-more-robust-to-outliers |
| Guidelines for Removing and Handling Outliers in Data by Jim Frost | https://statisticsbyjim.com/basics/remove-outliers |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#6-exploratory-data-analysis |
| 10 Types of Statistical Data Distribution Models | https://www.analyticssteps.com/blogs/10-types-statistical-data-distribution-models |
| What is EDA? | https://towardsdatascience.com/exploratory-data-analysis-8fc1cb20fd15 |
| Is Data Visualization Important for Data Science? - Video | https://www.youtube.com/watch?v=k8YxyrcAXJs&ab_channel=KenJee |
| Box Plot Details | https://byjus.com/maths/box-plot/ |
| Histogram Additional Details | https://towardsdatascience.com/histograms-why-how-431a5cfbfcd5#:~:text=A%20histogram%20provides%20a%20visual,or%20gaps%20in%20the%20data. |
| Types of Data Distributions | https://www.analyticsvidhya.com/blog/2017/09/6-probability-distributions-data-science/ |
| Understanding Skew | https://en.wikipedia.org/wiki/Skewness |
| Scatterplots and Correlation Additional Details | https://www.khanacademy.org/math/statistics-probability/describing-relationships-quantitative-data/introduction-to-scatterplots/a/scatterplots-and-correlation-review |
| Types of Correlations | https://www.ncl.ac.uk/webtemplate/ask-assets/external/maths-resources/statistics/regression-and-correlation/types-of-correlation.html |
| Correlation Matrix Details | https://www.displayr.com/what-is-a-correlation-matrix/ |
| Different correlation matrices in python | https://medium.com/@szabo.bibor/how-to-create-a-seaborn-correlation-heatmap-in-python-834c0686b88e |
| Creating Pivot Tables Documentation | https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.pivot_table.html |
| When to use different types of charts | https://help.flourish.studio/article/25-line-bar-and-pie-charts#:~:text=Line%20charts%20are%20ideal%20for,or%20axis%20labels%20are%20long. |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#7-feature-engineering |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#categorical-feature-engineering |
| All about Categorical Variable Encoding by Baijayanta Roy | https://towardsdatascience.com/all-about-categorical-variable-encoding-305f3361fd02 |
| Ordinal and One-Hot Encodings for Categorical Data by Jason Brownlee | https://machinelearningmastery.com/one-hot-encoding-for-categorical-data/ |
| Feature Engineering Ordinal Variables by Sheng Jun | https://towardsdatascience.com/feature-engineering-ordinal-variables-bfea697f5eee |
| Target Encoding by Ryan Holbrook | https://www.kaggle.com/code/ryanholbrook/target-encoding |
| Weight of Evidence Coding by Bruce Lund | https://www.mwsug.org/proceedings/2016/AA/MWSUG-2016-AA15.pdf |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#continuous-feature-engineering |
| About Feature Scaling and Normalization by Sebastian Raschka | https://sebastianraschka.com/Articles/2014_about_feature_scaling.html |
| Feature Scaling Techniques in Python – A Complete Guide by Eddie_4072 | https://www.analyticsvidhya.com/blog/2021/05/feature-scaling-techniques-in-python-a-complete-guide/ |
| Feature Scaling for Machine Learning: Understanding the Difference Between Normalization vs. Standardization by Aniruddha Bhandari | https://www.analyticsvidhya.com/blog/2020/04/feature-scaling-machine-learning-normalization-standardization/#:~:text=Normalization%20is%20a%20scaling%20technique,known%20as%20Min%2DMax%20scaling.&text=Here%2C%20Xmax%20and%20Xmin%20are,values%20of%20the%20feature%20respectively. |
| Robust Scaler - Sklearn Docs | https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.RobustScaler.html |
| Log Transformation: Purpose and Interpretation by Kyaw Saw Htoon | https://medium.com/@kyawsawhtoon/log-transformation-purpose-and-interpretation-9444b4b049c9 |
| Best exponential transformation to linearize your data with Scipy | https://towardsdatascience.com/best-exponential-transformation-to-linearize-your-data-with-scipy-cca6110313a6 |
| Exponentially scaling your data in order to zoom in on small differences | https://rikunert.com/exponential_scaler |
| Box Cox Transformation by Ted Hessing | https://sixsigmastudyguide.com/box-cox-transformation/ |
| Box-Cox Transformation and Target Variable: Explained | https://builtin.com/data-science/box-cox-transformation-target-variable |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#8-cross-validation |
| Understanding 8 types of Cross-Validation by Satyam Kumar | https://towardsdatascience.com/understanding-8-types-of-cross-validation-80c935a4976d |
| 7 Types of Cross Validation by Soumyaa Rawat | https://www.analyticssteps.com/blogs/7-types-cross-validation |
| k-fold cross-validation explained in plain English by Rukshan Pramoditha | https://towardsdatascience.com/k-fold-cross-validation-explained-in-plain-english-659e33c0bc0 |
| Machine Learning Fundamentals: Bias and Variance by Josh Starmer/Statquest | https://www.youtube.com/watch?v=EuBBz3bI-aA&t=4s&ab_channel=StatQuestwithJoshStarmer |
| A Quick Intro to Leave-One-Out Cross-Validation (LOOCV) by Statology | https://www.statology.org/leave-one-out-cross-validation/ |
| Time Series Cross Validation by Robert Hyndman | https://otexts.com/fpp3/tscv.html |
| Time Based Cross Validation | https://towardsdatascience.com/time-based-cross-validation-d259b13d42b8 |
| KFold vs. Monte Carlo by Rebecca Patro | https://towardsdatascience.com/cross-validation-k-fold-vs-monte-carlo-e54df2fc179b |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#9-feature-selection |
| Everything You Need to Know About Feature Selection In Machine Learning by Kartik Menon | https://www.simplilearn.com/tutorials/machine-learning-tutorial/feature-selection-in-machine-learning |
| A comprehensive guide to Feature Selection using Wrapper methods in Python | https://www.analyticsvidhya.com/blog/2020/10/a-comprehensive-guide-to-feature-selection-using-wrapper-methods-in-python/ |
| What is the difference between filter, wrapper, and embedded methods for feature selection? by Sebastian Raschka | https://sebastianraschka.com/faq/docs/feature_sele_categories.html |
| Introduction to Feature Selection methods with an example (or how to select the right variables?) | https://www.analyticsvidhya.com/blog/2016/12/introduction-to-feature-selection-methods-with-an-example-or-how-to-select-the-right-variables/ |
| Feature selection in Python using the Filter method by Renu Khandelwal | https://towardsdatascience.com/feature-selection-in-python-using-filter-method-7ae5cbc4ee05 |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#10-imbalanced-data |
| 10 Techniques to deal with Imbalanced Classes in Machine Learning by Analytics Vidhya | https://www.analyticsvidhya.com/blog/2020/07/10-techniques-to-deal-with-class-imbalance-in-machine-learning/ |
| Oversampling and Undersampling by Kurtis Pykes | https://towardsdatascience.com/oversampling-and-undersampling-5e2bbaf56dcf |
| Overcoming Class Imbalance using SMOTE Techniques | https://www.analyticsvidhya.com/blog/2020/10/overcoming-class-imbalance-using-smote-techniques/ |
| SMOTE explained for noobs – Synthetic Minority Over-sampling TEchnique line by line by Rich Data | https://rikunert.com/smote_explained |
| SMOTE by Joos Korstanje | https://towardsdatascience.com/smote-fdce2f605729 |
| 5 SMOTE Techniques for Oversampling your Imbalance Data by Cornellius Yudha Wijaya | https://towardsdatascience.com/5-smote-techniques-for-oversampling-your-imbalance-data-b8155bdbe2b5 |
| Fixing Imbalanced Datasets: An Introduction to ADASYN by Rui Nian | https://medium.com/@ruinian/an-introduction-to-adasyn-with-code-1383a5ece7aa |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#11-modeling |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#hyperparameter-tuning |
| What does "baseline" mean in the context of machine learning? | https://datascience.stackexchange.com/questions/30912/what-does-baseline-mean-in-the-context-of-machine-learning |
| Sklearn's Dummy Estimators | https://scikit-learn.org/stable/modules/model_evaluation.html#dummy-estimators |
| 7 Hyperparameter Optimization Techniques Every Data Scientist Should Know | https://towardsdatascience.com/7-hyperparameter-optimization-techniques-every-data-scientist-should-know-12cdebe713da |
| A Comprehensive Guide on Hyperparameter Tuning and its Techniques | https://www.analyticsvidhya.com/blog/2022/02/a-comprehensive-guide-on-hyperparameter-tuning-and-its-techniques/ |
| Hyperparameter tuning in Python by Tooba Jamal | https://towardsdatascience.com/hyperparameter-tuning-in-python-21a76794a1f7 |
| Random Search for Hyper-Parameter Optimization by James Bergestra and Yoshua Bengio | https://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf |
| A Conceptual Explanation of Bayesian Hyperparameter Optimization for Machine Learning by Will Koehrsen | https://towardsdatascience.com/a-conceptual-explanation-of-bayesian-model-based-hyperparameter-optimization-for-machine-learning-b8172278050f |
| Bayesian Optimization Primer by SigOpt | https://static.sigopt.com/b/20a144d208ef255d3b981ce419667ec25d8412e2/static/pdf/SigOpt_Bayesian_Optimization_Primer.pdf |
| Genetic Algorithms by Marcos Del Cueto | https://towardsdatascience.com/genetic-algorithm-to-optimize-machine-learning-hyperparameters-72bd6e2596fc |
| Simulated Annealing From Scratch in Python by Jason Brownlee | https://machinelearningmastery.com/simulated-annealing-from-scratch-in-python/#:~:text=Simulated%20Annealing-,Simulated%20Annealing%20is%20a%20stochastic%20global%20search%20optimization%20algorithm.,it%20easier%20to%20work%20with. |
| Optimization Techniques — Simulated Annealing by Frank Liang | https://towardsdatascience.com/optimization-techniques-simulated-annealing-d6a4785a1de7 |
| Hyperparameter optimization for Neural Networks | http://neupy.com/2016/12/17/hyperparameter_optimization_for_neural_networks.html#id13 |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#ensembling |
| Ensemble Methods: Elegant Techniques to Produce Improved Machine Learning Results | https://www.toptal.com/machine-learning/ensemble-methods-machine-learning#:~:text=Ensemble%20methods%20are%20techniques%20that,than%20a%20single%20model%20would |
| A Gentle Introduction to Ensemble Learning Algorithms by Jason Brownlee | https://machinelearningmastery.com/tour-of-ensemble-learning-algorithms/#:~:text=The%20three%20main%20classes%20of,on%20your%20predictive%20modeling%20project. |
| Types of Ensemble methods in Machine learning by Anju Tajbangshi | https://towardsdatascience.com/types-of-ensemble-methods-in-machine-learning-4ddaf73879db |
| Introduction to Ensembling/Stacking in Python by Anisotropic | https://www.kaggle.com/code/arthurtok/introduction-to-ensembling-stacking-in-python |
| Ensembles and Model Stacking by Eshaan Kirpal | https://www.kaggle.com/code/eshaan90/ensembles-and-model-stacking/notebook |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#12-model-evaluation |
| Model Evaluation Metrics in Machine Learning by Nagesh Singh Chauhan | https://www.kdnuggets.com/2020/05/model-evaluation-metrics-machine-learning.html |
| 11 Important Model Evaluation Metrics for Machine Learning Everyone should know | https://www.analyticsvidhya.com/blog/2019/08/11-important-model-evaluation-error-metrics/ |
| How To Interpret R-squared in Regression Analysis by Jim Frost | https://statisticsbyjim.com/regression/interpret-r-squared-regression/ |
| Know The Best Evaluation Metrics for Your Regression Model by Raghav Agrawal | https://www.analyticsvidhya.com/blog/2021/05/know-the-best-evaluation-metrics-for-your-regression-model/ |
| Recall, Precision, F1, ROC, AUC, and everything by Ofir Shalev | https://medium.com/swlh/recall-precision-f1-roc-auc-and-everything-542aedf322b9 |
| F1 Score vs ROC AUC vs Accuracy vs PR AUC: Which Evaluation Metric Should You Choose? by Jakub Czakon | https://neptune.ai/blog/f1-score-accuracy-roc-auc-pr-auc |
| Intuition behind Log Loss Score by Gaurav Dembla | https://towardsdatascience.com/intuition-behind-log-loss-score-4e0c9979680a#:~:text=is%20dependent%20on.-,What%20does%20log%2Dloss%20conceptually%20mean%3F,is%20the%20log%2Dloss%20value. |
| Why is ROC AUC equivalent to the probability that two randomly-selected samples are correctly ranked? | https://stats.stackexchange.com/questions/190216/why-is-roc-auc-equivalent-to-the-probability-that-two-randomly-selected-samples |
| Man U Whitney Test | https://en.wikipedia.org/wiki/Mann%E2%80%93Whitney_U_test#Area-under-curve_(AUC)_statistic_for_ROC_curves |
| Essential Things You Need to Know About F1-Score | https://en.wikipedia.org/wiki/Mann%E2%80%93Whitney_U_test#Area-under-curve_(AUC)_statistic_for_ROC_curves |
| ROC, AUC, precision, and recall visually explained by Paul Vanderlaken | https://paulvanderlaken.com/2019/08/16/roc-auc-precision-and-recall-visually-explained/ |
| R-squared Is Not Valid for Nonlinear Regression by Jim Frost | https://statisticsbyjim.com/regression/r-squared-invalid-nonlinear-regression/#:~:text=Nonlinear%20regression%20is%20an%20extremely,just%20don%27t%20go%20together. |
| 3 Best metrics to evaluate Regression Model? by Songhao Wu | https://towardsdatascience.com/what-are-the-best-metrics-to-evaluate-your-regression-model-418ca481755b |
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#12-model-productionization |
| Git For Data Scientists - Video | https://www.youtube.com/watch?v=_0rHU6qAQe0&ab_channel=KenJee |
| Git Documentation | https://git-scm.com/doc |
| Git Basics in 10 minutes | https://www.freecodecamp.org/news/learn-the-basics-of-git-in-under-10-minutes-da548267cc91/ |
| 365 Data Science Git & Github Course | https://365datascience.com/courses/git-and-github/ |
| Save and load ml models | https://machinelearningmastery.com/save-load-machine-learning-models-python-scikit-learn/ |
| 5 Different Ways to Save your ML Model | https://towardsdatascience.com/5-different-ways-to-save-your-machine-learning-model-b7996489d433 |
| Improve analytics slide decks | https://towardsdatascience.com/5-tips-to-improve-your-analytics-slide-decks-c5d0559259c0 |
| Streamlit Gallery | https://streamlit.io/gallery |
| Build 12 streamlit apps - Video | https://www.youtube.com/watch?v=JwSS70SZdyM&ab_channel=freeCodeCamp.org |
| Streamlit Project Example - Video | https://www.youtube.com/watch?v=Yk-unX4KnV4&ab_channel=KenJee |
| Build an api in python - Video | https://www.youtube.com/watch?v=5ZMpbdK0uqU&ab_channel=Indently |
| How to create an API in python | https://anderfernandez.com/en/blog/how-to-create-api-python/#:~:text=To%20create%20an%20API%20in%20Python%20with%20Flask%2C%20we%20have,app%20%3D%20Flask()%20%40app. |
| How to create a python library | https://medium.com/analytics-vidhya/how-to-create-a-python-library-7d5aea80cc3f |
|
Readme
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course#readme-ov-file |
| Please reload this page | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
|
Activity | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/activity |
|
511
stars | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/stargazers |
|
5
watching | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/watchers |
|
202
forks | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/forks |
|
Report repository
| https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FPlayingNumbers%2FML_Process_Course&report=PlayingNumbers+%28user%29 |
| Releases | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/releases |
| Packages
0 | https://patch-diff.githubusercontent.com/users/PlayingNumbers/packages?repo_name=ML_Process_Course |
| Please reload this page | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course |
| Contributors
2 | https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/graphs/contributors |
|
| https://github.com/jeffmli |
|
jeffmli
Jeffrey Li
| https://github.com/jeffmli |
|
| https://github.com/PlayingNumbers |
|
PlayingNumbers
| https://github.com/PlayingNumbers |
|
Jupyter Notebook
97.6%
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/search?l=jupyter-notebook |
|
Python
2.4%
| https://patch-diff.githubusercontent.com/PlayingNumbers/ML_Process_Course/search?l=python |
|
| https://github.com |
| Terms | https://docs.github.com/site-policy/github-terms/github-terms-of-service |
| Privacy | https://docs.github.com/site-policy/privacy-policies/github-privacy-statement |
| Security | https://github.com/security |
| Status | https://www.githubstatus.com/ |
| Community | https://github.community/ |
| Docs | https://docs.github.com/ |
| Contact | https://support.github.com?tags=dotcom-footer |