Skip to content

jacobabhihail-tech/ml-foundation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 

Repository files navigation

🧠 Phase 2: Machine Learning (Days 16–45)

🎯 Objective

This phase focuses on building strong machine learning foundations using Scikit-learn with an engineering mindset.

The goal is not theory memorization —
but building, validating, and saving real ML models.


📅 Phase Breakdown

📌 Week 3–4 (Days 16–30): ML Fundamentals

Core tools:

  • Python
  • NumPy
  • Pandas
  • Scikit-learn
  • Joblib

🏗 Foundation (Days 16–19)

Day 16

  • What is Machine Learning
  • Supervised vs Unsupervised
  • Train/Test Split
  • Pipeline mindset

Day 17

  • Linear Regression implementation

Day 18

  • Regression Metrics:
    • MAE
    • MSE
    • RMSE
    • R² Score

Day 19

  • Feature Scaling
    • StandardScaler
    • Why scaling matters

🧮 Classification (Days 20–22)

Day 20

  • Logistic Regression

Day 21

  • Classification Metrics:
    • Accuracy
    • Precision
    • Recall
    • F1 Score

Day 22

  • Confusion Matrix
  • ROC Curve
  • ROC-AUC Score

🌳 Tree-Based Models (Days 23–24)

Day 23

  • Decision Tree (Regressor & Classifier)

Day 24

  • Random Forest
  • Overfitting vs Underfitting

🔁 Model Validation & Optimization (Days 25–26)

Day 25

  • Cross Validation
  • Why single split is dangerous

Day 26

  • Hyperparameter Tuning
  • GridSearchCV
  • Model comparison

💾 Production Thinking (Days 27–29)

Day 27

  • Model saving using:
    • joblib
    • pickle

Day 28

  • End-to-end ML pipeline notebook
    • Preprocessing
    • Model
    • Evaluation
    • Saving

Day 29

  • Code refactoring
  • Clean structure
  • Removing redundant steps

🎤 Day 30 – Interview Preparation

  • Common ML interview questions
  • Bias-variance tradeoff
  • Overfitting explanation
  • Metric comparison scenarios
  • Model selection reasoning

🧠 Key Concepts Learned

  • Proper train-test workflow
  • Importance of cross-validation
  • Difference between regression & classification metrics
  • Tree models vs linear models
  • Hyperparameter tuning process
  • Pipeline building
  • Model reproducibility

📊 Skills Gained

✔ Implement ML models from scratch using Sklearn
✔ Evaluate using correct metrics
✔ Validate models properly
✔ Tune hyperparameters
✔ Save production-ready models
✔ Explain models confidently


🛠 Tools Used

  • Python
  • Pandas
  • NumPy
  • Scikit-learn
  • Matplotlib
  • Joblib

🚀 Outcome

By the end of this phase:

  • Able to build end-to-end ML workflows
  • Understand evaluation deeply
  • Think in pipeline structure
  • Ready to move toward advanced ML & real-world projects

👨‍💻 Long-Term Direction

This phase builds the foundation for:

  • Advanced ML
  • Model deployment
  • AI Engineering path

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors