Tag | Leah’s AI/ML Notebook 📓

Python 121: Best Time to Buy and Sell Stock / SQL 175,176: Second-highest Salary / DL Review: Transformers, Self-Attention Mechanism & Positional Encoding

Day207 - Leetcode: Python 53 & SQL 185 & DL Review

September 15, 2025 5 minute read

Python 53: Maximum Subarray / SQL 185: Department Top Three Sales / DL Review: RNNs, LSTM Networks & Gradient Vanishing & Exploding

Day206 - Leetcode: Python 217 & SQL 175,176 & DL Review

September 13, 2025 3 minute read

Python 217: Contains Duplicate / SQL 175,176: Second-highest Salary / DL Review: Transfer Learning & Fine-Tuning & CNNs

Day205 - Leetcode: Python 175 & SQL Inner Join & DL Review

September 12, 2025 6 minute read

Python 175: TwoSums / SQL: Inner Join Revisiting / DL Review: Embedding Layers, Autoencoders & Knowledge Distillation

Day204 - DL Review: Revisiting Optimizers, CNNs & Data Drifts

September 9, 2025 5 minute read

Optimizers in Neural Networks, Parameter Sharing in CNNs, and Data & Concept Drifts

Day201-203 - MLops Review: CI/CD with GitHub Actions and Azure (2)

September 5, 2025 5 minute read

Scaling CI/CD: From ACI to Azure Kubernetes Service

Day199-200 - MLops Review: CI/CD with GitHub Actions and Azure (1)

September 2, 2025 5 minute read

Automating Docker image builds with ACR and seamless deployments to ACI

Day191-198 Hands-On Docker: From Basics to Real-World Deployment (Course Completed)

August 11, 2025 2 minute read

Docker Review — From CLI Fundamentals to Multi-Container Orchestration

Day178-190 - SQL Review: SQL Fundamentals for Data Analysis (Course Completed)

July 11, 2025 2 minute read

SQL Mastery in 3 Weeks — From Fundamentals to Analytic SQL

Day177 - MLOps Review: Data Distribution Shifts and Monitoring (4)

July 9, 2025 10 minute read

Designing Machine Learning Systems: Data Distribution Shifts (3) (Understanding Failures: Monitoring and Observability for ML System Reliability)

Day176 - MLOps Review: Data Distribution Shifts and Monitoring (3)

July 8, 2025 7 minute read

Designing Machine Learning Systems: Data Distribution Shifts (2) (Detecting & Addressing Data Distribution Shifts)

Day175 - MLOps Review: Data Distribution Shifts and Monitoring (2)

July 7, 2025 7 minute read

Designing Machine Learning Systems: Causes of ML System Failures (2) (Correcting Degenerate Feedback Loops) & Data Distribution Shifts (1) (Covariate, Co...

Day174 - MLOps Review: Data Distribution Shifts and Monitoring (1)

July 5, 2025 7 minute read

Designing Machine Learning Systems: Causes of ML System Failures (1) (Production data differing from training data, Edge Cases, and Degenerate Feedback Loops)

Day173 - MLOps Review: Model Deployment and Prediction Service (3)

July 3, 2025 4 minute read

Designing Machine Learning Systems: ML on the Cloud and on the Edge & Model Optimization (AutoTVM & WebAssembly)

Day172 - MLOps Review: Model Deployment and Prediction Service (2)

July 2, 2025 5 minute read

Designing Machine Learning Systems: Model Comparison (Low-Rank Factorization, Knowledge Distillation, Pruning, & Quantization)

Day171 - MLOps Review: Model Deployment and Prediction Service (1)

July 1, 2025 7 minute read

Designing Machine Learning Systems: Model Deployment and Batch Prediction Versus Online Prediction (Unifying Batch and Streaming Pipeline)

Day170 - MLOps Review: Model Development and Offline Evaluation (4)

June 30, 2025 6 minute read

Designing Machine Learning Systems: Model Offline Evaluation (Methods: Perturbation Tests, Invariance Tests, etc.)

Day169 - MLOps Review: Model Development and Offline Evaluation (3)

June 29, 2025 6 minute read

Designing Machine Learning Systems: Auto ML (Hyperparameter Tuning & NAS), Model Offline Evaluation (1) (Establishing Baselines)

Day168 - MLOps Review: Model Development and Offline Evaluation (2)

June 28, 2025 6 minute read

Designing Machine Learning Systems: Experiment Tracking, Versioning, and Distributed Training (Data Parallel)

Day167 - MLOps Review: Model Development and Offline Evaluation (1)

June 27, 2025 6 minute read

Designing Machine Learning Systems: Model Selection, Evaluating ML Models, & Ensemble Method (Bagging, Boosting, and Stacking)

Day166 - MLOps Review: Feature Engineering (3)

June 2, 2025 4 minute read

Designing Machine Learning Systems: Data Leakage (Definition, Common Causes, Detecting and Preventing it)

Day165 - MLOps Review: Feature Engineering (2)

May 31, 2025 4 minute read

Designing Machine Learning Systems: Feature Engineering Techniques (2) (Positional Embeddings) & Engineering Good Features

Day164 - MLOps Review: Feature Engineering (1)

May 29, 2025 5 minute read

Designing Machine Learning Systems: Feature Engineering Techniques (1) (Handling Missing Values, Scaling, Normalization, Binning, Encoding Categorical Values...

Day163 - MLOps Review: Training Data (4)

May 26, 2025 4 minute read

Designing Machine Learning Systems: Data Augmentation (Simple Label-Preserving Transformations, Perturbation, and Data Synthesis)

Day162 - MLOps Review: Training Data (3)

May 25, 2025 6 minute read

Designing Machine Learning Systems: Class Imbalance (How to Deal with the problems: Evaluation Metrics, Over & Undersampling, Resampling and Algorithm-le...

Day161 - MLOps Review: Training Data (2)

May 23, 2025 6 minute read

Designing Machine Learning Systems: Labeling (Hand Labels, Natural Labels, & Addressing the Lack of Labels - Active Learning, etc.)

Day160 - MLOps Review: Training Data (1)

May 18, 2025 5 minute read

Designing Machine Learning Systems: Sampling (Nonprobability, Simple Random, Stratified, Weighted, Reservoir, and Importance Sampling)

Day159 - MLOps Review: Data Engineering Fundamentals (2)

May 17, 2025 5 minute read

Designing Machine Learning Systems: Modes of Dataflow & Batch / Real-Time Processing

Day158 - MLOps Review: Data Engineering Fundamentals (1)

April 29, 2025 5 minute read

Designing Machine Learning Systems: Data Formats (JSON, Parquet & Binary Format), Data Models (Relational & NoSQL), and Data Storage Engines (ETL)

Day157 - MLOps Review: Introduction to Machine Learning Systems Design (3)

April 26, 2025 3 minute read

Designing Machine Learning Systems: Framing ML Problems (2) (Types of ML Tasks & Objective Functions)

Day156 - MLOps Review: Introduction to Machine Learning Systems Design (2)

April 25, 2025 5 minute read

Designing Machine Learning Systems: Iterative Process & Framing ML Problems (1)

Day155 - MLOps Review: Introduction to Machine Learning Systems Design (1)

April 24, 2025 3 minute read

Designing Machine Learning Systems: Business and ML Objectives & Requirements for ML Systems

A New Chapter:: Continuing onward to MLOps

April 22, 2025 1 minute read

Designing Machine Learning Systems (MLOPs) Review Begins!

Day154 - STAT Review: Unsupervised Learning (7)

April 20, 2025 6 minute read

Practical Statistics for Data Scientists: Scaling and Categorical Variables (Scaling the Variables, Dominant Variables, Categorical Data, and Gower’s Distanc...

Day153 - STAT Review: Unsupervised Learning (6)

April 15, 2025 4 minute read

Practical Statistics for Data Scientists: Model-Based Clustering (Multivariate Normal Distribution, Mixtures of Normals & Selecting the Number of Cluster...

Day152 - STAT Review: Unsupervised Learning (5)

April 13, 2025 5 minute read

Practical Statistics for Data Scientists: Hierarchical Clustering (A Simple Example, the Dendrogram, the Agglomerative Algorithm & Measures of Dissimilar...

Day151 - STAT Review: Unsupervised Learning (4)

April 8, 2025 4 minute read

Practical Statistics for Data Scientists: K-Means Clustering (2) (Interpreting Clustering Results & Determining the Optimal Number of Clusters K)

Day150 - STAT Review: Unsupervised Learning (3)

April 6, 2025 5 minute read

Practical Statistics for Data Scientists: K-Means Clustering (1) (A Simple Example & K-Means Algorithm Code Source)

Day149 - STAT Review: Unsupervised Learning (2)

April 4, 2025 4 minute read

Practical Statistics for Data Scientists: Principal Components Analysis (2) (Formal Definition, Interpreting Components & Correspondence Analysis)

Day148 - STAT Review: Unsupervised Learning (1)

April 2, 2025 5 minute read

Practical Statistics for Data Scientists: Principal Components Analysis (1) (Unsupervised Learning, A Simple Example and Computing the Principal Components)

Day147 - STAT Review: Statistical Machine Learning (9)

March 31, 2025 6 minute read

Practical Statistics for Data Scientists: Boosting (2) (Regularization, Hyperparameters & Cross-Validation)

Day146 - STAT Review: Statistical Machine Learning (8)

March 30, 2025 6 minute read

Practical Statistics for Data Scientists: Boosting (1) (Key Concepts & XGBoost)

Day145 - STAT Review: Statistical Machine Learning (7)

March 29, 2025 6 minute read

Practical Statistics for Data Scientists: Bagging and the Random Forest (2) (Random Forest II & Variable Importance)

Day144 - STAT Review: Statistical Machine Learning (6)

March 28, 2025 4 minute read

Practical Statistics for Data Scientists: Bagging and the Random Forest (1) (Bagging and Random Forest)

Day143 - STAT Review: Statistical Machine Learning (5)

March 25, 2025 5 minute read

Practical Statistics for Data Scientists: Tree Models (3) (Dealing With Overfitting Problems in R and Python & Predicting a Continuous Value)

Day142 - STAT Review: Statistical Machine Learning (4)

March 21, 2025 5 minute read

Practical Statistics for Data Scientists: Tree Models (2) (A Simple Example, The Recursive Partitioning Algorithm, & Measuring Homogeneity or Impurity)

Day141 - STAT Review: Statistical Machine Learning (3)

March 20, 2025 4 minute read

Practical Statistics for Data Scientists: K-Nearest Neighbors (3) (KNN as a Feature Engine) & Tree Models (1) (Key Concepts)

Day140 - STAT Review: Statistical Machine Learning (2)

March 16, 2025 4 minute read

Practical Statistics for Data Scientists: K-Nearest Neighbors (2) (Standardization & Choosing K)

Day139 - STAT Review: Statistical Machine Learning (1)

March 15, 2025 5 minute read

Practical Statistics for Data Scientists: K-Nearest Neighbors (1) (Example, Distance Metrics, and One Hot Encoder)

Day138 - STAT Review: Classification (8)

March 14, 2025 3 minute read

Practical Statistics for Data Scientists: Strategies for Imbalanced Data (2) (Data Generation, Cost-Based Classification, and Exploring the Predictions)

Day137 - STAT Review: Classification (7)

March 9, 2025 4 minute read

Practical Statistics for Data Scientists: Strategies for Imbalanced Data (1) (Undersampling & Oversampling)

Day136 - STAT Review: Classification (6)

March 8, 2025 6 minute read

Practical Statistics for Data Scientists: Evaluating Classification Models (Confusion Matrix, ROC-AUC & Lift)

Day135 - STAT Review: Classification (5)

March 7, 2025 3 minute read

Practical Statistics for Data Scientists: Logistic Regression (3) Assessing the Model

Day134 - STAT Review: Classification (4)

March 5, 2025 4 minute read

Practical Statistics for Data Scientists: Logistic Regression (2) (GLM, Interpretation, Fitting the Model)

Day133 - STAT Review: Classification (3)

March 3, 2025 5 minute read

Practical Statistics for Data Scientists: Logistic Regression (1) (Mathematical Foundation: Odds, Logit Function, Formula, and Examples)

Day132 - STAT Review: Classification (2)

March 2, 2025 5 minute read

Practical Statistics for Data Scientists: Discriminant Analysis (Covariance, Discriminant Function, and Application: Predicting Default Risk)

Day131 - STAT Review: Classification (1)

March 1, 2025 6 minute read

Practical Statistics for Data Scientists: Naive Bayes (Theoretical Approach, Code Source, & Prediction)

Day130 - STAT Review: Regression and Prediction (8)

February 28, 2025 4 minute read

Practical Statistics for Data Scientists: Weighted Regression, and Interactions and Main Effects in Regression in Depth

Day129 - STAT Review: Regression and Prediction (7)

February 26, 2025 4 minute read

Practical Statistics for Data Scientists: Stepwise Regression & Model Selection in Depth

Day128 - STAT Review: Revisiting Mathematics Theories (4)

February 25, 2025 4 minute read

Mathematical Principles: Non-parametric Inference (Wilcoxon Signed-Rank Test & Wilcoxon Rank-Sum Test)

Day127 - STAT Review: Revisiting Mathematics Theories (3)

February 24, 2025 4 minute read

Mathematical Principles: Inference on Proportions- Sample Size Estimation, Hypothesis Testing, and Chi-Squared Test

Day126 - STAT Review: Revisiting Mathematics Theories (2)

February 23, 2025 4 minute read

Mathematical Principles: Inference for Variance, Chi-Squared Distribution, F-Statistics, and Inference on Proportions (Wald & Wilson)

Day125 - STAT Review: Revisiting Mathematics Theories (1)

February 22, 2025 4 minute read

Mathematical Principles: Hypothesis Testing, Paired Samples, Independent Samples, and additional concepts.

Day124 - STAT Review: Regression and Prediction (6)

February 19, 2025 6 minute read

Practical Statistics for Data Scientists: Partial Residual Plots and Nonlinearity, Polynomial & Spline Regression, and Generalized Additive Models

Day123 - STAT Review: Regression and Prediction (5)

February 18, 2025 6 minute read

Practical Statistics for Data Scientists: Regression Diagnostics- Outliers, Influential Observations, and Heteroskedasticity

Day122 - STAT Review: Regression and Prediction (4)

February 16, 2025 6 minute read

Practical Statistics for Data Scientists: Interpreting the Regression Equation - Correlation, Multicollinearity, Confounding Variables, and Interactions

Day121 - STAT Review: Regression and Prediction (3)

February 15, 2025 4 minute read

Practical Statistics for Data Scientists: Factor Variables in Regression, Dummy Variables, Many Levels, and Ordered Factor Variables

Day120 - STAT Review: Regression and Prediction (2)

February 13, 2025 5 minute read

Practical Statistics for Data Scientists: Assessing the Model, Cross Validation, Model Selection, and Prediction Using Regression

Day119 - STAT Review: Regression and Prediction (1)

February 11, 2025 6 minute read

Practical Statistics for Data Scientists: Simple Linear Regression, Least Squares, and Multiple Linear Regression

Day118 - STAT Review: Statistical Experiments and Significance Testing (5)

February 9, 2025 5 minute read

Practical Statistics for Data Scientists: Chi-Square Theories, Fisher’s Exact Test, Multi-Arm Bandit Algorithm, and Power & Sample Size

Day117 - STAT Review: Statistical Experiments and Significance Testing (4)

February 7, 2025 6 minute read

Practical Statistics for Data Scientists: ANOVA(One & Two-Way), F-statistic, and Chi-Square Test

Day116 - STAT Review: Statistical Experiments and Significance Testing (3)

February 6, 2025 6 minute read

Practical Statistics for Data Scientists: t-Tests, Multiple Testing, False Discovery, and Degrees of Freedom

Day115 - STAT Review: Statistical Experiments and Significance Testing (2)

February 5, 2025 5 minute read

Practical Statistics for Data Scientists: p-Values, Practical Applications, and Type I & Type II Errors

Day114 - STAT Review: Statistical Experiments and Significance Testing (1)

February 1, 2025 7 minute read

Practical Statistics for Data Scientists: A/B Testing, Hypothesis Tests (One-Way & Two-Way), and Permutation Test

Day113 - STAT Review: Data & Sampling Distributions (3)

January 31, 2025 8 minute read

Practical Statistics for Data Scientists: t-Dist, Binomial, Chi-Square, F-Dist, and Poisson Distribution.

Day112 - STAT Review: Data & Sampling Distributions (2)

January 29, 2025 4 minute read

Practical Statistics for Data Scientists: Bootstrap, Confidence Intervals, and Normal Distribution

Day111 - STAT Review: Data & Sampling Distributions (1)

January 28, 2025 5 minute read

Practical Statistics for Data Scientists: Sampling, Bias, and Sampling Distribution(Central Limit Theorem)

Day110 - STAT Review: Exploratory Data Analysis (2)

January 26, 2025 5 minute read

Practical Statistics for Data Scientists: Data Distribution, Correlation, and Various Data Visualization Plots

Day109 - STAT Review: Exploratory Data Analysis (1)

January 25, 2025 5 minute read

Practical Statistics of Data Scientists: Elements of Statistical Terminologies- Data Statistics Fundamentals, Data Types, and Estimates of Location & Var...

Continuing the TIL Project in 2025

January 15, 2025 2 minute read

The Ongoing Chronicles of TIL25 — A Motivating Expedition as a Data Scientist & AI/ML Engineer Candidate

Back to top ↑

mlReview

Day89 ML Review - Ensemble Method (5)

October 11, 2024 7 minute read

Revisiting Ensemble Method, Random Forest, and XGBoost

Day65 ML Review - Ensemble Method (4)

August 29, 2024 5 minute read

Bagging & Boosting : Basic Concepts & Code Implementation

Day64 ML Review - Ensemble Method (3)

August 28, 2024 5 minute read

Using the Majority Voting Principle to Make Predictions, and Evaluating & Tuning the Ensemble Classifier

Day63 ML Review - Ensemble Method (2)

August 27, 2024 7 minute read

Code Structure of Combining Classifiers via Majority Vote

Day62 ML Review - Ensemble Method (1)

August 26, 2024 3 minute read

Key Concepts and Mathematics Explanation

Day61 ML Review - Class Imbalances

August 25, 2024 3 minute read

Use Other Metrics, Assign Different Class Weights, or Upsample the Minority Class

Day60 ML Review - Cross Validation (5)

August 23, 2024 3 minute read

ROC area Under The Curve (ROC AUC)

Day59 ML Review - Cross Validation (4)

August 22, 2024 4 minute read

Confusion Matrix and F1 score

Day58 ML Review - Cross Validation (3)

August 21, 2024 3 minute read

Grid Search for Fine-Tuning Machine Learning Models

Day57 ML Review - Cross Validation (2)

August 20, 2024 7 minute read

Bias & Variance, and Learning & Validation Curves

Day56 ML Review - Cross Validation (1)

August 19, 2024 4 minute read

Model Selection and K-Fold Cross Validation

Day55 ML Review - Pipeline

August 18, 2024 2 minute read

Key Concepts and Example Code with Scikit-learn

Day54 ML Review - Dimensionality Reduction (5)

August 17, 2024 3 minute read

Applying Kernal Principal Component Analysis(KPCA) to New Data Points

Day53 ML Review - Dimensionality Reduction (4)

August 15, 2024 2 minute read

Implementing a Kernal Principal Component Analysis(KPCA) in Python

Day52 ML Review - Dimensionality Reduction (3)

August 14, 2024 5 minute read

Nonlinear Mappings with Kernel Principal Component Analysis

Day51 ML Review - Dimensionality Reduction (2)

August 13, 2024 5 minute read

Compressing Data via Linear Discriminant Analysis

Day50 ML Review - Dimensionality Reduction (1)

August 12, 2024 5 minute read

Compressing Data via Dimensionality Reduction and Summary of PCA

Day49 ML Review - Data Preprocessing (3)

August 8, 2024 4 minute read

Partitioning a Dataset into Training & Test Datasets, Feature Scaling, and Feature Selection

Day48 ML Review - Data Preprocessing (2)

August 7, 2024 4 minute read

Handling Categorical Data - Converting, Ordinal Encoding, and One-Hot Encoding

Day47 ML Review - Data Preprocessing (1)

August 6, 2024 3 minute read

Handling Missing Data - Eliminating and Imputing & Estimators API

Day46 ML Review - K-Nearest Neighbors (3)

August 5, 2024 3 minute read

The Curse of Dimensionality

Day45 ML Review - K-Nearest Neighbors (2)

August 2, 2024 3 minute read

Distance Metrics- Euclidean, Manhattan, Minkowski & Chebyshev Distance, and Cosine Similarity

Day44 ML Review - K-Nearest Neighbors (1)

August 1, 2024 3 minute read

Basic Concepts, How It Works, and Parametric & Non-Parametric Model

Day39-43 Linear Algebra & Matrix Review (Korean)

July 31, 2024 less than 1 minute read

Linear Algebra & Matrix for Programmers

Day38 ML Review - Random Forest (2)

July 31, 2024 2 minute read

Implementation Step by Step

Day37 ML Review - Decision Tree (3) & Random Forest (1)

July 30, 2024 3 minute read

Building a Decision Tree & Random Forest (1) - Key Concepts & How it Works

Day36 ML Review - Decision Tree (2)

July 29, 2024 2 minute read

Information Gain (2) - Entropy & Classification Error

Day35 ML Review - Decision Tree (1)

July 28, 2024 4 minute read

Components, How it Works & Maximizing Information Gain (1) - Gini Impurity

Day34 ML Review - Support Vector Machine (3)

July 25, 2024 2 minute read

Solving Nonlinear Problems - Using a Kernal SVM

Day33 ML Review - Support Vector Machine (2)

July 24, 2024 2 minute read

SVM: Nonlinear Separable Case

Day32 ML Review - Support Vector Machine (1)

July 23, 2024 2 minute read

Basic Concepts and Mathematical Formulations

Day31 ML Review - Logistic Regression (3)

July 22, 2024 2 minute read

How To Train in Scikit-Learn, and Regularization with LR model

Day30 ML Review - Logistic Regression (2)

July 20, 2024 1 minute read

Cost Function of Logistic Regression

Day29 ML Review - Logistic Regression (1)

July 19, 2024 3 minute read

Basic Concepts and Sigmoid Function

Day28 ML Review - Perceptron (3)

July 18, 2024 1 minute read

Step by Step - Training Perceptron (3)

Day27 ML Review - Perceptron (2)

July 17, 2024 1 minute read

Step by Step - Training Perceptron (2)

Day26 ML Review - Perceptron (1)

July 16, 2024 2 minute read

Choosing a Classification Algorithm Step by Step - Training Perceptron (1)

Day25 Statistics Review (4)

June 20, 2024 3 minute read

Population Proportions, p-values & Confidence Intervals, and Type I & II Errors

Day24 Statistics Review (3)

June 19, 2024 3 minute read

Test Statistics (Z-Test, t-Test, and Chi-Squared Test)

Day23 Statistics Review (2)

June 18, 2024 2 minute read

Law of Large Numbers, Central Limit Theorem, and Hypothesis Testing (1) - General Setup

Day22 Statistics Review (1)

June 17, 2024 2 minute read

Properties of Random Variable

Day21 Probability Review (3)

June 14, 2024 2 minute read

Continuous Probability Distribution and Markov Chains

Day20 Probability Review (2)

June 11, 2024 2 minute read

Joint, Marginal, & Conditional Probability Distributions, and Discrete & Poisson Distributions

Day19 Probability Review (1)

June 10, 2024 1 minute read

Basic Probability - Counting and Random Variables

Day18 ML Review - Principle Component Analysis (3)

June 8, 2024 2 minute read

Applying PCA in Machine Learning and Scree Plot

Day17 ML Review - Regularization

June 6, 2024 5 minute read

Concepts, Types of Regularization, and How It Works

Day16 ML Review - R-squared (3)

June 5, 2024 less than 1 minute read

Further Analysis on R-Squared

Day15 ML Review - R-Squared (2)

June 4, 2024 2 minute read

How R-Squared Is Used As a Performance Metric in Machine Learning

Day14 ML Review - R-Squared (1)

June 3, 2024 3 minute read

Concepts Overview, Mathematical Calculation, and Interpretation

Day13 ML Review - Gradient Descent (3)

June 2, 2024 5 minute read

Summarization and Types

Day12 ML Review - Gradient Descent (2)

May 31, 2024 2 minute read

Mathematical Explanation

Day11 ML Review - Gradient Descent (1)

May 28, 2024 4 minute read

Basic Concepts, Steps, and Key Consideration

Day10 ML Review - Gradient

May 25, 2024 3 minute read

Understanding Gradients in Machine Learning Applications

Day09 ML Review - Linear Regression (3)

May 24, 2024 2 minute read

SSE Sum Squared Error and Choosing Cost Function

Day08 ML Review - Linear Regression (2)

May 23, 2024 4 minute read

Cost Function and MSE Mean Squared Error

Day07 ML Review - Linear Regression (1)

May 22, 2024 4 minute read

Concepts Overview and Mathematical Calculation Exercise

Day06 ML Review - Principle Component Analysis (2)

May 21, 2024 3 minute read

Applications on Machine Learning & Further Explanations

Day05 ML Review - Principle Component Analysis (1)

May 20, 2024 5 minute read

Mathematical Definition & Algorithms

The Start of TIL 24

April 30, 2024

The Start of Recording TIL Summer ‘24 - During Summer in Rochester as a Data Scientist Candidate

Back to top ↑

StatReview

Day154 - STAT Review: Unsupervised Learning (7)

April 20, 2025 6 minute read

Practical Statistics for Data Scientists: Scaling and Categorical Variables (Scaling the Variables, Dominant Variables, Categorical Data, and Gower’s Distanc...

Day153 - STAT Review: Unsupervised Learning (6)

April 15, 2025 4 minute read

Practical Statistics for Data Scientists: Model-Based Clustering (Multivariate Normal Distribution, Mixtures of Normals & Selecting the Number of Cluster...

Day152 - STAT Review: Unsupervised Learning (5)

April 13, 2025 5 minute read

Practical Statistics for Data Scientists: Hierarchical Clustering (A Simple Example, the Dendrogram, the Agglomerative Algorithm & Measures of Dissimilar...

Day151 - STAT Review: Unsupervised Learning (4)

April 8, 2025 4 minute read

Practical Statistics for Data Scientists: K-Means Clustering (2) (Interpreting Clustering Results & Determining the Optimal Number of Clusters K)

Day150 - STAT Review: Unsupervised Learning (3)

April 6, 2025 5 minute read

Practical Statistics for Data Scientists: K-Means Clustering (1) (A Simple Example & K-Means Algorithm Code Source)

Day149 - STAT Review: Unsupervised Learning (2)

April 4, 2025 4 minute read

Practical Statistics for Data Scientists: Principal Components Analysis (2) (Formal Definition, Interpreting Components & Correspondence Analysis)

Day148 - STAT Review: Unsupervised Learning (1)

April 2, 2025 5 minute read

Practical Statistics for Data Scientists: Principal Components Analysis (1) (Unsupervised Learning, A Simple Example and Computing the Principal Components)

Day147 - STAT Review: Statistical Machine Learning (9)

March 31, 2025 6 minute read

Practical Statistics for Data Scientists: Boosting (2) (Regularization, Hyperparameters & Cross-Validation)

Day146 - STAT Review: Statistical Machine Learning (8)

March 30, 2025 6 minute read

Practical Statistics for Data Scientists: Boosting (1) (Key Concepts & XGBoost)

Day145 - STAT Review: Statistical Machine Learning (7)

March 29, 2025 6 minute read

Practical Statistics for Data Scientists: Bagging and the Random Forest (2) (Random Forest II & Variable Importance)

Day144 - STAT Review: Statistical Machine Learning (6)

March 28, 2025 4 minute read

Practical Statistics for Data Scientists: Bagging and the Random Forest (1) (Bagging and Random Forest)

Day143 - STAT Review: Statistical Machine Learning (5)

March 25, 2025 5 minute read

Practical Statistics for Data Scientists: Tree Models (3) (Dealing With Overfitting Problems in R and Python & Predicting a Continuous Value)

Day142 - STAT Review: Statistical Machine Learning (4)

March 21, 2025 5 minute read

Practical Statistics for Data Scientists: Tree Models (2) (A Simple Example, The Recursive Partitioning Algorithm, & Measuring Homogeneity or Impurity)

Day141 - STAT Review: Statistical Machine Learning (3)

March 20, 2025 4 minute read

Practical Statistics for Data Scientists: K-Nearest Neighbors (3) (KNN as a Feature Engine) & Tree Models (1) (Key Concepts)

Day140 - STAT Review: Statistical Machine Learning (2)

March 16, 2025 4 minute read

Practical Statistics for Data Scientists: K-Nearest Neighbors (2) (Standardization & Choosing K)

Day139 - STAT Review: Statistical Machine Learning (1)

March 15, 2025 5 minute read

Practical Statistics for Data Scientists: K-Nearest Neighbors (1) (Example, Distance Metrics, and One Hot Encoder)

Day138 - STAT Review: Classification (8)

March 14, 2025 3 minute read

Practical Statistics for Data Scientists: Strategies for Imbalanced Data (2) (Data Generation, Cost-Based Classification, and Exploring the Predictions)

Day137 - STAT Review: Classification (7)

March 9, 2025 4 minute read

Practical Statistics for Data Scientists: Strategies for Imbalanced Data (1) (Undersampling & Oversampling)

Day136 - STAT Review: Classification (6)

March 8, 2025 6 minute read

Practical Statistics for Data Scientists: Evaluating Classification Models (Confusion Matrix, ROC-AUC & Lift)

Day135 - STAT Review: Classification (5)

March 7, 2025 3 minute read

Practical Statistics for Data Scientists: Logistic Regression (3) Assessing the Model

Day134 - STAT Review: Classification (4)

March 5, 2025 4 minute read

Practical Statistics for Data Scientists: Logistic Regression (2) (GLM, Interpretation, Fitting the Model)

Day133 - STAT Review: Classification (3)

March 3, 2025 5 minute read

Practical Statistics for Data Scientists: Logistic Regression (1) (Mathematical Foundation: Odds, Logit Function, Formula, and Examples)

Day132 - STAT Review: Classification (2)

March 2, 2025 5 minute read

Practical Statistics for Data Scientists: Discriminant Analysis (Covariance, Discriminant Function, and Application: Predicting Default Risk)

Day131 - STAT Review: Classification (1)

March 1, 2025 6 minute read

Practical Statistics for Data Scientists: Naive Bayes (Theoretical Approach, Code Source, & Prediction)

Day130 - STAT Review: Regression and Prediction (8)

February 28, 2025 4 minute read

Practical Statistics for Data Scientists: Weighted Regression, and Interactions and Main Effects in Regression in Depth

Day129 - STAT Review: Regression and Prediction (7)

February 26, 2025 4 minute read

Practical Statistics for Data Scientists: Stepwise Regression & Model Selection in Depth

Day128 - STAT Review: Revisiting Mathematics Theories (4)

February 25, 2025 4 minute read

Mathematical Principles: Non-parametric Inference (Wilcoxon Signed-Rank Test & Wilcoxon Rank-Sum Test)

Day127 - STAT Review: Revisiting Mathematics Theories (3)

February 24, 2025 4 minute read

Mathematical Principles: Inference on Proportions- Sample Size Estimation, Hypothesis Testing, and Chi-Squared Test

Day126 - STAT Review: Revisiting Mathematics Theories (2)

February 23, 2025 4 minute read

Mathematical Principles: Inference for Variance, Chi-Squared Distribution, F-Statistics, and Inference on Proportions (Wald & Wilson)

Day125 - STAT Review: Revisiting Mathematics Theories (1)

February 22, 2025 4 minute read

Mathematical Principles: Hypothesis Testing, Paired Samples, Independent Samples, and additional concepts.

Day124 - STAT Review: Regression and Prediction (6)

February 19, 2025 6 minute read

Practical Statistics for Data Scientists: Partial Residual Plots and Nonlinearity, Polynomial & Spline Regression, and Generalized Additive Models

Day123 - STAT Review: Regression and Prediction (5)

February 18, 2025 6 minute read

Practical Statistics for Data Scientists: Regression Diagnostics- Outliers, Influential Observations, and Heteroskedasticity

Day122 - STAT Review: Regression and Prediction (4)

February 16, 2025 6 minute read

Practical Statistics for Data Scientists: Interpreting the Regression Equation - Correlation, Multicollinearity, Confounding Variables, and Interactions

Day121 - STAT Review: Regression and Prediction (3)

February 15, 2025 4 minute read

Practical Statistics for Data Scientists: Factor Variables in Regression, Dummy Variables, Many Levels, and Ordered Factor Variables

Day120 - STAT Review: Regression and Prediction (2)

February 13, 2025 5 minute read

Practical Statistics for Data Scientists: Assessing the Model, Cross Validation, Model Selection, and Prediction Using Regression

Day119 - STAT Review: Regression and Prediction (1)

February 11, 2025 6 minute read

Practical Statistics for Data Scientists: Simple Linear Regression, Least Squares, and Multiple Linear Regression

Day118 - STAT Review: Statistical Experiments and Significance Testing (5)

February 9, 2025 5 minute read

Practical Statistics for Data Scientists: Chi-Square Theories, Fisher’s Exact Test, Multi-Arm Bandit Algorithm, and Power & Sample Size

Day117 - STAT Review: Statistical Experiments and Significance Testing (4)

February 7, 2025 6 minute read

Practical Statistics for Data Scientists: ANOVA(One & Two-Way), F-statistic, and Chi-Square Test

Day116 - STAT Review: Statistical Experiments and Significance Testing (3)

February 6, 2025 6 minute read

Practical Statistics for Data Scientists: t-Tests, Multiple Testing, False Discovery, and Degrees of Freedom

Day115 - STAT Review: Statistical Experiments and Significance Testing (2)

February 5, 2025 5 minute read

Practical Statistics for Data Scientists: p-Values, Practical Applications, and Type I & Type II Errors

Day114 - STAT Review: Statistical Experiments and Significance Testing (1)

February 1, 2025 7 minute read

Practical Statistics for Data Scientists: A/B Testing, Hypothesis Tests (One-Way & Two-Way), and Permutation Test

Day113 - STAT Review: Data & Sampling Distributions (3)

January 31, 2025 8 minute read

Practical Statistics for Data Scientists: t-Dist, Binomial, Chi-Square, F-Dist, and Poisson Distribution.

Day112 - STAT Review: Data & Sampling Distributions (2)

January 29, 2025 4 minute read

Practical Statistics for Data Scientists: Bootstrap, Confidence Intervals, and Normal Distribution

Day111 - STAT Review: Data & Sampling Distributions (1)

January 28, 2025 5 minute read

Practical Statistics for Data Scientists: Sampling, Bias, and Sampling Distribution(Central Limit Theorem)

Day110 - STAT Review: Exploratory Data Analysis (2)

January 26, 2025 5 minute read

Practical Statistics for Data Scientists: Data Distribution, Correlation, and Various Data Visualization Plots

Day109 - STAT Review: Exploratory Data Analysis (1)

January 25, 2025 5 minute read

Practical Statistics of Data Scientists: Elements of Statistical Terminologies- Data Statistics Fundamentals, Data Types, and Estimates of Location & Var...

Continuing the TIL Project in 2025

January 15, 2025 2 minute read

The Ongoing Chronicles of TIL25 — A Motivating Expedition as a Data Scientist & AI/ML Engineer Candidate

Back to top ↑

classifier

Day89 ML Review - Ensemble Method (5)

October 11, 2024 7 minute read

Revisiting Ensemble Method, Random Forest, and XGBoost

Day73 Deep Learning Lecture Review - Lecture 5

September 11, 2024 10 minute read

Transformers and Foundation Models: GELU, Layer Norm, Key Concepts & Workflow

Day71 DL Review - Natural Language Processing (NLP)

September 8, 2024 5 minute read

Primary Goals, Common Tasks, and Deep Learning NLP

Day65 ML Review - Ensemble Method (4)

August 29, 2024 5 minute read

Bagging & Boosting : Basic Concepts & Code Implementation

Day64 ML Review - Ensemble Method (3)

August 28, 2024 5 minute read

Using the Majority Voting Principle to Make Predictions, and Evaluating & Tuning the Ensemble Classifier

Day63 ML Review - Ensemble Method (2)

August 27, 2024 7 minute read

Code Structure of Combining Classifiers via Majority Vote

Day62 ML Review - Ensemble Method (1)

August 26, 2024 3 minute read

Key Concepts and Mathematics Explanation

Day61 ML Review - Class Imbalances

August 25, 2024 3 minute read

Use Other Metrics, Assign Different Class Weights, or Upsample the Minority Class

Day60 ML Review - Cross Validation (5)

August 23, 2024 3 minute read

ROC area Under The Curve (ROC AUC)

Day59 ML Review - Cross Validation (4)

August 22, 2024 4 minute read

Confusion Matrix and F1 score

Day58 ML Review - Cross Validation (3)

August 21, 2024 3 minute read

Grid Search for Fine-Tuning Machine Learning Models

Day57 ML Review - Cross Validation (2)

August 20, 2024 7 minute read

Bias & Variance, and Learning & Validation Curves

Day56 ML Review - Cross Validation (1)

August 19, 2024 4 minute read

Model Selection and K-Fold Cross Validation

Day55 ML Review - Pipeline

August 18, 2024 2 minute read

Key Concepts and Example Code with Scikit-learn

Day54 ML Review - Dimensionality Reduction (5)

August 17, 2024 3 minute read

Applying Kernal Principal Component Analysis(KPCA) to New Data Points

Day53 ML Review - Dimensionality Reduction (4)

August 15, 2024 2 minute read

Implementing a Kernal Principal Component Analysis(KPCA) in Python

Day52 ML Review - Dimensionality Reduction (3)

August 14, 2024 5 minute read

Nonlinear Mappings with Kernel Principal Component Analysis

Day51 ML Review - Dimensionality Reduction (2)

August 13, 2024 5 minute read

Compressing Data via Linear Discriminant Analysis

Day50 ML Review - Dimensionality Reduction (1)

August 12, 2024 5 minute read

Compressing Data via Dimensionality Reduction and Summary of PCA

Day49 ML Review - Data Preprocessing (3)

August 8, 2024 4 minute read

Partitioning a Dataset into Training & Test Datasets, Feature Scaling, and Feature Selection

Day48 ML Review - Data Preprocessing (2)

August 7, 2024 4 minute read

Handling Categorical Data - Converting, Ordinal Encoding, and One-Hot Encoding

Day47 ML Review - Data Preprocessing (1)

August 6, 2024 3 minute read

Handling Missing Data - Eliminating and Imputing & Estimators API

Day46 ML Review - K-Nearest Neighbors (3)

August 5, 2024 3 minute read

The Curse of Dimensionality

Day45 ML Review - K-Nearest Neighbors (2)

August 2, 2024 3 minute read

Distance Metrics- Euclidean, Manhattan, Minkowski & Chebyshev Distance, and Cosine Similarity

Day44 ML Review - K-Nearest Neighbors (1)

August 1, 2024 3 minute read

Basic Concepts, How It Works, and Parametric & Non-Parametric Model

Day38 ML Review - Random Forest (2)

July 31, 2024 2 minute read

Implementation Step by Step

Day37 ML Review - Decision Tree (3) & Random Forest (1)

July 30, 2024 3 minute read

Building a Decision Tree & Random Forest (1) - Key Concepts & How it Works

Day36 ML Review - Decision Tree (2)

July 29, 2024 2 minute read

Information Gain (2) - Entropy & Classification Error

Day35 ML Review - Decision Tree (1)

July 28, 2024 4 minute read

Components, How it Works & Maximizing Information Gain (1) - Gini Impurity

Day34 ML Review - Support Vector Machine (3)

July 25, 2024 2 minute read

Solving Nonlinear Problems - Using a Kernal SVM

Day33 ML Review - Support Vector Machine (2)

July 24, 2024 2 minute read

SVM: Nonlinear Separable Case

Day32 ML Review - Support Vector Machine (1)

July 23, 2024 2 minute read

Basic Concepts and Mathematical Formulations

Day31 ML Review - Logistic Regression (3)

July 22, 2024 2 minute read

How To Train in Scikit-Learn, and Regularization with LR model

Day30 ML Review - Logistic Regression (2)

July 20, 2024 1 minute read

Cost Function of Logistic Regression

Day29 ML Review - Logistic Regression (1)

July 19, 2024 3 minute read

Basic Concepts and Sigmoid Function

Day28 ML Review - Perceptron (3)

July 18, 2024 1 minute read

Step by Step - Training Perceptron (3)

Day27 ML Review - Perceptron (2)

July 17, 2024 1 minute read

Step by Step - Training Perceptron (2)

Day26 ML Review - Perceptron (1)

July 16, 2024 2 minute read

Choosing a Classification Algorithm Step by Step - Training Perceptron (1)

Back to top ↑

MLOpsReview

Day209 - Leetcode: Python 20 & MLOps Review: ML Engineering (1)

September 17, 2025 3 minute read

Python 20: Valid Parentheses & ML Engineering: High-level ML System Design

Day208 - Leetcode: Python 121 & SQL 175,176 & DL Review

September 16, 2025 4 minute read

Python 121: Best Time to Buy and Sell Stock / SQL 175,176: Second-highest Salary / DL Review: Transformers, Self-Attention Mechanism & Positional Encoding

Day207 - Leetcode: Python 53 & SQL 185 & DL Review

September 15, 2025 5 minute read

Python 53: Maximum Subarray / SQL 185: Department Top Three Sales / DL Review: RNNs, LSTM Networks & Gradient Vanishing & Exploding

Day206 - Leetcode: Python 217 & SQL 175,176 & DL Review

September 13, 2025 3 minute read

Python 217: Contains Duplicate / SQL 175,176: Second-highest Salary / DL Review: Transfer Learning & Fine-Tuning & CNNs

Day205 - Leetcode: Python 175 & SQL Inner Join & DL Review

September 12, 2025 6 minute read

Python 175: TwoSums / SQL: Inner Join Revisiting / DL Review: Embedding Layers, Autoencoders & Knowledge Distillation

Day204 - DL Review: Revisiting Optimizers, CNNs & Data Drifts

September 9, 2025 5 minute read

Optimizers in Neural Networks, Parameter Sharing in CNNs, and Data & Concept Drifts

Day201-203 - MLops Review: CI/CD with GitHub Actions and Azure (2)

September 5, 2025 5 minute read

Scaling CI/CD: From ACI to Azure Kubernetes Service

Day199-200 - MLops Review: CI/CD with GitHub Actions and Azure (1)

September 2, 2025 5 minute read

Automating Docker image builds with ACR and seamless deployments to ACI

Day177 - MLOps Review: Data Distribution Shifts and Monitoring (4)

July 9, 2025 10 minute read

Designing Machine Learning Systems: Data Distribution Shifts (3) (Understanding Failures: Monitoring and Observability for ML System Reliability)

Day176 - MLOps Review: Data Distribution Shifts and Monitoring (3)

July 8, 2025 7 minute read

Designing Machine Learning Systems: Data Distribution Shifts (2) (Detecting & Addressing Data Distribution Shifts)

Day175 - MLOps Review: Data Distribution Shifts and Monitoring (2)

July 7, 2025 7 minute read

Designing Machine Learning Systems: Causes of ML System Failures (2) (Correcting Degenerate Feedback Loops) & Data Distribution Shifts (1) (Covariate, Co...

Day174 - MLOps Review: Data Distribution Shifts and Monitoring (1)

July 5, 2025 7 minute read

Designing Machine Learning Systems: Causes of ML System Failures (1) (Production data differing from training data, Edge Cases, and Degenerate Feedback Loops)

Day173 - MLOps Review: Model Deployment and Prediction Service (3)

July 3, 2025 4 minute read

Designing Machine Learning Systems: ML on the Cloud and on the Edge & Model Optimization (AutoTVM & WebAssembly)

Day172 - MLOps Review: Model Deployment and Prediction Service (2)

July 2, 2025 5 minute read

Designing Machine Learning Systems: Model Comparison (Low-Rank Factorization, Knowledge Distillation, Pruning, & Quantization)

Day171 - MLOps Review: Model Deployment and Prediction Service (1)

July 1, 2025 7 minute read

Designing Machine Learning Systems: Model Deployment and Batch Prediction Versus Online Prediction (Unifying Batch and Streaming Pipeline)

Day170 - MLOps Review: Model Development and Offline Evaluation (4)

June 30, 2025 6 minute read

Designing Machine Learning Systems: Model Offline Evaluation (Methods: Perturbation Tests, Invariance Tests, etc.)

Day169 - MLOps Review: Model Development and Offline Evaluation (3)

June 29, 2025 6 minute read

Designing Machine Learning Systems: Auto ML (Hyperparameter Tuning & NAS), Model Offline Evaluation (1) (Establishing Baselines)

Day168 - MLOps Review: Model Development and Offline Evaluation (2)

June 28, 2025 6 minute read

Designing Machine Learning Systems: Experiment Tracking, Versioning, and Distributed Training (Data Parallel)

Day167 - MLOps Review: Model Development and Offline Evaluation (1)

June 27, 2025 6 minute read

Designing Machine Learning Systems: Model Selection, Evaluating ML Models, & Ensemble Method (Bagging, Boosting, and Stacking)

Day166 - MLOps Review: Feature Engineering (3)

June 2, 2025 4 minute read

Designing Machine Learning Systems: Data Leakage (Definition, Common Causes, Detecting and Preventing it)

Day165 - MLOps Review: Feature Engineering (2)

May 31, 2025 4 minute read

Designing Machine Learning Systems: Feature Engineering Techniques (2) (Positional Embeddings) & Engineering Good Features

Day164 - MLOps Review: Feature Engineering (1)

May 29, 2025 5 minute read

Designing Machine Learning Systems: Feature Engineering Techniques (1) (Handling Missing Values, Scaling, Normalization, Binning, Encoding Categorical Values...

Day163 - MLOps Review: Training Data (4)

May 26, 2025 4 minute read

Designing Machine Learning Systems: Data Augmentation (Simple Label-Preserving Transformations, Perturbation, and Data Synthesis)

Day162 - MLOps Review: Training Data (3)

May 25, 2025 6 minute read

Designing Machine Learning Systems: Class Imbalance (How to Deal with the problems: Evaluation Metrics, Over & Undersampling, Resampling and Algorithm-le...

Day161 - MLOps Review: Training Data (2)

May 23, 2025 6 minute read

Designing Machine Learning Systems: Labeling (Hand Labels, Natural Labels, & Addressing the Lack of Labels - Active Learning, etc.)

Day160 - MLOps Review: Training Data (1)

May 18, 2025 5 minute read

Designing Machine Learning Systems: Sampling (Nonprobability, Simple Random, Stratified, Weighted, Reservoir, and Importance Sampling)

Day159 - MLOps Review: Data Engineering Fundamentals (2)

May 17, 2025 5 minute read

Designing Machine Learning Systems: Modes of Dataflow & Batch / Real-Time Processing

Day158 - MLOps Review: Data Engineering Fundamentals (1)

April 29, 2025 5 minute read

Designing Machine Learning Systems: Data Formats (JSON, Parquet & Binary Format), Data Models (Relational & NoSQL), and Data Storage Engines (ETL)

Day157 - MLOps Review: Introduction to Machine Learning Systems Design (3)

April 26, 2025 3 minute read

Designing Machine Learning Systems: Framing ML Problems (2) (Types of ML Tasks & Objective Functions)

Day156 - MLOps Review: Introduction to Machine Learning Systems Design (2)

April 25, 2025 5 minute read

Designing Machine Learning Systems: Iterative Process & Framing ML Problems (1)

Day155 - MLOps Review: Introduction to Machine Learning Systems Design (1)

April 24, 2025 3 minute read

Designing Machine Learning Systems: Business and ML Objectives & Requirements for ML Systems

A New Chapter:: Continuing onward to MLOps

April 22, 2025 1 minute read

Designing Machine Learning Systems (MLOPs) Review Begins!

Back to top ↑

dlReview

Day108 Deep Learning Lecture Review - Advanced Techniques in DL

December 21, 2024 4 minute read

HW5: Out-of-distribution (OOD) Detection (Maximum Softmax Probability & ODIN) and Continual Learning (SLDA & IID Streaming)

Day107 Deep Learning Lecture Review - HW4 - Adjusting Probabilities into Real-World

December 20, 2024 6 minute read

HW4: Model Calibration (Platt Scaling & Label Smoothing) and Conformal Prediction (Naive and Adaptive Predictions Sets)

Day106 Deep Learning Lecture Review- Enhancing DL Workflows (Data Distribution)

December 19, 2024 4 minute read

HW3: Optimization through Data Loading, Profiling, & Scaling, and Comparison of Data Parallel & Distributed Data Parallel

Day105 Deep Learning Lecture Review - Lecture 20 (End)

December 18, 2024 9 minute read

Model Drifting, Periodic Re-Training, Detecting Model Drift, Continual Learning (Pre-Trained Model, NCC), and Real-Time Machine Learning

Day104 Deep Learning Lecture Review - Lecture 19

December 12, 2024 7 minute read

Data-Centric AI: Label Noise, Selection Bias, Data Leakage, and Error Analysis for Model Improvement (Subgroup Errors)

Day103 Deep Learning Lecture Review - Lecture 18 (2)

December 9, 2024 5 minute read

Data-Centric AI: Active Learning, SEALS(Similarity Search for Efficient Active Learning), Dataset Pruning, and Data Engine

Day102 Deep Learning Lecture Review - Lecture 18 (1)

November 29, 2024 6 minute read

Data-Centric AI: Crowdsourcing, Methods to Estimate Annotator Quality, Neural Scaling Laws, Pareto Curves and Power Law

Day101 Deep Learning Lecture Review - Lecture 17 (2)

November 23, 2024 5 minute read

Variation of Conformal Prediction: Size of Calibration Set, Evaluation, and Group-Based & Adaptive Conformal Prediction

Day100 Deep Learning Lecture Review - Lecture 17 (1)

November 12, 2024 7 minute read

Understanding Conformal Prediction: Concepts, Applications, Marginal Coverage, and Recipes In Detail

Day99 Deep Learning Lecture Review - Lecture 16

November 4, 2024 7 minute read

Uncertainty in Deep Learning, Distribution Shifts, Model Calibration, and Out-of-Distribution (OOD) Detection

Day98 DL Review - Revisiting LLM models (Hand-Written Tree Map)

November 3, 2024 1 minute read

Language Models- Transfer Learning, Basic Concepts & Terminologies, Components of NLP Models, and Attention Mechanism

Day97 Deep Learning Lecture Review - Lecture 15 (2)

November 2, 2024 7 minute read

Bias Mitigation Strategies: Loss Reweighting, Sampling & Synthetic Samples and Architectural Changes (OccamNets, Adversarial Training & DANN)

Day96 Deep Learning Lecture Review - Lecture 15 (1)

November 1, 2024 8 minute read

Model Comparison and Bias Mitigation; McNemar’s Test, Dataset Bias, and Bias Detection

Day95 Deep Learning Lecture Review - Lecture 14

October 31, 2024 6 minute read

AI Ethics; AI Safety, Key Issues, AGI (Artificial General Intelligence), and Current AI Models’ Challenges

Day94 Deep Learning Lecture Review - Lecture 13

October 30, 2024 10 minute read

Llama 3: Framework, Workflow (RMSNorm, Grouped Query Attention, RoPE, SwiGLU Attention), Pre-training & Post-training

Day93 Deep Learning Lecture Review - Fine-Tuning Models (2) & Prompt Engineering

October 23, 2024 7 minute read

Comparing Pre-trained model embeddings (ResNet+SBERT vs. CLIP) and Prompt Engineering (Short and Direct, Few-Shot Learning, & Expert Prompting)

Day92 Deep Learning Lecture Review - Fine-Tuning Models (1)

October 17, 2024 8 minute read

HW2: Understanding of LoRA and Pre-trained Model Embeddings (ResNet+SBERT) for Visual Question Answering (VQA)

Day91 Deep Learning Lecture Review - Optimizing Hyperparameters

October 14, 2024 7 minute read

Weights & Biases (W&B) for Monitoring and Fine-Tuning ResNet-18 and Post-Training Evaluations (Dying ReLU, Brightness Robustness)

Day90 Deep Learning Lecture Review - Background Knowledges

October 12, 2024 7 minute read

HW0: Softmax Properties, PyTorch Lightning, and DataLoader

Day88 Deep Learning Lecture Review - Lecture 10-12

October 10, 2024 12 minute read

Deep Learning & Numerical Precision(Floating Point), Hardware Considerations, and Distributed Model Training

Day87 Deep Learning Lecture Review - Lecture 8 (2) & 9

October 9, 2024 6 minute read

LLMs - Speeding Up LLMs (Grouped Query Attention, KV Caches, MoE, and DPO)

Day86 Deep Learning Lecture Review - Lecture 8 (1)

October 8, 2024 8 minute read

LLMs- Generating Texts, Positional Encoding, and Fine-Tuning LLMs (LoRA)

Day85 Deep Learning Lecture Review - Lecture 7

October 7, 2024 6 minute read

LLMs - Perplexity, Tokenizers, Data Cleaning, and Embedding Layer

Day75-84 Introduction to Natural Language Processing using Deep Learning

October 6, 2024 2 minute read

Basic Machine Learning & Deep Learning, Word Embedding, CNNs, RNNs, LSTM and Transformer

Day74 Deep Learning Lecture Review - Lecture 6

September 12, 2024 6 minute read

Large Language Model - BERT, GPT, and GPT-2, 3 & 4

Day73 Deep Learning Lecture Review - Lecture 5

September 11, 2024 10 minute read

Transformers and Foundation Models: GELU, Layer Norm, Key Concepts & Workflow

Day72 Deep Learning Lecture Review - Lecture 4

September 10, 2024 6 minute read

Brief Explanation of Basic Algebra and Machine Learning

Day71 DL Review - Natural Language Processing (NLP)

September 8, 2024 5 minute read

Primary Goals, Common Tasks, and Deep Learning NLP

Day70 DL Review - Transformer

September 7, 2024 8 minute read

Transformer Architecture, How the Models Are Different, and Q,K,V in Self-Attention

Back to top ↑

statReview

Day25 Statistics Review (4)

June 20, 2024 3 minute read

Population Proportions, p-values & Confidence Intervals, and Type I & II Errors

Day24 Statistics Review (3)

June 19, 2024 3 minute read

Test Statistics (Z-Test, t-Test, and Chi-Squared Test)

Day23 Statistics Review (2)

June 18, 2024 2 minute read

Law of Large Numbers, Central Limit Theorem, and Hypothesis Testing (1) - General Setup

Day22 Statistics Review (1)

June 17, 2024 2 minute read

Properties of Random Variable

Day21 Probability Review (3)

June 14, 2024 2 minute read

Continuous Probability Distribution and Markov Chains

Day20 Probability Review (2)

June 11, 2024 2 minute read

Joint, Marginal, & Conditional Probability Distributions, and Discrete & Poisson Distributions

Day19 Probability Review (1)

June 10, 2024 1 minute read

Basic Probability - Counting and Random Variables

Day18 ML Review - Principle Component Analysis (3)

June 8, 2024 2 minute read

Applying PCA in Machine Learning and Scree Plot

Day17 ML Review - Regularization

June 6, 2024 5 minute read

Concepts, Types of Regularization, and How It Works

Day16 ML Review - R-squared (3)

June 5, 2024 less than 1 minute read

Further Analysis on R-Squared

Day15 ML Review - R-Squared (2)

June 4, 2024 2 minute read

How R-Squared Is Used As a Performance Metric in Machine Learning

Day14 ML Review - R-Squared (1)

June 3, 2024 3 minute read

Concepts Overview, Mathematical Calculation, and Interpretation

Day06 ML Review - Principle Component Analysis (2)

May 21, 2024 3 minute read

Applications on Machine Learning & Further Explanations

Day05 ML Review - Principle Component Analysis (1)

May 20, 2024 5 minute read

Mathematical Definition & Algorithms

Day04 Basic Mathematics Review (4)

May 17, 2024 3 minute read

Bayesian Statistics, the Law of Total Probability, and Var-Cov Matrix

Day03 Basic Mathematics Review (3)

May 16, 2024 3 minute read

Eigendecomposition, Symmetric Matrix, and Eigendecomposition

Day02 Basic Mathematics Review (2)

May 15, 2024 2 minute read

Norm(2), Eigenvectors and Eigenvalues

Day01 Basic Mathematics Review (1)

May 1, 2024 2 minute read

Scalar, Norm, Matrix (Inverse, Basic Functions), Rank

The Start of TIL 24

April 30, 2024

The Start of Recording TIL Summer ‘24 - During Summer in Rochester as a Data Scientist Candidate

Back to top ↑

TIL_23

DataMiningPratice(4)-AggregatingData

August 13, 2023

Aggregating data with groupby

DataMiningPratice(3)-RefiningUnnecessaryColumns

August 12, 2023

Adding new columns and refining unnecessary columns.

DataMiningPratice(2)-RefiningDatasets

August 11, 2023

Change data type

DataMiningPratice(1)-BasicImportofDatasets

August 9, 2023

Utilize public data to import two completely different data, pre-process them, and merge them

Data Structure/ Ch03:BasicDataStructure(3)

August 7, 2023

Basic Data Structure - 3. Deque

Data Structure/ Ch03:BasicDataStructure(2)

August 5, 2023

Basic Data Structure - 2. Queue

Data Structure/ Ch03:BasicDataStructure(1)

August 4, 2023

Basic Data Structure - 1. Stacks

Data Structure/ Ch02:AlgorithmAnalysis(2)

August 2, 2023

TIL :CH02. Algorithm Anlaysis (2)

Data Structure/ Ch02:AlgorithmAnalysis

August 1, 2023

TIL: CH02. Algorithm Anlaysis

Data Structure/ Ch01:BasicPython

July 27, 2023

Python Basic Review

Back to top ↑

statistics

(editing) Data Science Interview Prep - Statistics

August 9, 2024 2 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - SQL

August 9, 2024 1 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - ML Basic

August 9, 2024 1 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - LLM/AI

August 9, 2024 3 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - DL

August 9, 2024 less than 1 minute read

Interview Questions & Answers

Statistics & Probability Lecture Review

August 1, 2024 9 minute read

Introduction of Statistics (2023) Whole Lecture Review

Day25 Statistics Review (4)

June 20, 2024 3 minute read

Population Proportions, p-values & Confidence Intervals, and Type I & II Errors

Day23 Statistics Review (2)

June 18, 2024 2 minute read

Law of Large Numbers, Central Limit Theorem, and Hypothesis Testing (1) - General Setup

Day22 Statistics Review (1)

June 17, 2024 2 minute read

Properties of Random Variable

Back to top ↑

basicDataStructure

Data Structure/ Ch03:BasicDataStructure(3)

August 7, 2023

Basic Data Structure - 3. Deque

Data Structure/ Ch03:BasicDataStructure(2)

August 5, 2023

Basic Data Structure - 2. Queue

Data Structure/ Ch03:BasicDataStructure(1)

August 4, 2023

Basic Data Structure - 1. Stacks

Data Structure/ Ch02:AlgorithmAnalysis(2)

August 2, 2023

TIL :CH02. Algorithm Anlaysis (2)

Data Structure/ Ch02:AlgorithmAnalysis

August 1, 2023

TIL: CH02. Algorithm Anlaysis

Data Structure/ Ch01:BasicPython

July 27, 2023

Python Basic Review

Back to top ↑

crossValidation

Day61 ML Review - Class Imbalances

August 25, 2024 3 minute read

Use Other Metrics, Assign Different Class Weights, or Upsample the Minority Class

Day60 ML Review - Cross Validation (5)

August 23, 2024 3 minute read

ROC area Under The Curve (ROC AUC)

Day59 ML Review - Cross Validation (4)

August 22, 2024 4 minute read

Confusion Matrix and F1 score

Day58 ML Review - Cross Validation (3)

August 21, 2024 3 minute read

Grid Search for Fine-Tuning Machine Learning Models

Day57 ML Review - Cross Validation (2)

August 20, 2024 7 minute read

Bias & Variance, and Learning & Validation Curves

Day56 ML Review - Cross Validation (1)

August 19, 2024 4 minute read

Model Selection and K-Fold Cross Validation

Back to top ↑

DLReview

Day209 - Leetcode: Python 20 & MLOps Review: ML Engineering (1)

September 17, 2025 3 minute read

Python 20: Valid Parentheses & ML Engineering: High-level ML System Design

Day208 - Leetcode: Python 121 & SQL 175,176 & DL Review

September 16, 2025 4 minute read

Python 121: Best Time to Buy and Sell Stock / SQL 175,176: Second-highest Salary / DL Review: Transformers, Self-Attention Mechanism & Positional Encoding

Day207 - Leetcode: Python 53 & SQL 185 & DL Review

September 15, 2025 5 minute read

Python 53: Maximum Subarray / SQL 185: Department Top Three Sales / DL Review: RNNs, LSTM Networks & Gradient Vanishing & Exploding

Day206 - Leetcode: Python 217 & SQL 175,176 & DL Review

September 13, 2025 3 minute read

Python 217: Contains Duplicate / SQL 175,176: Second-highest Salary / DL Review: Transfer Learning & Fine-Tuning & CNNs

Day205 - Leetcode: Python 175 & SQL Inner Join & DL Review

September 12, 2025 6 minute read

Python 175: TwoSums / SQL: Inner Join Revisiting / DL Review: Embedding Layers, Autoencoders & Knowledge Distillation

Day204 - DL Review: Revisiting Optimizers, CNNs & Data Drifts

September 9, 2025 5 minute read

Optimizers in Neural Networks, Parameter Sharing in CNNs, and Data & Concept Drifts

Back to top ↑

dataScience

(editing) Data Science Interview Prep - Statistics

August 9, 2024 2 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - SQL

August 9, 2024 1 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - ML Basic

August 9, 2024 1 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - LLM/AI

August 9, 2024 3 minute read

Interview Questions & Answers

Statistics & Probability Lecture Review

August 1, 2024 9 minute read

Introduction of Statistics (2023) Whole Lecture Review

Back to top ↑

interviewPrep

(editing) Data Science Interview Prep - Statistics

August 9, 2024 2 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - SQL

August 9, 2024 1 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - ML Basic

August 9, 2024 1 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - LLM/AI

August 9, 2024 3 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - DL

August 9, 2024 less than 1 minute read

Interview Questions & Answers

Back to top ↑

interviewGoogle

(editing) Data Science Interview Prep - Statistics

August 9, 2024 2 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - SQL

August 9, 2024 1 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - ML Basic

August 9, 2024 1 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - LLM/AI

August 9, 2024 3 minute read

Interview Questions & Answers

(editing) Data Science Interview Prep - DL

August 9, 2024 less than 1 minute read

Interview Questions & Answers

Back to top ↑

dimensionalityReduction

Day54 ML Review - Dimensionality Reduction (5)

August 17, 2024 3 minute read

Applying Kernal Principal Component Analysis(KPCA) to New Data Points

Day53 ML Review - Dimensionality Reduction (4)

August 15, 2024 2 minute read

Implementing a Kernal Principal Component Analysis(KPCA) in Python

Day52 ML Review - Dimensionality Reduction (3)

August 14, 2024 5 minute read

Nonlinear Mappings with Kernel Principal Component Analysis

Day51 ML Review - Dimensionality Reduction (2)

August 13, 2024 5 minute read

Compressing Data via Linear Discriminant Analysis

Day50 ML Review - Dimensionality Reduction (1)

August 12, 2024 5 minute read

Compressing Data via Dimensionality Reduction and Summary of PCA

Back to top ↑

NLP

Day88 Deep Learning Lecture Review - Lecture 10-12

October 10, 2024 12 minute read

Deep Learning & Numerical Precision(Floating Point), Hardware Considerations, and Distributed Model Training

Day87 Deep Learning Lecture Review - Lecture 8 (2) & 9

October 9, 2024 6 minute read

LLMs - Speeding Up LLMs (Grouped Query Attention, KV Caches, MoE, and DPO)

Day86 Deep Learning Lecture Review - Lecture 8 (1)

October 8, 2024 8 minute read

LLMs- Generating Texts, Positional Encoding, and Fine-Tuning LLMs (LoRA)

Day85 Deep Learning Lecture Review - Lecture 7

October 7, 2024 6 minute read

LLMs - Perplexity, Tokenizers, Data Cleaning, and Embedding Layer

Day74 Deep Learning Lecture Review - Lecture 6

September 12, 2024 6 minute read

Large Language Model - BERT, GPT, and GPT-2, 3 & 4

Back to top ↑

LLM

Day88 Deep Learning Lecture Review - Lecture 10-12

October 10, 2024 12 minute read

Deep Learning & Numerical Precision(Floating Point), Hardware Considerations, and Distributed Model Training

Day87 Deep Learning Lecture Review - Lecture 8 (2) & 9

October 9, 2024 6 minute read

LLMs - Speeding Up LLMs (Grouped Query Attention, KV Caches, MoE, and DPO)

Day86 Deep Learning Lecture Review - Lecture 8 (1)

October 8, 2024 8 minute read

LLMs- Generating Texts, Positional Encoding, and Fine-Tuning LLMs (LoRA)

Day85 Deep Learning Lecture Review - Lecture 7

October 7, 2024 6 minute read

LLMs - Perplexity, Tokenizers, Data Cleaning, and Embedding Layer

Day74 Deep Learning Lecture Review - Lecture 6

September 12, 2024 6 minute read

Large Language Model - BERT, GPT, and GPT-2, 3 & 4

Back to top ↑

basicDatamining

DataMiningPratice(4)-AggregatingData

August 13, 2023

Aggregating data with groupby

DataMiningPratice(3)-RefiningUnnecessaryColumns

August 12, 2023

Adding new columns and refining unnecessary columns.

DataMiningPratice(2)-RefiningDatasets

August 11, 2023

Change data type

DataMiningPratice(1)-BasicImportofDatasets

August 9, 2023

Utilize public data to import two completely different data, pre-process them, and merge them

Back to top ↑

probability

Statistics & Probability Lecture Review

August 1, 2024 9 minute read

Introduction of Statistics (2023) Whole Lecture Review

Day21 Probability Review (3)

June 14, 2024 2 minute read

Continuous Probability Distribution and Markov Chains

Day20 Probability Review (2)

June 11, 2024 2 minute read

Joint, Marginal, & Conditional Probability Distributions, and Discrete & Poisson Distributions

Day19 Probability Review (1)

June 10, 2024 1 minute read

Basic Probability - Counting and Random Variables

Back to top ↑

logisticRegression

Day49 ML Review - Data Preprocessing (3)

August 8, 2024 4 minute read

Partitioning a Dataset into Training & Test Datasets, Feature Scaling, and Feature Selection

Day31 ML Review - Logistic Regression (3)

July 22, 2024 2 minute read

How To Train in Scikit-Learn, and Regularization with LR model

Day30 ML Review - Logistic Regression (2)

July 20, 2024 1 minute read

Cost Function of Logistic Regression

Day29 ML Review - Logistic Regression (1)

July 19, 2024 3 minute read

Basic Concepts and Sigmoid Function

Back to top ↑

decisionTree

Day38 ML Review - Random Forest (2)

July 31, 2024 2 minute read

Implementation Step by Step

Day37 ML Review - Decision Tree (3) & Random Forest (1)

July 30, 2024 3 minute read

Building a Decision Tree & Random Forest (1) - Key Concepts & How it Works

Day36 ML Review - Decision Tree (2)

July 29, 2024 2 minute read

Information Gain (2) - Entropy & Classification Error

Day35 ML Review - Decision Tree (1)

July 28, 2024 4 minute read

Components, How it Works & Maximizing Information Gain (1) - Gini Impurity

Back to top ↑

ensembleMethod

Day65 ML Review - Ensemble Method (4)

August 29, 2024 5 minute read

Bagging & Boosting : Basic Concepts & Code Implementation

Day64 ML Review - Ensemble Method (3)

August 28, 2024 5 minute read

Using the Majority Voting Principle to Make Predictions, and Evaluating & Tuning the Ensemble Classifier

Day63 ML Review - Ensemble Method (2)

August 27, 2024 7 minute read

Code Structure of Combining Classifiers via Majority Vote

Day62 ML Review - Ensemble Method (1)

August 26, 2024 3 minute read

Key Concepts and Mathematics Explanation

Back to top ↑

fine-tuning

Day95 Deep Learning Lecture Review - Lecture 14

October 31, 2024 6 minute read

AI Ethics; AI Safety, Key Issues, AGI (Artificial General Intelligence), and Current AI Models’ Challenges

Day94 Deep Learning Lecture Review - Lecture 13

October 30, 2024 10 minute read

Llama 3: Framework, Workflow (RMSNorm, Grouped Query Attention, RoPE, SwiGLU Attention), Pre-training & Post-training

Day93 Deep Learning Lecture Review - Fine-Tuning Models (2) & Prompt Engineering

October 23, 2024 7 minute read

Comparing Pre-trained model embeddings (ResNet+SBERT vs. CLIP) and Prompt Engineering (Short and Direct, Few-Shot Learning, & Expert Prompting)

Day92 Deep Learning Lecture Review - Fine-Tuning Models (1)

October 17, 2024 8 minute read

HW2: Understanding of LoRA and Pre-trained Model Embeddings (ResNet+SBERT) for Visual Question Answering (VQA)

Back to top ↑

Notice

A New Chapter:: Continuing onward to MLOps

April 22, 2025 1 minute read

Designing Machine Learning Systems (MLOPs) Review Begins!

Continuing the TIL Project in 2025

January 15, 2025 2 minute read

The Ongoing Chronicles of TIL25 — A Motivating Expedition as a Data Scientist & AI/ML Engineer Candidate

The Start of TIL 24

April 30, 2024

The Start of Recording TIL Summer ‘24 - During Summer in Rochester as a Data Scientist Candidate

Back to top ↑

PCA

Day18 ML Review - Principle Component Analysis (3)

June 8, 2024 2 minute read

Applying PCA in Machine Learning and Scree Plot

Day06 ML Review - Principle Component Analysis (2)

May 21, 2024 3 minute read

Applications on Machine Learning & Further Explanations

Day05 ML Review - Principle Component Analysis (1)

May 20, 2024 5 minute read

Mathematical Definition & Algorithms

Back to top ↑

supportVectorMachine

Day34 ML Review - Support Vector Machine (3)

July 25, 2024 2 minute read

Solving Nonlinear Problems - Using a Kernal SVM

Day33 ML Review - Support Vector Machine (2)

July 24, 2024 2 minute read

SVM: Nonlinear Separable Case

Day32 ML Review - Support Vector Machine (1)

July 23, 2024 2 minute read

Basic Concepts and Mathematical Formulations

Back to top ↑

kNearestNeighbors

Day46 ML Review - K-Nearest Neighbors (3)

August 5, 2024 3 minute read

The Curse of Dimensionality

Day45 ML Review - K-Nearest Neighbors (2)

August 2, 2024 3 minute read

Distance Metrics- Euclidean, Manhattan, Minkowski & Chebyshev Distance, and Cosine Similarity

Day44 ML Review - K-Nearest Neighbors (1)

August 1, 2024 3 minute read

Basic Concepts, How It Works, and Parametric & Non-Parametric Model

Back to top ↑

neural network

Day75-84 Introduction to Natural Language Processing using Deep Learning

October 6, 2024 2 minute read

Basic Machine Learning & Deep Learning, Word Embedding, CNNs, RNNs, LSTM and Transformer

Day69 DL Review - Convolutional Neural Networks (CNNs)

September 6, 2024 8 minute read

Basic Concepts and the Detailed Architecture

Day67 Deep Learning Lecture Review - Lecture 2-3

September 4, 2024 7 minute read

Types of Learning and Neural Net Zoo: Fully Connected Networks (MLPs), Inductive Bias, and Convolutional Neural Networks (CNNs)

Back to top ↑

transformer

Day73 Deep Learning Lecture Review - Lecture 5

September 11, 2024 10 minute read

Transformers and Foundation Models: GELU, Layer Norm, Key Concepts & Workflow

Day71 DL Review - Natural Language Processing (NLP)

September 8, 2024 5 minute read

Primary Goals, Common Tasks, and Deep Learning NLP

Day70 DL Review - Transformer

September 7, 2024 8 minute read

Transformer Architecture, How the Models Are Different, and Q,K,V in Self-Attention

Back to top ↑

CLIP

Day95 Deep Learning Lecture Review - Lecture 14

October 31, 2024 6 minute read

AI Ethics; AI Safety, Key Issues, AGI (Artificial General Intelligence), and Current AI Models’ Challenges

Day94 Deep Learning Lecture Review - Lecture 13

October 30, 2024 10 minute read

Llama 3: Framework, Workflow (RMSNorm, Grouped Query Attention, RoPE, SwiGLU Attention), Pre-training & Post-training

Day93 Deep Learning Lecture Review - Fine-Tuning Models (2) & Prompt Engineering

October 23, 2024 7 minute read

Comparing Pre-trained model embeddings (ResNet+SBERT vs. CLIP) and Prompt Engineering (Short and Direct, Few-Shot Learning, & Expert Prompting)

Back to top ↑

ResNet

Day95 Deep Learning Lecture Review - Lecture 14

October 31, 2024 6 minute read

AI Ethics; AI Safety, Key Issues, AGI (Artificial General Intelligence), and Current AI Models’ Challenges

Day94 Deep Learning Lecture Review - Lecture 13

October 30, 2024 10 minute read

Llama 3: Framework, Workflow (RMSNorm, Grouped Query Attention, RoPE, SwiGLU Attention), Pre-training & Post-training

Day93 Deep Learning Lecture Review - Fine-Tuning Models (2) & Prompt Engineering

October 23, 2024 7 minute read

Comparing Pre-trained model embeddings (ResNet+SBERT vs. CLIP) and Prompt Engineering (Short and Direct, Few-Shot Learning, & Expert Prompting)

Back to top ↑

SBERT

Day95 Deep Learning Lecture Review - Lecture 14

October 31, 2024 6 minute read

AI Ethics; AI Safety, Key Issues, AGI (Artificial General Intelligence), and Current AI Models’ Challenges

Day94 Deep Learning Lecture Review - Lecture 13

October 30, 2024 10 minute read

Llama 3: Framework, Workflow (RMSNorm, Grouped Query Attention, RoPE, SwiGLU Attention), Pre-training & Post-training

Day93 Deep Learning Lecture Review - Fine-Tuning Models (2) & Prompt Engineering

October 23, 2024 7 minute read

Comparing Pre-trained model embeddings (ResNet+SBERT vs. CLIP) and Prompt Engineering (Short and Direct, Few-Shot Learning, & Expert Prompting)

Back to top ↑

prompt engineering

Day95 Deep Learning Lecture Review - Lecture 14

October 31, 2024 6 minute read

AI Ethics; AI Safety, Key Issues, AGI (Artificial General Intelligence), and Current AI Models’ Challenges

Day94 Deep Learning Lecture Review - Lecture 13

October 30, 2024 10 minute read

Llama 3: Framework, Workflow (RMSNorm, Grouped Query Attention, RoPE, SwiGLU Attention), Pre-training & Post-training

Day93 Deep Learning Lecture Review - Fine-Tuning Models (2) & Prompt Engineering

October 23, 2024 7 minute read

Comparing Pre-trained model embeddings (ResNet+SBERT vs. CLIP) and Prompt Engineering (Short and Direct, Few-Shot Learning, & Expert Prompting)

Back to top ↑

randomForest

Day38 ML Review - Random Forest (2)

July 31, 2024 2 minute read

Implementation Step by Step

Day37 ML Review - Decision Tree (3) & Random Forest (1)

July 30, 2024 3 minute read

Building a Decision Tree & Random Forest (1) - Key Concepts & How it Works

Back to top ↑

dataPreprocessing

Day48 ML Review - Data Preprocessing (2)

August 7, 2024 4 minute read

Handling Categorical Data - Converting, Ordinal Encoding, and One-Hot Encoding

Day47 ML Review - Data Preprocessing (1)

August 6, 2024 3 minute read

Handling Missing Data - Eliminating and Imputing & Estimators API

Back to top ↑

DL Review

Day67 Deep Learning Lecture Review - Lecture 2-3

September 4, 2024 7 minute read

Types of Learning and Neural Net Zoo: Fully Connected Networks (MLPs), Inductive Bias, and Convolutional Neural Networks (CNNs)

Day66 Deep Learning Lecture Review - Lecture 1

September 3, 2024 6 minute read

Basic Mathematics, Supervised ML, and Review of Multi-Layered Perceptron

Back to top ↑

DL review

Day69 DL Review - Convolutional Neural Networks (CNNs)

September 6, 2024 8 minute read

Basic Concepts and the Detailed Architecture

Day68 Deep Learning Lecture Review - Lecture 3

September 5, 2024 7 minute read

Neural Net Zoo: Transformers, Recurrent Neural Networks (RNNs) and Graph Neural Networks (GNNs)

Back to top ↑

CNN

Day75-84 Introduction to Natural Language Processing using Deep Learning

October 6, 2024 2 minute read

Basic Machine Learning & Deep Learning, Word Embedding, CNNs, RNNs, LSTM and Transformer

Day69 DL Review - Convolutional Neural Networks (CNNs)

September 6, 2024 8 minute read

Basic Concepts and the Detailed Architecture

Back to top ↑

selfAttention

Day73 Deep Learning Lecture Review - Lecture 5

September 11, 2024 10 minute read

Transformers and Foundation Models: GELU, Layer Norm, Key Concepts & Workflow

Day71 DL Review - Natural Language Processing (NLP)

September 8, 2024 5 minute read

Primary Goals, Common Tasks, and Deep Learning NLP

Back to top ↑

statstics

Day24 Statistics Review (3)

June 19, 2024 3 minute read

Test Statistics (Z-Test, t-Test, and Chi-Squared Test)

Back to top ↑

mathematicsReview

Day39-43 Linear Algebra & Matrix Review (Korean)

July 31, 2024 less than 1 minute read

Linear Algebra & Matrix for Programmers

Back to top ↑

matrix

Day39-43 Linear Algebra & Matrix Review (Korean)

July 31, 2024 less than 1 minute read

Linear Algebra & Matrix for Programmers

Back to top ↑

linearAlgebra

Day39-43 Linear Algebra & Matrix Review (Korean)

July 31, 2024 less than 1 minute read

Linear Algebra & Matrix for Programmers

Back to top ↑

lectureReview

Statistics & Probability Lecture Review

August 1, 2024 9 minute read

Introduction of Statistics (2023) Whole Lecture Review

Back to top ↑

data science

(editing) Data Science Interview Prep - DL

August 9, 2024 less than 1 minute read

Interview Questions & Answers

Back to top ↑

pipeline

Day55 ML Review - Pipeline

August 18, 2024 2 minute read

Key Concepts and Example Code with Scikit-learn

Back to top ↑

kFold

Day56 ML Review - Cross Validation (1)

August 19, 2024 4 minute read

Model Selection and K-Fold Cross Validation

Back to top ↑

Mathematic Review

Day66 Deep Learning Lecture Review - Lecture 1

September 3, 2024 6 minute read

Basic Mathematics, Supervised ML, and Review of Multi-Layered Perceptron

Back to top ↑

Matrix

Day66 Deep Learning Lecture Review - Lecture 1

September 3, 2024 6 minute read

Basic Mathematics, Supervised ML, and Review of Multi-Layered Perceptron

Back to top ↑

Linear Algebra

Day66 Deep Learning Lecture Review - Lecture 1

September 3, 2024 6 minute read

Basic Mathematics, Supervised ML, and Review of Multi-Layered Perceptron

Back to top ↑

transformers

Day68 Deep Learning Lecture Review - Lecture 3

September 5, 2024 7 minute read

Neural Net Zoo: Transformers, Recurrent Neural Networks (RNNs) and Graph Neural Networks (GNNs)

Back to top ↑

neuralNetwork

Day70 DL Review - Transformer

September 7, 2024 8 minute read

Transformer Architecture, How the Models Are Different, and Q,K,V in Self-Attention

Back to top ↑

foundationModels

Day70 DL Review - Transformer

September 7, 2024 8 minute read

Transformer Architecture, How the Models Are Different, and Q,K,V in Self-Attention

Back to top ↑

linear algebra

Day72 Deep Learning Lecture Review - Lecture 4

September 10, 2024 6 minute read

Brief Explanation of Basic Algebra and Machine Learning

Back to top ↑

basic machine learning

Day72 Deep Learning Lecture Review - Lecture 4

September 10, 2024 6 minute read

Brief Explanation of Basic Algebra and Machine Learning

Back to top ↑

BERT

Day74 Deep Learning Lecture Review - Lecture 6

September 12, 2024 6 minute read

Large Language Model - BERT, GPT, and GPT-2, 3 & 4

Back to top ↑

GPT

Day74 Deep Learning Lecture Review - Lecture 6

September 12, 2024 6 minute read

Large Language Model - BERT, GPT, and GPT-2, 3 & 4

Back to top ↑

random forest

Day89 ML Review - Ensemble Method (5)

October 11, 2024 7 minute read

Revisiting Ensemble Method, Random Forest, and XGBoost

Back to top ↑

ensemble method

Day89 ML Review - Ensemble Method (5)

October 11, 2024 7 minute read

Revisiting Ensemble Method, Random Forest, and XGBoost

Back to top ↑

pytorch

Day90 Deep Learning Lecture Review - Background Knowledges

October 12, 2024 7 minute read

HW0: Softmax Properties, PyTorch Lightning, and DataLoader

Back to top ↑

activation function

Day90 Deep Learning Lecture Review - Background Knowledges

October 12, 2024 7 minute read

HW0: Softmax Properties, PyTorch Lightning, and DataLoader

Back to top ↑

data loader

Day90 Deep Learning Lecture Review - Background Knowledges

October 12, 2024 7 minute read

HW0: Softmax Properties, PyTorch Lightning, and DataLoader

Back to top ↑

pytorch lightning

Day90 Deep Learning Lecture Review - Background Knowledges

October 12, 2024 7 minute read

HW0: Softmax Properties, PyTorch Lightning, and DataLoader

Back to top ↑

hyperparameter tuning

Day91 Deep Learning Lecture Review - Optimizing Hyperparameters

October 14, 2024 7 minute read

Weights & Biases (W&B) for Monitoring and Fine-Tuning ResNet-18 and Post-Training Evaluations (Dying ReLU, Brightness Robustness)

Back to top ↑

mlops

Day91 Deep Learning Lecture Review - Optimizing Hyperparameters

October 14, 2024 7 minute read

Weights & Biases (W&B) for Monitoring and Fine-Tuning ResNet-18 and Post-Training Evaluations (Dying ReLU, Brightness Robustness)

Back to top ↑

w&b

Day91 Deep Learning Lecture Review - Optimizing Hyperparameters

October 14, 2024 7 minute read

Weights & Biases (W&B) for Monitoring and Fine-Tuning ResNet-18 and Post-Training Evaluations (Dying ReLU, Brightness Robustness)

Back to top ↑

LoRA

Day92 Deep Learning Lecture Review - Fine-Tuning Models (1)

October 17, 2024 8 minute read

HW2: Understanding of LoRA and Pre-trained Model Embeddings (ResNet+SBERT) for Visual Question Answering (VQA)

Back to top ↑

model comparison

Day96 Deep Learning Lecture Review - Lecture 15 (1)

November 1, 2024 8 minute read

Model Comparison and Bias Mitigation; McNemar’s Test, Dataset Bias, and Bias Detection

Back to top ↑

bias mitigation

Day96 Deep Learning Lecture Review - Lecture 15 (1)

November 1, 2024 8 minute read

Model Comparison and Bias Mitigation; McNemar’s Test, Dataset Bias, and Bias Detection

Back to top ↑

dataset bias

Day96 Deep Learning Lecture Review - Lecture 15 (1)

November 1, 2024 8 minute read

Model Comparison and Bias Mitigation; McNemar’s Test, Dataset Bias, and Bias Detection

Back to top ↑

algorithm bias

Day96 Deep Learning Lecture Review - Lecture 15 (1)

November 1, 2024 8 minute read

Model Comparison and Bias Mitigation; McNemar’s Test, Dataset Bias, and Bias Detection

Back to top ↑

SQL

Day178-190 - SQL Review: SQL Fundamentals for Data Analysis (Course Completed)

July 11, 2025 2 minute read

SQL Mastery in 3 Weeks — From Fundamentals to Analytic SQL

Back to top ↑

DataAnalytics

Day178-190 - SQL Review: SQL Fundamentals for Data Analysis (Course Completed)

July 11, 2025 2 minute read

SQL Mastery in 3 Weeks — From Fundamentals to Analytic SQL

Back to top ↑

Docker

Day191-198 Hands-On Docker: From Basics to Real-World Deployment (Course Completed)

August 11, 2025 2 minute read

Docker Review — From CLI Fundamentals to Multi-Container Orchestration

Back to top ↑

MlOps

Day191-198 Hands-On Docker: From Basics to Real-World Deployment (Course Completed)

August 11, 2025 2 minute read

Docker Review — From CLI Fundamentals to Multi-Container Orchestration

Back to top ↑