Outlier detection and removal: z score, standard deviation | Feature engineering tutorial python # 3
If we have a dataset that follows normal distribution than we can use 3 or more standard deviation to spot outliers in the dataset. Many times these are legitimate values and it really depends on the situation if you want to remove them or not. But removing outliers can significantly increase the statistical power of machine learning model hence it is recommended that you treat outliers before building a model. Z score indicates how many standard deviation away a given sample is. We are going to go through all this theory and write python code to remove outliers from heights dataset that I have taken it from kaggle. Link for kaggle dataset: https://www.kaggle.com/mustafaali96/w... Code & Exercise: https://github.com/codebasics/py/blob... CSV file for exercise: https://github.com/codebasics/py/tree... Topics 00:00 Introduction 00:20 Exploratory analysis on a kaggle dataset 01:14 Plot histogram and bell curve 06:30 Use 3 standard deviation to remove outliers 12:14 Use Z score to remove outliers 17:39 Exercise Do you want to learn technology from me? Check https://resources.codebasics.io/611vqk for my affordable video courses. Website: https://resources.codebasics.io/611vqk Facebook: / codebasicshub Twitter: / codebasicshub

Outlier detection and removal using IQR | Feature engineering tutorial python # 4

Normal Distribution and Z Score | Math, Statistics for data science, machine learning

Machine Learning Tutorial Python 12 - K Fold Cross Validation

Data Science & Machine Learning Project - Part 4 Feature Engineering | Image Classification

Frankreich – Senegal Highlights | Gruppe I, FIFA WM 2026 | sportstudio

Machine Learning Tutorial Python - 19: Principal Component Analysis (PCA) with Python Code

Irak – Norwegen Highlights | Gruppe I, FIFA WM 2026 | sportstudio

Median, Mean, Mode, Percentile | Math, Statistics for data science, machine learning

The StatQuest Introduction to PyTorch

Machine Learning Tutorial Python - 6: Dummy Variables & One Hot Encoding

How to Detect and Remove Outliers in the Data | Python

Top 40 Cypress Interview Questions & Answers 2025 | Cypress Interview Questions | FAQs | MindMajix

Something is jamming GPS over Europe. Here's what we found

Loss or Cost Function | Deep Learning Tutorial 11 (Tensorflow Tutorial, Keras & Python)

Is the AI Boom About to COLLAPSE?

Database Sharding and Partitioning

The A to Z of dealing with Outliers | Data Preprocessing | Data Science

Machine Learning Tutorial Python - 13: K Means Clustering Algorithm

Building and training ML models with Vertex AI

