Introduction to Data Preprocessing
Preparing Data for Analysis
Data Characteristics, Summary and Distribution
Data Preprocessing in the Real World
Welcome to Data Preprocessing
Welcome data scientist! You are now ready to learn all about Python for Data Preprocessing. Data preprocessing is all about getting an overall feel of the look and usability of your dataset.
This workshop will provide you with the foundation you need to become a data scientist. After completing this workshop, you will be able to use Python to perform preprocessing on data. Specifically, you’ll learn: how to handle missing data, how to manage bad data entries, how to estimate data characteristics and summary statistics, and how to identify outliers and estimate data distribution.
Python for Data Science and Python for Data Visualizations workshops are prerequisites for this workshop. If you don’t have any prior Python programming knowledge, it is highly recommended you complete those workshops first.