Python for Data Preprocessing

  • Introduction to Data Preprocessing
  • Preparing Data for Analysis
  • Data Characteristics, Summary and Distribution
  • Data Preprocessing in the Real World

Welcome to Data Preprocessing

Welcome data scientist! You are now ready to learn all about Python for Data Preprocessing. Data preprocessing is all about getting an overall feel of the look and usability of your dataset.

This workshop will provide you with the foundation you need to become a data scientist. After completing this workshop, you will be able to use Python to perform preprocessing on data. Specifically, you’ll learn:  how to handle missing data, how to manage bad data entries, how to estimate data characteristics and summary statistics, and how to identify outliers and estimate data distribution.

Python for Data Science and Python for Data Visualizations workshops are prerequisites for this workshop. If you don’t have any prior Python programming knowledge, it is highly recommended you complete those workshops first.