03 Jan 2024

Exploratory Data Analysis

Exploratory Data Analysis (EDA) is the initial phase of data analysis in which you explore and summarize the main characteristics, patterns, and relationships present in a dataset. The goal of EDA is to gain a better understanding of the data, uncover potential insights, and identify any data quality issues or anomalies before proceeding with further analysis or modeling.

EDA Checklist

  1. What questions are you trying to solve or prove wrong?
  2. What kind of data are you dealing with and how do you treat the different types?
  3. What is missing from the data and how do you deal with it?
  4. What are the outliers and what should we do about them?
  5. How can you add, change or remove features to get more out of your data?