Monday 2-8 pm Berlin time: Course introduction, setup and introduction to Pandas
First we will get set up with Jupyter notebooks and go over how to configure the environment for data analysis and visualization, as well as how to import the Pandas and Matplotlib/Seaborn libraries.
We will talk about the datasets that we will be looking at in this course, and how to read the data into Pandas.Then we will dive in to the basics of Pandas and talk about DataFrames and Series objects. DataFrames are the core data structure in Pandas,and are similar to spreadsheets with rows and columns.
We will read in the course data and learn how to quickly summarize the data, and how to perform some basic data cleaning techniques such as handling duplicates, missing values, null types and outliers. We will go over working with dates in Pandas, as well as string manipulations. As part of this data exploration, we will look at how to quickly and easily generate some basic plots straight from Pandas.