Introduction to Pandas

“What matters in life is not what happens to you but what you remember and how you remember it..”
Gabriel García Márquez

The Pandas library provides the fundamental data structure, called dataframe, to store and manipulate datasets in Python. The name Pandas evolved from Panel Data. NumPy and Pandas share many common syntax and functionality. While NumPy is used only for homogeneous data for fast numeric computations, Pandas DataFrame can contain heterogeneous, tabulated data.

TipChapter Objectives

In this chapter, we will learn the following:

  1. Fundamental data structures: Series, Dataframe, and Index

  2. Manipulating indices and columns in datasets

  3. Basic descriptive statistics using Pandas

  4. Acquisition of data in different formats and from different sources