Pandas

This section is currently unfinished and will be updated further!

Pandas is a Python library that makes it easier to work with data.

It helps programmers to store, clean, sort and analyse data - especially data that looks like a table (rows and columns), similar to a spreadsheet in Excel.

Example 1 - Reading a CSV File

Suppose there is a file named products_complete.csv that contains the following data:

ProductID,ProductName,Category,Price,Stock
1,Laptop,Electronics,750,10
2,Mouse,Accessories,25,50
3,Keyboard,Accessories,40,25
4,Monitor,Electronics,150,15
5,Printer,Electronics,120,20

We can use Pandas to load this file and display its contents using the following code:

import pandas as pd

data = pd.read_csv("products_complete.csv")

print(data)

Example 2 - Calculating the Average Price

import pandas as pd

data = pd.read_csv("products_complete.csv")

averagePrice = data["Price"].mean()

print(averagePrice)

Example 3 - Finding the Highest Price

import pandas as pd

data = pd.read_csv("products_complete.csv")

maxPrice = data["Price"].max()

print(maxPrice)

Example 4 - Finding the Lowest Price

import pandas as pd

data = pd.read_csv("products_complete.csv")

minPrice = data["Price"].min()

print(minPrice)

Example 5 - Finding the Most and Least Expensive Products

import pandas as pd

data = pd.read_csv("products_complete.csv")

mostExpensive = data.loc[data["Price"].idxmax(), "ProductName"]
leastExpensive = data.loc[data["Price"].idxmin(), "ProductName"]

print(f"Most Expensive Product: {mostExpensive}")
print(f"Least Expensive Product: {leastExpensive}")