Numpy Intro.

Posted under » Python Data Analysis on 14 June 2023

NumPy, which stands for Numerical Python, is a library consisting of multidimensional array objects and a collection of routines for processing those arrays. An array is a central data structure of the NumPy library. An array is a grid of values and it contains information about the raw data, how to locate an element, and how to interpret an element.

When you install Pandas, it will install Numpy too. I see Pandas as something like excel, and Numpy more like matrix multi dimensional data.

The most important object defined in NumPy is an N-dimensional array type called ndarray.

It has a grid of elements that can be indexed in various ways. The elements are all of the same type, referred to as the array dtype.

An array can be indexed by a tuple of nonnegative integers, by booleans, by another array, or by integers.

The rank of the array is the number of dimensions. The shape of the array is a tuple of integers giving the size of the array along each dimension.

One way we can initialize NumPy arrays is from Python lists, using nested lists for two- or higher-dimensional data. In NumPy, dimensions are called axes.

import numpy as np 
# 1D
a = np.array([1,2,3]) 
print (a)

# 2D or 2 axes
b = np.array([[1, 2], [3, 4]]) 
print (b)

We can access the elements in the array using square brackets. When you’re accessing elements, remember that indexing in NumPy starts at 0. That means that if you want to access the first element in your array, you’ll be accessing element “0”.

a = np.array([[1, 2, 3, 4], [5, 6, 7, 8], [9, 10, 11, 12]])
print(a[0])

[1 2 3 4]

Any item extracted from ndarray object (by slicing) is represented by a Python object of a type. The diagram above shows a relationship between ndarray, data type object (dtype) and array scalar type.