Skip to content

naveensiwas/data_science_libraries

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Top Python Libraries Every Data science Aspirant Must know !

1.NumPy

What is NumPy ?

NumPy is a library for the Python programming language, adding support for large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

  • A matrix is a two-dimensional data structure where numbers are arranged into rows and columns.
  • In Numpy dimensions are called axes. The number of axes is rank.
  • The first axis ( i.e. axis-0 ) is running vertically downwards across rows, and the second (axis-1) running horizontally across columns.

NumPy Topics

  • Array
  • Shape and Reshape
  • Shape and Reshape with Arange
  • Swapaxes
  • Understanding Axes
  • Matrix Operations
  • Slicing - Two-dimensional Numpy Arrays
  • Slice a Range of Values from Two-dimensional Numpy Arrays
  • Masking and Filtering
  • Transposing
  • Sorting
  • Concatenation
  • Ones
  • Zeros

2.Pandas

What is Pandas ?

Pandas is a software library written for the python programming language for data manipulation and analysis.

  • Pandas is built on top of the NumPy package, meaning a lot of the structure of NumPy is used or replicated in Pandas.
  • Data in pandas is often used to feed statistical analysis in SciPy, plotting functions from Matplotlib, and machine learning algorithms in Scikit-learn.
  • The primary two components of pandas are the Series and DataFrame.

1.Pandas Series and Dataframe

2.Pandas Viewing Data