I feel that I characterize Pandas as "improving." Numpy / SciPy skips most of the point. Numpy / Scipy is quite focused on efficient numerical calculation and solving numerical problems, which are often solved by scientists and engineers. If your problem starts with formulas and includes a numerical solution from there, you are probably good with these two.
Pandas is much more consistent with issues that start with data stored in files or databases containing strings as well as numbers. Consider the problem of reading data from a database query. In Pandas, you can read_sql_query directly and have a usable version of the data on one line. Numpy / SciPy has no equivalent functions.
For data containing rows or discrete rather than continuous data, there is no equivalent to the groupby function or joining tables into a database when matching values.
For time series, there is a massive advantage of processing time series of data using the datetime index, which allows you to smoothly drag and drop different intervals, fill in values โโand plot incredibly easily.
Since many of my problems start life in spreadsheets, I am also very grateful for the relatively transparent processing of Excel files in .xls and .xlsx formats with the interface format.
There is also a wider ecosystem with packages such as a boat that provides smoother statistical analysis and model fitting than is possible with numpy / scipy base material.
source share