DataframeDescribe
The describe function from pandas displays common statistics of numerical columns within a dataframe. This includes the columns maximum, minimum, certain quantiles, etc. This kit makes it simple to use the describe function and display the data for either the entire dataframe or a subset of columns.
Options
columns: Specifies columns to describe
where: Specifies a condition for describing data
group: Specifies columns to group by
Examples
Example 1 - Describe All Columns in the Dataframe
When exploring a new dataset, it’s useful to get summary statistics across all numeric and datetime columns. The describe function returns metrics such as count, mean, min, and max for each column. This provides a high-level overview of the dataset and helps identify anomalies or outliers early in the analysis process.
#> DataframeDescribe --print
AFLEFT
bankTransactionsDfDescribe = bankTransactionsDf.describe()
print(bankTransactionsDfDescribe) #)2 AFRIGHT