Spark SQL Aggregation Functions
- groupBy : It is used to group records based on
columns.
- count : It is used to count number of records
- sum : It is used to calculate sum of all records for a particular column.
- mean : It is used to calculate mean of all records for a particular column.
- min : It is used to calculate minimum of all records for a particular column.
- max : It is used to calculate maximum of all records for a particular column.
#pyspark #spark #python #sparksql #dataframe #aggregation #groupBy #sum #mean #avg #max #min
Download
0 formats
No download links available.
PySpark Examples - How to use Aggregation Functions DataFrame (sum,mean,max,min,groupBy) - Spark SQL | NatokHD