mardi 26 juillet 2016

What is the difference between cube and groupBy for operating on DataFrames?

Question is pretty much in the title. I can't find any detailed documentation regarding the differences.

I do notice a difference because when interchanging cube and groupBy function calls, I get different results. I noticed that for the result using 'cube', I got a lot of weird null values on the expressions I often grouped by.

Aucun commentaire:

Enregistrer un commentaire