SQL: difference between PARTITION BY and GROUP BY

Question

I&#8217;ve been using GROUP BY for all types of aggregate queries over the years. Recently, I&#8217;ve been reverse-engineering some code that uses PARTITION BY to perform aggregations. In reading through all the documentation I can find about PARTITION BY, it sounds a lot like GROUP BY, maybe with a little e…

Accepted Answer

They&#8217;re used in different places. GROUP BY modifies the entire query, like:select customerId, count(*) as orderCountfrom Ordersgroup by customerIdBut PARTITION BY just works on a window function, like ROW_NUMBER():select row_number() over (partition by customerId order by orderId)    as OrderNumberForThisCustomerfrom OrdersGROUP BY normally reduces the number of rows returned by rollingthem up and calculating averages or sums for each row.PARTITION BY does not affect the number of rows returned, but itchanges how a window function&#8217;s result is calculated.

Advertisement

Answer