SQL Server: Difference between PARTITION BY and GROUP BY

Question

I've been using GROUP BY for all types of aggregate queries over the years. Recently, I've been reverse-engineering some code that uses PARTITION BY to perform aggregations. In reading through all the documentation I can find about PARTITION BY, it sounds a lot like GROUP BY, maybe with a little extra functionality added in?

nisha · Answer

We can take a simple example.Consider a table named&#160;TableA&#160;with the following values:id  firstname                   lastname                    Mark
-------------------------------------------------------------------
1   arun                        prasanth                    40
2   ann                         antony                      45
3   sruthy                      abc                         41
6   new                         abc                         47
1   arun                        prasanth                    45
1   arun                        prasanth                    49
2   ann                         antony                      49
GROUP BYThe SQL GROUP BY clause can be used in a SELECT statement to collect data across multiple records and group the results by one or more columns.In more simple words GROUP BY statement is used in conjunction with the aggregate functions to group the result-set by one or more columns.Syntax:SELECT expression1, expression2, ... expression_n, 
       aggregate_function (aggregate_expression)
FROM tables
WHERE conditions
GROUP BY expression1, expression2, ... expression_n;
We can apply&#160;GROUP BY&#160;in our table:select SUM(Mark)marksum,firstname from TableA
group by id,firstName
Results:marksum  firstname
----------------
94      ann                      
134     arun                     
47      new                      
41      sruthy   
In our real table we have 7 rows and when we apply&#160;GROUP BY id, the server group the results based on&#160;id:In simple words:here&#160;GROUP BY&#160;normally reduces the number of rows returned by rolling them up and calculating&#160;Sum()&#160;for each row.PARTITION BYBefore going to PARTITION BY, let us look at the&#160;OVER&#160;clause:According to the MSDN definition:OVER clause defines a window or user-specified set of rows within a query result set. A window function then computes a value for each row in the window. You can use the OVER clause with functions to compute aggregated values such as moving averages, cumulative aggregates, running totals, or a top N per group results.PARTITION BY will not reduce the number of rows returned.We can apply PARTITION BY in our example table:SELECT SUM(Mark) OVER (PARTITION BY id) AS marksum, firstname FROM TableA
Result:marksum firstname 
-------------------
134     arun                     
134     arun                     
134     arun                     
94      ann                      
94      ann                      
41      sruthy                   
47      new  
Look at the results - it will partition the rows and returns&#160;all&#160;rows, unlike GROUP BY.

SQL Server Difference between PARTITION BY and GROUP BY

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Others

What is the difference between hadoop and google analytics ?

Mention the difference between Data Driven Testing and Retesting?

What is the difference between Dark Web and Deep Web?

What is the difference between BASH and DOS?

SQL Server: PARTITION BY vs GROUP BY

How do I UPDATE from a SELECT in SQL Server?

How do I UPDATE from a SELECT in SQL Server?

What is a stored procedure?

Ordering by the order of values in a SQL IN() clause

SQL Server replaces LEFT JOIN for LEFT OUTER JOIN in view query

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES