SQL Server Performance: GROUP BY int vs GROUP BY VARCHAR

Question

SQL Server Performance: GROUP BY int vs GROUP BY VARCHAR

Is there a difference in performance when grouping by different data types? For example, if I group INT, do I get better performance than if I group varchar?

+8

performance types sql-server group-by

richard Aug 25 '11 at 21:15

source share

3 answers

Do you define a data type based solely on how the data type performs in GROUP BY ? This is the same data, you just decide how to store 123456, like INT or VARCHAR ? Have you considered other factors, such as the cost of the CPU to convert between numeric and string types, when this might not have been necessary? Additional memory needed to store the entire table in the cache? Overhead string for VARCHAR indicating length? As for storage costs (for example, 1234567890 takes 4 bytes as INT , but "1234567890" takes 10 bytes + line overhead like VARCHAR )? What about compression? How will the index in this column align with the clustered index in the table, which can affect how useful the "already grouped" ones are?

In other words, I would not consider GROUP BY performance in a bubble.

+4

Aaron bertrand Aug 25 '11 at 10:46

source share

Grouping by int will be slightly faster than grouping by varchar, but what really matters is if there is an index in the field that the database can use to group.

+3

Guffa Aug 25 '11 at 9:31

source share

Simon hugs · Accepted Answer · 2011-08-25T21:20:42+0000

I would say that GROUP BY INT is faster, since only 4 bytes are checked by verses of n bytes in the varchar field.

SQL Server Performance: GROUP BY int vs GROUP BY VARCHAR

More articles: