How does the SQL server determine the number of rows?

Question

How does the SQL server determine the number of rows?

I am trying to debug a rather complicated stored procedure, which is combined into many tables (10-11). I see that for a part of the tree, the estimated number of rows is very different from the actual number of rows - in the worst case, the SQL server estimates that 1 row will be returned when 55,000 rows are actually returned!

I am trying to understand why this is so - all my statistics are updated, and I updated the statistics using FULLSCAN on several tables. I do not use any user-defined functions or table variables. As far as I can see, the SQL server should be able to accurately estimate how many rows will be returned, but it continues to choose a plan in which it executes tens of thousands of RDI queries (when it is expected that it will execute only 1 or 2).

What can I do to try to understand why the counted number of lines came out so many?

UPDATE: . Thus, looking at the plan, I found one node, in particular, which seems suspicious - it scanned a table in a table using the following history:

status <> 5
AND [type] = 1
OR [type] = 2

This predicate returns the entire table (630 rows - the table scan itself is NOT a source of poor performance), however the SQL server has an estimated number of rows of only 37. Then the SQL server performs several nested loops using this to search for RDI, scan indexes and search indexes. Could this be the source of my mass miscalculation? How to get it to estimate a more reasonable number of rows?

+5

sql-server sql-server-2005 sql-execution-plan

Justin 25 sept. '09 at 11:13

source share

4 answers

, .

( )

( : . ...)

exec sp_msforeachtable 'UPDATE STATISTICS ?'

(, ), ( , ):

exec sp_msforeachtable "DBCC DBREINDEX('?')"

, Microsoft SQL Server 2008

+3

Mitch Wheat 25 . '09 11:17

, sniffing:

CREATE PROCEDURE xyz
(
    @param1 int
    ,@param2 varchar(10)

)AS

DECLARE @param_1 int
       ,@param_2 varchar(10)

SELECT @param_1=@param1
      ,@param_2=@param2

...complex query here....
...WHERE column1=@param_1 AND column2=@param_2....

go

0

KM. 25 . '09 11:17

0

user55474 04 . '10 18:57

Quassnoi · Accepted Answer · 2009-09-25T11:21:33+0000

SQL Serversplits each index into 200ranges with the following data (from here )

RANGE_HI_KEY
A key value indicating the upper boundary of the histogram step.
RANGE_ROWS
, ( , RANGE_HI_KEY, , RANGE_HI_KEY).
EQ_ROWS
, RANGE_HI_KEY.
AVG_RANGE_ROWS
.
DISTINCT_RANGE_ROWS
, ( RANGE_HI_KEY RANGE_HI_KEY);

RANGE_HI_KEY.

, .

( ):

1          1
2          1
3          10000
4          1

SQL Server : 1 3 4 , :

RANGE_HI_KEY  RANGE_ROWS  EQ_ROWS  AVG_RANGE_ROWS  DISTINCT_RANGE_ROWS
3             2           10000    1               2

, , , 2 1, .

3 , :

RANGE_HI_KEY  RANGE_ROWS  EQ_ROWS  AVG_RANGE_ROWS  DISTINCT_RANGE_ROWS
4             10002       1        3334            3

, 2 3334, .

How does the SQL server determine the number of rows?

More articles: