To add to what Dirk said about the condition number method, the rule is that CN > 30 indicate severe collinearity values CN > 30 indicate severe collinearity . Other methods besides the condition number include:
1) the covariance determinant is a matrix that ranges from 0 (Perfect Collinearity) to 1 (no collinearity)
# using Dirk example > det(cov(mm12[,-1])) [1] 0.8856818 > det(cov(mm123[,-1])) [1] 8.916092e-09
2) Using the fact that the determinant of a diagonal matrix is a product of eigenvalues => The presence of one or more small eigenvalues indicates collinearity
> eigen(cov(mm12[,-1]))$values [1] 1.0876357 0.8143184 > eigen(cov(mm123[,-1]))$values [1] 5.388022e+00 9.862794e-01 1.677819e-09
3) The value of the coefficient of variation (VIF). VIF for the predictor i is 1 / (1-R_i ^ 2), where R_i ^ 2 is R ^ 2 from the regression of the predictor i from the rest of the predictors. Collinearity is present when the VIF for at least one independent variable is large. Thumb Rule: VIF > 10 is of concern . For implementation in R see here . I would also like to comment that using R ^ 2 to determine collinearity should go hand in hand with a visual analysis of scattering patterns, because a single outlier can “cause” collinearity where it does not exist, or it can hide collinearity where it exists ,
George Dontas Jun 15 '10 at 8:23 2010-06-15 08:23
source share