I have a basic linear regression with 80 numerical variables (no classification variables). The training kit has 1600 series, testing 700.
I need a python package that iterates through all combinations of columns to find the best user-defined evaluation function or function with a result similar to AIC. OR If it does not exist, what do people use here to select variables? I know that R has packages like this, but I don't want to deal with Rpy2
I have no preference if LM requires learning scikit, numpy, pandas, statsmodels or another.
source
share