I have a dataset containing an independent variable and a set of dependent variables. I would like to fit a function to each set of independent variables using a bootable non-linear least-squares procedure. In some cases, independent variables are "good quality", that is, they are good enough for the function. In other cases, they are noisy.
In all cases, I can use nls() to get parameter estimates. However, when the data is noisy, the bootstrap generates an Error in nls(...) : singular gradient error. I can understand why nls installation on noisy data will fail, for example. due to the inability to converge after too many iterations, but I donβt understand why this is a singular gradient error, and why I get only oversampled low quality datasets.
the code:
require(ggplot2) require(plyr) require(boot) # Data are in long form: columns are 'enzyme', 'x', and 'y' enz <- read.table("http://dl.dropbox.com/s/ts3ruh91kpr47sj/SE.txt", header=TRUE) # Nonlinear formula to fit to data mmFormula <- formula(y ~ (x*Vmax) / (x + Km))
nls great for setting data (even if in some cases, like a , I doubt that the model is suitable for data.
# Use nls to fit mmFormula to the data - this works well enough fitDf <- ddply(enz, .(enzyme), function(x) coefficients(nls(mmFormula, x, start=list(Km=100, Vmax=0.5)))) # Create points to plot for the simulated fits xGrid <- 0:200 simFits <- dlply(fitDf, .(enzyme), function(x) data.frame(x=xGrid, y=(xGrid * x$Vmax)/(xGrid + x$Km))) simFits <- ldply(simFits, identity) ggplot() + geom_point(data=enz, aes(x=x, y=y)) + geom_line(data=simFits, aes(x=x, y=y)) + facet_wrap(~enzyme, scales="free_y") + aes(ymin=0)

Bootstrapping works great for good quality data:
# Function to pass to bootstrap; returns coefficients of nls fit to formula nlsCoef <- function(df, i) { KmGuess <- median(df$x) VmaxGuess <- max(df$y) dfSamp <- df[i,] nlsCoef <- coefficients(nls(mmFormula, dfSamp, start=list(Km=100, Vmax=0.5))) } eBoot <- boot(subset(enz, enzyme=="e"), nlsCoef, R=1000) #No error
But not for low quality data
dBoot <- boot(subset(enz, enzyme=="d"), nlsCoef, R=10) > Error in nls(mmFormula, dfSamp, start = list(Km = KmGuess, Vmax = VmaxGuess)) : singular gradient
What causes this error? And what should I do with this, given that I would like to use plyr to simultaneously execute many bootstrap syntaxes?