Why the model cannot predict the sine

Question

Why the model cannot predict the sine

I am trying to create the studied time series using RSTN LSTM using Keras, so I want to predict the datapoint and pass it back as input to predict the next, etc., so that I can actually generate timers (for example, given 2000 data points, predict the next year 2000) I'm trying to do it this way, but the RMSE of the test is 1.28, and the prediction is basically a straight line

# LSTM for international airline passengers problem with regression framing import numpy import matplotlib.pyplot as plt from pandas import read_csv import math from keras.models import Sequential from keras.layers import Dense from keras.layers import LSTM from sklearn.preprocessing import MinMaxScaler from sklearn.metrics import mean_squared_error # convert an array of values into a dataset matrix def create_dataset(dataset, look_back=1): dataX, dataY = [], [] for i in range(len(dataset)-look_back-1): a = dataset[i:(i+look_back), 0] dataX.append(a) dataY.append(dataset[i + look_back, 0]) return numpy.array(dataX), numpy.array(dataY) # fix random seed for reproducibility numpy.random.seed(7) # load the dataset dataset = np.sin(np.linspace(0,35,10000)).reshape(-1,1) print(type(dataset)) print(dataset.shape) dataset = dataset.astype('float32') # normalize the dataset scaler = MinMaxScaler(feature_range=(0, 1)) dataset = scaler.fit_transform(dataset) # split into train and test sets train_size = int(len(dataset) * 0.5) test_size = len(dataset) - train_size train, test = dataset[0:train_size,:], dataset[train_size:len(dataset),:] # reshape into X=t and Y=t+1 look_back = 1 trainX, trainY = create_dataset(train, look_back) testX, testY = create_dataset(test, look_back) # reshape input to be [samples, time steps, features] trainX = numpy.reshape(trainX, (trainX.shape[0], 1, trainX.shape[1])) testX = numpy.reshape(testX, (testX.shape[0], 1, testX.shape[1])) # create and fit the LSTM network model = Sequential() model.add(LSTM(16, input_shape=(1, look_back))) model.add(Dense(1)) model.compile(loss='mean_squared_error', optimizer='adam') model.fit(trainX, trainY, epochs=10, batch_size=1, verbose=2) # make predictions trainPredict = model.predict(trainX) testPredict = list() prediction = model.predict(testX[0].reshape(1,1,1)) for i in range(trainX.shape[0]): prediction = model.predict(prediction.reshape(1,1,1)) testPredict.append(prediction) testPredict = np.array(testPredict).reshape(-1,1) # invert predictions trainPredict = scaler.inverse_transform(trainPredict) trainY = scaler.inverse_transform([trainY]) testPredict = scaler.inverse_transform(testPredict) testY = scaler.inverse_transform([testY]) # calculate root mean squared error trainScore = math.sqrt(mean_squared_error(trainY[0], trainPredict[:,0])) print('Train Score: %.2f RMSE' % (trainScore)) testScore = math.sqrt(mean_squared_error(testY[0], testPredict[:,0])) print('Test Score: %.2f RMSE' % (testScore)) # shift train predictions for plotting trainPredictPlot = numpy.empty_like(dataset) trainPredictPlot[:, :] = numpy.nan trainPredictPlot[look_back:len(trainPredict)+look_back, :] = trainPredict # shift test predictions for plotting testPredictPlot = numpy.empty_like(dataset) testPredictPlot[:, :] = numpy.nan testPredictPlot[len(trainPredict)+(look_back*2)+1:len(dataset)-1, :] = testPredict # plot baseline and predictions plt.plot(scaler.inverse_transform(dataset)) plt.plot(trainPredictPlot) plt.plot(testPredictPlot) plt.show()

What am I doing wrong?

+5

python keras lstm recurrent-neural-network

Luca thiede May 09 '17 at 23:32

source share

2 answers

I see several problems with your code. Your value for look_back is 1, which means that LSTM sees only one pattern at a time, which is clearly not enough to learn anything about the sequence.

You probably did it so that in the end you could make the final prediction, submitting the prediction from the previous step as a new input. To fix this, you need to train with a lot of timestamps, and then switch to the network on LSTM with one type of time.

In addition, when you make the final prediction, you need to show the network more than one sample of the first truth. Otherwise, the position on the sine is ambiguous. (does it go up or down in the next step?)

I hit quickly. This is how I created the data:

 import numpy as np numSamples = 1000 numTimesteps = 50 width = np.pi/2.0 def getRandomSine(numSamples = 100, width = np.pi): return np.sin(np.linspace(0,width,numSamples) + (np.random.rand()*np.pi*2)) trainX = np.stack([getRandomSine(numSamples = numTimesteps+1) for _ in range(numSamples)]) valX = np.stack([getRandomSine(numSamples = numTimesteps+1) for _ in range(numSamples)]) trainX = trainX.reshape((numSamples,numTimesteps+1,1)) valX = valX.reshape((numSamples,numTimesteps+1,1)) trainY = trainX[:,1:,:] trainX = trainX[:,:-1,:] valY = valX[:,1:,:] valX = valX[:,:-1,:]

Here I trained the model:

 import keras from keras.models import Sequential from keras import layers model = Sequential() model.add(layers.recurrent.LSTM(32,return_sequences=True,input_shape=(numTimesteps, 1))) model.add(layers.recurrent.LSTM(32,return_sequences=True)) model.add(layers.wrappers.TimeDistributed(layers.Dense(1,input_shape=(1,10)))) model.compile(loss='mean_squared_error', optimizer='adam') model.summary() model.fit(trainX, trainY, nb_epoch=50, validation_data=(valX, valY), batch_size=32)

And here I modified the trained model to continue the prediction:

 # serialize the model and get its weights, for quick re-building config = model.get_config() weights = model.get_weights() config[0]['config']['batch_input_shape'] = (1, 1, 1) config[0]['config']['stateful'] = True config[1]['config']['stateful'] = True from keras.models import model_from_config new_model = Sequential().from_config(config) new_model.set_weights(weights) #create test sine testX = getRandomSine(numSamples = numTimesteps*10, width = width*10) new_model.reset_states() testPredictions = [] # burn in for i in range(numTimesteps): prediction = new_model.predict(np.array([[[testX[i]]]])) testPredictions.append(prediction[0,0,0]) # prediction for i in range(numTimesteps, len(testX)): prediction = new_model.predict(prediction) testPredictions.append(prediction[0,0,0]) # plot result import matplotlib.pyplot as plt plt.plot(np.stack([testPredictions,testX]).T) plt.show()

Here is the result. Prediction errors add up and diverge very quickly from the input sine. But he clearly recognized the general shape of the sinuses. Now you can try to improve this by trying different levels, activating functions, etc.

+2

sietschie May 12, '17 at 8:53

source share

Luca thiede · Accepted Answer · 2017-05-23T22:20:16+0000

I worked a bit on a different architecture and uploaded it to github .

So, for all the people who are looking for point-by-point time series prediction, I hope this helps.

The results look like this:

Why the model cannot predict the sine

More articles: