Comparisons

ARIMA vs LSTM

Fight!

Step 1
We would say---

LSTM is basically a bunch of ARIMA gathering together.

Compare to ARIMA, LSTM works WAY FASTER.
With the same idea, forecasting the new n+1 time lags by the previous n lages values.
However, different as what we expected,
LSTM did not work as well as we thought. (But WHY?)
But we definitely cannot use ARIMA forever---

Let's see the advantages and weakness of these two models!

Step 2!

Step 2
Their Pros?

ARIMA

● Relatively High SMAPE scores
Ends up with 39.6649 mean SMAPE score.

● Works well for short-term
For short-run forecasts with high frequency data.

LSTM

● A LOT FASTER
One model for all the pages (145k) at once and take just 20 mins for 20 epochs.

● Not so sensitive to non-stationary data.

Step 3!

Step 3
Their Cons?

ARIMA

● High Cost and SUPER SLOW
1 min for each page, needs 100 days for 145k pages.

● Unstable
Too many assumptions to satisfy before using it. Warning if any of it violated.

LSTM

● Relatively lower score due to one model for all?

● Start to forget what happened very long ago (limit is 400 days)

Step 4!

Step 4
Wanna do further---

Fibonacci Sequence

Filling in the missing values by fibonacci sequence, which assumes the visits gradually increase.

SSANOVA

Try to fit in smoothing spline analysis of variance (SSANOVA).
Will need to be careful of overfitting.

Amnesia

LSTM works poorly since it easily forgets long-term trend.
Try to use attention to fix this problem.
Attention can bring useful information from a distant past to the current RNN cell.

Tell us more!