Review of forecasting
Understanding prediction intervals
Time-series cross-validation
Project 1 reminder and details
Let’s recall the process of breaking down time series data.
We used forecasting to predict trend and seasonality into the future.
We modeled residuals separately using an ARIMA model.
Final forecast = Trend + Seasonality + Residual Model
What are prediction intervals?
The interval within which we expect
values of our \(y\) variable to lie at
a future point in time.
Last week, we forecasted carrot prices from August 2024 to January 2028.
The predicted price for:
But how confident are we? Instead of a single value, we might say:
These are our prediction intervals.
“We are 95% confident that the true price trend is between $0.05 and $0.10 per month.”
CIs are calculated using:
Example:
Prediction intervals are always wider than confidence intervals because they capture both:
The most frequently used prediction (and confidence) interval is 95%.
The intuitive interpretation: We are 95% confident that our prediction interval contains the value of our outcome variable that will be realized.
In Tableau (and most other software) prediction intervals are shown with shaded regions around the prediction.
Which of the following best defines confidence intervals?
A. A range of values that is likely to contain an unknown population parameter with a specified level of confidence.
B. A range of values that represents the mean of a sample.
C. A range of values used to estimate a future value of a variable with a specified level of confidence.
D. A range of values that is used to test a hypothesis about a population parameter.
Which of the following best defines prediction intervals?
A. A range of values that is likely to contain an unknown population parameter with a specified level of confidence.
B. A range of values that represents the mean of a sample.
C. A range of values used to estimate a future value of a variable with a specified level of confidence.
D. A range of values that is used to test a hypothesis about a population parameter.
Last week, we generated forecasts using forecast()
in R.
Let’s look at the prediction interval for the carrot forecast:
The shaded region shows the prediction interval range.
How are prediction intervals calculated?
The formula for a 95% prediction interval for the \(h\)-step forecast is:
\[\text{Prediction Interval for }{y}_{T+h|T} = \widehat{y}_{T+h|T} \pm 1.96 \cdot \widehat{\sigma}_h\]
where
\(\widehat{y}_{T+h|T}\) is our prediction of \(y\) at time \(T\), \(h\) periods into the future
\(h\) represents the forecast horizon, or the specific number of intervals (e.g., days, months, years) from the last observed data point
1.96 is the multiplier for a 95% prediction interval assuming that our sampling distribution of future observations is normal
\(\widehat{\sigma}_h\) is an estimate of the standard deviation for the \(h\)-step forecast
We adjust the interval width based on how confident we want to be:
Confidence Level | Multiplier |
---|---|
80% | 1.28 |
85% | 1.44 |
90% | 1.64 |
95% | 1.96 |
99% | 2.58 |
The equation for a prediction interval can be written more generally as: \[ \widehat{y}_{T+h|T} \pm c\cdot \widehat{\sigma}_h \]
where the multiplier \(c\) depends on the prediction interval you desire.
Note that these all assume a normal distribution of future observations.
The standard deviation of the forecast errors measures how much our predicted values tend to vary from actual observations.
The further we forecast into the future, the greater this uncertainty becomes.
We adjust our prediction intervals based on the standard deviation of the forecast distribution.
Let’s look at how this is done mathematically.
When forecasting 1 step ahead (\(h=1\)), we will estimate the standard deviation of the forecast distribution according to:
\[ \widehat{\sigma} = \sqrt{\frac{1}{T-K}\sum^T_{t=1}e^2_t} \]
where
\(T\) is the number of data points in the time series
\(K\) is the number of parameters estimated in the forecasting method
\(e_t\) is the residual at time \(t\)
The residual \(e_t\) in the equation of the standard error is not the same as the residual \(R_t\) from the time series decomposition.
Residuals (\(R_t\)) in Time Series Decomposition
Residuals (\(e_t\)) in Forecasting & Prediction Intervals
When forecasting \(h\) steps ahead (\(h\geq 1\)), we can estimate the standard deviation of the forecast distribution according to:
\[ \widehat{\sigma}_h = \widehat{\sigma}\sqrt{h} \]
This formula highlights how uncertainty grows over time.
Note that there are many other methods for estimating the standard deviation of the forecasting distribution. This formula is for the Naïve method, and is among the simplest.1
In lab this week, we will show you how to estimate prediction intervals in R and plot them in Tableau.
How do we validate
our forecast model?
The objective:
We want to show our audience
that they should trust our predictions.
The solution:
Time series cross-validation
evaluates forecasting accuracy.
Train-Test Splits
Measure Accuracy
Construct a series of “training” and “test” sets of data.
Use your “training” set to construct a forecast (following the same methods in your main forecast).
Evaluate the accuracy of your forecast using your “test” set.
Training data = blue
Test data = orange
Imagine you’re keeping a diary of the daily high temperature in your town.
\(\rightarrow\) Each day, you train a model on all previous days and test your prediction on the next.
\(\rightarrow\) Over time, your predictions (hopefully) get better as you learn from more past observations!
That’s exactly how rolling-origin cross-validation works in time series forecasting.
How do we measure how well our model predicts future values?
Metric | Formula | Interpretation |
---|---|---|
Mean Forecast Error (MFE) | \(\sum_{t=1}^{H} \frac{\hat{y}_t - y_t}{H}\) | Average difference between predictions & actual values. |
Mean Absolute Error (MAE) | \(\sum_{t=1}^{H} \frac{| \hat{y}_t - y_t | }{H}\) | Measures average absolute error size. |
Root Mean Squared Error (RMSE) | \(\sqrt{\sum_{t=1}^{H} \frac{ (\hat{y}_t - y_t )^2 }{H}}\) | Penalizes larger errors more heavily. |
Mean Absolute Percentage Error (MAPE) | \(\sum_{t=1}^{H} \frac{ (\hat{y}_t - y_t )/y_t }{H}\) | Expresses error as a percentage of actual values. |
Choose the forecasting model that minimizes prediction error according to these measures.
This is a useful resource for these evaluation methods: https://medium.com/analytics-vidhya/basics-of-forecast-accuracy-db704b0b001b
Groups of 2
Choose an ag biz or enre management question to answer with time series data
Collect time series data
Analyze trends
Generate a forecast
\(\rightarrow\) Generate prediction intervals
Present results in a recorded video
I am currently growing almonds and walnuts, and in the coming season I will plant a new orchard.
Almond trees need to mature for 3-4 years before they are ready for harvest.
Walnut trees need to mature for 4-7 years before they are ready for harvest.
Both trees will continue producing for roughly 25 years.
How will I decide which trees to plant?
DATA: Time series data on almond and walnut prices.
ANALYSIS: Forecast prices for the next 10 years (really 25 would better answer your Q, but is probably too long of a prediction period)
ANSWER: I will plant walnuts because over a 10 year horizon they would yield higher revenues than almond trees, even though I would have to wait longer before selling them.
Your presentation videos should be roughly 8 minutes. (Minimum: 6 min, Maximum 9 min)
You should begin your presentation with the following:
Who are you? (what business, organization, etc. wants an answer to your question?)
Who is your audience? (business executive team, policy makers, consumers?)
You should then:
Give us any relevant background info that is necessary to understanding your question
State your question
Data:
Describe where your data is from
Provide summary statistics and visualizations
Analysis:
What forecasting model did you use? Why?
Show your forecast and prediction intervals
Discussion/Conclusion:
How did you use your forecast to answer your question?
What are the key take-aways for your intended audience?
What are limitations to your analysis?
The video presentation is part 1 of the project.
A report summarizing the presentation content is part 2 of the project. Your report should summarize the same information as your presentation, and can include additional information and visualizations if they are helpful for answering your question.
Each group member will be responsible for uploading the report onto their Google site. You can both have an identical report, but both must post it to your personal webpage.
Hyndman, R.J., & Athanasopoulos, G. (2021) Forecasting: principles and practice, 3rd edition, OTexts: Melbourne, Australia. https://otexts.com/fpp3/. Accessed on 02-28-2023.
A useful blog post: https://mins.space/blog/2020-07-30-forecasting-through-decomposition/
Another useful blog post: https://medium.com/analytics-vidhya/basics-of-forecast-accuracy-db704b0b001b