Time Series Models

Definition

Models for sequential data where the order of observations matters and temporal dependencies carry predictive information. Covers classical statistical methods (ARIMA, state-space models) and modern neural approaches.

Intuition

A time series is a sequence of observations indexed by time. The key challenge is that observations are not i.i.d.: knowing yesterday’s value tells you something about today’s. The goal is to model this autocorrelation structure to make forecasts.

Formal Description

Stationarity

A time series ${y_{t}}$ is weakly stationary if:

$E [y_{t}] = μ$ (constant mean)
$Cov (y_{t}, y_{t + h}) = γ (h)$ depends only on lag $h$ , not $t$

Most classical methods assume stationarity. Augmented Dickey-Fuller (ADF) test checks for unit roots (non-stationarity).

Differencing to achieve stationarity: $Δ y_{t} = y_{t} - y_{t - 1}$ . The order of differencing $d$ is chosen such that $Δ^{d} y_{t}$ is stationary.

ARIMA $(p, d, q)$

AR( $p$ ) — Autoregressive: $y_{t} = \sum_{i = 1}^{p} ϕ_{i} y_{t - i} + ε_{t}$

MA( $q$ ) — Moving Average: $y_{t} = μ + \sum_{j = 1}^{q} θ_{j} ε_{t - j} + ε_{t}$

ARMA( $p, q$ ): $\sum_{i = 0}^{p} ϕ_{i} y_{t - i} = \sum_{j = 0}^{q} θ_{j} ε_{t - j}$

ARIMA( $p, d, q$ ): Apply $d$ -th differencing to $y_{t}$ , then fit ARMA( $p, q$ ).

SARIMA( $p, d, q$ )( $P, D, Q$ ) $_{s}$ : adds seasonal AR, I, MA terms at lag $s$ (e.g., $s = 12$ for monthly data).

Order selection:

ACF (autocorrelation function): significant at lags 1..q → MA(q)
PACF (partial autocorrelation): significant at lags 1..p → AR(p)
Use AIC/BIC for model selection; auto_arima from pmdarima automates this.

Exponential Smoothing (ETS)

Weighted average of past observations, with exponentially decaying weights:

Simple ES: $\overset{y}{^}_{t + 1} = α y_{t} + (1 - α) \overset{y}{^}_{t}$

Holt-Winters (Triple ES): models level, trend, and seasonality. ETS model:

Error type (Additive/Multiplicative)
Trend type (None/Additive/Additive-damped)
Seasonal type (None/Additive/Multiplicative)

Multiplicative seasonality is appropriate when seasonal fluctuations are proportional to the level.

State-Space Models (Local Linear Trend)

y_{t} = H_{t} z_{t} + ε_{t} (observation equation)

z_{t + 1} = F_{t} z_{t} + G_{t} η_{t} (state transition equation)

The Kalman filter computes $P (z_{t} ∣ y_{1}, \dots, y_{t})$ analytically for linear-Gaussian systems.

VAR (Vector Autoregression)

Multivariate extension of AR for $K$ time series:

y_{t} = i = 1 \sum p A_{i} y_{t - i} + ε_{t}

Models cross-series dependencies. Used in macroeconomics; Granger causality tests check whether one series helps predict another.

Neural Time Series Models

LSTM/GRU: sequence-to-sequence models that learn long-range dependencies from data (see recurrent_networks).

Temporal Convolutional Networks (TCN): dilated causal convolutions; can outperform LSTMs with faster training.

Transformer-based: Informer, Autoformer, PatchTST for long-horizon forecasting; attention captures long-range dependencies without sequential processing.

N-BEATS, N-HiTS: pure neural, interpretable forecasting; state of the art on M4 benchmark.

Evaluation Metrics

Metric	Formula	Properties
MAE	$\frac{1}{T}\sum	y_t - \hat{y}_t	$	Scale-dependent, robust to outliers
RMSE	$\frac{1}{T} \sum (y_{t} - \overset{y}{^}_{t})^{2}$	Penalises large errors
MAPE	$\frac{100}{T}\sum\frac{	y_t - \hat{y}_t	}{	y_t	}$	Scale-free; undefined for $y_{t} = 0$
MASE	MAE normalised by naive in-sample MAE	Scale-free and meaningful

Applications

Insurance: claims reserving (chain-ladder, stochastic development methods)
Finance: volatility forecasting (GARCH), algorithmic trading signals
Demand forecasting: inventory management, supply chain
Anomaly detection: detecting unusual spikes in time series metrics

Trade-offs

ARIMA: interpretable, fast, principled; limited to linear dependencies and stationary processes.
LSTM/Transformers: capture non-linear temporal patterns; require more data and tuning.
Choose classical methods when $n < 500$ or explainability is required; neural methods for large-scale, complex patterns.

Notes

Explorer

time_series_models

Time Series Models

Definition

Intuition

Formal Description

Stationarity

ARIMA $(p, d, q)$

Exponential Smoothing (ETS)

State-Space Models (Local Linear Trend)

VAR (Vector Autoregression)

Neural Time Series Models

Evaluation Metrics

Applications

Trade-offs

Links

Graph View

Table of Contents

Backlinks

Notes

Explorer

time_series_models

Time Series Models

Definition

Intuition

Formal Description

Stationarity

ARIMA(p,d,q)

Exponential Smoothing (ETS)

State-Space Models (Local Linear Trend)

VAR (Vector Autoregression)

Neural Time Series Models

Evaluation Metrics

Applications

Trade-offs

Links

Graph View

Table of Contents

Backlinks

ARIMA $(p, d, q)$