How can we make sense of data collected over time—stock prices, daily temperatures, brain signals—when each observation is not independent but carries a memory of the past? This question has driven the development of time series analysis, a subfield of statistics that builds models for the temporal dependence structure in sequential data. The history of the field is a story of successive frameworks that each addressed a limitation of its predecessors, while many remain in active use today for different kinds of problems.
The first systematic framework for modeling time dependence was the Autoregressive Moving-Average (ARMA) Models, developed between 1927 and 1970. ARMA models describe a time series as a linear combination of its own past values (the autoregressive part) and past forecast errors (the moving-average part). The core insight was that a relatively small number of parameters could capture the persistence and shock-propagation patterns in stationary series. ARMA provided a parsimonious time-domain representation, but it assumed that the underlying statistical properties—mean, variance, autocorrelation—did not change over time.
While ARMA was being formalized, a fundamentally different perspective emerged: Spectral Analysis (1930–Present). Instead of modeling how a series evolves from one time step to the next, spectral analysis decomposes the series into a sum of sinusoidal components at different frequencies. This frequency-domain approach reveals periodic cycles that are invisible to ARMA's time-domain lens. Spectral analysis and ARMA are not rivals; they are complementary tools. Spectral analysis excels at detecting hidden periodicities (e.g., business cycles, seasonal patterns), while ARMA is better suited for short-term forecasting. Today, spectral methods remain a standard diagnostic tool, especially in geophysics, engineering, and economics, where understanding cyclical behavior is paramount.
A major leap came with State-Space Models and the Kalman Filter (1960–Present). These models represent an observed time series as a noisy measurement of an unobserved "state" that evolves over time according to its own dynamics. The Kalman filter provides a recursive algorithm to estimate the hidden state as new data arrive. This framework absorbed and generalized earlier ARMA models: any ARMA process can be written in state-space form. But state-space models go far beyond ARMA by handling missing observations, multiple series, and time-varying parameters naturally. They became the backbone of navigation systems, control engineering, and macroeconometrics.
In the 1970s, the Box-Jenkins Methodology (1970–2000) systematized the practical use of ARMA models. Box and Jenkins proposed a three-stage iterative cycle—identification, estimation, diagnostic checking—that turned ARMA modeling from an art into a disciplined procedure. They also extended ARMA to handle non-stationary series through differencing, creating the ARIMA (Autoregressive Integrated Moving Average) class. The Box-Jenkins methodology was not a new model but a methodological school that narrowed the focus to a specific workflow. It coexisted with state-space models, which offered more flexibility but required more computational effort. By the 1990s, the Box-Jenkins approach declined as software made state-space and other methods easier to apply, but its emphasis on iterative model building remains influential.
A persistent limitation of ARMA and state-space models was their assumption of constant variance. Financial returns, however, exhibit volatility clustering—periods of high turbulence followed by calm. The ARCH and GARCH Models (1982–Present) addressed this by modeling the conditional variance as a function of past squared shocks (ARCH) and past variances (GARCH). GARCH extended ARCH by adding a moving-average component for the variance, analogous to how ARMA extended pure autoregression for the mean. These models transformed financial econometrics, enabling risk assessment and option pricing. They remain the standard toolkit for volatility modeling, though they have been extended into many variants (EGARCH, GJR-GARCH) to capture asymmetries and long memory.
Around the same time, a different problem came into focus: many economic time series (e.g., consumption and income) drift together over time, but simple regression on such non-stationary series produces spurious results. Cointegration (1987–Present) provided a rigorous framework for modeling these long-run equilibrium relationships. Two or more non-stationary series are cointegrated if a linear combination of them is stationary. This concept transformed econometrics by allowing researchers to separate short-run dynamics from long-run attractors. Cointegration coexists with GARCH and state-space models: a typical macro-finance study might use cointegration for the long-run structure, GARCH for volatility, and state-space methods for unobserved components.
While the frameworks above were largely frequentist, Bayesian Time Series Analysis (1990–Present) brought a different inferential philosophy. Instead of point estimates and confidence intervals, Bayesian methods treat parameters and latent states as random variables with prior distributions, updated via Bayes' theorem as data arrive. The Kalman filter, originally derived as a frequentist algorithm, was reinterpreted as a Bayesian updating scheme for linear Gaussian state-space models. This connection deepened with the development of Markov chain Monte Carlo (MCMC) methods in the 1990s, which allowed Bayesian inference for complex nonlinear and non-Gaussian state-space models. Bayesian time series analysis now coexists with frequentist approaches, offering advantages when prior information is strong, when models are highly parameterized, or when uncertainty quantification is critical. It has absorbed and extended state-space modeling, and it often incorporates GARCH and cointegration structures within a Bayesian framework.
The most recent wave, Nonparametric and Machine Learning Approaches (2000–Present), relaxes the parametric assumptions that constrained earlier methods. Instead of specifying a fixed functional form (e.g., linear autoregression), these methods let the data determine the shape of the dependence. Gaussian processes, neural networks, random forests, and gradient boosting have been adapted for time series forecasting, often outperforming classical models on large, high-dimensional datasets. However, this flexibility comes at a cost: interpretability is lower, and uncertainty quantification is less straightforward than in Bayesian or state-space frameworks. These approaches do not replace earlier methods but rather extend the toolkit. For example, a hybrid model might use a neural network to capture nonlinear patterns in the residuals of an ARMA-GARCH fit. The leading frameworks today—spectral analysis, state-space models, GARCH, cointegration, Bayesian methods, and machine learning—coexist in a division of labor: spectral analysis for periodic diagnostics, state-space for dynamic systems, GARCH for volatility, cointegration for long-run relations, Bayesian methods for principled uncertainty, and machine learning for flexible prediction.
Despite their differences, today's leading frameworks agree on several core principles. First, all recognize that time series data violate the independence assumption of standard statistics, so models must explicitly account for temporal dependence. Second, all acknowledge the importance of model checking—whether through diagnostic plots, cross-validation, or posterior predictive checks. Third, there is broad consensus that no single framework dominates all problems; the choice depends on the data's characteristics (stationarity, sample size, dimensionality) and the goal (forecasting, inference, or description).
Disagreements persist on fundamental questions. Frequentist and Bayesian approaches differ on how to treat uncertainty: frequentists rely on sampling distributions, while Bayesians use posterior distributions. Machine learning methods often prioritize predictive accuracy over interpretability, whereas classical econometric models emphasize parameter interpretability. There is also active debate about how to handle non-stationarity: differencing (Box-Jenkins), cointegration, time-varying parameters (state-space), or regime-switching models each make different assumptions about how the world changes. These disagreements are not signs of weakness but of a healthy, pluralistic field where different frameworks illuminate different aspects of temporal data.