Start here

Choose a framework

The timeline is your map. Pick a school or approach to see what it argues, what came before it, and what sits inside it.

Start from1960

Select a framework above to see its date range, sources, and workspace.

Spatial Statistics

When data points are collected across a landscape, a brain scan, or a city map, the fundamental assumption of classical statistics—that observations are independent—breaks down. Locations close together tend to be more alike than locations far apart, a property known as spatial dependence. Spatial statistics is the branch of statistics that builds models and methods specifically to account for this dependence, turning what would otherwise be a violation of assumptions into the central object of study. The field has developed four major frameworks, each addressing a different kind of spatial data: continuous fields, discrete areal units, event locations, and data that also unfold through time.

Geostatistics: Modeling Continuous Spatial Fields (1960–Present)

The first framework to crystallize was geostatistics, which emerged from mining and petroleum engineering in the 1960s. The practical pressure was straightforward: given a few drill-core samples, how do you predict the grade of ore at an unsampled location? Geostatistics treats the spatial phenomenon as a continuous random field—a surface of values that varies smoothly across space. The key innovation was the variogram, a function that describes how the variance between two measurements grows with the distance between their locations. Using the variogram, the method of kriging (named after South African engineer Danie Krige) produces optimal linear predictions at unsampled points, along with a measure of prediction uncertainty.

Geostatistics remains a leading framework today, especially in earth sciences, environmental monitoring, and natural resource estimation. Its strength lies in interpolation and uncertainty quantification for continuous variables. Later frameworks did not replace geostatistics; rather, they coexisted with it by addressing data types that geostatistics was not designed for. Geostatistics assumes a continuous underlying surface, but many spatial problems involve data aggregated into discrete zones or patterns of points.

Lattice Data Analysis: Spatial Dependence in Discrete Areal Units (1970–Present)

By the 1970s, researchers working with data aggregated into regions—counties, census tracts, grid cells—needed a different approach. In lattice data, the spatial locations are fixed polygons or pixels, and the question is how to model the dependence between neighboring areas. The foundational contribution was the conditional autoregressive (CAR) model, which specifies the value at each region as conditionally dependent on its neighbors. This framework is closely tied to Markov random fields and is often implemented in a Bayesian hierarchical setting.

Lattice data analysis differs from geostatistics in a fundamental way: it does not assume a continuous field that can be interpolated anywhere. Instead, it models the correlation structure among a fixed set of areal units. This makes it the natural framework for disease mapping, social epidemiology, and spatial econometrics. The CAR model and its extensions (e.g., the intrinsic CAR and Besag-York-Mollié model) remain workhorses in public health and criminology. The framework coexists with geostatistics by occupying a distinct data regime—discrete regions rather than continuous surfaces.

Spatial Point Process Analysis: Patterns of Event Locations (1970–Present)

Also emerging in the 1970s, spatial point process analysis addresses a third type of data: the locations of events themselves. Instead of measuring a value at every location, the data are a set of points—tree positions in a forest, earthquake epicenters, crime incidents. The central question is whether the points exhibit clustering, regularity, or complete spatial randomness. The key tool is the Ripley's K-function (and its derivative, the L-function), which compares the observed distribution of inter-point distances to what would be expected under a Poisson process.

This framework differs from both geostatistics and lattice analysis in that the locations are random, not fixed. The focus is on the stochastic process generating the points, not on interpolating a continuous field or modeling dependence among fixed regions. Spatial point process analysis has been extended to include covariates (via the Cox process and Gibbs processes) and to handle marked points (where each event carries additional attributes). It remains active in ecology, seismology, and epidemiology, often used alongside geostatistics when both event locations and continuous covariates are present.

Spatiotemporal Statistics: Adding the Time Dimension (1990–Present)

By the 1990s, the growing availability of repeated measurements over time—satellite imagery, climate records, disease surveillance—pushed spatial statistics to incorporate the temporal dimension explicitly. Spatiotemporal statistics extends all three earlier frameworks to data that vary in both space and time. The distinctive shift is that dependence must now be modeled across two dimensions: locations close in space and time are expected to be more similar than those far apart in either dimension.

Spatiotemporal models often build on geostatistical foundations, using space-time variograms and spatiotemporal kriging. For lattice data, dynamic CAR models and spatiotemporal autoregressive models allow the spatial dependence structure to evolve over time steps. For point patterns, spatiotemporal point processes (e.g., the space-time Poisson process and the epidemic-type aftershock sequence model) capture how event clusters propagate. A major practical pressure driving this framework was the need for real-time disease surveillance and environmental monitoring, where predictions must be updated as new data arrive.

Spatiotemporal statistics did not replace the earlier frameworks; it absorbed and extended them. Today, it is the fastest-growing area within spatial statistics, with applications in climate science, air quality monitoring, epidemiology, and remote sensing. Its relationship to the other frameworks is one of transformation: it takes the core ideas of geostatistics, lattice analysis, and point process analysis and adds a temporal dimension, often requiring new computational strategies (such as dimension reduction and Gaussian process approximations) to handle the massive datasets involved.

How the Frameworks Relate Today

The four frameworks are not competing paradigms but a division of labor based on data type. Geostatistics is the method of choice when the goal is interpolation of a continuous field from sparse samples. Lattice data analysis is used when data come pre-aggregated into discrete regions and the focus is on spatial autocorrelation and areal smoothing. Spatial point process analysis is applied when the locations themselves are the data and the interest is in clustering or regularity. Spatiotemporal statistics extends all three when time is a factor.

Despite their different starting points, the frameworks share a core commitment: they all model spatial dependence explicitly rather than treating it as a nuisance. They also increasingly borrow tools from each other. For example, geostatistical methods are now routinely used to smooth lattice data, and point process models often include geostatistically interpolated covariates. The main disagreement among practitioners is not about which framework is correct but about how to handle computational scalability and model choice in large datasets.

Connections to Neighboring Subfields

Spatial statistics has deep ties to statistical learning, particularly through kernel methods and Gaussian processes. Gaussian process regression, a staple of modern machine learning, is essentially a geostatistical kriging model with a flexible covariance function. Bayesian learning frameworks are widely used in spatial statistics for hierarchical modeling, especially in lattice data analysis and spatiotemporal settings. Conversely, spatial statistics has influenced statistical learning by contributing structured covariance models and methods for handling non-Euclidean spaces.

Another important connection is to survival analysis. Spatial survival models combine frailty models (random effects that account for unobserved heterogeneity) with spatial correlation structures. For example, in epidemiology, the time until disease onset may depend on both individual risk factors and the spatial location of exposure. These models typically use a lattice or geostatistical framework to capture spatial dependence in the baseline hazard or frailty term. The integration of spatial and survival methods remains an active research frontier, especially in cancer registry studies and environmental health.

What the Leading Frameworks Agree and Disagree On

Today, all four frameworks agree on the fundamental principle that spatial dependence must be modeled, not ignored. They also agree on the value of hierarchical Bayesian approaches for quantifying uncertainty and incorporating prior information. Where they disagree is on the most appropriate representation of space: continuous random fields (geostatistics) versus discrete adjacency structures (lattice) versus stochastic point processes (point patterns). This disagreement is not a conflict but a reflection of the different data types each framework was designed for. The leading frameworks coexist because real-world spatial problems rarely fit neatly into one category; a single study might involve interpolating a continuous pollutant surface (geostatistics), mapping disease rates by county (lattice), and analyzing the clustering of case locations (point process), all within a spatiotemporal framework. The future of the field lies in hybrid models that combine elements of all four traditions, often powered by advances in computation and machine learning.

Was this useful?

Sources

Framework Workspace

Choose a framework to open this workspace

Select a framework from the timeline above to see what it is, what it responds to, and which concepts sit inside it.

Overview

What this framework argues, what problem it answered, and why it mattered.

Concept map

Core concepts and the prerequisite order inside the framework.

Workflow tools

Verification, sources, generation status, and correction tools.

Loading vocabulary comparison...

Spatial Statistics timeline

Spatial Statistics

Geostatistics: Modeling Continuous Spatial Fields (1960–Present)

Lattice Data Analysis: Spatial Dependence in Discrete Areal Units (1970–Present)

Spatial Point Process Analysis: Patterns of Event Locations (1970–Present)

Spatiotemporal Statistics: Adding the Time Dimension (1990–Present)

How the Frameworks Relate Today

Connections to Neighboring Subfields

What the Leading Frameworks Agree and Disagree On

Spatial Statistics timeline

Spatial Statistics

Geostatistics: Modeling Continuous Spatial Fields (1960–Present)

Lattice Data Analysis: Spatial Dependence in Discrete Areal Units (1970–Present)

Spatial Point Process Analysis: Patterns of Event Locations (1970–Present)

Spatiotemporal Statistics: Adding the Time Dimension (1990–Present)

How the Frameworks Relate Today

Connections to Neighboring Subfields

What the Leading Frameworks Agree and Disagree On