Computational imaging emerged from a simple but stubborn fact: every physical camera trades one kind of image quality for another. A wide aperture gathers light but narrows depth of field; a small aperture sharpens focus but dims the image; a high-resolution sensor demands a large, expensive lens. For most of the twentieth century, the only way to push past these trade-offs was to build better optics. Computational imaging inverts that logic: instead of asking the lens to do everything, it co-designs the optical hardware and the digital software as a single system, letting computation compensate for optical compromises and, in some cases, replace the lens entirely.
The first framework to act on this co-design principle was Coded Aperture Imaging, which appeared in the early 1990s. Instead of a lens that focuses light onto a sensor, a coded aperture uses a patterned mask—a sheet with a carefully designed array of holes—placed in front of a sensor. Each point in the scene casts a shadow of the mask pattern onto the sensor, producing a scrambled image that looks nothing like the scene. A reconstruction algorithm then decodes the pattern to recover a sharp image. The method was originally developed for X-ray and gamma-ray astronomy, where conventional lenses are impractical, but it demonstrated a radical idea: the optical element no longer had to form a recognizable image; it only had to encode information that computation could later decode.
At roughly the same time, the broader philosophy that made coded aperture work was being articulated as Joint Optical-Digital Design. This framework is not a single technique but a coordinating research program: it insists that the optical system and the digital processing pipeline should be designed together, not sequentially. A traditional camera designer first builds the best possible lens and then, as an afterthought, applies digital sharpening or denoising. Joint Optical-Digital Design treats the lens and the algorithm as two halves of a single optimization problem. Coded aperture imaging was an early, dramatic instantiation of this philosophy, but the framework also encompasses designs where a conventional lens is deliberately made simpler or cheaper because the algorithm will correct its aberrations. The key claim is that the optimal optical design changes when you know what computation will follow.
By the early 2000s, digital cameras had become ubiquitous, and the co-design philosophy migrated from specialized scientific instruments into consumer imaging. Computational Photography took the Joint Optical-Digital Design program and applied it to the problems that ordinary photographers face: motion blur, limited dynamic range, shallow depth of field, and low-light noise. Instead of a single snapshot, a computational camera might capture a burst of images with different exposures and fuse them into a high-dynamic-range result, or it might use a coded exposure pattern—a "fluttered shutter"—to preserve high-frequency detail that would otherwise be blurred by motion. The key shift was that the camera now actively controlled the capture process in ways that a conventional film camera could not, and the final image was assembled by software rather than by the optics alone.
Light Field Photography emerged in the same period but took a different path. Rather than enhancing a conventional 2D image, it aimed to capture the full 4D plenoptic function—the intensity of light traveling along every ray through space. The canonical implementation places a microlens array between the main lens and the sensor, so that each microlens records the direction of incoming light as well as its total intensity. The result is a raw data set that can be computationally refocused after the fact, or even used to synthesize novel viewpoints. Light Field Photography did not supersede Computational Photography; the two frameworks coexisted and, in some cameras, even complemented each other. Computational Photography improved the quality of a single image; Light Field Photography traded some spatial resolution for the ability to change the focus or perspective after capture. The tension between them was a practical one: could you afford to sacrifice resolution for flexibility, or was it better to use computation to wring the most out of a conventional sensor?
The 2010s brought a sharp methodological divide. Data-Driven Computational Imaging introduced a fundamentally new way to solve the reconstruction problem. Earlier frameworks had relied on hand-designed models of the optical system and hand-crafted priors about natural images—smoothness, sparsity, or edge preservation. Data-driven methods replaced those models with deep neural networks trained on large datasets of input-output pairs. Given a blurred or noisy raw capture, a network learned to predict the clean image directly, often producing results that surpassed model-based methods in speed and perceptual quality. This shift changed the joint-design process itself: instead of optimizing the optical system for a known reconstruction algorithm, researchers could now optimize the optics and the neural network together, end-to-end, using gradient descent. The framework did not reject the Joint Optical-Digital Design philosophy; it radicalized it by making the reconstruction algorithm a learned black box rather than an explicit physical model.
Fourier Ptychography, introduced around 2013, took the co-design idea in a different direction, returning to the coherent wavefronts of microscopy. The technique uses an array of LEDs to illuminate a sample from different angles, capturing a sequence of low-resolution images through a low-numerical-aperture objective. Each image records a different region of the sample's Fourier spectrum. A phase-retrieval algorithm then stitches these overlapping spectral patches together to reconstruct a high-resolution complex field—both amplitude and phase—far beyond the native resolution of the objective lens. Fourier Ptychography shares a deep kinship with Coded Aperture Imaging: both use non-traditional optical elements (a coded mask in one case, an LED array in the other) to encode information that computation later decodes. But where Coded Aperture Imaging was designed for incoherent X-rays, Fourier Ptychography works with coherent visible light and exploits the phase information that conventional sensors discard. It also inherits the Joint Optical-Digital Design commitment: the LED array and the reconstruction algorithm are co-designed to maximize resolution and field of view simultaneously.
Today, the field is organized around a central debate. On one side, Data-Driven Computational Imaging continues to push the limits of what learned reconstruction can achieve, especially in consumer applications like smartphone photography, where a lightweight network can replace bulky optics. On the other side, model-based frameworks—Coded Aperture Imaging, Fourier Ptychography, and the broader Joint Optical-Digital Design program—argue that physically grounded models offer interpretability, robustness, and sample efficiency that pure learning cannot match. A hybrid approach is emerging: neural networks that incorporate the physical forward model of the camera as a differentiable layer, so that the network learns only the prior or the correction, not the entire physics. This hybrid strategy preserves the co-design principle while keeping the reconstruction grounded in optics.
Computational Photography and Light Field Photography remain active, but their roles have shifted. Computational Photography is now the dominant paradigm in mobile imaging, where multi-frame fusion, depth-from-defocus, and learned denoising are standard. Light Field Photography, after a wave of consumer cameras in the early 2010s, has retreated to specialized applications in microscopy and industrial inspection, where the ability to refocus after capture outweighs the loss of resolution. The two frameworks no longer compete directly; they serve different niches.
What the leading frameworks agree on is that the optical system and the digital algorithm must be designed as a single unit—the Joint Optical-Digital Design principle is now nearly universal. Where they disagree is on the nature of the reconstruction algorithm. Should it be a transparent physical model whose parameters are calibrated once, or a learned function that adapts to data? The answer is not settled, and the tension between model-based and data-driven approaches is the engine driving the field forward. Each new framework has been an attempt to push the boundary of what can be captured by co-designing optics and computation, and the current frontier is about how much of that computation should be learned rather than derived from first principles.