Health insurance confronts economists with a persistent dilemma. Pooling risk across a population protects individuals from financial catastrophe, but the very act of insuring changes how people behave and which risks they reveal. The history of health insurance theory is the story of how economists have built, refined, and contested frameworks to manage this tension between financial protection and incentive distortion.
In 1963, Kenneth Arrow published a landmark analysis that established the starting point for all subsequent health insurance theory. Arrow argued that healthcare markets are fundamentally different from the textbook ideal because uncertainty is pervasive and information is distributed unequally. Patients cannot judge the quality of care they need, insurers cannot perfectly predict who will become sick, and the product itself—health—is unpredictable. These features, Arrow showed, create a cascade of market failures: missing markets for risk, moral hazard in consumption, and barriers to entry that prevent competitive pricing. Arrow’s framework did not prescribe a single policy solution; instead, it provided a diagnostic lens that identified why insurance markets require institutional intervention. Every later framework in health insurance theory has either refined Arrow’s diagnosis or proposed mechanisms to correct the failures he identified.
Arrow’s diagnosis was broad. The next generation of theorists broke it into two distinct incentive problems, each with its own formal model and policy implications.
Mark Pauly’s 1968 moral hazard framework narrowed Arrow’s insight about insurance-induced demand into a precise microeconomic model. Pauly argued that once individuals are insured, the price they face at the point of care drops to near zero, so they consume more care than they would if they paid the full cost. This overconsumption is not irrational; it is a rational response to a subsidy embedded in insurance. The moral hazard framework transformed the policy question from “should we insure?” to “how should we design cost-sharing to balance risk protection against wasteful overuse?” Deductibles, copayments, and coinsurance all emerged from this logic. Moral hazard remains the dominant framework for analyzing demand-side incentives in insurance design.
A few years later, the adverse selection framework, formalized by Michael Rothschild and Joseph Stiglitz in 1976, addressed a different information problem. Where moral hazard concerns hidden actions (what people do after buying insurance), adverse selection concerns hidden types (what people know about their own health before buying insurance). Rothschild and Stiglitz showed that when insurers cannot distinguish high-risk from low-risk individuals, offering a single pooled price drives low-risk customers away, potentially causing the market to unravel entirely. Their model predicted that competitive insurance markets would either fragment into separate contracts for different risk types or fail to cover the low-risk population. This framework shifted policy attention toward risk adjustment, guaranteed issue, and community rating—mechanisms designed to prevent the unraveling that adverse selection predicts.
Moral hazard and adverse selection are often taught together, but they make different assumptions about what information is hidden and what behavior is distorted. Moral hazard assumes insurers know the population risk but cannot observe effort; adverse selection assumes insurers cannot observe pre-existing risk. Both frameworks share a rational-choice foundation: individuals respond optimally to the incentives insurance creates. That shared assumption would later become a target for behavioral critics.
By the 1980s, the theoretical understanding of adverse selection had outpaced institutional practice. Alain Enthoven’s managed competition framework, developed from 1980 onward, was a direct attempt to design a market structure that could contain adverse selection while preserving consumer choice. Enthoven proposed that a neutral sponsor—an employer or a government agency—would manage the market by standardizing benefit packages, collecting risk-adjusted premiums, and requiring insurers to accept all applicants. The goal was to channel competition toward quality and efficiency rather than toward cream-skimming the healthiest enrollees.
Managed competition did not reject the adverse selection framework; it absorbed it as a design constraint. Enthoven’s model assumed that without active management, insurers would compete by avoiding sick people rather than by improving care. The framework’s influence is visible in the design of Medicare Advantage, the Swiss health insurance system, and the Affordable Care Act’s insurance exchanges. Managed competition remains a living tradition, especially in policy circles, where it coexists with ongoing debates about whether sponsors can actually measure risk well enough to prevent selection.
As theoretical frameworks multiplied, a methodological divide opened over how to test and quantify their predictions. Two competing schools emerged in the 1990s and 2000s, each with a different philosophy about what empirical evidence should look like.
Structural econometric modeling, which gained prominence in health insurance research around 1990, treats insurance choice and healthcare demand as outcomes of an explicit economic model. Structural researchers specify utility functions, budget constraints, and information sets, then estimate the deep parameters—risk preferences, price sensitivity, health risk—that govern behavior. Once estimated, the model can simulate counterfactual policies: what would happen if deductibles were raised, or if a public option were introduced? The strength of the structural approach is its ability to predict behavior outside the range of observed data. Its vulnerability is that its predictions depend on the model’s assumptions, which may be wrong.
The credibility revolution in health economics, which accelerated after 2000, took the opposite tack. Influenced by the broader movement in empirical microeconomics, credibility-revolution researchers prioritize causal identification over model structure. They use randomized experiments, natural experiments, and quasi-experimental designs—such as the Oregon Health Insurance Experiment, which randomly assigned Medicaid coverage to low-income adults—to estimate the causal effect of insurance on outcomes like health, financial strain, and labor supply. The credibility revolution’s strength is that its estimates are transparent and assumption-light. Its limitation is that it can only answer questions about policies that have actually been implemented, not about novel designs.
These two methodological frameworks coexist in productive tension. Structural modelers argue that the credibility revolution’s estimates are local and cannot guide broad policy design; credibility-revolution advocates counter that structural models are only as good as their untestable assumptions. Many contemporary studies combine both approaches, using quasi-experimental variation to discipline structural parameters or using structural models to extrapolate experimental results to new populations.
Around 2000, a new wave of theory began questioning the rational-choice foundation shared by moral hazard, adverse selection, and managed competition. Behavioral health insurance theory draws on psychology and behavioral economics to show that consumers do not always choose insurance plans or use healthcare in the way that rational-choice models predict. People exhibit inertia, sticking with the same plan year after year even when switching would save money. They are loss-averse, overvaluing small copayments relative to large premiums. They misunderstand complex insurance contracts, underestimating the probability of rare but expensive events.
Behavioral theory does not replace the earlier frameworks; it complicates them. Moral hazard models assume that consumers respond to the marginal price of care, but behavioral evidence shows that many consumers do not know what that price is. Adverse selection models assume that consumers know their own risk and act on that knowledge, but behavioral research finds that people are overconfident about their health and fail to anticipate future needs. Managed competition assumes that consumers will shop for the best value among standardized plans, but behavioral inertia undermines that assumption.
The behavioral critique has led to practical innovations: default enrollment in high-value plans, simplified choice architectures, and “nudges” to encourage preventive care. It has also opened a live disagreement about how much weight to give rational-choice versus behavioral models in policy design.
Today, no single framework dominates health insurance theory. Researchers and policymakers draw on multiple frameworks depending on the question they are asking.
Arrowian market failure remains the default justification for government intervention in insurance markets. Moral hazard is the standard lens for designing cost-sharing and evaluating demand-side responses. Adverse selection guides risk adjustment and market regulation. Managed competition provides the blueprint for exchange design. Structural econometric models are used to forecast the effects of proposed reforms, while credibility-revolution methods are used to evaluate existing programs. Behavioral theory adds a layer of realism about how consumers actually behave.
On points of agreement, most frameworks accept that insurance markets are prone to failure without regulation, that information asymmetry is central, and that both demand-side and supply-side incentives matter. On points of disagreement, the frameworks split along two axes. The first axis is methodological: structural modelers and credibility-revolution empiricists disagree about how much assumption-driven modeling is acceptable. The second axis is behavioral: rational-choice frameworks and behavioral frameworks disagree about whether consumers are competent decision-makers or need active guidance.
The field’s current vitality comes from this pluralism. Researchers routinely combine structural estimation with quasi-experimental validation, or test behavioral predictions against rational-choice benchmarks. The core tension between risk protection and incentive distortion has not been resolved—it has been refined into a set of specialized tools, each suited to a different part of the problem.