How does the human brain enable language? That question has driven neurolinguistics since its earliest days, but the answers have changed dramatically as new methods and theoretical commitments reshaped the field. The history of neurolinguistics is not a steady accumulation of facts about brain-language relations; it is a series of competing frameworks, each offering a different account of what kind of neural organization supports language and how that organization can be studied. Five major frameworks have defined this trajectory: the Lesion-Deficit Model, Generative Linguistics, the Modularity Hypothesis, the Cognitive Neuroscience Approach, and Connectionist Models. Their relationships—replacement, coexistence, challenge, and methodological infrastructure—tell the story of how the field arrived at its current state.
The first systematic framework for linking language to the brain emerged from clinical observations in the nineteenth century. The Lesion-Deficit Model, anchored by the work of Paul Broca (1861) and Carl Wernicke (1874), treated language as a set of functions that could be localized to specific cortical regions. Damage to a particular area, the reasoning went, produced a predictable deficit—loss of speech production after left frontal lesions, loss of comprehension after left temporal lesions. This model dominated neurolinguistics for over a century because it offered a clear, testable hypothesis: language is organized as a mosaic of specialized centers connected by white-matter pathways.
Yet the Lesion-Deficit Model carried a fundamental inferential limitation. It could only observe what happened when a region was missing; it could not directly observe what a healthy region did during normal processing. Lesion studies conflated the functions of a damaged area with the compensatory reorganization of surrounding tissue, and they could not resolve fine-grained temporal dynamics. By the mid-twentieth century, researchers began to feel the need for a framework that could ask more precise questions about the computational architecture of language, not just its gross anatomical correlates.
The rise of Generative Linguistics in the 1960s, spearheaded by Noam Chomsky, reframed the entire enterprise. Where the Lesion-Deficit Model had focused on mapping observable speech behaviors onto brain regions, generative theory argued that the primary object of study should be the abstract, unconscious knowledge that underlies language—a system of rules and representations. This shift had profound consequences for neurolinguistics: it transformed the questions researchers asked of the brain. Instead of asking "Where is speech located?", they began asking "Where in the brain are syntactic rules computed?" and "How does the neural substrate implement the distinction between competence and performance?"
At roughly the same time, the Modularity Hypothesis—most influentially articulated by Jerry Fodor (1983)—provided a specific neural interpretation of the generative program. Fodor argued that the mind is composed of specialized, informationally encapsulated modules, each dedicated to a particular cognitive domain. Language, in this view, was a prime candidate for modular organization: a fast, automatic, domain-specific processor that operates independently of general intelligence. The Modularity Hypothesis coexisted with Generative Linguistics, sharpening its predictions about brain organization. Where generative theory described a formal system, modularity claimed that this system is physically instantiated as a dedicated neural circuit, isolable from other cognitive systems.
Both frameworks, however, remained largely theoretical. They generated predictions about where language functions should be found, but they lacked the tools to test those predictions with the spatial and temporal precision that the claims demanded. The Lesion-Deficit Model could provide coarse localization, but it could not distinguish between a dedicated language module and a more distributed network that happened to be vulnerable to focal damage.
The Cognitive Neuroscience Approach, emerging around 1980, did not begin as a theoretical competitor to generative or modular accounts. Instead, it introduced a new methodological infrastructure that transformed how all earlier frameworks could be evaluated. The key innovation was the development of non-invasive neuroimaging techniques—first positron emission tomography (PET) and then functional magnetic resonance imaging (fMRI)—that allowed researchers to observe healthy brains engaged in language tasks. Where the Lesion-Deficit Model inferred function from pathology, neuroimaging made it possible to watch the intact brain at work. Where generative and modular theories made predictions about specialized circuits, neuroimaging could test whether those circuits actually existed.
The Cognitive Neuroscience Approach is methodologically dominant today because it provides the empirical platform on which all other frameworks must now make their case. It does not prescribe a particular theory of language; rather, it offers a set of tools—fMRI, EEG/MEG, diffusion tractography, and increasingly multivariate pattern analysis—that can be used to test predictions from any theoretical perspective. A researcher working within the generative tradition can use fMRI to look for brain regions that respond selectively to syntactic violations. A proponent of the Modularity Hypothesis can use the same technique to ask whether those regions are informationally encapsulated. The Cognitive Neuroscience Approach thus functions as a common empirical language, but it also carries its own assumptions: that the hemodynamic response measured by fMRI is a reliable proxy for neural computation, and that subtraction paradigms can isolate cognitive components. These assumptions are themselves debated, but the approach has become the default mode of inquiry in neurolinguistics.
While neuroimaging was providing new ways to observe the brain, a different kind of challenge to the generative-modular consensus was emerging from computational modeling. Connectionist Models, which gained prominence in the 1980s, proposed that language processing could be explained without innate rules or dedicated modules. Instead, connectionism treated language as the emergent product of distributed, parallel processing in networks of simple neuron-like units. Learning, in this view, is a matter of adjusting connection weights through exposure to input, not of acquiring a pre-specified set of rules.
Where the Modularity Hypothesis claimed that language is domain-specific and informationally encapsulated, connectionism argued for domain-general learning mechanisms and distributed representations. Where Generative Linguistics posited abstract symbolic rules, connectionism claimed that apparent rule-governed behavior could arise from statistical regularities in the input. This was not a minor disagreement; it was a fundamental clash over the very architecture of the language faculty. Connectionist models of past-tense acquisition (e.g., Rumelhart & McClelland, 1986) directly challenged the generative claim that children must innately know a rule for forming regular past tenses. The debate between symbolic and connectionist accounts of language processing remains one of the most active fault lines in neurolinguistics.
Today, the five frameworks coexist in a complex division of labor. The Lesion-Deficit Model has not disappeared; modern lesion-behavior mapping, using high-resolution structural MRI and voxel-based lesion-symptom mapping, has revived the logic of the classical model with much greater precision. Lesion studies remain uniquely valuable for establishing causal necessity—if a region is damaged and a function is lost, that region is necessary for that function in a way that correlational neuroimaging cannot demonstrate.
Generative Linguistics and the Modularity Hypothesis remain active as theoretical frameworks, but they have been transformed by their engagement with neuroimaging data. Many researchers now treat modularity as a graded property rather than an all-or-nothing feature, and generative models are increasingly constrained by what is known about neural computation. The Cognitive Neuroscience Approach provides the methodological backbone for most current work, but it does not settle the theoretical debates. Connectionist Models continue to offer a powerful alternative, especially in domains like speech perception and reading, where distributed representations and statistical learning are well supported.
What the leading frameworks agree on is that language is implemented in a large-scale network of left-hemisphere regions—including Broca's area, Wernicke's area, the superior temporal gyrus, and the arcuate fasciculus—and that this network interacts with other cognitive systems. They disagree sharply on the nature of that implementation. Modularity advocates see dedicated circuits for syntax, phonology, and semantics; connectionists see overlapping, task-sensitive networks shaped by experience. Generative linguists argue for innate, domain-specific principles; connectionists argue for domain-general learning mechanisms. The Cognitive Neuroscience Approach, for its part, provides the data that both sides use to support their claims, but it does not—and cannot—resolve the debate on its own. The tension between modular and distributed architectures, and between symbolic and statistical accounts of linguistic knowledge, remains the central unresolved problem of neurolinguistics.