How can a chemist predict the outcome of a reaction without ever stepping into a laboratory? That question has driven theoretical chemistry for more than three centuries. The field has always been an attempt to replace trial-and-error experimentation with mathematical and physical reasoning—first through qualitative rankings of affinity, then through thermodynamic laws, then through quantum mechanical equations, and most recently through data-driven models trained on quantum calculations. Each step forward has not simply replaced what came before; it has added a new layer of explanation, a new set of tools, and often a new set of trade-offs between accuracy, generality, and computational cost.
The earliest systematic attempt to predict chemical behavior was Chemical Affinity Theory (1700–1850). Chemists like Geoffroy and Bergman constructed tables ranking substances by their tendency to combine, treating affinity as a kind of chemical gravity. These tables could predict which reactants would displace others in simple exchanges, but they offered no explanation for why one substance had greater affinity than another. Affinity was a black-box property: useful for prediction within a narrow range, but incapable of generalization to new reactions or to the forces that held atoms together.
Daltonian Atomic Theory (1803–1900) transformed the problem by introducing a particulate model of matter. Dalton argued that each element consists of identical atoms with fixed relative weights, and that chemical combination occurs in simple whole-number ratios. This framework made sense of the law of definite proportions and gave chemists a language for describing reactions as rearrangements of atoms. Yet Dalton's atoms were featureless spheres; the theory said nothing about why atoms stick together or why some combinations are stable while others are not. Affinity remained a mystery, now attached to invisible particles.
Chemical Thermodynamics (1850–Present) sidestepped the question of microscopic forces entirely. Gibbs, Helmholtz, and others showed that the direction and extent of a reaction could be predicted from macroscopic properties—enthalpy, entropy, and free energy—without any knowledge of atomic structure. Thermodynamics gave chemists a powerful tool: the ability to determine whether a reaction would occur spontaneously and where equilibrium would lie. But it was a science of what happens, not why. It could not explain rates, mechanisms, or the shapes of molecules.
Statistical Mechanics (1870–Present) closed that explanatory gap by connecting macroscopic thermodynamic quantities to the statistical behavior of large ensembles of molecules. Boltzmann and Gibbs showed that entropy, for example, could be understood as a measure of the number of microscopic arrangements consistent with a given macroscopic state. Statistical mechanics did not replace thermodynamics; it provided a molecular foundation for it. The two frameworks now coexist: thermodynamics remains the practical tool for predicting reaction spontaneity, while statistical mechanics explains why those predictions work and extends them to systems far from equilibrium.
The first three frameworks—affinity, atomic theory, and thermodynamics—could describe what happened in a reaction but could not explain the chemical bond itself. That changed with the arrival of quantum mechanics in the 1920s.
Valence Bond Theory (1927–Present), developed primarily by Heitler and London and later extended by Pauling, treated the chemical bond as the overlap of atomic orbitals on adjacent atoms. In this picture, a bond forms when two electrons with opposite spins occupy the overlapping region, lowering the system's energy. Valence bond theory preserved the intuitive idea of localized bonds between specific pairs of atoms. It was remarkably successful at explaining the geometries of simple molecules and the directional character of bonds, and it became the language of organic chemistry through Pauling's concept of hybridization.
Molecular Orbital Theory (1928–Present), advanced by Hund, Mulliken, and later by Hückel and others, took a different starting point. Instead of building bonds from atomic orbitals, it treated the entire molecule as a single quantum system in which electrons occupy delocalized molecular orbitals that extend over the whole structure. This approach naturally explained phenomena that valence bond theory struggled with, such as the electronic structure of conjugated systems, the colors of transition-metal complexes, and the existence of molecules with unpaired electrons.
The two theories were not simply alternative formulations of the same physics; they embodied different commitments about how to think about molecules. Valence bond theory was chemically intuitive and worked well for ground-state geometries, but it became cumbersome for excited states and delocalized systems. Molecular orbital theory was mathematically more systematic and could handle a wider range of electronic states, but its delocalized orbitals were harder to map onto the familiar picture of individual bonds. For decades the two frameworks coexisted in a productive rivalry. Today, most practical quantum chemistry uses molecular orbital methods, but valence bond concepts—resonance, hybridization, bond order—remain indispensable for qualitative reasoning and for interpreting molecular orbital results.
By the 1960s, wavefunction-based methods (rooted in molecular orbital theory) could in principle solve the Schrödinger equation for any molecule, but the computational cost grew steeply with system size. A many-electron wavefunction depends on 3N coordinates (for N electrons), and accurate calculations quickly became impractical for molecules with more than a handful of atoms.
Density Functional Theory (1964–Present) offered a radical alternative. Hohenberg and Kohn proved that the ground-state energy of a many-electron system is a unique functional of the electron density—a function of just three spatial coordinates, regardless of the number of electrons. Kohn and Sham then provided a practical scheme for approximating this functional. DFT replaced the intractable wavefunction with a much simpler quantity, making it possible to treat systems of hundreds of atoms with reasonable accuracy.
DFT did not make wavefunction methods obsolete. Post-Hartree-Fock methods like coupled cluster theory remain more accurate for small molecules and for systems where electron correlation is critical. But DFT's favorable scaling—roughly cubic in system size, compared to exponential or factorial scaling for many wavefunction methods—made it the workhorse of computational chemistry. The trade-off is that DFT relies on approximate exchange-correlation functionals, and no single functional works well for all problems. Choosing the right functional for a given system is itself a skilled judgment. Despite this limitation, DFT dominates modern theoretical chemistry, especially for materials, surfaces, and large biomolecules.
Even DFT becomes expensive for systems of tens of thousands of atoms, such as an enzyme in solution or a nanoparticle. The response has been pragmatic integration rather than replacement.
Hybrid QM/MM Methods (1976–Present), pioneered by Warshel and Levitt, divide a system into a small quantum region (treated with DFT or a wavefunction method) and a large classical region (treated with molecular mechanics, a descendant of statistical mechanics). The quantum region captures bond breaking and formation, while the classical region provides the environmental electrostatic and steric effects. QM/MM did not reject either quantum or classical frameworks; it absorbed both into a single computational strategy. This hybrid approach is now standard for studying enzymatic reactions, photochemical processes in proteins, and reactions at solid-liquid interfaces.
Machine Learning Potentials (2007–Present) represent the newest layer. Instead of solving the Schrödinger equation directly, these methods train neural networks or other machine learning models on thousands of DFT calculations. Once trained, the model can predict energies and forces for new configurations at a tiny fraction of the cost of DFT. Machine learning potentials do not replace DFT; they depend on DFT for their training data. Their accuracy is limited by the quality and coverage of that data, and they cannot yet reliably extrapolate to regions of chemical space far from their training set. But for applications where speed is paramount—molecular dynamics simulations of millions of atoms, high-throughput screening of catalysts—they are becoming indispensable.
Theoretical chemistry today is a pluralistic enterprise. No single framework dominates all problems. Chemical thermodynamics and statistical mechanics remain essential for understanding reaction spontaneity, phase behavior, and transport properties. Valence bond theory still provides the conceptual vocabulary for organic chemistry. Molecular orbital theory and post-Hartree-Fock methods are the gold standard for small-molecule accuracy. DFT is the default for most routine calculations on medium to large systems. QM/MM methods bridge the gap between quantum and classical descriptions. Machine learning potentials are rapidly expanding the scale of what can be simulated.
What the leading frameworks agree on is that the underlying physics is quantum mechanical and that approximations are unavoidable. Where they disagree is on which approximations are worth making. Wavefunction methods prioritize accuracy and systematic improvability, accepting high computational cost. DFT prioritizes a favorable cost-accuracy balance, accepting that errors are system-dependent and hard to predict. Machine learning potentials prioritize speed and scalability, accepting that they are interpolative and require extensive training data. These are not disagreements about fundamental theory; they are practical judgments about how to allocate limited computational resources for different scientific questions. The history of theoretical chemistry suggests that this pluralism will persist, with each framework finding its niche and occasionally borrowing ideas from its neighbors.