How can the vast, messy record of scholarly communication be measured, mapped, and understood? That question has driven bibliometrics for nearly a century, and it has never received a single settled answer. The field's history is a series of methodological frameworks, each responding to the limits of earlier tools while preserving what worked. Some frameworks now coexist as complementary approaches; others remain in productive tension. The central pressure has always been the same: quantitative methods promise objectivity and scale, but what they measure—citations, links, downloads—is never a pure signal of quality or influence.
Before bibliometrics had a name, it had a set of powerful descriptive laws. In the 1920s and 1930s, researchers like Alfred Lotka, Samuel Bradford, and George Zipf observed regular patterns in the distribution of scientific productivity, article dispersion across journals, and word frequencies. Lotka's law described how a small fraction of authors produce most publications; Bradford's law showed that a core of journals accounts for most articles in a field; Zipf's law captured word-frequency distributions. These were not explanatory theories but empirical regularities, and they gave the field its first quantitative toolkit. The label "Statistical Bibliography" captured this early identity: a descriptive, document-centered enterprise that counted and distributed rather than evaluated or mapped.
Statistical Bibliography was never really replaced. Instead, its laws were absorbed into later frameworks as background assumptions. When later bibliometricians built citation networks or mapped science, they still relied on the skewed distributions that Lotka, Bradford, and Zipf had documented. The shift was not a rejection but a transformation: the field moved from describing static patterns to analyzing dynamic relationships.
The decisive break came in the 1960s with Eugene Garfield's creation of the Science Citation Index (SCI). For the first time, citations could be systematically tracked across millions of articles. This single innovation split bibliometrics into two enduring traditions.
Evaluative citation analysis used citation counts as proxies for impact, influence, or quality. Universities, journals, and individual researchers began to be ranked by how often they were cited. The method was powerful and seductive, but it also attracted criticism: citations could be strategic, self-serving, or simply conventional. The evaluative tradition never claimed that citation counts equaled truth, but in practice they were often treated that way.
Relational citation analysis took a different path. Instead of counting citations, it traced the links between documents to reveal the structure of scientific communities. This relational tradition became the foundation for network-based approaches that followed. Citation Analysis, in both its evaluative and relational forms, remains active today. The evaluative branch is embedded in research assessment systems worldwide; the relational branch continues to evolve through network science and visualization.
Two frameworks extended Citation Analysis in complementary directions, and they are best understood together. Bibliographic Coupling, introduced by Myer Kessler in 1963, linked two documents if they shared references in their bibliographies. The coupling strength was fixed at the moment of publication: two papers that cited the same set of earlier works were permanently related, regardless of how later scholars used them.
Co-citation Analysis, developed by Henry Small and Irina Marshakova in 1973, inverted the logic. Two documents were linked if they were cited together in later publications. This meant the relationship was dynamic: as scholarly attention shifted, co-citation patterns changed. A classic paper could become co-cited with a new discovery, revealing emerging specialties.
These two frameworks are not competitors but complementary tools. Bibliographic Coupling captures stable, author-intended relationships; Co-citation Analysis captures evolving, community-driven connections. Together, they gave bibliometrics a way to study both the fixed structure of a field's literature and the fluid boundaries of its research fronts.
Science Mapping emerged as a distinct methodological school that absorbed and coordinated the relational techniques developed by Citation Analysis, Bibliographic Coupling, and Co-citation Analysis. Its core commitment was not just to measure relationships but to visualize them as maps of scientific fields. By the 1990s and 2000s, Science Mapping had developed its own toolkit: multidimensional scaling, cluster analysis, and network layouts that turned citation networks into two-dimensional landscapes.
Science Mapping differs from earlier relational frameworks in its explicit goal of creating interpretable visual representations. A co-citation map of a research field, for example, shows clusters of closely related documents, with distances representing similarity. Science Mapping remains a leading framework today, especially in research evaluation and science policy, where maps of scientific activity help identify emerging fields, interdisciplinary bridges, and national research strengths.
Scientometrics, which took shape in the 1970s, broadened bibliometrics' scope beyond documents and citations. It asked questions about the entire research enterprise: how funding correlates with output, how collaboration patterns affect productivity, how national science systems compare. Scientometrics absorbed the evaluative tradition of Citation Analysis but extended it to patents, grants, policy documents, and institutional data.
This broadening created a tension. Scientometrics became the framework most closely tied to research evaluation and science policy, and with that came responsibility. By the 2010s, the misuse of metrics—especially the journal impact factor and the h-index—had sparked a backlash. The Leiden Manifesto (2015) and the San Francisco Declaration on Research Assessment (DORA) called for more responsible use of quantitative indicators. Scientometrics today is not just a set of methods but a field engaged in self-criticism about the social consequences of its own tools.
Altmetrics emerged around 2010 from a simple observation: scholarly influence increasingly happens outside traditional citation channels. Articles are tweeted, blogged, bookmarked, downloaded, and discussed on social media. Altmetrics proposed to capture these diverse traces of attention, offering a faster and more democratic alternative to citation-based measurement.
Altmetrics did not reject Citation Analysis outright; it challenged its monopoly. The two frameworks share an infrastructure assumption that quantitative traces can indicate influence. But where Citation Analysis relies on the slow, curated process of formal citation, Altmetrics embraces the noisy, real-time world of online engagement. This has led to ongoing disagreements about what altmetrics actually measure—attention, not necessarily quality—and whether they are vulnerable to gaming. Altmetrics remains a living tradition, coexisting uneasily with evaluative citation analysis while pushing the field to consider broader definitions of scholarly impact.
Today, bibliometrics is a field of coexisting frameworks, each with its own strengths and blind spots. Citation Analysis (both evaluative and relational), Bibliographic Coupling, Co-citation Analysis, Science Mapping, Scientometrics, and Altmetrics are all active. They agree on one fundamental point: quantitative methods can reveal patterns in scholarly communication that are invisible to qualitative judgment alone. They disagree on what those patterns mean and which traces are worth counting.
The deepest disagreement is between the evaluative and relational traditions. Evaluative approaches treat citation counts as proxies for impact and are embedded in institutional decision-making. Relational approaches treat citations as traces of intellectual structure and resist their use as performance indicators. Altmetrics adds a third axis, arguing that the web has created new forms of influence that neither tradition captures.
This pluralism is not a sign of weakness. It reflects the field's recognition that no single metric or map can answer every question about scholarly communication. The responsible metrics movement, which grew out of Scientometrics' self-critique, now shapes how all frameworks are applied. The challenge for students of bibliometrics is not to choose one framework but to understand what each reveals, what each obscures, and how they can be used together without conflating measurement with meaning.