For over a century, scholarly editing has been shaped by a fundamental question: what is a text, and how should an edition represent its authority? The answer has shifted repeatedly, driven by debates over authorial intention, the social nature of textual production, and the possibilities of digital representation. The subfield of text encoding and scholarly editing is best understood as a sequence of frameworks, each responding to the limitations of its predecessors, and many continuing to coexist in productive tension.
The earliest systematic framework, Lachmannian Stemmatics (1830–Present), aimed to reconstruct a lost original text by comparing surviving manuscripts and building a family tree (stemma) of textual relationships. Its goal was to recover the author's original words by eliminating scribal errors through recension. This reconstructionist approach assumed that a single authoritative version could be recovered through rigorous method.
Bédierist Best-Text Editing (1928–Present) emerged as a direct critique of Lachmannian stemmatics. Joseph Bédier argued that the stemmatic method often produced artificial, circular results—editors unconsciously shaped the stemma to favor their preferred readings. Instead, Bédier advocated selecting a single, historically attested manuscript as the base text and correcting it only where it was clearly corrupt. This framework preserved the documentary integrity of a specific witness rather than attempting to reconstruct a hypothetical original.
Documentary Editing (1930–Present) extended the Bédierist emphasis on historical witnesses but focused on non-literary texts such as letters, diaries, and legal documents. Its practitioners aimed to reproduce the source text faithfully, often with minimal intervention, preserving spelling, punctuation, and layout. Documentary editing coexisted with reconstructionist approaches, serving a different purpose: preserving evidence rather than restoring an ideal text.
New Bibliography (1900–1950) brought a new rigor to the study of printed books, particularly Shakespeare. Scholars like W. W. Greg and A. W. Pollard analyzed the physical processes of printing—typesetting, proofreading, press variants—to infer authorial intention from the material evidence of early editions. New Bibliography treated the printed book as a physical object whose features could reveal the author's original intentions, often obscured by compositors and publishers.
Copy-Text Editing (1950–Present), formalized by W. W. Greg, provided a practical rule for choosing a base text: the edition that best preserves the author's accidentals (spelling, punctuation) should serve as copy-text, while substantive readings could be drawn from other authoritative editions. Greg's rationale was that accidentals were most likely to reflect the author's habits in the earliest edition, while substantive changes might be authorial revisions. This framework became dominant in Anglo-American editing, especially for modern authors, and remains influential in print and digital editions.
Genetic Criticism and Editing (1970–Present) shifted attention from the final published text to the process of writing itself. Drawing on French literary theory, genetic critics study manuscripts, drafts, and notes—the "avant-texte"—to understand how a work came into being. Genetic editions present multiple stages of composition, often in parallel, rather than a single authoritative version. This framework challenged the final-text orientation of both reconstructionist and copy-text editing, arguing that the creative process is itself the object of study.
Social Textual Criticism (1986–Present), articulated by D. F. McKenzie and Jerome McGann, argued that texts are produced through collaborative social processes involving authors, editors, publishers, and readers. McKenzie's "sociology of texts" and McGann's "textual condition" emphasized that meaning is shaped by the entire network of production and reception. Social textual criticism rejected the notion of a single authorial intention, instead viewing texts as inherently unstable and collaborative. This framework provided the theoretical foundation for later digital and social editing practices.
The arrival of digital methods transformed editorial practice by making it possible to represent textual complexity in machine-readable form. TEI Encoding (1987–Present) established a community-governed standard for marking up texts in XML. The Text Encoding Initiative's Guidelines provided a vocabulary for encoding structural, semantic, and editorial features—from paragraphs and headings to variant readings, deletions, and annotations. TEI's key contribution was interoperability: editions encoded in TEI could be shared, searched, and processed across platforms. The framework operationalized earlier editorial theories by allowing editors to tag multiple witnesses, record variant readings, and encode genetic layers within a single digital file.
Descriptive Markup and OHCO (1990–Present) provided the theoretical justification for TEI's hierarchical structure. The Ordered Hierarchy of Content Objects (OHCO) model, proposed by Steven DeRose and others, argued that text is inherently hierarchical—chapters contain sections, sections contain paragraphs, and so on. This model aligned with the tree structure of XML, making TEI's nested tags a natural fit for representing textual structure. However, the OHCO model soon faced criticism for its inability to handle overlapping hierarchies, such as a verse that spans two paragraphs or a marginal annotation that interrupts a sentence.
Digital Scholarly Editing (1990–Present) emerged as a practical synthesis that could accommodate genetic, social, and documentary approaches within a digital environment. Unlike print editions, digital editions could present multiple versions, link transcriptions to facsimiles, and include multimedia annotations. The digital medium allowed editors to escape the linear constraints of the book, offering readers the ability to navigate between a diplomatic transcription, a critical text, and a genetic reconstruction. Digital scholarly editing did not replace earlier frameworks but absorbed them: a digital edition could encode a copy-text, record variant readings from multiple witnesses, and display the genetic process, all within a single TEI-encoded file. This flexibility made digital editing the dominant practice for new scholarly editions, though it also raised new questions about sustainability, interface design, and long-term preservation.
Stand-Off Markup (1997–Present) directly addressed the limitations of the OHCO model. Instead of embedding all annotations within the text's hierarchical structure, stand-off markup stores annotations in separate files that point to locations in the base text via XPath or other addressing schemes. This approach allows overlapping hierarchies, multiple annotation layers, and independent management of different editorial perspectives. Stand-off markup coexists with TEI's inline encoding; many projects use a hybrid approach, embedding structural markup while storing critical annotations externally.
Social Scholarly Editing (2012–Present) extends the insights of social textual criticism into the digital realm. Where earlier digital editions were typically produced by a single editor or small team, social scholarly editing invites broader participation—crowdsourced transcription, community annotation, and collaborative revision. Platforms like Wikisource and the Ben Jonson's Walk project exemplify this model. Social scholarly editing does not replace traditional editorial authority but distributes it, acknowledging that textual meaning is negotiated among many stakeholders.
Linked-Data Scholarly Editions (2015–Present) represent a shift from document-centric to data-centric models. Instead of encoding an edition as a single TEI XML file, linked-data editions publish structured data using RDF and ontologies, connecting people, places, works, and events to external resources like VIAF or GeoNames. This framework diverges from TEI's XML tree structure by treating textual information as a graph of interrelated entities. Linked-data editions enable querying across multiple editions and integration with the semantic web, but they also require different editorial workflows and raise questions about how to represent textual nuance in a triple-based model.
Today, multiple frameworks remain active, each serving different editorial needs. Lachmannian stemmatics continues to be used for classical and medieval texts where reconstruction is the goal. Bédierist best-text editing and documentary editing remain standard for historical documents where fidelity to a single witness is paramount. Copy-text editing still guides many modern author editions. Genetic criticism and social textual criticism inform editorial theory and digital design. TEI encoding is the de facto standard for digital scholarly editions, though stand-off markup and linked-data approaches are gaining traction for complex projects.
The leading frameworks today—Digital Scholarly Editing, TEI Encoding, and Linked-Data Scholarly Editions—agree that digital editions should be open, interoperable, and sustainable. They disagree on the best data model: TEI's hierarchical XML versus linked data's graph structure. They also disagree on the role of editorial authority: social scholarly editing advocates for distributed participation, while traditional digital editing maintains the editor's central role. These tensions are productive, driving innovation in how we represent, share, and interpret texts in the digital age.