Developing a virtual research environment for scholarly editing. Arthur Schnitzler: Digitale Historisch-Kritische Edition
July 17, 2013, 10:30 | Short Paper, Embassy Regents F
Introduction
Arthur Schnitzler: Digitale historisch-kritische Edition. Werke 1904-1931 is a long-term project founded by the Nordrhein-Westfälische Akademie der Wissenschaften with a runtime of 18 years. The aim is to create a digital critical edition of both the majority of works published during Schnitzler's lifetime and his literary estate, providing scholars with an up-to-date, philologically dependable textual basis.
The first project phase (2012-14) is primarily about conceptional groundwork and focuses on two texts, Fräulein Else and Komödie der Verführung which were chosen both for their literary value and their usefulness as a testbed. Editing them will go hand in hand with finalizing the editorial principles and establishing the digital workflow.
To this end, the team of editors at Bergische Universität Wuppertal closely cooperates with computer scientists at Trier Center for Digital Humanities who are developing a customized virtual research environment. Based upon a unified data model, it will support all editorial steps from entering the meta data through transcribing the textual witnesses to preparing the online publication and printouts. This paper shall outline the philological challenges of the project and show how the projected platform is to meet them.
Initial Situation and Objectives
For scholars working on Schnitzler’s writings, the availability of a philologically well founded textual basis is currently a key desideratum. Arthur Schnitzler is probably the only German-language author of literary world rank of whom there are to this day neither a commented student edition nor a (historical) critical edition. The project seeks to remedy this situation by:
- Providing the first edition of most of the works published during Arthur Schnitzler‘s lifetime with an authentic, critically constituted text.
- Providing a genetic edition of the major part of the literary estate (belonging to both published and unpublished work).
- Making the whole edited material accessible by text-critical comments and contextualizing its literary-historical background
Given this constellation, the integral approach of the projected edition has to be emphasized: The edited material in its diversity is to be united within a homogeneous central data pool, allowing for different types of output. Following the principle of single-source publishing, the projected public online edition could be completed in the future by an annotated student edition or even a critical edition.
Challenges in Scholarly Editing
The new edition of Schnitzler’s — published as unpublished — oeuvre is rich in philological challenges. It presents itself as a dynamic yet coherent system of most diverse transformation processes, branchings and cross-links. The individual literary "work" proves to be an unreliable unit; Schnitzler used to work for several years or even decades on a text. In this process, not only the genre would occasionally change, but there also may occur genetic bifurcations (one draft becoming multiple works) or fusions (several drafts being merged into one work).The project thus chooses the text witness in its materiality as a basis, prioritizing the categories of topography, genesis and intertextuality. Hence, the individual witness has to be described not only in its semantics, but also in its physical form and its processuality (writing stages). Moreover, the cross-links and correspondences of sequences and fragments between text witnesses are of particular interest.
Literary Computing and Software Development
In terms of literary computing, two aspects are to be emphasized: Firstly, the various requirements in scholarly editing (meta data capture, transcription, annotation, linking text and facsimile, linking text sequences with each other, collation, preparation of indices and critical apparatus) can only be met with a series of appropriate tools.
Secondly, these tools must be coordinated. In practical terms, this means above all that they have to fit into a unified philological workflow. This brings to the fore a facet that is often considered of minor interest in digital editorial projects: the aspect of the data format.
It is essential to determine a data model which can serve as a basis for the whole philological process, possibly without laborious intermediate conversions. From meta data collection through transcription and annotation to topography, networking and publication, all editorial stages need to be supported. This is complicated all the more by a set of overlapping hierarchies:
Firstly, there is the documentary view (with reference to the material witness, i.e. the single sheet), secondly the semantic-textual view (the text in its philological structure), thirdly sequences of any size have to be linked. This is not a trivial constellation for the XML-format, and the project currently evaluates how different standards deal with this problem. The benchmark are both the guidelines of the TEI and their concrete implementation in current projects (such as the digital Faust-edition). However, some of the proposed strategies, such as a splitting up into multiple files with different markup or the breaking up of subsidiary hierarchy levels, may add great complexity to the editor’s work. So we currently test if the workflow could alternatively build on a database system, out of which the current processing status can be exported or archived any time into an international, platform-independent standard.
Virtual Research Environment
The tools needed for the project are currently being developed and made interoperable at the Trier Center for Digital Humanities. As for the meta data management, an adapted module of the Forschungsnetzwerk und Datenbanksystem FuD is used which allows the physical description of textual witnesses and their often complex tradition via specially customized input masks. For some other tasks (see below), currently existing solutions will be evaluated and tested in terms of usability. However, the core functionality is to be covered largely by Transcribo, an entirely new transcription, topography and edition software.
Transcribo
Transcribo is developed in close collaboration between computer scientists and philologists of the project partners. The graphical user interface is centered around the digital facsimile, i.e. usually the scanned witness. Units of varying extent (e.g. words, lines or paragraphs) can be marked by means of a rectangle-or polygon-tool, and then be transcribed and annotated. Each image file is displayed twice: the original can be examined on the left while the view on the right is used as a work space, where the transcribed text is placed topographically exact on top of the slightly attenuated facsimile. Where the spatial arrangement does not match the textual word order, words can be combined into sequences to preserve the semantic relationships in the transcript. As a central feature, the program further allows to mark genetic and philologically relevant phenomena in each transcribed unit and add annotations to them. This is done by a context menu with a selection of project-specific options. These include so far different variants of corrections (such as immediate corrections or late corrections with single, double or multiple strikethroughs and overwritings), the marking of highlightings, uncertain readings and non-identifiable graphs. This selection, however, can be extended and will be adjusted over the entire course of the project to meet the requirements of the underlying material.
Fig. 1:
Transcription and annotation in Transcribo.
Further Extensions
FuD and Transcribo lay the basis for scholarly editing in an online environment by covering the key requirements: meta data management, transcription, topography description and basic annotations. This nucleus will be expanded over the next years to a virtual research environment that supports the entire project-specific workflow in a unified interface. This implies the integration of a number of additional features:
- Collation; here it has to be stressed that we will not only have to compare fixed texts, but also to identify similar sequences in a large data pool.
- The linking of freely definable sequences within or between text witnesses.
- The identification of dependencies between witnesses, in particular the creation of genetic paths.
- A publishing program for printouts.
- August Wilhelm Schlegel "Kritische Ausgabe der Vorlesungen“ Band IV–VI
- Digitalisierung und elektronische Edition der Korrespondenz August Wilhelm Schlegels
- Wolfgang Koeppens „Jugend“