I'm currently involved with a project which is moving towards marking-up
transcriptions of some Greek papyrii (and fragments), to enable the papyrii
(and hopefully the metadata associated with them) to be searched and
We have (or so we thought) established a DTD for this purposes, basically by
using the same TEI Tagsets as used by the Duke Databank of Documentart
Papyrii in the Perseus system
TEI.prose, TEI.names.dates, TEI.linking, TEI.transcr, TEI.figures,
TEI.textcrit, and then writing our own encoding guidelines (though
following, of course, the general TEI guidlelines) on how the tagging should
be applied to represent specific features of these texts (changes of hand,
dubious characters, reading marks, accents, orthographical phenomena, etc. -
not all of these features are current;ly marked up in the Duke texts). It
should be noted that the Greek itself is encoded in betacode (as used by the
TLG), but without any of the special betacode "escapes" (the TEI mark-up is
used instead). It should also be noted that ideally XML rather than SGML
documents are to be produced, to make subsequent processing simpler.
However, it has come to my attention that there appear to be a number of
other guidelines (all TEI-based) for marking up documents such as these,
1) Epidoc ("guidelines for structured markup of epigraphic texts":
http://www.stoa.org/markup/epidoc02.html) - whilst the markup requirements
of epigraphers may be different from that of papyrologists, it would seem,
however, that many of the features marked up are the same.
2) Digitial Scriptorium transcription dtd
(http://sunsite.berkeley.edu/Scriptorium/transcription.html) - whilst this
is primarily for the transcription of medieval manuscripts, many of the
kinds of things marked up are the same as those that would be marked-up for
3) TLG dtd? - as I understand from Nick Nicholas' email to TEI-L on 3 March,
the TLG are converting their holdings to XML, and are selecting (TEI?)
tagsets to use for this, and will presumably be writing encoding guidelines
appropriately. As the TLG is concerned with Greek texts too, obviously this
is of interest to us.
What I would like to know before proceeding with large scale mark-up of the
papyrii is - should I be recommending use of one of these DTDs and guidlines
instead of the ones we have developed? I believe what we have developed will
do the job, but I'm worried about doing something one way if everyone else
is using something else!
Is anyone else involved in any similar projects in this area (i.e.
papyrii) - if so, what DTD and guidelines are you using? Any help or
suggestions would be gratefully received!
Paul Groves, HCDT Project Officer
Humanities Computing Development Team
OUCS, 13 Banbury Rd, Oxford, OX6 2NN.
Tel: +44 (0)1865 273 290 Web: