Thanks for all your suggestions.
I should explain that I already have marked up a
fairly large character set on a stroke by stroke
basis. In this case, it was characters from texts from
Vindolanda, a Roman fort on Hadrians wall, in Old
Roman Cursive script from 90-103 AD. The dataset has
over 2000 characters, and comprises of nearly 5000
individual strokes. Yes, it took me months!
I devised the encoding scheme myself from information
that the papyrologists use when trying to read such
texts. It is a fairly large tag set. The texts were
encoded by using a simple graphical tool, which
resulted in XML represenations of the stroke data.
The reason for doing this was to provide an AI
character "agent" with data to make models of
characters, and compare unknown characters to these
models. Marking up the texts in XML provided a bridge
between image and textual data. The resulting system
propogates possible interpretations of unknown
characters: a tool to help papyrologists read damaged
texts. We have had some early success with this so
I confess that my XML markup is not TEI compliant -
yet. I think that it wouldnt take too much work to
bash it into shape, and to change my data files. I
guess my question to the list is, is there any point
in making such a markup TEI compliant? (I am aware of
the preservation of data issues). Is anyone else doing
such work, or does textual encoding tend to stop at
the character level? Is there room for markup on a
stroke level in the TEI, or would I be making
unnecessary work for myself ;)
--- Tim Romano <[log in to unmask]> wrote: > Melissa,
> Are you referring to the lines in printed glyphs
> (e.g. Caslon lower case
> Latin a) or scribal ductus?
> Tim Romano
> At 05:40 AM 2/5/03, Melissa Terras wrote:
> >Does anybody know of any work that
> >is being done on the markup of texts, not on a
> >character level, but an individual stroke level?
> >For example, the letter "L" has two strokes, each
> >which has a variety of properties that could be
Do You Yahoo!?
Everything you'll ever need on one web page
from News and Sport to Email and Music Charts