I have a question: Does anybody know of any work that
is being done on the markup of texts, not on a
character level, but an individual stroke level?
For example, the letter "L" has two strokes, each one
which has a variety of properties that could be
In case you think I'm as mad as a box of frogs, the
reason for doing this is to provide the data to train
a stochastic AI architecture to effectively "read"
textual data. As a newbie to TEI I would like to ask
if anyone (else) has come up with an encoding scheme
to markup text in this manner, on a stroke by stroke
Also, is there room for such an encoding scheme in the
TEI? Where does "text" stop - on the character level,
or below, on the stroke level?
I'd be really interested in hearing your reponses.
Dr Melissa Terras
Royal Academy of Engineering
29 Great Peter Street
Do You Yahoo!?
Everything you'll ever need on one web page
from News and Sport to Email and Music Charts