Marcus Bingenheimer <[log in to unmask]> writes:
> There is a markup scheme for furigana (http://www.w3.org/TR/ruby/) which could be bound into TEI by namesspaces I guess.
> Since Annotation seems to be the common denominator of both the
> syntactic kaeriten and the phonological information in the furigana, I
> would think that kaeriten too count as some kind of Ruby (the W3C def
> seems to be: "Ruby" are short runs of text alongside the base text,
> typically used in
> East Asian documents to indicate pronunciation or to provide a short
Not really, see below.
> That seems a starting point.
It certainly would be one. Another possible route within the TEI
would be to employ the <choice> element together with <seg>. There
has been some work on similar cases, e.g. linguistic annotation of
Japanese or Sanskrit segmentation. (see http://www.tei-c.org/Activities/CE/cew12.pdf)
However, the issue David brought up is in fact a bit more complex (and
that is why I said it had not been done to my knowledge), since it
(specifically, the "kaeriten" mentioned above) also involves
reordering chunks of the text to account for the differences between
Chinese and Japanese sentence order.
Institute for Research in Humanities, Kyoto University
47 Higashiogura-cho, Kitashirakawa, Sakyo-ku, Kyoto 606-8265, JAPAN