Assuming names are not words seems like a strong statement, because of course they are. So in that sense is your encoding not misrepresenting something--especially if you are tagging everything else as words?
The question I would ask is 'is the issue you are trying to resolve a feature of the fact that the names are in fact words after all, or is it inherent to the the fact that they are names and just happen to share a feature with words.
Of the solutions, I actually prefer your option 2, since it sounds to me like the proposed feature request is really a request for sugar for w/name
Dan @ Treo
Mijn buurman is een beetje kaal. Misschien heeft hij minder angst.
-JMA Biesheuvel, De Angstkunstenaar
From: "Gabriel Bodard" <[log in to unmask]>
Subj: Re: segmented names
Date: Tue 11 Nov 2008 11:41
To: "[log in to unmask]" <[log in to unmask]>
I think this message was lost in the rush of the TEI MM. (Either that or
it is genuinely uninteresting to everyone.)
Let me state my options in the simplest possible terms:
(1) I assume (as I have been up to now) that a name is not a word, and
put in a feature request to add att.segLike attributes to name and
(2) I tag all segmented or incomplete names additionally with a <seg>
element. I don't much like it, but I can do.
Gabriel Bodard a écrit :
> If a word is segmented or the <w> tag contains an incomplete word, the
> segLike attribute @part is available to mark this word as segmented. The
> att.segLike class is not available on elements such as <name>,
> <persName>, <placeName>, however.
> It would seem to me that anything one might want to say about a word or
> other grammatical segment (that it is divided, that it has metre or
> rhyme or other function, that it has a lemmatized or normalized
> headword) one will also want to be able to say about names. In our
> corpus all strings of transcribed characters that we have been able to
> so resolve are tagged either as <w> or as <name>, with the result that I
> can segments words but not names.
> Any advice?
> Dr Gabriel BODARD
> (Epigrapher & Digital Classicist)
> Centre for Computing in the Humanities
> King's College London
> 26-29 Drury Lane
> London WC2B 5RL
> Email: [log in to unmask]
> Tel: +44 (0)20 7848 1388
> Fax: +44 (0)20 7848 2980
Dr Gabriel BODARD
(Epigrapher & Digital Classicist)
Centre for Computing in the Humanities
King's College London
26-29 Drury Lane
London WC2B 5RL
Email: [log in to unmask]
Tel: +44 (0)20 7848 1388
Fax: +44 (0)20 7848 2980