On Mon, 2004-06-28 at 11:26, Sebastian Rahtz wrote:
> Peter Flynn wrote:
> >I am looking for an XML set of character entity declarations
> >for the TEI Extra Greek Classical Letters
> I am confused. what extra Greek letters are you talking about?
Those declared in "-//TEI TR1 W4:1992//ENTITIES Extra Classial Greek
> if the characters are in Unicode, why not just encode them directly?
> why go via character entities?
They are *already* in existing (SGML) files as character entities.
I want to automate their conversion to Unicode characters for XML,
rather than hand-edit the files.
Having said that, there are many reasons why one might want to keep
ASCII encoding for archive copies of TEI documents, rather than use
UTF-8 files, not the least of which is that many potential users of
TEI documents are not in a position to use UTF-8 conformant software.
> >If it's not in existence yet, we have some resources here
> >to do it, assuming all the required code points are indeed
> >defined in Unicode Greek Extended (1F00-1FFF)...are they?
> I don't think, myself, that the TEI is in the business of maintaining
> character entity sets.
I wasn't implying that the TEI should: I was asking the TEI-using
community if they knew of anyone who had already done this task.
> Especially since there is nowhere to put them in a schema world :-}
All the more reason why I need to convert the existing documents to