> I understand the TEI has adopted RFC3066 for language tagging
> within archived documents. Can you tell me whether the TEI use the
> Language tagging standard (RFC) just within software (thus
> automated tagging) or do TEI members tag documents "by hand".
I can't say with any confidence what the majority of TEI projects
might do. However, I can say with certainty that the vast majority,
if not all, of the projects with which I have been directly involved
either do no language tagging at all, or apply language identification
to passages of text by hand. In P5 encoding this is accomplished
using the RFC 3066 (soon to be 4646 or BCP 0047 or whatever) tag on
an xml:lang= attribute. P4 encoding does not explicitly make use of
standard language tags.
> We are currently discussing the Registry format (currently Record
> Jar) on the IETF-LTRU and I was wondering whether anyone here has
> any opinions on the format.
I'm sorry, I'm afraid I don't understand what the Registry format is
or is for. Does this have to do with the "Language tags in HTML and
XML (Working Draft in review)" W3C document that I've only just begun
I hope this helps (even though I doubt it)-: