I was reusing an EEBO TEI file as a template for another context and I
noticed a couple of unusual features. I'm not sure whether these are
design features or accidents of history
My working example (picked at random) is
*) The primary identifier within the XML-based system is A00005, but
there is no xml:id="A00005" or similar easily XSLT-matchable attribute
(I was expecting to see it on the root element).
*) There is a complete lack of xml:id / id tags
*) There is no xml:lang tag covering the textual content in the header
*) The <availability/> tag has a license URL, but doesn't use the
<licence/> tag as the container for it. This is a significant barrier
to automated license detection and thus automated reuse.
*) The <language> tag and xml:lang both use three letter language
codes (I believe that http://tools.ietf.org/html/bcp47 forces us to
prefer two letter language codes where available)
*) The textual content of the header contains a LOT of proper nouns
with corresponding websites, but none are linked using <name/> or
Some of these are probably pretty trivial fixes, if they're
brokenness, maybe they could be considered for the next times the
files are being updated?
...let us be heard from red core to black sky