I am a little worried about this. Do you have practical evidence, Martin, that people
who are willing to correct the XML files will only do so if they can work with
fragments of the form <PAGE>…..</PAGE>? I am sure a system _could_ be set up
to extract the fragments into well-formed expanded XML, and then put back in
the originals, but checking the put-back hasn’t corrupted the non-page structure
seems quite problematic.
My feeling is that anyone capable of editing a TEI XML file at all is also
capable of finding the right <pb/> for the facsimile image they are staring at,
and editing the right XML.
May I, with respect, suggest that you are over-problematizing this?
Chief Data Architect
University of Oxford IT Services
13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431