Not all of these defects disqualify files from being TEI, and some of them have relative straight forward technical fixes...
If you commit your script to the github repo, I'll see what I can do.
> On 22/11/2013, at 3:46, Sebastian Rahtz <[log in to unmask]> wrote:
> I amused myself looking at these 258 examples and seeing whether they were valid TEI.
> It is rather sad to say that only about 30% seem to pass this simple test. Some reasons include
> * TEI P4 not P5
> * incomplete fragment
> * no namespace
> * reference to non-existent DTD
> as well as the straightforward “wrong against the TEI” ones.
> What conclusions one can draw is another matter :-}