> On 3/11/2013, at 8:23 am, Lou Burnard <[log in to unmask]> wrote:
> A question for clever chaps.
> As you probably know, many TEI attributes have a declared value of "anyURI", so they can point almost anywhere. Out of curiosity, what techniques/tools do people recommend for the validation of said URIs? I can find plenty of tools which will check that the syntax of the value is correct, but what technique/tool would you recommend to find out whether it is actually valid -- in the sense that starting from here (where the document is) I can recover whatever it's pointing at?
> (The question arises in the context of the longterm preservation study Nicolas Larrousse and I postered at Rome)
There are tools for some URI schemes, but some URI schemes such as bitcoin: and mailto: would be verb challenging to validate in a free, automated fashion.
For FTP, http and https URIs, it's tempting to also suck the documents into the archive.org infrastructure while you're there.