I don’t want to stop you having fun, Eric, but some of this is already done for you. From
you can download CSV and JSON versions of the whole metadata catalogue, and has a browsable sortable
HTML table with all that data. Perhaps not all the data categories you want. The ability of jQuery datatables to cope with 60000 rows of data
is rather awesome.

and of course HTML for all the texts is available from  too.

I did try putting up an eXist database with all the texts for searching, but couldn’t justify maintaining it to myself. 
Wolfgang Meier did a better job of this, but there isn’t enough space in the margin to remember the URL for that.

the more text analysis the better, natch :-}

Sebastian Rahtz      
Chief Data Architect
University of Oxford IT Services
+44 1865 283431