Dear Matthew,in german there is the German Text Archive that provides complete dumps of the text, see: http://www.deutsches-textarchiv.de/download
[log in to unmask]" type="cite">
Apologies for any duplicates received due to cross-posting.
I am collecting links for publicly accessible, computable TEI (or other similar xml markup such as SGM, LMNL) files. In order to be included, archives/collections/datasets/corpora must have meet one of the two criteria:
Bulk download of raw xml (not html transformed)
Xml fully accessible via predictable url structure (an example of this would be the Walk Whitman archive, which as a “raw xml” link on every transformed html page)
Please note that I am not interested in sample xml, only collections with some kind of curatorial or scholarly focus. Thank you all for any leads!
Clinical Assistant Professor of English and Director of Digital Media Lab
University of Pittsburgh