I am currently working on a visualisation system to explore XML corpora.
I would be interested in testing it with serveral richly encoded corpus.
By corpus, I don't mean a <teiCorpus> but rather a collection of similar
documents encoded in TEI preferably or XML.
I haven't been able to find such corpora by following links on the TEI
page because, even when tei documents are available, they are not
presented as Corpus and should be downloaded using a tedious process.
I have already used my visualisation system on the shakespeare plays
distributed by Jon Bosak and on a local corpus of 100 transcripted
manuscripts. Ideally, a corpus of 50 to 400 homogeneous documents
encoded with details would be perfect. This is only for research
purposes and contributors will be notified of the results.
I hope such corpora exist. Thanks in advance to providers.
Ecole des Mines de Nantes, 4 rue Alfred Kastler, La Chantrerie,
BP 20722, 44307 Nantes Cedex 03, France
Voice: +33-2-51-85-82-08 | Fax: +33-2-51-85-82-49
[log in to unmask] | http://www.emn.fr/fekete/