Is there an obvious place in the file headers of texts from a language corpus
to put standard biographical information about the authors? This information
would be of use to linguists interested in the features of the language of
authors of a specific age-group, place of birth etc.
Looking at the P2 Guidelines, possibilities include (all in file header):
1) As part of a taxonomy declaration within <EncodingDesc>. See section 22.3.5
of chapter 5. Not ideal, and possibly not in the spirit of the tag.
2) By defining keywords as part of <profileDesc> <textClass>. See section
22.4.3 of chapter 5. Not easy to see how this would work for the author's
3) As attributes to author tag in <sourceDesc>. However, only global attributes
are used, so a fudge really.
4) Using a special tag from the corpora dtd, e.g. <textDesc>. See section
26.2.1 of chapter 26. This seems a logical place, but none of the suggested
tags are relevant. An alternative might be to use <particDesc>
<participant>. See section 26.2.2 of chapter 26. However, an author is not
really a participant in a language interaction.
King's College London
[log in to unmask]