On 2011-08-26 23:15, Jens Østergaard Petersen wrote:
> On Aug 26, 2011, at 2:54 PM, Martin Holmes wrote:
>> I thought what he was basically saying was that _if_ this separation were rigid:
>> - elements (in<text> at least) contain transcribed content
>> - attributes contain editorial/interpretive/metadata content
>> then indexing and searching the original text would be much simpler.
> Hi Martin,
> I try to guess at the design principles behind TEI, seen as an XML application. To me it looks as if this separation is one of the guiding principles, though I don't believe I have ever heard anyone say so. Following discussions on the list, it appears to be second nature to the TEI community.
The above was more or less the case up to P4, after which we had the "war on
attributes" (ca. 2002; the attributes lost), nowadays in P5 there are many
cases, where you can't just take the stuff that is in black in Oxygen and
run with it as your text.
As Sebastian mentioned, it was at some time considered to go further down
this road and allow markup to be expressed either as attribute or as
element. At that time, rather early in the development of the ODD language,
the idea was abandoned (to my relief). I think the type of things you want
to express do belong in a layer different from the "text as such" and are
better handled using stand-off constructs.
Christian Wittern, Kyoto