Sure, I mean this kind of structure:
No me digas.
No me dig-as
NEG 1SG.ACC say-SBJN.2SG
Don't tell me.
One or more 'source' lines, with one or more annotation lines.
The basic trick involves capturing the particular properties of
alignment between the tiers. The example above is very basic, a more
complex example might have POS tags on a tier and syntactic structure on
three cats danced the waltz
NUM NN VBP DET NN
0,5=S 0,2=NP 3,5=NP 2,5=VP
I can provide some references that describe this stuff in more detail if
it's helpful for anyone. In fact, I'm currently working on a review of
formats for this kind of data, so I'm trying to figure out how TEI fits
into that review.
On 04/18/2013 11:49 PM, Laurent Romary wrote:
> One possibility would be to use<cit> as a construct for representing the source text and further glosses, annotations, etc. Maybe you could provide an example to see the kind of use case you have in mind.
> Le 19 avr. 2013 � 07:22, Joshua Crowgey a �crit :
>> I'm researching XML formats for interlinear glossed text as found in
>> linguistic corpora. I've found a few pre-xml TEI discussions on the web:
>> Is the feature structure tagset still the relevant one for IGT? What's
>> the current best practice? Are there any linguistic projects using TEI
>> of note?
>> Happy Friday,
> Laurent Romary
> INRIA& HUB-IDSL
> [log in to unmask]