For Martin Mueller:
I confirm that we are going to create a XML document of some books of
bible conforming to a particular print edition.
Lou Burnard wrote:
>Let's get this clear.
><seg> is the generic tag for linguistic segmentation, however motivated.
><s> is a special case of <seg> (it is in fact syntactic sugar for <seg
>type="s">. Its purpose is to mark up linguistic segmentation of some
>There are two important differences between <s> and <seg>:
>(a) if <s> is used, it must segment the whole of the text end-to-end. It
>was originally conceived of as being used for language corpora, where
>you want to be sure that every word is inside *something* so you can use
>that to reference it.
>(b) following on from that, <s> cannot appear within <s>. If you want to
>segment units within <s>, therefore, you have to use some other element
>(several are proposed: <cl>, <phr>, <w>, <m>, <c> ... all of which are
>syntactic sugar for <seg type="cl"> (etc.)
>Inasmuchas verses do indeed segment the bible end-to-end, I see no
>problem with using <s> to represent them. But it then means <s> won't be
>available should you decide to do some kind of linguistic analysis of
>your text. And you won't be able to deal with those (not a few) verses
>of the bible which contain more than one para or verse line.
We have understund that <s> is a particular case of <seg>, and that we
can use seg instead of s.
But both of them can't contain more than one para (cant contain <p>),
how we can resolve this problems? In the next example
<seg n='1'><p><lb n='2'/> IN THE beginning was the Word,
<lb n='3'/>and the Word was with God,</p>
<p><lb n='4'/> and the Word was God Himself.</p>
<p><seg n='2'><lb n='5'/>He was present originally with God.
We have put <p> as father of <seg> when possible, and we have put <p> as
son of <seg> when seg contain more then one para.
In the next paragraph you write about <ab>, that substitute <p> tag.
Since we want inser <p> infromation toghether with verse, I suppose that
<ab> can't be e solution.
>It was for reasons such as these that, lo these many years ago (1997 in
>fact) the Core Committee of the first TEI Technical Review Committee,
>proposed the addition of an element <ab> to be added unto the existing
>TEI universe. And the committee laboured and set forth their thinking in
>a document (http://www.tei-c.org/Vault/TC/tccw04.html). And the Editors
>looked upon it and saw that it was Good, and said "So we shall implement
>it, yea, in P4, and every TEI version thereafter". And so it came to
>pass that the <ab> element entered into the vocabulary of the TEI. And
>great was the rejoicing thereof, amongst those of the children of Israel
>who had noticed.
>Sorry, it is a bit hot here.
I'd like to stress that while the guideline, at
http://www.tei-c.org/P4X/AI.html#AILC, said that s can't be nested,
" The *<s>* may be thought of as providing an abbreviated version
of the tag <seg type='s-unit'>, with the important additional
proviso that (unlike *<seg>* elements) *<s>* elements may not be
nested within each other. "
so is not valid:
<s> .... <s> ... </s> ... </s>
but the DTD generated with pizza software, and also this page:
http://e-xml.cribecu.sns.it/risorse/TEI-p4html/ref-S.html, said that s
can contain it-self.
Who have reason?
>On Tue, 2003-07-15 at 14:23, Martin Mueller wrote:
>>I thought the <s> tag was a set of tags, including <c> and <w>, used for
>>linguistic segmentation. <s> stands for sentence. A biblical verse is
>>usually, but not always, identical with a sentence. <seg type="verse"> may
>>be a better choice: after all the division of the bible into chapter and
>>verse is an arbitrary process of segmentation.
>>I don't see the point of all those <lb> tags. Do they mirror the line breaks
>>of a particular print edition, or is the idea to tag the text down to the
>>level of the clause?
>>From: TEI (Text Encoding Initiative) public discussion list
>>[mailto:[log in to unmask]] On Behalf Of Michele Paoli
>>Sent: Tuesday, July 15, 2003 2:52 AM
>>To: [log in to unmask]
>>We are going to markup some books of blible.
>>We are going to use the tag <s> for the verse.
>>We show you e small example:
>> <s n='1'><lb n='2'> IN THE beginning was the Word,
>> <lb n='3'>and the Word was with God,
>> <lb n='4'> and the Word was God Himself.
>> <s n='2'><lb n='5'>He was present originally with God.
>>Have you any suggestion?
>>You think that tag 's' together with attribute 'n' is the right way to
>>markup biblical verse?
>>Thank you very much in advance.
>> Michele Paoli - Computer Research for Cultural Heritage Centre
>> via della Faggiola, 19 - 56127 Pisa (Italy)
>> Phone: +39 (0)50 509409
>> e-mail: [log in to unmask]
Michele Paoli - Computer Research for Cultural Heritage Centre
via della Faggiola, 19 - 56127 Pisa (Italy)
Phone: +39 (0)50 509409
e-mail: [log in to unmask]