In all the examples given in this thread, <lb/> and its siblings (imagined as well as real) have been placed in the text stream, that is, as marking transitions in the text of an element which with mixed contents.
One thing that worries me a little is that the schema allows me to place (the empty) milestone elements among the element-only top-level elements such as <text>, <body>, and <div>, that is, outside the text stream. This is very common with <pb/>, but it is also found with <lb/> (see e.g <http://www.deutschestextarchiv.de/book/download_xml/dilthey_geisteswissenschaften_1883>).
The reason I write this is that I have my own problems related to milestones and the question "which elements are block-level elements (and how do I retrieve them)", and this may not be relevant to the present discussion, but I still ask myself, how can a line break occur where there is no text to form lines? does a page break at the divisions we see in the text or at the point where the characters in one page end and those in the next page begins?
A related question is why anchors are allowed to stand outside the text stream, as children or siblings of elements that could easily have an xml:id themselves?
Sorry for sidetracking the white-space discussion!
On 10 Feb 2014, at 15:48, Sebastian Rahtz <[log in to unmask]> wrote:
> On 10 Feb 2014, at 14:39, Scott Derrick <[log in to unmask]> wrote:
>> Having a tag or set of tags that absolutely encapsulates a line of text would eliminate half the kludgy(my words) code dealing with it and the output would be more consistent and correct.
> there are (so they say!) many ways to skin cats. including:
> * use semantic tags like <l> or <seg>
> * use an authoring schema which makes life easier for humans to read XML (o nefas)
> * use another markup language, and keep TEI for non-human interchange
> * use milestones and anchors
> * use transcription surface/zone/line markup
> so you have to decide what’s best for a particular project
> If we _did_ have the tag for line-end, as in:
> <p> and so he said
> <sl/>this is the way the world ends<el/>
> <sl/>not with a bang but a whimper<el/>
> in a quiet voice</p>
> I wonder what you think the whitespace between <el/> and <sl/> means?
> Sebastian Rahtz
> Director (Research) of Academic IT
> University of Oxford IT Services
> 13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431
> Não sou nada.
> Nunca serei nada.
> Não posso querer ser nada.
> À parte isso, tenho em mim todos os sonhos do mundo.