The answer to the question of how to handle whitespace depends upon what
you're trying to do. Do you want to encode the text in such a way as to be able
(1) to render it correctly (presentation)
(2) to query the encoded data to determine something about how
whitespace is used (data)
I raised an analogous issue a month or so ago in regard to some insular
medieval manuscripts in which the width of white-space (or the lack of
white space) between words can indicate the words are pronounced or
performed, e.g. elision of sounds, metrical emphasis. If such manuscripts
were encoded with data as well as presentation in mind, one could query the
text-base to gather, say, a list of words ending in -a- which are followed
by a word beginning with -w- where there is no white-space or very little
white-space separating the two words.
At 08:57 PM 11/26/02 +0000, chris.hamilton-emery wrote:
>A few quick questions . . .
>How does TEI handle indentation patterns and whitespace?
>If a stanza has a regular stanzaic form can one capture indentation? Or do
>we encode unicode tabs (is that 	 ?). I'm working in XML so I presume
>I can ignore named entities and just code everything in UTF-8?
>If one has poetry which has to preserve whitespace and say typographic
>changes (Black Mountain stuff, "Composition By Field that kind of thing) how
>does one align random elements or position things against each other? Is it
>a case of using non-breaking spaces and hoping for the best? Can one specify
>a font and font size or FO characteristics?
>Poetry can stretch the idea of separating content and style, as the style
>can be the content in experimental poetry.