On 3 Apr 2013, at 18:20, Kevin Hawkins <[log in to unmask]> wrote:
> There are different types of page breaks in word processors, so I'd like to clarify for everyone involved.
> There's the kind that forces a break at that point, which you insert manually. This has various subtypes: some start a new "section" (important for the size of margins etc.), whereas others continue in the same section.
> But then there's also the kind of breaks that just happen to occur because that's where the text flows. If you edit the text, it reflows, and the break occurs at a different point.
true. both of these are capturable in Word -> TEI XML. I currently deal with the former, but the latter is also findable. eg
<w:t>of Baltimore, or the above reward if taken out of the State of Maryland. In either case he must be secured in jail so that I get him again.</w:t>
What do people think I should turn that w:lastRenderedPageBreak into? ie how to distinguish from a forced page break?
Director (Research) of Academic IT
University of Oxford IT Services
13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431