I'd feel very inclined to store the text twice, once
broken down by pages, one by structural divisions.
That way the hideous task of isolating the interesting
elements in between two <pb> is done once,
not on every retrieval.
Information Manager, Oxford University Computing Services
13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431