> Sebastian Rahtz wrote:
> I'd feel very inclined to store the text twice, once broken
> down by pages, one by structural divisions.
> That way the hideous task of isolating the interesting
> elements in between two <pb> is done once, not on every retrieval.
But wouldn't breaking down the text by pages require an XPath
expression of equal hideousness to the one required for querying