Having an automated solution to removing initial articles from <head>s
would solve only one of the problems I face when trying to generate a table
of contents or index from documents in my project. I try to adjust other
parts of the <head> tags as well that an automated process would not be
able to understand unless it were very cleverly scripted.
For example, in my Shaker journals project I have articles with titles such
as these that need more than just the leading articles removed.
The Shaker Manifesto--our standard bearer (=> level1="Shaker Manifesto: our
The Shaker Village, N.H., Apr 1878 (=> level1="Shaker Village, New
Hampshire, April 1878"
Christian culture, 1 (=> level1="Christian culture, no. 1")
CHRISTIAN CULTURE, #2 (=> level1="Christina culture, no. 2")
CHAP I (=>"Chapter I")
Given this amount of adjustments required to prepare an attractive table of
contents or index, I don't mind manually keying the <head>s in my
documents. However, with over 8,000 pages of encoding still ahead of me,
I'm not actually looking forward to it.
Library Information Systems Specialist
Hamilton College Library