I can not necessarily answer your first question without the DTD but the
following emacs questions are possible.
Ingo Mittendorf wrote:
> 2. Notation of empty elements
> I suppose I'm the 1273rd user to ask this: With XML TEI, should I write
> <pb> or <pb/>?
> Emacs complains about <pb> (no tag omissability etc).
> Should I completely switch over to <pb/>? What can I do if I start off
> the transcription with a word processor (I'm not happy using emacs for
> this) and already have <pb n="175">, which it is impossible to just search
> and replace. Do I have to do make the change manually?
For it to be XML compliant you should use <pb/>.
You can use regular expression search and replace in emacs to accomplish
Under the search menu select query-replace-regexp.
The regexp syntax can be a bit much if you are used to other languages
because there are characters to be escaped. The book "Learning Emacs"
(O'Reilly) is a handy guide.
In your case you can use:
\(<pb n="[0-9]+"\)> and replace it with \1/>
\( and \) are the begining and ends of a group of characters that you
want to assign to a variable
[0-9]+ means one or more digits between 0 and 9
The replace expression
\1 variable 1 (the string that was enclosed in the first (and in this
case only) \( \)
/> the replacement string
This should replace all instances of <pb n="XXX"> with <pb n="XXX"/>
where XXX is an arbitary number.
> 4. Number of displayed errors in emacs
> Emacs restricts the number of errors to 200 (I had loads of empty elements
> in my text, which pushed the number of mistakes well over 200 and made up
> for about 98% of the mistakes). It says this can be changed with the -E option
> [?]. I can't find anything about such an option.
I presume you are using emacs with some distribution of Jim Clark's sp
or jade (or just nsgmls) and it is actually nsgmls that is performing
the validation. If so, somewhere in your configuration files (likely
your .emacs files unless someone else has modified in the psgml
distribution) you have a call to nsgmls for validation purposes. You can
modify the command there.
Or when you use the validation command just jump down into the mini
buffer and modify the command line to read something like :
nsgmls -E 500 -wxml -s /usr/lib/sgml/xml/xml.dcl file.xml
where 500 is the maximum number of errors your would like reported
(instead of the default 200)
However many of the "errors" disappear when you fix early errors in the
document since once an error occurs it can throw off the rest of the
This and answers about the validation would be in the nsgmls
> 5. Declaring character entities with an ISO set
SDATA is no longer used in XML. You probably want to use an XML version
of the entity set.
For alternate XMLified :) ent files check out:
http://www.oasis-open.org/cover/xml-ISOents.zip (collection of
School of Information Sciences
University of Pittsburgh
Pittsburgh, PA 15260