LISTSERV 16.5 - TEI-L Archives

Subscriber's Corner

Email Lists

TEI-L Archives

TEI-L@LISTSERV.BROWN.EDU

View:

Message:

[

First

Last

]

By Topic:

[

First

Last

]

By Author:

[

First

Last

]

Font:

Proportional Font

		LISTSERV Archives
		TEI-L Home
		TEI-L August 2003

Subject:

Re: encoding declarations

From:

Burnard Towers <[log in to unmask]>

Reply-To:

Burnard Towers <[log in to unmask]>

Date:

Fri, 1 Aug 2003 09:24:16 +0100

Content-Type:

text/plain

Parts/Attachments:

text/plain (24 lines)

Michael Beddow remarks:

> That means that any entities intended as generic
> boilerplate constituents for a wide variety of documents should
> always have
> a (correct!) text declaration. That way, they will be usable no
> matter what
> the encoding of the document into which they are included. Similarly,
> precisely in the context of a corpus where more than one encoding may have
> been employed, it is wise to take precautions against
> hard-to-trace encoding
> muddles by furnishing each entity with an appropriate text declaration.
>

A further aspect of this which has always perplexed me is that (unless I'm
mistaken) the encoding of an entity which embeds another one does *not*
become the default for the embedded entity. In other words, if I have a
corpus of non-UTF-8 encoded entities, it is not enough simply to stick an
appropriate encoding declaration on the outermost entity which embeds all
the others: if they don't have their own declarations, they will default to
UTF8 and things will go Horribly Wrong.

L

Top of Message | Previous Page | Permalink

Search Archives

Advanced Options

Options

		Log In
		Get Password

		Subscribe or Unsubscribe