LISTSERV mailing list manager LISTSERV 16.5

Help for TEI-L Archives


TEI-L Archives

TEI-L Archives


TEI-L@LISTSERV.BROWN.EDU


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

TEI-L Home

TEI-L Home

TEI-L  September 2003

TEI-L September 2003

Subject:

Re: Converting from any HTML to TEI

From:

Al Magary <[log in to unmask]>

Reply-To:

Al Magary <[log in to unmask]>

Date:

Sun, 31 Aug 2003 20:38:27 -0700

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (63 lines)

I introduced myself a couple months ago as a newbie to TEI and
have been trying valiantly to follow the traffic, particularly
this thread as it seems learnable.  Now, in Conal Tuohy's post
(below), he describes a process that begins in MS Word--which,
shall we say, is EZ to understand--and end with a properly TEI
encoded (?) document.  That certainly looks like a useful
lesson--but would it be possible for someone to step through
such a process here and explain what is happening with each
step?

And I just do not understand your last sentence.

My project is a new edition of Hall's Chronicle (1550), a
700,000-word chronicle covering 1399-1547.  There are
practically no commercial opportunities, so I expect that it
will be a self-financed web edition--so, of course, I will have
to do the markup myself.

I understand that TEI-L is devoted to solving problems at the
expert level, but we newbies could sure use some occasional
instruction, sometimes in words of one syllable or less!

Thanks,
Al Magary

----- Original Message -----
From: "Conal Tuohy" <[log in to unmask]>
To: <[log in to unmask]>
Sent: Sunday, August 31, 2003 3:51 PM
Subject: Re: Converting from any HTML to TEI


> Hi Eric
>
> I have been doing this for a while: I use MS-Word and save the
document as HTML. This HTML is as ugly as sin, of course. Then I
use JTidy to convert it to XHTML, and then various XSLT
transformations to produce TEI. First the MS-Word-flavoured HTML
is converted to a more standard HTML and from there to a simple
TEI. The whole conversion process is hosted inside Cocoon2
running as a Servlet inside Tomcat. I'm happy to share the XSL
transforms if you like.
>
> Cheers
>
> Con
>
> > -----Original Message-----
> > From: Eric Frigot [mailto:[log in to unmask]]
> > Sent: Saturday, 30 August 2003 1:30 a.m.
> > To: [log in to unmask]
> > Subject: Re: Converting from any HTML to TEI
> >
> >
> > Thanks for this answer, but i would to handle this process
in a Java
> > application. I cannot understand why there is no simple way
> > to do that !
> >
> > Eric.
> >
>

Top of Message | Previous Page | Permalink

Advanced Options


Options

Log In

Log In

Get Password

Get Password


Subscribe or Unsubscribe

Subscribe or Unsubscribe


Archives

ATOM RSS1 RSS2



LISTSERV.BROWN.EDU

CataList Email List Search Powered by the LISTSERV Email List Manager