I introduced myself a couple months ago as a newbie to TEI and
have been trying valiantly to follow the traffic, particularly
this thread as it seems learnable. Now, in Conal Tuohy's post
(below), he describes a process that begins in MS Word--which,
shall we say, is EZ to understand--and end with a properly TEI
encoded (?) document. That certainly looks like a useful
lesson--but would it be possible for someone to step through
such a process here and explain what is happening with each
step?
And I just do not understand your last sentence.
My project is a new edition of Hall's Chronicle (1550), a
700,000-word chronicle covering 1399-1547. There are
practically no commercial opportunities, so I expect that it
will be a self-financed web edition--so, of course, I will have
to do the markup myself.
I understand that TEI-L is devoted to solving problems at the
expert level, but we newbies could sure use some occasional
instruction, sometimes in words of one syllable or less!
Thanks,
Al Magary
----- Original Message -----
From: "Conal Tuohy" <[log in to unmask]>
To: <[log in to unmask]>
Sent: Sunday, August 31, 2003 3:51 PM
Subject: Re: Converting from any HTML to TEI
> Hi Eric
>
> I have been doing this for a while: I use MS-Word and save the
document as HTML. This HTML is as ugly as sin, of course. Then I
use JTidy to convert it to XHTML, and then various XSLT
transformations to produce TEI. First the MS-Word-flavoured HTML
is converted to a more standard HTML and from there to a simple
TEI. The whole conversion process is hosted inside Cocoon2
running as a Servlet inside Tomcat. I'm happy to share the XSL
transforms if you like.
>
> Cheers
>
> Con
>
> > -----Original Message-----
> > From: Eric Frigot [mailto:[log in to unmask]]
> > Sent: Saturday, 30 August 2003 1:30 a.m.
> > To: [log in to unmask]
> > Subject: Re: Converting from any HTML to TEI
> >
> >
> > Thanks for this answer, but i would to handle this process
in a Java
> > application. I cannot understand why there is no simple way
> > to do that !
> >
> > Eric.
> >
>
|