We can't see the original, but I'm guessing that there is no
straightforward translation relationship between paragraph 1 and 2.
Rather, 1 is vaguely lexico-grammatical in the source language (right?
xml:lang?), and 2 is a paraphrase in the target language. Or I
I'd consider using <ab> instead of <p>, to minimize the interpretational
Other than that, it's hard to tell, not enough data...
On 11/06/2013 07:13 PM, Lou Burnard wrote:
> Is the repetition in the second paragraph intentional? Seems a bit odd
> if it's meant to be translating the first para. If they are alternative
> translations, I'd suggest wrapping them in separate <s> elements.
> You probably want type="lemmatized" and type="translated" rather than
> xml:id -- I'm guessing you'll have more than one trans;lated div to deal
> with in the final document!
> On 06/11/13 16:54, Vicky Phillips wrote:
>> I'm new to TEI so was hoping that someone could provide some
>> guidance. I have an xml file which has come back from translators
>> which contains lemmatized and translated text. I would like to
>> convert this to basic TEI if possible but I'm not sure which elements
>> I should be using for this. This is what I've come up with..
>> <div xml:id="ZONE1-1">
>> <div xml:id="lemmatized">
>> <p>dewis y parch W S jones M a yn cadeirydd bwrdd
>> ysgol newydd machynlleth a r
>> parch Josiah jones yn is cadeirydd</p>
>> <div xml:id="translated" xml:lang="en">
>> <p>The Parch. Dewiswyd W. S. JONES, M.A., THE
>> CHAIR OF THE NEW SCHOOL
>> MACHYNLLETH, A'R PARCH JOSIAH JONES
>> IS-GADEIRYDD. W. S. JONES, M.A., THE
>> CHAIR OF THE NEW SCHOOL MACHYNLLETH, A'R
>> PARCH JOSIAH JONES IS-GADEIRYDD. W.
>> S. Jones, M.A., the chair of the new School
>> Machynlleth, a'r Parch Josiah
>> Jones is-gadeirydd.</p>
>> The original OCR text will be kept in the ALTO file and so the ZONE
>> ids are references to the ALTO. Is this way of doing it correct or is
>> there a better way of doing this. Any help would be very much