LISTSERV mailing list manager LISTSERV 16.5

Help for TEI-L Archives


TEI-L Archives

TEI-L Archives


TEI-L@LISTSERV.BROWN.EDU


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

TEI-L Home

TEI-L Home

TEI-L  May 1991

TEI-L May 1991

Subject:

SGML Update: a conference report

From:

Lou Burnard <[log in to unmask]>

Reply-To:

Lou Burnard <[log in to unmask]>

Date:

Mon, 20 May 91 09:32:00 BST

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (282 lines)

SGML in Europe: a conference report
 
The Dutch SGML Users Group hosted a two day international
conference in Amsterdam 16-17 May under the general title `SGML
Update: consultancy, tools, courses'. This attracted over a
hundred delegates, by no means all from the Benelux area, though
mostly from European publishing and software houses. There were
two keynote speakers (Sperling Martin for the AAP, and myself for
the TEI), about a dozen presentations from manufacturers or
consultants and a well-arranged software exhibit in which all the
major SGML software vendors were represented, with the
conspicuous exception of Software Exoterica who had apparently
had to withdraw at the last minute. There was ample opportunity
for discussion and argument between presentations, over an
excellent buffet lunch and in the evenings.
 
Sperling Martin as one of the chief progenitors of the AAP
standard was happy to report that it was now in use by more than
25 major publishers, with a further forty planning to adopt it
over the next twelve months. He gave brief overviews of three
particularly successful applications on the fringes of
conventional publishing. Firstly, the Association for Computing
Machinery, which has just developed a five year strategic plan
with the AAP standard at the centre of several dozen new print
products, on demand reprint facilities, optically stored
databases, hypertext products etc. Perhaps more interestingly,
the ACM plans to mandate the AAP standard as the interchange
format of preference for its army of unpaid professional
contributors, reviewers and referees in the future. Secondly, the
Society of Automative Engineers, which is adapting the AAP
standard for use in something called a `Global Mobility
Technology Information Center' or in plainer English, a database
of information about all sorts of transport systems. The
interesting thing here was the convergence between SGML and
object-oriented databases -- as well as manuals of technical
information, SGML was being used as the vehicle for data to be
transferred directly into CAD/CAM systems. Sperling's third AAP
success story was a similarly hybrid development: a new legal
database system developed for the Clark Boardman Company,
providing integrated information services derived from legal
journals, statutes and regulations, a body of case law together
with interpretation and annotation, usable by traditional print
journals or electronic hypertexts. Of course, the AAP project had
not been an unmitigated success: it had begun at a time when SGML
was barely established, and some aspects, notably those concerned
with maths, formulae and tables have never been finished
properly. Moreover, there are a few deliberate errors in the
standard, introduced (said Sperling ingenuously) as `reader
tests'. He also called attention to some image problems -- all
too familiar to TEI ears -- such as the perceived conflict
between TeX and SGML, or ODA and SGML, and the intimidating
nature of SGML so long as its cause is left to the purists and
the evangelists. Looking to the future, Martin predicted an
increased awareness of SGML within the library community as a
practical means of coping with the explosive growth of published
materials, particularly in Science and Medicine. The AAP standard
was to be assessed for suitability as a `non-proprietary
information exchange vehicle' for electronically networked
journals, by the 110-member Association of Research Libraries,
under a scheme for which the National Science Foundation had
recently provided $0.75m seed funding. His presentation concluded
with some sound advice for those developing a strategic business
plan in which SGML featured (concentrate on the business asset,
don't expect technology to do everything, expect to spend at
least $5 a page to get electronically tractable text...) and some
predictions for future AAP work. A corrected version of the AAP
standard would be re-submitted to ANSI and a summary of needed
corrections to the published dtds would appear in EPSIG news at
the end of this year.
 
Seamus McCague gave an impressively detailed description of two
practical applications of SGML in work undertaken by his company,
ICPC, a fifteen year old Dublin-based specialist typesetting
company. One, for Elsevier, involved the production of about
100,000 pages of high quality camera-ready copy from SGML encoded
text annually; the other, for Delmar, the conversion of an
existing reference book into an electronic resource. Details of
the two projects provided interesting contrasts in production
methods; they also showed how the SGML solution was equally
applicable to two very different scale operations. For Elsevier,
the use of SGML greatly simplified both process and quality
control, by facilitating the automatic extraction of data for the
publisher's control database; for Delmar, it had made possible
significant improvements to the product (a drug handbook) by
automating the production of a variety of indexes.
 
Francois Chahuneau of AIS, the thinking man's Antoine de Caunes,
gave a characteristically ebullient presentation about the
relationship between SGML documents and database systems. He
distinguished four characteristic modes of action: simple storage
of documents in a database, where typically only a limited amount
of header type information is visible to the database; database-
driven document extraction, where documents are synthesized from
information held in a database as a specialised form of report;
tightly coupled systems in which highly volatile document and
database systems share information; and the true document
database in which all the information and structure of a document
are represented by isomorphic database constructs, thus combining
the well-understood strengths of database systems in such matters
as concurrency control, security and resilience with the
flexibility and multiple-indexing capabilities of document
processing systems. As examples of this last mode, he then
described in some detail two products: his own company's SGML-
Search, which is based on PAT, and Electronic Book Technologies'
Dynatext, and also demonstrated a  beta-test version of the MS-
Windows version of the latter. It uses an interesting scripting
language based in part on DSSSL, which enables it to be
configured to look more or less like anything, whereas SGML
Search is command-line driven, using a fairly rebarbative syntax.
 
The interface between SGML and database systems was also touched
on by Jan Grootenhuis of CIRCE, the doyen of Dutch SGML
consultancies. Speaking of his experience in teaching SGML, he
remarked that people with a typographic background found SGML
almost as difficult to understand as people with a computer
science background found the requirements of typography, which
struck a familiar chord. He then briefly described a recent
project in which documents had been converted automatically into
an Oracle database, using a database model defined by Han
Schouten. The project had shown that database definitions could
be automatically generated from a DTD; the complete suite of
Oracle manuals, created as Ventura or WordPerfect documents, had
been loaded into an Oracle-Freetext database, using SGML as an
intermediary. He noted that the tendency of technical writers to
use descriptive tagging to bring about formatting effects had
made this task unnecessarily difficult, and argued for better
enforcement of descriptive standards. He also outlined some
experiences in using SGML for CD-ROM publication of journals at
Samson, and of legal and other regulations published by the Dutch
government, and the updating problems involved. His conclusion
was that SGML was now past the point of no return. It was no
longer being used in pilot projects only, but as an integral part
of real work. Its use was no longer regarded as worthy of
comment; moreover, because its evangelists were too busy doing
real work to try to publicise it, the task was being taken on by
professional teachers and educators.
 
The first day of the conference concluded with manufacturers'
presentations. Tim Toussaint(MID) and Paul Grosso (Arbortext)
gave a joint presentation.  Toussaint revealed that MID, formerly
Dutch and now German, is now 26% French. They used Arbortext as
an SGML editor, and Exoterica's XTRAN to convert it for loading
into an unspecified relational database. Applications included
standard reference works such as the Brockhaus Duden and a
database of standards documentation. Grosso gave a good sales
pitch for Arbortext, which is a luxuriously appointed SGML editor
intended for use primarily in an electronic publishing
environment and described as non-intimidating and user-congenial.
It includes a specialised WYSYWG editor for tables and formulae
from which AAP-conformant marked up text is generated, has good
browsing and outlining facilities and its own script language.
 
Hugo Sleimer, European Sales Director for Verity (a spinoff from
Advanced Decision Systems) gave a classy presentation of a
product called TOPIC, the only relevance of which seemed to be
that it supported a wide variety of document formats, including
SGML. Much of his presentation dealt exhaustively with the
problems of text retrieval by boolean logic, at a level which did
not show much respect for his audience's intelligence.  Tibor
Tscheke, from Sturtz Electronic Publishing, was due to talk about
his company's work in creating an electronic version of the
Brockhaus Encyclopedia, but had unfortunately been forbidden to
do so by Brockhaus. He was therefore reduced to some generalities
about the role of information within an enterprise, the
integration of SGML systems into mainstream information
processing and so forth, which was a pity.
 
I opened the second day of the conference by summarising the
current status of the TEI and discussing some of the technical
problem areas we had so far identified, in particular those
raised by historians and linguists for whom any tagging is an
interpretation which must be defensible. This being the second
time I had done it in two weeks, I managed to get through most of
my material within a reasonable approximation to the time
allocated me.
 
Yuri Rubinsky (SoftQuad Inc) gave an entertaining and wide-
ranging talk, picking up in passing some of the technical issues
I had raised rather than simply presenting a product review,
though he did mention in passing (and also demonstrated) that
Author/Editor was now available under Windows and Motif as well
as for the MAC. The theme of his talk was that SGML could be used
to describe more than just documents, and that several of its
capabilities were under-used. There was more to an SGML document
than its element structure. Among specific examples he mentioned
were customised publication, for example by extracting `technical
data packages' geared to a specific maintenance task from CALS-
compliant documentation in the Navair database; using attribute
values to generate documentation at different user levels from a
common source; an ingenious use of entity references within
`boiler plate text fragments' in General Motors manuals; and the
assembly of customised DTDs from sets of DTD fragments by a use
of parameter entities strikingly similar to that proposed by the
TEI, or by use of marked sections. For the GM application, this
approach had reportedly saved the cost of its implementation
within six months.
 
Pamela Gennusa (Database Publishing Systems) also picked up the
recurrent theme of this conference: that SGML was uniquely
appropriate to  database publishing. She gave a good description
of the major issues in preparing text for publication in database
format and the strengths of SGML as a means of making explicit
the information content of texts in a neutral way, which was
essential given that authors and consumers had different
requirements of it and touching on the problems of security, high
volume and time sensitivity which characterise database
publishing as an industry. She also gave a good overview of the
capabilities of the new version of Datalogics' set of SGML
products, notably WriterStation, an impressive authoring tool
with several new facilities and  DMA (Document Management
Architecture) a complex set of object-oriented tools providing
database management facilities for SGML material which also
includes full text searching facilities like those described
earlier by Chahuneau.
 
Ruud Loth (IBM Netherlands) gave a workmanlike presentation of
IBM's SGML product range, which now includes an context sensitive
editor for OS/2 called TextWrite, a formatter for VM or MVS
called BookMaster and a new range of products called Book Manager
to deal with `softcopy books' (IBMese for `electronic texts').
Book manager Build runs under VM and MVS and generates `softcopy'
from GML or SGML documents; BookManager Read runs additionally
under DOS or OS/2  and has impressive facilities for hypertext-
style browsing, intelligent text retrieval, indexing and
annotation. IBM documentation (47,000 titles, 9 milliard pages)
would soon be available in this new form.
 
Bruce Wolman of Texcel AS then gave a detailed product
description of the Avalanche `FastTag' automatic tagging system
which, it is claimed, can handle almost any kind of text and
automatically insert usable markup into it. The product has two
components, a `visual recognition engine' which searches for
visually distinct entities in a document, as defined by a set of
rules encoded in a language confusingly called Inspec, and
another language, called Louise, which defines the form in which
these objects should be encoded. Things like tables, footnotes,
horizontal lines, running headers or footers or special control
sequences could all be automatically tagged as well as objects
defined by regular expressions or specific keywords in the text.
The product had just been launched in Europe and was available
for MSDOS, VMS, Ultrix and Macintosh.
 
John Mackenzie Owen of the Dutch consultancy Pandata gave a brief
description of the SGML handling capabilities of BasisPlus,
stressing however its strengths as a document management system
rather than its admittedly limited SGML features. Bev Nichols of
Shafstall described the Shafstall-6000, an all-singing all-
dancing document conversion system based on a package called
CopyMaster which included SGML among its 800,000 claimed
`document-to-document' pairings but which (I had the impression)
would really rather be operating on a proprietary format called
the Shaffstall Document Standard. The last presentation of the
day was from Ian Pirie of Yard Software Systems who described the
successful Protos project carried out by Sema Group and Pandata
for the CEC. The project handled proposals for funding from DG 13
which had to be distributed to member states for comment and the
ensuing comments. MarkIt had been used to validate the format of
the messages passed in either direction, its regular expression
facilities being particularly useful in automatically encoding
the content of telex messages, and its application language to
encode the messages for storage in a Basis database. The whole
operation had been carried out with minimal disruption of the
message system.
 
Aside from the presentations, the conference provided an
excellent opportunity to catch up on the expanding world of SGML-
aware software. Among products demonstrated were new versions of
MarkIt and WriteIt from Sema Group, of Author/Editor from
Softquad, Arbortext,  Writerstation from Datalogics and an
interesting new product, an SGML editor called EASE from a Dutch
company called E2S. Delegates were also given a copy of the first
fruits from the European Work group on SGML, a consortium of
European publishers which has been working on a set of AAP-
inspired dtds for scientific journals which took the form of a
very well designed and produced booklet documenting a DTD for
scientific article headers. I came away from the conference
reassured that SGML was alive and well and living somewhere in
Europe.
 
Lou Burnard
Text Encoding Initiative

Top of Message | Previous Page | Permalink

Advanced Options


Options

Log In

Log In

Get Password

Get Password


Search Archives

Search Archives


Subscribe or Unsubscribe

Subscribe or Unsubscribe


Archives

August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007
December 2006
November 2006
October 2006
September 2006
August 2006
July 2006
June 2006
May 2006
April 2006
March 2006
February 2006
January 2006
December 2005
November 2005
October 2005
September 2005
August 2005
July 2005
June 2005
May 2005
April 2005
March 2005
February 2005
January 2005
December 2004
November 2004
October 2004
September 2004
August 2004
July 2004
June 2004
May 2004
April 2004
March 2004
February 2004
January 2004
December 2003
November 2003
October 2003
September 2003
August 2003
July 2003
June 2003
May 2003
April 2003
March 2003
February 2003
January 2003
December 2002
November 2002
October 2002
September 2002
August 2002
July 2002
June 2002
May 2002
April 2002
March 2002
February 2002
January 2002
December 2001
November 2001
October 2001
September 2001
August 2001
July 2001
June 2001
May 2001
April 2001
March 2001
February 2001
January 2001
December 2000
November 2000
October 2000
September 2000
August 2000
July 2000
June 2000
May 2000
April 2000
March 2000
February 2000
January 2000
December 1999
November 1999
October 1999
September 1999
August 1999
July 1999
June 1999
May 1999
April 1999
March 1999
February 1999
January 1999
December 1998
November 1998
October 1998
September 1998
August 1998
July 1998
June 1998
May 1998
April 1998
March 1998
February 1998
January 1998
December 1997
November 1997
October 1997
September 1997
August 1997
July 1997
June 1997
May 1997
April 1997
March 1997
February 1997
January 1997
December 1996
November 1996
October 1996
September 1996
August 1996
July 1996
June 1996
May 1996
April 1996
March 1996
February 1996
January 1996
December 1995
November 1995
October 1995
September 1995
August 1995
July 1995
June 1995
May 1995
April 1995
March 1995
February 1995
January 1995
December 1994
November 1994
October 1994
September 1994
August 1994
July 1994
June 1994
May 1994
April 1994
March 1994
February 1994
January 1994
December 1993
November 1993
October 1993
September 1993
August 1993
July 1993
June 1993
May 1993
April 1993
March 1993
February 1993
January 1993
December 1992
November 1992
October 1992
September 1992
August 1992
July 1992
June 1992
May 1992
April 1992
March 1992
February 1992
January 1992
December 1991
November 1991
October 1991
September 1991
August 1991
July 1991
June 1991
May 1991
April 1991
March 1991
February 1991
January 1991
December 1990
November 1990
October 1990
September 1990
August 1990
July 1990
June 1990
April 1990
March 1990
February 1990
January 1990

ATOM RSS1 RSS2



LISTSERV.BROWN.EDU

CataList Email List Search Powered by the LISTSERV Email List Manager