LibreOffice has just released a recent version (3.3) which allows for 1
million rows in a spreadsheet, so it might be worth trying that instead
of Excel. It can also save to XML, and if you use the FODS file format,
you'll get a single XML file which is easier to process than the zipped
package which constitutes an ODS file.
On 11-08-16 06:25 AM, Croenen, Godfried wrote:
> Hi all,
> I apologise for this question which is a bit off topic, but maybe
> someone on the list can enlighten me.
> I have recently been playing around with the XML functions of Excel. I
> have been extracting TEI elements from a large corpus and loaded that
> into an Excel spreadsheet for editing and sorting in a tabular format
> and then processing it further. The Excel file contains about half a
> million <name> elements with their attributes.
> However, when I try to save the edited file as XML data (NOT XML
> spreadsheet) only the first 24,528 rows are saved. There is no
> indication that there is something wrong with some of the rows, as the
> number of rows in the saved file stays the same however I sort the data.
> An Excel spreadsheet can contain up to 1,048,576 rows, and I assume that
> there is no different upper limit for XML data (at least I found nothing
> about this in the Excel Help). When I import the XML data, Excel
> certainly reads in all the rows.
> The people in my university's computing service have no idea why this is
> happening and the only thing they have been able to suggest is to break
> the data file down in smaller chuncks because with smaller files the
> problem does not seem happen.
> Is this a known bug/feature in Excel? Is there a way around it?
> Dr. Godfried Croenen
> (currently Lester Cappon Fellow at the Newberry Library, Chicago)
> School of Cultures, Languages and Area Studies
> University of Liverpool
> Chatham Street
> L69 7ZR
> United Kingdom
> Tel: +44 (0)151 794 2763
> Fax: +44 (0)151 794 2357
> e-mail: [log in to unmask] <mailto:[log in to unmask]>
> Personal webpage: http://www.liv.ac.uk/~gcroenen/index.htm
> Online Froissart: http://www.hrionline.ac.uk/onlinefroissart/
University of Victoria Humanities Computing and Media Centre
([log in to unmask])