[tex4ht] [bug #618] Incomplete XML Document, domfilter error, truncated build on large file.
Nasser M. Abbasi
puszcza-hackers at gnu.org.ua
Tue Dec 12 02:04:12 CET 2023
URL:
<http://puszcza.gnu.org.ua/bugs/?618>
Summary: Incomplete XML Document, domfilter error, truncated
build on large file.
Project: tex4ht
Submitted by: nma123
Submitted on: Tue Dec 12 01:04:12 2023
Category: None
Priority: 5 - Normal
Severity: 7 - Important
Status: None
Privacy: Public
Assigned to: None
Originator Email:
Open/Closed: Open
Discussion Lock: Any
_______________________________________________________
Details:
I have been working with Michal on this via private email but thought to enter
a bug report on this just for tracking and documentation.
I have one large file (57,000 PDF pages) that when compiled with tex4ht (takes
14 hrs), and at about 10% when generating the final HTML pages, it gets XML
error and stops.
i.e. the 90% rest of the sections are missing from the final web pages.
-------------------------------------------------------
[INFO] make4ht-lib: parse_lg process file: reportsubsection1100.htm
[WARNING] domfilter: DOM parsing of reportsubsection1100.htm failed:
[WARNING] domfilter:
...ive/2023/texmf-dist/tex/luatex/luaxml/luaxml-mod-xml.lua:175: Incomplete
XML Document [char=33675]
[INFO] make4ht-lib: parse_lg process file: reportsubsection1100.htm
[WARNING] domfilter: DOM parsing of reportsubsection1100.htm failed:
[WARNING] domfilter:
...ive/2023/texmf-dist/tex/luatex/luaxml/luaxml-mod-xml.lua:175: Incomplete
XML Document [char=33675]
[INFO] make4ht-lib: parse_lg process file: reportsubsection1100.htm
----------------------------------
I've just send Michal a link to complete self contained ZIP file (450 MB) with
instructions how to run as standalone in order to see these errors on his end.
I tried this on latest texlive 2023 on new Linux installation.
I will work with Michal to provide any additional information he needs from
me, to hopefully find the cause of this problem.
This happens only on this file. I think may be due to the large size, since
the Latex code is all generated by same program and only this file gives this
error.
--Nasser
_______________________________________________________
Reply to this item at:
<http://puszcza.gnu.org.ua/bugs/?618>
_______________________________________________
Message sent via/by Puszcza
http://puszcza.gnu.org.ua/
More information about the tex4ht
mailing list.