I am trying to parse an input xml file that is 13,00,000 lines long with a size of 56 MB, using xsltproc. I get the below error:
input.xml:245393: parser error : internal error: Huge input lookup
"description" : "List of values for possible department codes"
^
unable to parse input.xml
My xsltproc was able to process an xml file that was 9,30,000 lines long with a size of 48 MB.
In fact, I tried decreasing the xml lines to 600,000 by removing the unnecessary parts. Still, same error, which is strange, because it is able to parse 900,000 but not 600,000.
How do I resolve this issue?
maxLengthas30sounds rather like an XSD schema related problem. Is that document referring to a schema? Is the error occuring with somexsl:keyprocessing?XML_PARSE_HUGEparser option (which then I think disables any security based restrictions/limits normally set by default). Then you need to recompile. Or try to use one of the languages like Python or PHP which use libxml2, it seems they have options (e.g. lxml in lxml.de/parsing.html#parser-options declareshuge_tree) to disable the security based limits at run-time.