Closed dclong closed 3 years ago
How big is report.json?
About 13G.
The underlying library will refuse to parse JSON documents larger than 4 GB. It will support large inputs, but only if they are made of a stream of JSON documents (e.g., ndjson).
Ingesting a single 13 GB document all at once in a DOM tree is a performance and interoperability anti-pattern. I recommend against it.
I cannot speak pysimdjson but I expect that it works as is expected.
As Lemire said, this isn't supported in the underlying library. When/if simdjson gets streaming support, we'll definitely implement it.
I encounter the following issue when parsing a huge (>10G) JSON file.