sepinf-inc / IPED

IPED Digital Forensic Tool. It is an open source software that can be used to process and analyze digital evidence, often seized at crime scenes by law enforcement or in a corporate investigation by private examiners.
Other
893 stars 214 forks source link

Increase Tika's SAXParser default pool size #1965

Closed lfcnassif closed 8 months ago

lfcnassif commented 8 months ago

In machines with dozens of CPU cores, default Tika SAXParser objects pool size may not be enough and can cause a bottleneck, like reported at https://github.com/sepinf-inc/IPED/discussions/1964

paulobreim commented 8 months ago

Do you think if I limited it to, say, 8 or 16 cores, it would be a good test?

lfcnassif commented 8 months ago

Do you think if I limited it to, say, 8 or 16 cores, it would be a good test?

There is no need, Paulo. Default Tika SAXParser pool size is 10, I increased it to the number of available logical CPU cores. A snapshot should be ready in 10min here if you would like to test: https://github.com/sepinf-inc/IPED/actions/runs/6750453327

paulobreim commented 8 months ago

Great, I will test now

paulobreim commented 8 months ago

@lfcnassif finished well, no more problems with SAXParser.

tks

lfcnassif commented 8 months ago

Thanks for testing @paulobreim!