gliwka / hyperscan-java

Match tens of thousands of regular expressions within milliseconds - Java bindings for Intel's hyperscan 5
BSD 3-Clause "New" or "Revised" License
175 stars 46 forks source link

Create a compressed BASE64 encoded version of the hyperscan database. #223

Open apismensky opened 7 months ago

apismensky commented 7 months ago

We need to store a hyperscan database so that it can be easily serialized and sent as a part of the payload (json, xml etc). Ideal solution would be to have it as BASE64 encoded and compressed from the stream, produced by Database's save method. Compression can reduce DB size in average ~10 times, as we saw in some cases for our DBs, which can be critical for large databases. Also a corresponding static method to read a serialized DB is needed (BASE64 encode and decompress) PR: https://github.com/gliwka/hyperscan-java/pull/224

apismensky commented 7 months ago

@gliwka - please take a look and let me know