explosion / srsly

🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
MIT License
435 stars 31 forks source link

Add use_gzip flag for json read/write #9

Closed polm closed 5 years ago

polm commented 5 years ago

The basic idea here is to use gzipped json for disk-efficient storage of simply structured data. In particular this should come in handy in shrinking language data in spaCy.

As of this commit this works, and there's a basic test. A couple of things to improve: