DistrictDataLabs / baleen

An automated ingestion service for blogs to construct a corpus for NLP research.
MIT License
86 stars 38 forks source link

move sanitize to its own exporter option #89

Open janetriley opened 7 years ago

janetriley commented 7 years ago

export.py's --scheme argument accepts json and html, as well as sanitize levels raw, safe, and text.

Move sanitize levels to their own argument and ensure they get passed in properly to Post.htmlize().

janetriley commented 7 years ago

What should the single-letter flag be? I was thinking of --sanitize but -S is already taken by scheme. --level and -L don't seem intuitive enough.

bbengfort commented 7 years ago

Hmm, this is a tricky one. Would it be too weird to do --sanitize and -L? What about --sanitize and -z?