DistrictDataLabs / baleen

An automated ingestion service for blogs to construct a corpus for NLP research.
MIT License
86 stars 38 forks source link

Move html sanitization to Post #87

Closed janetriley closed 7 years ago

janetriley commented 7 years ago

HTML sanitization functions are in the exporter and only applied at export. We may want to sanitize Posts in other situations.

Make Posts sanitizable and let the exporter ask for the level the user requested.

bbengfort commented 7 years ago

I really like how you did this in htmlize()!

janetriley commented 7 years ago

Branch merged, closing the issue.