DistrictDataLabs / baleen

An automated ingestion service for blogs to construct a corpus for NLP research.
MIT License
86 stars 38 forks source link

Add tests to sanitize #82

Closed janetriley closed 7 years ago

janetriley commented 7 years ago

70 write tests for sanitize.

Added constants for the sanitize levels. sanitize_html didn't return anything if the sanitize level isn't recognized. Returns the unmodified content if level None, raises an error if unknown level.

coveralls commented 7 years ago

Coverage Status

Coverage increased (+1.6%) to 70.952% when pulling ba6de5303321f7debbcbf9ce3989df8957b18cc2 on janetriley:feature-tweak_export_sanitize_and_add_tests into f16b2a3167d956eecdf927283831d86699bfbd01 on DistrictDataLabs:develop.

will2041 commented 7 years ago

Looks like the pep8 check while running tests is failing the travis build. I'll address that in a different commit.

I'm digging the changes here. I'll pull this in then fix up the build issue.