uchicago-capp-30320 / CivicLens

Putting the public back in public commenting
https://civic-lens.org
GNU Affero General Public License v3.0
2 stars 1 forks source link

Clean existing comment data #204

Closed andrewjtdunn closed 6 months ago

andrewjtdunn commented 6 months ago

I've built text cleaning into the methods so this should happen on an ongoing basis, but we need to go back and clean the text of comments already in the database.

Just a one-time run of some quick code to apply civiclens.utils.text.clean_text to the comment field in regulations_comment

andrewjtdunn commented 6 months ago

Decision 5.17.2024: this is addressed in the NLP portion in the pipeline, so nothing new needs to be done