Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
We can refactor
src/preprocessing.py
and make it slightly more readable/time efficient, as well as easy to add more transformations to the text