nikwilms / ESG-Score-Prediction-from-Sustainability-Reports

This repository contains code and data for a machine learning model that predicts ESG (Environmental, Social, and Governance) scores based on sustainability reports and company data. It's a valuable resource for researchers, investors, and sustainability professionals interested in ESG score prediction using machine learning techniques.
MIT License
15 stars 2 forks source link

Remove sparse terms #18

Closed mariusbosch closed 10 months ago

mariusbosch commented 10 months ago

If a word or term appears very infrequently across the documents, it might not be very informative. Consider setting a threshold and removing words/terms that appear less frequently than this threshold.