nikwilms / ESG-Score-Prediction-from-Sustainability-Reports

This repository contains code and data for a machine learning model that predicts ESG (Environmental, Social, and Governance) scores based on sustainability reports and company data. It's a valuable resource for researchers, investors, and sustainability professionals interested in ESG score prediction using machine learning techniques.
MIT License
15 stars 2 forks source link

Tokenization #14

Closed mariusbosch closed 10 months ago

mariusbosch commented 10 months ago

Break text into words, phrases, symbols, or other meaningful elements (tokens) to make it easier to analyze. Libraries like NLTK and spaCy offer good tokenization tools.