nestauk / industrial_taxonomy

Refactor of nestauk/industrial-taxonomy which upon completion will replace it.
MIT License
3 stars 0 forks source link

Label sectors with names #22

Open Juan-Mateos opened 2 years ago

Juan-Mateos commented 2 years ago

Do we want to do any additional filtering of text sectors to remove noisy ones?

Devise strategy to name sectors.

In the previous version of the taxonomy we used tfidf at the text sector level. We could alternatively do this inside SIC4s i.e. comparing a text sector only to those generated by the same SIC4.

We could also check alternative strategies e.g. use GPT3?