Eventually, we'll need to make a table with organization names and things like
What names does the organization have in each country that it is active in (eg Letzte Generation, Ultima Generazione) - we already have this a little bit in climate_orgs.py
What abbreviation does the organization have?
Is the organization better identified by its (English) name, or by its abbreviation? eg "Fridays for Future" is better than "FFF", and "Last Generation" is better than "Letzte Generation" (for English readers), but "NABU" is probably better than whatever this abbreviation stands for
is the organization uniquely identified by their name? this is usually the case, but eg not for "BUND", which is just a common German word, and may require a language model to identify it as a protest organization by context
we don't need to make a table with all groups, but rather devise one standard way how we take the info from ACLED, and then define a table with exceptions, similar to the climate_orgs.py approach but better
this is expecially relevant when scaling to other countries later, not so much at the moment
Eventually, we'll need to make a table with organization names and things like
climate_orgs.py
we don't need to make a table with all groups, but rather devise one standard way how we take the info from ACLED, and then define a table with exceptions, similar to the
climate_orgs.py
approach but betterthis is expecially relevant when scaling to other countries later, not so much at the moment