Sparsh1212 / gsocanalyzer

A blazingly fast tool to analyze all the selected organizations in Google Summer of Code in the form of graphical analytics.
MIT License
75 stars 39 forks source link

Duplicate name under topics category. #77

Open sauravrajleaf opened 2 years ago

sauravrajleaf commented 2 years ago

Description

There are multiple names present for the same topics category.

Relevant files

I've attached screenshots of some of the names i found.

Screenshot

screenshot-localhost_3000-2021 12 30-19_05_40 screenshot-localhost_3000-2021 12 30-19_06_37

References

Add file reference link here

sauravrajleaf commented 2 years ago

If this issue is legit. I would like to work on this to find some solution.

Sparsh1212 commented 2 years ago

I think this is because there are duplicate entries (not exactly duplicate) because of the name changes of the organization in recent years. To tackle this we'll have to merge all those organizations into a single one. For merging them you could just use the script (merge.py) and follow the instructions to merge. if you are interested to work on this issue, kindly create a list of all the duplicate orgs that you could find and merge all of them in a single PR. Let me know if you want me to assign this issue to you.