Sparsh1212 / gsocanalyzer

A blazingly fast tool to analyze all the selected organizations in Google Summer of Code in the form of graphical analytics.
MIT License
75 stars 39 forks source link

Duplicate entries for organizations. #66

Closed letsintegreat closed 2 years ago

letsintegreat commented 2 years ago

Description

As some organizations used different names while participating in GSoC, there are multiple entries for them in finalData.json.

Like:

Stony Brook University Biomedical Informatics Stony Brook University, Biomedical Informatics

GFOSS - Open Technology Alliance GFOSS - Open Technologies Alliance Open Technologies Aliance - GFOSS Open Technologies Alliance - GFOSS

FOSSology FOSSology Project

However, the data provided here by Google, contains one single entry for each organization. So we can recompile the entire finalData.json from the given link to remove all the discrepancies at once.

ShivaRapolu01 commented 2 years ago

If this issue is not assigned , I can work on this

letsintegreat commented 2 years ago

I'd like to work on this issue.

ShivaRapolu01 commented 2 years ago

Yeah @letsintegreat , you pointed out this issue so you must go ahead 👍

Sparsh1212 commented 2 years ago

@ShivaRapolu01 Thanks for showing interest in the issue. But since @letsintegreat pointed this out and he is willing to fix that, we have to assign him. Feel free to comment on any unassigned issue.