Open MarcoAJanssen opened 7 years ago
This is possible but would require a fair amount of development time to build properly. I think we will want to redo the curator form at some point and this could fall into that larger rework.
In the meantime Calvin has developed some deduplication tools that can help with removing the annoying Netlogo, NetLogo rNetLogo RNetLogo dupes from the command line and we'll start to apply those more regularly.
This is not a high priority, but will be great to resolve before we get more people being part in adding meta data.
The entries for sponsors and platforms include many typos and this becomes an issue if we want to visualize the data. Manual data cleaning is an option, but we need to reduce mistakes being made. A possible solution might be that when we want to add data to sponsors or platform, we get a list of the most frequent options plus the option "other". If the sponsor or platform is not in the list one can type in the option via other. This may reduce the diversity in which popular sponsors and platforms are entered and reduce the need for manual datacleaning.