comses / catalog

Web tools to annotate publications related to computational modeling
http://catalog.comses.net
GNU General Public License v3.0
3 stars 3 forks source link

Improve workflow to reduce mistakes in data entry #104

Open MarcoAJanssen opened 7 years ago

MarcoAJanssen commented 7 years ago

The entries for sponsors and platforms include many typos and this becomes an issue if we want to visualize the data. Manual data cleaning is an option, but we need to reduce mistakes being made. A possible solution might be that when we want to add data to sponsors or platform, we get a list of the most frequent options plus the option "other". If the sponsor or platform is not in the list one can type in the option via other. This may reduce the diversity in which popular sponsors and platforms are entered and reduce the need for manual datacleaning.

alee commented 5 years ago

This is possible but would require a fair amount of development time to build properly. I think we will want to redo the curator form at some point and this could fall into that larger rework.

In the meantime Calvin has developed some deduplication tools that can help with removing the annoying Netlogo, NetLogo rNetLogo RNetLogo dupes from the command line and we'll start to apply those more regularly.

MarcoAJanssen commented 5 years ago

This is not a high priority, but will be great to resolve before we get more people being part in adding meta data.