Open iDigBioBot opened 7 years ago
we need a label for DQ issues, and perhaps a skill needs label? This ticket is really about strategies for cleaning data (tools and skills needs).
Added data quality tag
i think we need a space (on Twitter? or maybe biology.stackexchange or?) where we can post these questions to lots more people. There must be scripts out there people are using right now that we could point/link to
Just some references to existing examples of data cleaning using scripts:
Maybe good to list them elsewehere together with other examples?
A user submitted this information via the Darwin Core Hour webform: Timestamp: 2/8/2017 12:44:55 Please provide a topic of interest: How to clean/reformat data efficiently and en-masse. EG: Depth measurements in many formats- convert all to meters Are you capable of and interested in participating: No Who else would you recommend to participate in the presentation: John or David Bloom What resources can you point to: OpenRefine? Your name: Ben Frable Your email: