USEPA / Phytoplankton-Data-Analysis

Phytoplankton Data Analysis
3 stars 0 forks source link

Use of # symbol #19

Closed mjpdenver closed 10 years ago

mjpdenver commented 10 years ago

Checking records I noticed taxa fields which use the "#" field. Some examples follow.

Chroomonas #1 Pseudoanabaena #1 Cosmarium #1 Cosmarium #2

Since the # symbol is a comment symbol in R, these records get read in without error, stripping the number. This needs to be fixed.

jbeaulie commented 10 years ago

Can this be handled by setting (comment.char = "") in the read.table command?

mjpdenver commented 10 years ago

Looking into this more, the "#" actually often gets read into data.frames by readWorksheet, so I may have misidentified the problem. More soon.

Matt

Date: Tue, 25 Mar 2014 10:46:16 -0700 From: notifications@github.com To: Phytoplankton-Data-Analysis@noreply.github.com CC: Matt_Pocernich@hotmail.com Subject: Re: [Phytoplankton-Data-Analysis] Use of # symbol (#19)

Can this be handled by setting (comment.char = "") in the read.table command?

— Reply to this email directly or view it on GitHub.

mjpdenver commented 10 years ago

I think the broader question here is how to hand non-ascii encodings. Many taxa names contain umlots? and non-ascii punctuation and they seem to get mangled.

mjpdenver commented 10 years ago

I believe this issue has been resolved. Foreign punctuation seem to be successfully saved.