inbo / data-publication

🔓 Open biodiversity data publication by the INBO
https://ipt.inbo.be
Creative Commons Attribution 4.0 International
16 stars 7 forks source link

remove linebreaks from input csv #180

Closed PietrH closed 1 year ago

PietrH commented 1 year ago

As requested, a little script to remove linebreaks from an input csv, replacing them with spaces. Also checks to see if the resulting file has one record per line compared to the data.table csv parser.

PietrH commented 1 year ago

Dimitri had a csv occurrence file that had linebreaks in some records that break the csv parsing of the IPT, as a quick and dirty fix, we want to replace all line breaks with spaces, for all fields. This pull request includes a quick and dirty R script that does this, and a processed csv file.

I also changed .gitignore to include the output csv file