I was inspired by https://github.com/johnmyleswhite/CorruptCSVs to have a go at packaging the TSVs and using datapkg to read, but noticed different behavior from read.csv() when reading the r_example_1.tsv. R's read.csv() behavior seems preferable especially as the datapkg version simply discards that (badly formatted) second column.
Maybe relevant to @roll @georgiana-b @akariv @pwalsh as well?
#r_example_1.tsv
1
2
3
4
5 0
R's read.csv():
> read.csv("r_example_1.tsv", header = FALSE, sep = "\t")
V1 V2
1 1 NA
2 2 NA
3 3 NA
4 4 NA
5 5 0
I was inspired by https://github.com/johnmyleswhite/CorruptCSVs to have a go at packaging the TSVs and using
datapkg
to read, but noticed different behavior fromread.csv()
when reading ther_example_1.tsv
. R'sread.csv()
behavior seems preferable especially as thedatapkg
version simply discards that (badly formatted) second column.Maybe relevant to @roll @georgiana-b @akariv @pwalsh as well?
R's
read.csv()
:The
datapkg
approach:https://github.com/danfowler/CorruptCSVs