mlr-org / farff

a faster arff parser
Other
11 stars 6 forks source link

Quoted strings in nominal values #14

Closed jakobbossek closed 8 years ago

jakobbossek commented 8 years ago

ds = getOMLDataSet(71) # no problems with RWeka setOMLConfig(arff.reader = "farff") ds = getOMLDataSet(71)

Data '71' file 'description.xml' found in cache. Data '71' file 'dataset.arff' found in cache. Fehler in consume(x, "^\s}\s", no.match.error = TRUE) : Error while parsing factor levels in line: @attribute carbon {'\'B1of3\'','\'B2of3\'','\'B3of3\''}

jakobbossek commented 8 years ago

Might be related to #10.

berndbischl commented 8 years ago

it not related to 10. I occurs also with readr and is a bug when we parse the header / factor levels

berndbischl commented 8 years ago

This works now. I have added a similar unit test, converted from the same data did 71. Because that data set is just too large for a repeated test.