renatopp / liac-arff

A library for read and write ARFF files in Python
MIT License
98 stars 49 forks source link

Allow tab separated instance data #43

Open jondo opened 8 years ago

jondo commented 8 years ago

I have got a lot of ARFF files with tab separated instance data, and Weka can read them. Could you please also read them? (I know that officially "Attribute values for each instance are delimited by commas.")

mfeurer commented 8 years ago

Hi, I don't think liac-arff should support anything which is not in the documentation. Regarding that, I filed an issue at the wekalist.

mfeurer commented 8 years ago

Arff specs are updated. Do you want to work on this and submit a pull request?

jondo commented 8 years ago

Great to hear! Sorry - I have switched to using pandas.read_table for now, and I will stay with that because this fits well to my next step of calling pandas.merge for joining tables.

renatopp commented 8 years ago

load and loads could receive a parameter delimiter (just like csv module), with defaults to ,, and simply redirect it to the data conversion procedure (which uses csv).