frictionlessdata / tableschema-go

A Go library for working with Table Schema.
MIT License
46 stars 10 forks source link

Add xls/xlsx data processing #68

Open endersonmaia opened 6 years ago

endersonmaia commented 6 years ago

A lot of data are made public using .xls/.xlsx format, and it would be great to free this data from this format to more open formats ;)

danielfireman commented 6 years ago

That is indeed a super nice to have!

The first thing to do is implementing a Table and Iterator. You can follow the CSV implementation.

One way of approaching this problem is generating a CSV from the XLS/XLSX and from there re-use the CSV implementation.

Furthermore, you could re-use functions like Remote and FromFile. It is quite simple to create a new package source and move the Source and those methods, which are likely to be re-used on this and other implementations.

Care to take a stab at this issue?

endersonmaia commented 6 years ago

I already started searching for some libs to deal with .xls, and it's hard to find something useful.

I'm experimenting with extrame/xls, and will post something here ASAP.

danielfireman commented 6 years ago

I usually start searching at https://golanglibs.com .. extrame/xls is actually the top hit.

The good news is that it seems that it can read XLS without formatting (which suffices our needs). Hope your tests go well and let me know if you run into any problem or have any questions.

endersonmaia commented 4 years ago

taking note : https://github.com/avelino/awesome-go#microsoft-excel

endersonmaia commented 4 years ago

using the github stars metric to decide :

endersonmaia commented 3 years ago

seems like a great option to use https://github.com/pbnjay/grate