okfn / handbook

Guides and introductions for participating in Labs and some of its projects.
http://okfnlabs.org/handbook/
170 stars 51 forks source link

Tabular data #4

Closed timClicks closed 10 years ago

timClicks commented 13 years ago

tablib and pandas seem to be the best libraries for dealing with tabular data. tablib has a really lovely API, whereas pandas is much more memory efficient for doing lots of number crunching.

Perhaps we could include a chapter on processing .xls documents. E.g. use tablib to easily convert an .xls to .csv and then heavy data processing with pandas? Alternatively, just use tablib for everything and mention pandas/other libs as alternatives.