Open bbengfort opened 8 years ago
I'd like to work on this! How do I become a collaborator on the project?
Sorry, just going through all the issues now. Are you at our sprint?
On Thursday, June 2, 2016, W. Lee Pang, PhD notifications@github.com wrote:
I'd like to work on this! How do I become a collaborator on the project?
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/DistrictDataLabs/trinket/issues/43#issuecomment-223343207, or mute the thread https://github.com/notifications/unsubscribe/AAth7tjK-g2BuJJT_-ZQHJOdbE1bm7bPks5qHwIwgaJpZM4HA4-Q .
Sent from Gmail Mobile
Unfortunately, I had to leave the conference before the sprints.
Would you be opposed to using pandas for this?
not necessarily, though I think messytables does a better job than pandas. Not sure if we're ready for the pandas dependency or not. I'd hold off on this though - it seems like someone is working on it.
See issue #14 - research will be needed to proceed with this issue. Once completing #14 (and possibly just ripping the messytables
implementation, close and proceed with this one.
Create a module in Trinket for auto analysis (we need a good name for it). It should expose a single function or class at it's root that can be used within a celery task.
This function/class should take as input a file-like object and generic keyword arguments (
**kwargs
).As output, the function should return a tuple/list whose length is the (maximum) number of columns in the dataset, and whose values contain the datatype of each column, ordered by column index.
Other stuff: