Implement beta auto analysis

DistrictDataLabs / cultivar

Multidimensional data explorer and visualization tool.

http://trinket.districtdatalabs.com

Apache License 2.0

52 stars 18 forks source link

Implement beta auto analysis #43

Open bbengfort opened 8 years ago

bbengfort commented 8 years ago

Create a module in Trinket for auto analysis (we need a good name for it). It should expose a single function or class at it's root that can be used within a celery task.

This function/class should take as input a file-like object and generic keyword arguments (**kwargs).

As output, the function should return a tuple/list whose length is the (maximum) number of columns in the dataset, and whose values contain the datatype of each column, ordered by column index.

Other stuff:

No third party dependencies except unicodecsv and numpy.

wleepang commented 8 years ago

I'd like to work on this! How do I become a collaborator on the project?

bbengfort commented 8 years ago

Sorry, just going through all the issues now. Are you at our sprint?

On Thursday, June 2, 2016, W. Lee Pang, PhD notifications@github.com wrote:

I'd like to work on this! How do I become a collaborator on the project?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/DistrictDataLabs/trinket/issues/43#issuecomment-223343207, or mute the thread https://github.com/notifications/unsubscribe/AAth7tjK-g2BuJJT_-ZQHJOdbE1bm7bPks5qHwIwgaJpZM4HA4-Q .

Sent from Gmail Mobile

wleepang commented 8 years ago

Unfortunately, I had to leave the conference before the sprints.

wleepang commented 8 years ago

Would you be opposed to using pandas for this?

bbengfort commented 8 years ago

not necessarily, though I think messytables does a better job than pandas. Not sure if we're ready for the pandas dependency or not. I'd hold off on this though - it seems like someone is working on it.

rebeccabilbro commented 7 years ago

See issue #14 - research will be needed to proceed with this issue. Once completing #14 (and possibly just ripping the messytables implementation, close and proceed with this one.