stat157 / background

0 stars 4 forks source link

Curated Data to Analyzers #9

Closed j-zhang closed 10 years ago

j-zhang commented 10 years ago

How is your data curation horizontal group planning to present the data to the analyzers? What format will the data be in? Where will you be storing the data, as mentioned in issue #1?

tristantao commented 10 years ago

Currently, we've thought about storing the data in a certain format (you don't have to know this), and then have a function that retrieves/parses that data into one of the followings:

  1. List of Dicts so, [ {K:V, K:V, K:V}, {...}], with each dict representing an entry
  2. Dataframe like object, as we did in the last class room.
  3. 2D array, representing a table.

Either way, you can assume that you'll get a function that you'll call and receive some kind of object back. We'll provide usage cases. The raw (but cleaned) data will probably be stored (still tentative) as CSV in a separate directory.

bonghyun5 commented 10 years ago

That sounds good.