red-bin / metadata_grapher

2 stars 1 forks source link

Standardize dataset format #5

Open dataders opened 5 years ago

dataders commented 5 years ago

What is the optimal way to serve the dataset(s)?

The size of the Seattle dataset makes processing on my local machine pretty resource intensive. I'm also not sure how do network centrality calculations in a parallel/distributed way....

Here's the three approaches I see, but I haven't looked at Houston's data yet.