frictionlessdata / website-old

The Frictionless Data website.
http://old.frictionlessdata.io/
31 stars 28 forks source link

New Section: How Do Data Packages Compare To... #37

Open danfowler opened 8 years ago

danfowler commented 8 years ago

A typical question when explaining the concept of Data Packages to people who already have some familiarity with "containerization", "metadata", and schema languages is how Data Packages compare, especially when considering the work involved in supporting an additional standard.

A comparison table listing the features supported by different formats seems like a good place to point people. In addition, it could make clear potential ways in which two standards can complement each other.

Potential comparisons to make:

Related: https://github.com/frictionlessdata/project/issues/274

danfowler commented 8 years ago

@sje30 hi Stephen, this particular task I've set for myself was inspired by your comments on HDF5. Do you think there's anything worth adding here?

sje30 commented 8 years ago

hi @danfowler this comparison list looks good, but the problem I find with comparison tables is that they tend to be seen (by me at least) as not objective unless done by an independent person/group.

What might also work is the "killer features" that you can highlight to convince people why to switch to yet anotherr technology.

(on a minor topic, are things like sql/xml worth comparing too?)

danfowler commented 8 years ago

Thanks @sje30 yeah, I understand. I thinking less about marketing and more in terms of actually explaining the thing. For instance, I'll be talking to people who know HDF intimately (https://github.com/frictionlessdata/project/issues/290) soon and it would be good to the best way to describe the way these formats relate.

are things like sql/xml worth comparing too?

I've added to these to the issue. Thanks!

danfowler commented 8 years ago

Related: https://discuss.okfn.org/t/w3c-csv-for-the-web-how-does-it-relate-to-data-packages/1715

sje30 commented 8 years ago

Another data format to think about - its quite new, but mentions one thing that is a concern, performance. https://blog.rstudio.org/2016/03/29/feather/

danfowler commented 8 years ago

Thanks @sje30 I have included it above among the serialization formats!