jazzband / tablib

Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
https://tablib.readthedocs.io/
MIT License
4.63k stars 593 forks source link

CSVW: CSV on the Web #305

Open westurner opened 7 years ago

westurner commented 7 years ago

CSVW

Homepage: https://w3c.github.io/csvw/ Standard: https://www.w3.org/TR/tabular-data-primer/ Standard: https://www.w3.org/TR/tabular-data-model/ Standard: https://www.w3.org/TR/tabular-metadata/ Standard: https://www.w3.org/TR/csv2json/ Standard: https://www.w3.org/TR/csv2rdf/ Namespace: http://www.w3.org/ns/csvw# xmlns: @prefix csvw: <http://www.w3.org/ns/csvw#> . @context: http://www.w3.org/ns/csvw.jsonld

CSVW (CSV on the Web) is a set of relatively new standards for representing CSV rows and columns as RDF (and JSON / JSON-LD) along with metadata.

.

westurner commented 7 years ago

From https://twitter.com/westurner/status/901988850330996736 :

@kennethreitz SQL Query (from any DB) ⟿ Pandas DataFrame, in one line of code: https://github.com/kennethreitz/records#records-sql-for-humans

@westurner What about #CSVW #JSONLD? (#nbmeta #linkedreproducibility)

https://w3c.github.io/csvw/primer/

@kennethreitz want to send a pull request?

@westurner I'd like to. It may be good to get to 1.0 w/ #nbmeta (maybe retool w/ #attrs) result objects w/ _reprhtml (_repr_rdfa_andjsonld) be4

@kennethreitz that'd be good! i just want to make sure curl/requests and friends get back data, not html

@westurner "Model for Tabular Data and Metadata on the Web" (#JSONLD, #RDFa HTML) is for Data on the Web #dwbp #linkeddata http://www.w3.org/TR/tabular-data-model/

@kennethreitz sounds like something to add to tablib! http://docs.python-tablib.org/en/master/

@westurner The challenge/opportunity is where to add the additional columnar metadata: {@sqlalchemy, #tablib,}

@westurner "7 metadata header rows (column label, property URI path, DataType, unit, accuracy, precision, significant figures)" https://wrdrd.github.io/docs/consulting/linkedreproducibility#csv-csvw-and-metadata-rows

@kennethreitz hmm, tablib is quite dumb when it comes to that stuff.... could autodetect if all columns match

@kennethreitz or we just support strings out the gate and call it v0

@westurner

CSVW defaults to xsd:string if unspecified.

"How do you support units of measure?" #qudt https://www.w3.org/TR/tabular-data-primer/#units-of-measure

@kennethreitz
i say we stick to strings and call it v0 :)

westurner commented 7 years ago

@westurner I'd like to. It may be good to get to 1.0 w/ #nbmeta (maybe retool w/ #attrs) result objects w/ repr_html (repr_rdfa_and_jsonld) be4

nbmeta: https://github.com/westurner/nbmeta

@westurner

CSVW defaults to xsd:string if unspecified.

From https://www.w3.org/TR/tabular-data-primer/#datatypes :+1:

NOTE You don't have to include "datatype": "string" for columns that are strings — columns are assumed to hold strings if no datatype is explicitly specified.

kennethreitz commented 7 years ago

this sounds awesome!

westurner commented 7 years ago

Unfortunately I don't know when I'll be able to get to this while job searching.

On Thursday, August 31, 2017, Kenneth Reitz notifications@github.com wrote:

this sounds awesome!

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/kennethreitz/tablib/issues/305#issuecomment-326480489, or mute the thread https://github.com/notifications/unsubscribe-auth/AADGy9bcXuifI956S5qCu2vlKN2f0TmRks5sd388gaJpZM4PJoIM .

westurner commented 7 years ago

@westurner I'd like to. It may be good to get to 1.0 w/ #nbmeta (maybe retool w/ #attrs) result objects w/ repr_html (repr_rdfa_and_jsonld) be4

attrs

farhaanbukhsh commented 5 years ago

hey @westurner and @hugovk, I would like to collaborate on this feature and get it up and going. Can you guys guide me or may be point me in the right direction to start this. Thanks in advance. :smiley:

hugovk commented 5 years ago

Have a look at the dev docs:

farhaanbukhsh commented 5 years ago

@hugovk thanks for pointing this to me will try to do a WIP and then we can take this ahead.

56789KD commented 2 months ago

Autho comment

56789KD commented 2 months ago

Comment

hugovk commented 2 months ago

@56789KD Please stop spamming.