Update licences - Githubissues

Stephen-Gates commented 7 years ago

Started work on updating licenses.csv.

will source new licences from https://github.com/okfn/opendefinition/tree/gh-pages/licenses
not sure if that source has values for all columns in csv
does scrape.py do this?

Added Community files

feedback welcome on code of conduct, contributing and issue template
hopefully this is generic enough to use in other repos.

Separated out (unmaintained) Changelog from Readme

suggest making a release of changes since 0.7
do you want to keep a changelog? If so is this updated with PR or afterwards?

Other questions:

do you want datapackage.json improved to add constraints? (related https://github.com/okfn/licenses/issues/58)
in https://github.com/okfn/licenses/issues/54
in https://github.com/okfn/licenses/issues/5
assume deploy.py is run after PR accepted and not as part of PR?

mlinksva commented 7 years ago

will source new licences from https://github.com/okfn/opendefinition/tree/gh-pages/licenses
not sure if that source has values for all columns in csv
does scrape.py do this?

Entirely possible that opendefinition.org/licenses/ has changed so that scrape.py won't work.

FWIW the approach I was going to take, or investigate whether it'd work, would be to make a Jekyll collection of licenses on opendefinition.org so that it would be machine-readable at the source. See https://github.com/github/choosealicense.com/tree/gh-pages/_licenses for an example of that concept, even with licenses as the topic. :smile:

Stephen-Gates commented 7 years ago

As discussed I can do some basic things (like fix a CSV) but not code (unless we're talking COBOL and Mainframes 25 years ago). So if the scraper is broken, I'm eyeballing the changes so I'll close this PR.

I was trying to progress this to support another project so it could draw from a list of authoritative open licences to make Frictionless Data using Data Packages that require an open licence according to the specification.

If this list of open licences is not going to be maintained, then I'll create a (smaller) drop down list in my App of "preferred" open licences. I'll avoid external dependencies that way.

rufuspollock commented 7 years ago

@Stephen-Gates i don't think we need to scrape in the first instance - we can just maintain the CSV list here. (Even if there is some duplication with the opendefinition list atm - ultimately we obviously want one place that is authoritative).

@mlinksva amazing work on the opendefinition licenses directory.

Summary: I think we want this repo (for now) as a simple data package that people can reuse in other projects. That means it wants to be lightweight, submodulable and standalone ...

This explains why we wouldn't want to use opendefinition as it has a bunch of other stuff in it.

Does that clarify things?

@Stephen-Gates we therefore do want to to maintain this repo - but it shouldn't need (much) automation and we don't need to scrape - we can just update the licenses.csv.

Stephen-Gates commented 7 years ago

@rufuspollock sure. The scripts categorised the licences into groups. The OD web site shows other groups. Do you want columns for that?

rufuspollock commented 7 years ago

I'd start with what we have here for now and think about the main od website differences later once this first piece of work is done.

Stephen-Gates commented 7 years ago

restarting work on this

Stephen-Gates commented 7 years ago

@mlinksva @rufuspollock this is ready for review.

Very tempted to delete the family column in licenses.csv - thoughts?

Edit: forgot we chatted about that in #54

mlinksva commented 7 years ago

Looks like an improvement overall.

Stephen-Gates commented 7 years ago

Is the CSV the master and the JSON generated? I assume deploy.py is run after PR accepted and not as part of PR?

mlinksva commented 7 years ago

Just skimming deploy.py, CSV is unused. The script generates jsonp. Probably worth running and adding a commit to this with any updates generated.

Stephen-Gates commented 7 years ago

Another suggested change, can OGL-UK-2.0 be marked as superseded by OGL-UK-3.0?

mlinksva commented 7 years ago

I guess it can be marked as superseded, as is OGL-UK-1.0. There's no mechanism to denote by what, is there?

Stephen-Gates commented 7 years ago

Not that I'm aware of for OGL-UK-2.0, being superseded by OGL-UK-3.0 is obvious. It is less clear that GeoGratis was superseded by OGL-Canada-2.0.

Perhaps a bit of a re-think is needed as there is some good information captured by http://opendefinition.org/licenses/ and http://opendefinition.org/licenses/nonconformant/ that isn't captured in the csv or json. E.g.

why the licence doesn't conform
the licence was discontinued
level of use
superseded by
(in)compatibility with other licences

Stephen-Gates commented 7 years ago

@mlinksva reverted to WIP.

By adding status of "superseded" to OGL-UK-2.0, I should have got an error in GoodTables.io due to the enum constraint.

"superseded" is in the csv for two entries, one on purpose, one by mistake.

This lead me to what is the correct spelling of "superceded" <- GitHub's autocorrect is telling me, "not this".

Are you happy to leave is as "superceded"?

I'll explore the GoodTables.io issue and then correct the schema/data

mlinksva commented 7 years ago

Makes sense to use the more common spelling, unless something that we can't change is depending on supercede.

Stephen-Gates commented 7 years ago

GoodTables issue raised at https://discuss.okfn.org/t/launching-goodtables-io-tell-us-what-you-think/5165/35

Stephen-Gates commented 7 years ago

Can I suggest that an owner of this repo set up an account in GoodTables.io to check licenses.csv against the datapackage.json and place a badge in the readme.

That would resolve #58, progress #45 and complement this PR

Stephen-Gates commented 7 years ago

After reading Leigh Dodds post, The state of open licensing, 2017 edition I wonder if the following licences should be added to licenses.csv as not reviewed?

open data licence for Mexico
the Linux Foundation Community Data License Agreement – Sharing, Version 1.0
the Linux Foundation Community Data License Agreement – Permissive, Version 1.0

@mlinksva @rufuspollock

Stephen-Gates commented 7 years ago

@roll has fixed the schema validation on GoodTables.io (see Job History).

This PR is now good to go unless you'd like the open licences mentioned above included.

I would still like Continuous Data Integration set up as suggested.

rufuspollock commented 7 years ago

After reading Leigh Dodds post, The state of open licensing, 2017 edition I wonder if the following licences should be added to licenses.csv as not reviewed?

Yes, but that is a separate PR / issue so let's do it after this gets merged.

rufuspollock commented 7 years ago

@mlinksva is this good to go in your opininon? If so let's get it merged 👍 😄

And huge well done and thanks to @Stephen-Gates for his awesome contribution here. 💯 🥇

rufuspollock commented 7 years ago

@mlinksva 👍 👏

@Stephen-Gates 👏 👏 🥇 💯 🎱

okfn / licenses

Update licences #57