UniversalDependencies / docs

Universal Dependencies online documentation
http://universaldependencies.org/
Apache License 2.0
267 stars 246 forks source link

Consider GPL → CC #296

Closed stranak closed 7 years ago

stranak commented 8 years ago

GPL license for data is confusing and hard to interpret at best. Please consider changing the license to CC-BY-SA, or at least adding it as an alternative.

This concerns following treebanks:

spyysalo commented 8 years ago

Good idea, I would like to see this change. @stranak: would you mind writing a few sentences with links explaining why CC licenses are better for data that could be used to explain this issue to the corpus creators?

(Not sure why Finnish is listed, both Finnish treebanks are CC..?)

stranak commented 8 years ago

I contacted proper professionals, colleagues from CLARIN-D Legal Helpdesk (http://www.clarin-d.de/en/help/legal-information-platform) and they were kind enough to promise to formulate the argument properly. Paweł Kamocki should write his reply here shortly.

(I have no idea about the Finnish in the list, although it was clearly me who put it in. Maybe some previous versions of treebanks used GPL? Maybe just my error, please disregard it.)

pkamocki commented 8 years ago

Well, technically you can license non-software works under GPL. What is really impossible is to do it the other way round (license software under a data licence like CC). Licensing data or articles, or other non-software works under GPL, however, is not a good idea, and for several reasons:

Licenses for Other Types of Works We believe that published software and documentation should be free software and free documentation. We recommend making all sorts of educational and reference works free also, using free documentation licenses such as the GNU Free Documentation License(GNU FDL). For essays of opinion and scientific papers, we recommend either the Creative Commons Attribution-NoDerivs 3.0 United States License, or the simple “verbatim copying only” license stated above. We don't take the position that artistic or entertainment works must be free, but if you want to make one free, we recommend the Free Art License.

Finally, please keep in mind that in case of doubt (e.g. is an .xml file "software" or "data"?), dual (or multiple) licensing might be a good solution, as in fact it creates an alternative for the user (allowing him to chose which of the licenses he wants to comply with).

Hope that helps!

dan-zeman commented 8 years ago

I think Finnish is there because FTB has a dual license. We could probably just ignore the other one.

pkamocki commented 8 years ago

Yes, if there are several licenses, you can chose the one you prefer and comply with it.

dan-zeman commented 8 years ago

Thanks @pkamocki for all the insights. I have now tried to contact the people who I believe have the power to allow a CC license for Alpino, AnCora, DDT and Składnica. Asked them for that, and referred them to this discussion. So let's hope they find it a good idea, too.

spyysalo commented 7 years ago

@pkamocki: thank you for the detailed argument!

@dan-zeman : did you hear anything back?

dan-zeman commented 7 years ago

Partially. I got permission to relicense Danish and Dutch, and I have already modified these two. I got a reply from Poland but they are not sure whether they need to acquire a permission further up the chain, so that is on hold. No response from Spain so far (Spanish + Catalan).

spyysalo commented 7 years ago

As there is no recent activity, I'll go ahead and close this. If anyone is willing to continue the discussion with the copyright holders of the treebanks that remain GPL-licensed (Catalan, Faroese, Galician-TreeGal, Polish, and Spanish-AnCora), feel free to reopen.

dan-zeman commented 7 years ago

Galician-TreeGal is actually a different case, they use LGPLLR. Since that license's name says it's for language resources, it hopefully does not have the issues that GNU GPL has.