kba / hocr-spec

The hOCR Embedded OCR Workflow and Output Format
http://kba.github.io/hocr-spec/1.2/
72 stars 20 forks source link

License? #5

Open amitdo opened 8 years ago

amitdo commented 8 years ago

If you can reach Thomas Breuel, ask him to License his spec under open source license, preferably CC0.

kba commented 7 years ago

We've exchanged e-mails and Thomas Breuel supports the project and license. He even forwarded hocr.info to this site and links to the current version from the old Google Doc. :tada: :balloon:

amitdo commented 7 years ago

CC0 To the extent possible under law, the editors have waived all copyright and related or neighboring rights to this work. In addition, as of 29 September 2016, the editors have made this specification available under the Open Web Foundation Agreement Version 1.0, which is available at http://www.openwebfoundation.org/legal/the-owf-1-0-agreements/owfa-1-0.

So it's a dual-license?

Parts of this work may be from another specification document. If so, those parts are instead covered by the license of that specification document.

This is not very clear. Which parts exactly?

What about also putting this license info in the README or in a new LICENSE / COPYING file?

kba commented 7 years ago

This is the standard licensing of web standards in WHATWG specs. Mozilla also recommended to publish standards CC0 + OWFa

Parts of this work may be from another specification document. If so, those parts are instead covered by the license of that specification document.

If the part is not explicitly specified, the clause has no meaning. You that in RFC sometimes that it is acknowledged that section XY was taken verbatim from some other spec, but mostly other RFC, so there's no licensing issue.

What about also putting this license info in the README or in a new LICENSE / COPYING file?

That is a good point, we should. I'll reopen this lest we forget. Since this isn't software, I'm not sure what best practices are, but we'll find out.

amitdo commented 7 years ago

Since this isn't software, I'm not sure what best practices are, but we'll find out.

https://github.com/w3c/html

kba commented 7 years ago

https://github.com/w3c/html has this LICENSE

All documents in this Repository are licensed by contributors under the W3C Software and Document License.

WHATWG uses CC0, e.g.

But in many cases, they do not have a LICENSE or COPYING.

The ALTO board favors CC-BY to make sure people link back to the spec.

I'd prefer CC0, that's as free as it gets.

kba commented 7 years ago

https://github.com/kba/hocr-spec/blob/master/images/baseline.png This one I have from the Tesseract wiki which has a good section on baseline. Note to self: Find out the author and ask for permission to incorporate in a FAQ section.

zuphilip commented 7 years ago

See https://github.com/tesseract-ocr/tesseract/wiki/FAQ/_history# The author of that paragraph is @StefRe the two changes on June 28th.

amitdo commented 7 years ago

belongs to #15...