ibm-aur-nlp / PubLayNet

Other
915 stars 164 forks source link

License terms of the PubLayNet dataset #7

Closed snavavf closed 5 years ago

snavavf commented 5 years ago

Hi, the document images are from the commercial use collection of PMC OA. Then what are the license terms of the PubLayNet dataset? Is the whole dataset also allowed for commercial purposes?

Thanks a lot!

ajjimeno commented 5 years ago

PMC OA is distributed by the US NIH/National Library of Medicine (NLM) under a Creative Commons license. We are distributing derivative work from the PMC OA collection under a specific license as described in this repository. The derivative work includes among other things images generated from the PDF documents and the location of document layout components using the generated images as reference, so it is not what the NLM is distributing with PMC OA. The license under which we distribute the derivative work should be quite permissive, but please check with your company legal team.