Closed faph closed 3 months ago
As a secondary issue, the copyright statement at the top of each file has the words "All rights reserved." I think that statement conflicts with the Apache license statement directly underneath that. Some copyrights that are granted by law to authors are actually explicitly surrendered under that license.
As a tertiary issue, the citation file at https://github.com/huggingface/transformers/blob/main/CITATION.cff lists another set of authors. I am not sure about exactly what the citation file references, but I do note that Github labels this in the sidebar as "Cite this repository". The file itself also contains the message "If you use this software, please cite it using these metadata."
It probably does not hurt reviewing whether/how all software authors can be referenced here.
@stevhliu | @MKhalusova Any feedback on this please?
cc @amyeroberts who can help answer this better than I can :)
Hi @faph, thanks for raising this issue!
I actually don't know all the details about the repo's copyright and its headers. I'm going to ping @LysandreJik here who knows more about it here.
A few things to note, based on what's raised here (@LysandreJik can correct me if I'm wrong):
transformers
is licensed under Apache 2.0, there are special cases (certain files) where a different license applies. For example, some models were released under a different license, and this is reflected in the modeling file e.g. for BLIP text. CITATION.cff
are existing and former employees of Hugging Face who have contributed significantly to building and maintaining the repo. I don't think it makes sense to add all people who added a commit here - as this doesn't reflect the same contribution as maintaining and deciding on the design of the library.Hi @amyeroberts that's helpful, thanks.
So to be super clear: is there a copyright assignment to Hugging Face or is there not?
Yes, my understanding is that:
@amyeroberts Thanks. In that case, please clarify the legal mechanism whereby individual contributors surrender copyrights to Hugging Face. There is no Contributors License Agreement to be signed as far as I can see?
Hi @faph, I spoke with the legal team, and have some updates / clarifications:
Correct, we don't have a CLA
Based on the documents available, there is no assignment in transformers and all contributions are subject to the same Apache license:
- Grant of Copyright License. Subject to the terms and conditions of this License, each Contributor hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable copyright license to reproduce, prepare Derivative Works of, publicly display, publicly perform, sublicense, and distribute the Work and such Derivative Works in Source or Object form.
- Submission of Contributions. Unless You explicitly state otherwise, any Contribution intentionally submitted for inclusion in the Work by You to the Licensor shall be under the terms and conditions of this License, without any additional terms or conditions. Notwithstanding the above, nothing herein shall supersede or modify the terms of any separate license agreement you may have execute with Licensor regarding such Contributions.
The people cited in CITATION.cff
are the authors of the transformers paper
The All rights reserved statement doesn't make sense, but it shouldn't really affect the license below
"Hugging Face team" refers to the company, Hugging Face
@amyeroberts That's great, many thanks for clarifying.
If and when there is an opportunity to fix the copyright statement in each file, it wouldn't hurt phrasing the owners as "Hugging Face and contributors" or similar?
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Please note that issues that do not follow the contributing guidelines are likely to be ignored.
System Info
N/A
Who can help?
@stevhliu | @MKhalusova
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
I understand
transformers
is licensed under the Apache 2.0 license.However, it is not entirely clear whether authors retain copyrights for their contributions. As far as I can see there is no CLA defined in Github. Yet, the copyright statements in each file suggest the copyrights are owned by "The Hugging Face team". It's not clear what that "team" refers to.
Could we clarify any copyright assignment, please?
Expected behavior
Unambiguous copyright statements :smile: