hplt-project / sacremoses

Python port of Moses tokenizer, truecaser and normalizer
MIT License
486 stars 59 forks source link

Add a license file and copyright infomation to the repository #7

Closed BLKSerene closed 5 years ago

BLKSerene commented 5 years ago

Hi, I'm using the python port of Moses Tokenizer in my project, I would appreciate it if you add a LICENSE file and copyright information to the repository.

And what's the official name for this python port of Moses Tokenizer, sacremoses (the module name) or mosestokenizer (as in the title of the doc)?

alvations commented 5 years ago

The license file is left out on purpose. I don't own the moses project and this port done out of convenience. I guess the @moses-smt maintainers know that this repo exists and have let it continue to exist.

The issue with LGPL from Moses' license is

  1. Conveying Modified Versions.

    If you modify a copy of the Library, and, in your modifications, a facility refers to a function or data to be supplied by an Application that uses the facility (other than as an argument passed when the facility is invoked), then you may convey a copy of the modified version:

    a) under this License, provided that you make a good faith effort to ensure that, in the event an Application does not supply the function or data, the facility still operates, and performs whatever part of its purpose remains meaningful, or b) under the GNU GPL, with none of the additional permissions of this License applicable to that copy.

And I don't want to change the license to GPL from LGPL unnecessary to respect the original Moses' maintainers' stance on keeping the LGPL, so as a layman not knowing legalese, I have choosen to just repeat the license heading in the README.md

GNU Lesser General Public License version 2.1 or, at your option, any later version.

I'm not a lawyer or well-versed in license but if there's any lawyer who would like give their professional advice and consultation on what should be done in terms of licensing of this repository, please feel free to comment in this issue.


BTW, mosestokenizer is another library that's a Python wrapper to the Perl code, also in https://github.com/moses-smt/mosesdecoder/tree/master/scripts/tokenizer/mosestokenizer. I've changed the header to sacremoses to avoid confusion.

BLKSerene commented 5 years ago

Well, the licensing details seems a little tricky... My project is open source and licensed under GNU GPL version 3, so I guess it is okay for me to use sacremoses in my own project no matter it's the LGPL or the GPL that will be picked by Sacremoses later?