mbzuai-nlp / LaMini-LM

LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
806 stars 49 forks source link

the data is very nice, but why only available for non commercial #1

Closed huseinzol05 closed 1 year ago

huseinzol05 commented 1 year ago

thanks for the data! as the title, why the data is available for non commercial purpose? again, thank you.

minghao-wu commented 1 year ago

Hi @huseinzol05 ,

Thank you for your interest in our work.

We do hope that we can offer a license with fewer limitations. However, it is important to note that part of the instructions and all the responses in the dataset are generated by ChatGPT, and therefore it is subject to OpenAI's terms of service. As a result, it can only be used for non-commercial purposes.

huseinzol05 commented 1 year ago

but based on https://openai.com/policies/terms-of-use,

(a) Your Content. You may provide input to the Services (“Input”), and receive output generated and returned by the Services based on the Input (“Output”). Input and Output are collectively “Content.” As between the parties and to the extent permitted by applicable law, you own all Input. Subject to your compliance with these Terms, OpenAI hereby assigns to you all its right, title and interest in and to Output. This means you can use Content for any purpose, including commercial purposes such as sale or publication, if you comply with these Terms. OpenAI may use Content to provide and maintain the Services, comply with applicable law, and enforce our policies. You are responsible for Content, including for ensuring that it does not violate any applicable law or these Terms.

I might missed something here

minghao-wu commented 1 year ago

However, OpenAI also mentions this:

(c) Restrictions. You may not (i) use the Services in a way that infringes, misappropriates or violates any person’s rights; (ii) reverse assemble, reverse compile, decompile, translate or otherwise attempt to discover the source code or underlying components of models, algorithms, and systems of the Services (except to the extent such restrictions are contrary to applicable law); (iii) use output from the Services to develop models that compete with OpenAI; (iv) except as permitted through the API, use any automated or programmatic method to extract data or output from the Services, including scraping, web harvesting, or web data extraction; (v) represent that output from the Services was human-generated when it is not or otherwise violate our Usage Policies; (vii) buy, sell, or transfer API keys without our prior consent; or (viii), send us any personal information of children under 13 or the applicable age of digital consent. You will comply with any rate limits and other requirements in our documentation. You may use Services only in geographies [currently supported by OpenAI](https://platform.openai.com/docs/supported-countries).

The output may not be used for developing models that compete with OpenAI, so our dataset cannot be used for commercial purpose.