Hello,
Is the license of the code of the training dataset permissive?
The description says that the same filtering rules from StarCoder are applied to this dataset. But is also non permissive licensed code filtered out as in StarCoder?
Is there more information about the complete dataset?
Hello, Is the license of the code of the training dataset permissive? The description says that the same filtering rules from StarCoder are applied to this dataset. But is also non permissive licensed code filtered out as in StarCoder? Is there more information about the complete dataset?