CarperAI / Code-Pile

This repository contains all the code for collecting large scale amounts of code from GitHub.
MIT License
105 stars 29 forks source link

Catalog Licenses/Copyright for each data source #46

Closed ncoop57 closed 1 year ago

ncoop57 commented 1 year ago

For every data source, we need to keep track of the license to ensure we are not violating it, especially around redistribution.

The main sources we need to catalog for the first thrust of code pile is the following sources:

ncoop57 commented 1 year ago

Completed: https://docs.google.com/spreadsheets/d/19IAFhqRvhRxdUj-df8PUOBI2W8aEqGmJmBcZvXOuDZY/edit?usp=sharing