CodeSearchNet corpus is a dataset of 2 milllion (comment, code) pairs from opensource libraries hosted on GitHub. It contains code and documentation for several programming languages.
the dataset can be found here: https://huggingface.co/datasets/code_search_net
CodeSearchNet corpus is a dataset of 2 milllion (comment, code) pairs from opensource libraries hosted on GitHub. It contains code and documentation for several programming languages. the dataset can be found here: https://huggingface.co/datasets/code_search_net