🤗 [REQUEST] - IBM Granite Code Models

ethanc8 commented 3 months ago

Model introduction

The models are code generation models created by IBM. They include both base models and instruct models. They are new models that have been trained from scratch, except that the 34B version is based on the 20B version.

Model URL

https://huggingface.co/collections/ibm-granite/granite-code-models-6624c5cec322e4c148c8b330

Additional information (Optional)

No response

Decontamination

They have not specified any decontamination information. Information about the training data can be found at https://arxiv.org/pdf/2405.04324#page=4.

Author

No

Data

No

Security

[X] I confirm that the model is safe to run which is not designed to produce malicious code or content.

Integrity

[X] I confirm that the model comes from unique and original work and does not contain any plagiarism.

ethanc8 commented 3 months ago

Here are the reported MBPP and MBPP+ scores:

They have not reported HumanEval+ scores, but they have reported HumanEvalSynthesize and MultiPL-E scores at https://arxiv.org/pdf/2405.04324#page=10.

ethanc8 commented 3 months ago

They did not specify which version of MBPP+ they're using here, so they might be using v0.1.0.

Rubiel1 commented 1 month ago

Hello, I am also interested on the evaluation of the Granite models.

evalplus / evalplus