Failed to do inference after done the Instruction Tuning to Align with Natural Language Instructions

I have completed to do the instracution tuning with code_alpaca_20k.json.

deepspeed instruct_tune_codet5p.py \
  --load /home/ubuntu/ChatGPT/Models/Salesforce/codet5p-6b --save-dir output/instruct_codet5p_6b --instruct-data-path /home/ubuntu/ChatGPT/Data/alpaca-data/CodeAlpaca-20k/code_alpaca_20k.json \
  --fp16 --epochs 5 --deepspeed deepspeed_config.json

And the final model is in the folder of "/home/ubuntu/ChatGPT/CodeGen/CodeT5/CodeT5+/output/instruct_codet5p_6b/final_checkpoint", I tried to do inference, there have following issue:

And this is the inference code: (codegen) ubuntu@chatbot-a10:~/ChatGPT/CodeGen/CodeT5/CodeT5+$ cat cli1.py

from transformers import AutoModelForSeq2SeqLM, AutoTokenizer
import torch

checkpoint = "/home/ubuntu/ChatGPT/Models/Salesforce/codet5p-6b"
checkpoint = "/home/ubuntu/ChatGPT/CodeGen/CodeT5/CodeT5+/output/instruct_codet5p_6b/final_checkpoint"
device = "cuda" # for GPU usage or "cpu" for CPU usage

tokenizer = AutoTokenizer.from_pretrained(
        checkpoint,
        trust_remote_code=True
)

model = AutoModelForSeq2SeqLM.from_pretrained(
        checkpoint,
    torch_dtype=torch.float16,
    low_cpu_mem_usage=True,
    trust_remote_code=True).to(device)

inputs = tokenizer(
        "def print_hello_world():",
        return_tensors="pt").to(device)

inputs['decoder_input_ids'] = inputs['input_ids'].clone()

outputs = model.generate(**inputs, max_length=128)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

salesforce / CodeT5

Failed to do inference after done the Instruction Tuning to Align with Natural Language Instructions #131