salesforce / CodeTF

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
Apache License 2.0
1.45k stars 100 forks source link

ValueError: Config name is missing. #48

Open Martin7-1 opened 11 months ago

Martin7-1 commented 11 months ago

Hi! I am tring using CodeXGLUE refinement dataset, but I encounter error below:

Traceback (most recent call last):
  File "/Users/zhengyi/Desktop/code/python/llm-projects/CodeT5-fine-tune/lora.py", line 37, in <module>
    main()
  File "/Users/zhengyi/Desktop/code/python/llm-projects/CodeT5-fine-tune/lora.py", line 17, in main
    train, test, validation = dataset.load(subset="code-refinement", config_name="small")
  File "/Users/zhengyi/Desktop/code/python/framework-projects/CodeTF/codetf/data_utility/codexglue_dataset.py", line 19, in load
    return self.load_funcs[subset](*args, **kwargs)
  File "/Users/zhengyi/Desktop/code/python/framework-projects/CodeTF/codetf/data_utility/codexglue_dataset.py", line 80, in load_codexglue_code_refinement_dataset
    dataset = load_dataset(dataset)
  File "/opt/homebrew/Caskroom/miniconda/base/envs/CodeTF/lib/python3.9/site-packages/datasets/load.py", line 1785, in load_dataset
    builder_instance = load_dataset_builder(
  File "/opt/homebrew/Caskroom/miniconda/base/envs/CodeTF/lib/python3.9/site-packages/datasets/load.py", line 1540, in load_dataset_builder
    builder_instance: DatasetBuilder = builder_cls(
  File "/opt/homebrew/Caskroom/miniconda/base/envs/CodeTF/lib/python3.9/site-packages/datasets/builder.py", line 355, in __init__
    self.config, self.config_id = self._create_builder_config(
  File "/opt/homebrew/Caskroom/miniconda/base/envs/CodeTF/lib/python3.9/site-packages/datasets/builder.py", line 487, in _create_builder_config
    raise ValueError(
ValueError: Config name is missing.
Please pick one among the available configs: ['medium', 'small']
Example of usage:
    `load_dataset('code_x_glue_cc_code_refinement', 'medium')`

It seems that the function load_codexglue_code_refinement_dataset in codexglue.py doesn't pass the extra value to load dataset. Hope to fix it like #32. Thanks a lot!