Open ReamonYim opened 1 month ago
This may be because your script trafficllm_stage1.sh has no permission to access the /cache
path in the config of cache_dir. You can try to change the cache_dir in the trafficllm_stage1.sh (not in the main.py
) to solve the problem. For instance, change the path /cache
to ../cache
in the config of cache_dir.
Alternatively, you can also try deleting the cache_dir field in trafficllm_stage1.sh to store the cache in the default path. Refer to ChatGLM2's config.
I hope the above solutions work for you.
Problem Description:
While running the
dual-stage-tuning
training scripttrafficllm_stage1.sh
, I encountered the following error during the dataset loading phase:This error occurred when calling the
load_dataset
function in thedatasets
library, indicating that it does not support loading datasets cached using the local file system.Attempts and Outcomes:
I have tried several approaches to resolve this issue, but unfortunately, none of them have worked. Below is a summary of my efforts and their corresponding results:
Clearing the
cache
Directory:dual-stage-tuning/cache
directory to eliminate any potential issues caused by corrupted or incompatible cache files.NotImplementedError
.Changing the
cache_dir
to Different Directories (e.g.,/root/.cache/huggingface/datasets
):main.py
script to set thecache_dir
parameter to~/.cache/huggingface
and/cache
, hoping that changing the cache directory might resolve the file system compatibility issue.Forcing
LocalFileSystem
Usage:fs=LocalFileSystem()
inmain.py
and passed thefs
parameter to theload_dataset
call to explicitly enforce the use of the local file system.TypeError: __init__() got an unexpected keyword argument 'fs'
, preventing the program from continuing.Removing the
fs
Parameter:fs
parameter led to another issue, I removed it to restore the original code.NotImplementedError
reappeared, indicating that the core issue remained unresolved.Changing
tmp
Cache Directory:cache_dir
to a temporary directory (/tmp/hf_cache
), manually created this directory, and specified it in the script.LocalFileSystem
error persisted.Misconfiguration Leading to
FileExistsError
:/tmp
) to/dev/null
, resulting inFileExistsError: [Errno 17] File exists: '/dev/null'
.Attempting to Modify the
datasets
Source Code:datasets
library source code to remove thefs
parameter inbuilder.py
, but this approach could potentially break other dependencies withindatasets
.Request for Help:
I would greatly appreciate any guidance or suggestions on how to resolve this
LocalFileSystem
issue. It is currently preventing me from loading the dataset and proceeding with the model training. Your assistance would mean a lot, and I’m eager to hear any potential solutions or workarounds.Thank you so much in advance for your help and support!