Closed Dinaelin-Yip closed 4 weeks ago
Meanwhile, it seems that there's no neg
directory in the processed dataset.
When I run the command
python worker.py \
--embed config/exp/llama-split.yaml \
--model config/model/llama-naml.yaml \
--exp config/exp/llama-split.yaml \
--data config/data/mind-llama.yaml \
--version small \
--llm_ver 7b \
--hidden_size 64 \
--layer 0 \
--lora 0 \
--fast_eval 0 \
--embed_hidden_size 4096
The error is imported,
FileNotFoundError: [Errno 2] No such file or directory: 'data/MIND-small/neg/meta.data.json'
Hi, please use processor_unitokv3.py
if your unitok
version is 3.x.x
. The processor.py
only support unitok==2.x.x
.
Also, processor_unitokv3.py
will generate neg
data.
Sorry, I just realized that you are using GNRS
repo. We will no longer maintain this repository. Please use Legommeneders instead!
For the generated data, please refer to: https://github.com/Jyonn/Legommenders/issues/8
Jyonn, Thank you for your quick reply! processor_unitokv3.py
works! Yes, I am using GNRS
repo, and sorry that I opened the issue here.
I would like to ask one more question. I want to conduct the experiments with llama to test the DIRE module. Shall I use large_llama_processor.py
to preprocess the MIND data?
You can use llama_processor.py
. large_llama_processor.py
is for the MIND-large dataset.
Thank you for your work! In
process/mind/processor.py
, row 128-142,The error is imported as follows,
Could you tell me how to fix this?