locuslab / wanda

A simple and effective LLM pruning approach.
https://arxiv.org/abs/2306.11695
MIT License
660 stars 87 forks source link

Why Does ExpectedMoreSplits Error Occur? #71

Open cleverchloe opened 2 weeks ago

cleverchloe commented 2 weeks ago

Hi, I'm trying to sparsify my model using your code, but I'm encountering the "ExpectedMoreSplits Error" shown below.

The error arises from this line when loading the training data.

Could you help me understand the reason for this error and suggest how I might fix it?

image
    raise ExpectedMoreSplits(str(set(expected_splits) - set(recorded_splits)))
datasets.utils.info_utils.ExpectedMoreSplits: {'validation'}
TagoreZhao commented 1 week ago

I followed this post to resolve this issue: https://huggingface.co/datasets/allenai/c4/discussions/7

edit the two load_dataset function to look like this: image

If this does not work(The error should change), try to install the newest huggingface dataset by running this: conda install -c huggingface -c conda-forge datasets