Closed StableFluffy closed 7 months ago
This is great @StableFluffy -- can you run
make style
make quality
and fix any minor things :)
Done !
@StableFluffy have you seen this error upon loading?
[INFO|modeling_utils.py:1491] 2024-04-11 08:46:15,579 >> Instantiating DebertaV2ForSequenceClassification model under default dtype torch.float16.
Traceback (most recent call last):
File "/net/nfs.cirrascale/allennlp/nathanl/herm/scripts/run_rm.py", line 336, in <module>
main()
File "/net/nfs.cirrascale/allennlp/nathanl/herm/scripts/run_rm.py", line 169, in main
model = model_builder(args.model, **model_kwargs, trust_remote_code=trust_remote_code)
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 563, in from_pretrained
return model_class.from_pretrained(
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3671, in from_pretrained
) = cls._load_pretrained_model(
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3947, in _load_pretrained_model
model.apply(model._initialize_weights)
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 897, in apply
module.apply(fn)
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 897, in apply
module.apply(fn)
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 897, in apply
module.apply(fn)
[Previous line repeated 4 more times]
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 898, in apply
fn(self)
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/transformers/modeling_utils.py", line 1704, in _initialize_weights
self._init_weights(module)
File "/net/nfs.cirrascale/allennlp/nathanl/miniconda3/envs/herm/lib/python3.10/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 919, in _init_weights
module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
RuntimeError: "normal_kernel_cpu" not implemented for 'Char'
Another issue is
AssertionError: <source> id not in input_ids
When trying to run on --pref_sets
Added Better-PairRM support Change result saving path to relative.
Benchmark Result