ValueError: Out shape is mismatched #11137

Closed cliuxinxin closed 2 years ago

cliuxinxin commented 2 years ago

How to reproduce the behaviour

Your Environment

Some weights of the model checkpoint at hfl/chinese-roberta-wwm-ext were not used when initializing BertModel: ['cls.predictions.transform.LayerNorm.bias', 'cls.seq_relationship.weight', 'cls.predictions.transform.dense.weight', 'cls.predictions.bias', 'cls.predictions.transform.dense.bias', 'cls.predictions.transform.LayerNorm.weight', 'cls.predictions.decoder.weight', 'cls.seq_relationship.bias']

when I update the spacy to latest,

cant work any more

when I change back to 3.3.1 , it's working.

I packed all the code and data. Just run the cmd

task = "news" os.environ['TASK'] = task !python -m spacy project run all test.zip

danieldk commented 2 years ago

Thanks for reporting this issue. Could you include all the files necessary to reproduce the issue? For example, the conversion scripts are missing in the provided zip files.

cliuxinxin commented 2 years ago

Thank you. I packed the data in to the zip file and upload.

danieldk commented 2 years ago

Thanks! I can now run the project, but it does the initialization without any issues and proceeds to finetune the transformer model. Any chance you can post the output of pip list, so that I can try to reproduce it with the same versions of the spaCy/Thinc dependencies?

cliuxinxin commented 2 years ago

!pip install -U pip setuptools wheel !pip install -U 'spacy[transformers,lookups]' !python -m spacy download zh_core_web_lg !pip install doccano-client !pip install datasets !pip install seqeval !pip install spacy-transformers !pip install umap-learn

I use the colab

Package Version

absl-py 1.1.0
...
[extensive package list truncated for brevity]
...
deprecat 2.1.1 descartes 1.1.0
...
[extensive package list truncated for brevity]
...
osqp 0.6.2.post0
...
[extensive package list truncated for brevity]
...
pytest 3.6.4
...
[extensive package list truncated for brevity]
...
statsmodels 0.10.2
...
[extensive package list truncated for brevity]
...
ptyprocess 0.7.0 pymc3 3.11.5
...
[extensive package list truncated for brevity]
...
statsmodels 0.10.2
...
[extensive package list truncated for brevity]
...
Werkzeug 1.0.1
wheel 0.37.1 widgetsnbextension 3.6.1
...
[extensive package list truncated for brevity]
...
cupy-cuda111 9.4.0
...
spacy 3.4.0
spacy-transformers 1.1.7
...
zipp 3.8.0

danieldk commented 2 years ago

Ah, I can reproduce it when downgrading to the cupy version that you have (9.4.0). Can you try to upgrade the CuPy package?

pip install --upgrade cupy-cuda111

I still have to look what the issue is with the older CuPy version, but that might at least solve the problem for you.

cliuxinxin commented 2 years ago

image Yes, it is working. Thank you very much. Hope you can fix the older version problem

miguelwon commented 2 years ago

Hi, I'm having the same issue, although already installed cupy-cuda111. This is my log:

(env) mwon@sebruno2:~/data-mwon/TC/src$ python -m spacy train config.cfg --output ../results_train_fase_1_809sample/ --paths.train  ../data/train_val_test/conll/train_fase_1.spacy --paths.dev ../data/train_val_test/conll/dev_fase_1.spacy -g 0
ℹ Saving to output directory: ../results_train_fase_1_809sample
ℹ Using GPU: 0

=========================== Initializing pipeline ===========================
[2022-07-25 12:15:05,619] [INFO] Set up nlp object from config
[2022-07-25 12:15:05,629] [INFO] Pipeline: ['transformer', 'ner']
[2022-07-25 12:15:05,632] [INFO] Created vocabulary
[2022-07-25 12:15:05,632] [INFO] Finished initializing nlp object
Some weights of the model checkpoint at neuralmind/bert-base-portuguese-cased were not used when initializing BertModel: ['cls.predictions.transform.dense.weight', 'cls.predictions.transform.LayerNorm.weight', 'cls.seq_relationship.weight', 'cls.predictions.decoder.weight', 'cls.seq_relationship.bias', 'cls.predictions.transform.dense.bias', 'cls.predictions.transform.LayerNorm.bias', 'cls.predictions.bias']
- This IS expected if you are initializing BertModel from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing BertModel from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
Traceback (most recent call last):
  File "/usr/lib/python3.6/runpy.py", line 193, in _run_module_as_main
    "__main__", mod_spec)
  File "/usr/lib/python3.6/runpy.py", line 85, in _run_code
    exec(code, run_globals)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/__main__.py", line 4, in <module>
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/cli/_util.py", line 71, in setup_cli
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/click/core.py", line 1128, in __call__
    return self.main(*args, **kwargs)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/click/core.py", line 1053, in main
    rv = self.invoke(ctx)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/click/core.py", line 1659, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/click/core.py", line 1395, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/click/core.py", line 754, in invoke
    return __callback(*args, **kwargs)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/typer/main.py", line 532, in wrapper
    return callback(**use_params)  # type: ignore
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/cli/train.py", line 45, in train_cli
    train(config_path, output_path, use_gpu=use_gpu, overrides=overrides)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/cli/train.py", line 72, in train
    nlp = init_nlp(config, use_gpu=use_gpu)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/training/initialize.py", line 84, in init_nlp
    nlp.initialize(lambda: train_corpus(nlp), sgd=optimizer)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/language.py", line 1317, in initialize
    proc.initialize(get_examples, nlp=self, **p_settings)
  File "spacy/pipeline/transition_parser.pyx", line 575, in spacy.pipeline.transition_parser.Parser.initialize
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/thinc/model.py", line 299, in initialize
    self.init(self, X=X, Y=Y)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/ml/tb_framework.py", line 47, in init
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/thinc/model.py", line 299, in initialize
    self.init(self, X=X, Y=Y)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/ml/_precomputable_affine.py", line 150, in init
    acts1 = predict(ids, tokvecs)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/ml/_precomputable_affine.py", line 131, in predict
    hiddens = model.predict(tokvecs[:-1])  # (nW, f, o, p)
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/thinc/model.py", line 315, in predict
    return self._func(self, X, is_train=False)[0]
  File "/mnt/sdb/data-mwon/TC/env/lib/python3.6/site-packages/spacy/ml/_precomputable_affine.py", line 29, in forward
    Yf[0] = model.get_param("pad")
  File "cupy/_core/core.pyx", line 1418, in cupy._core.core.ndarray.__setitem__
  File "cupy/_core/_routines_indexing.pyx", line 54, in cupy._core._routines_indexing._ndarray_setitem
  File "cupy/_core/_routines_indexing.pyx", line 959, in cupy._core._routines_indexing._scatter_op
  File "cupy/_core/_kernel.pyx", line 1161, in cupy._core._kernel.ufunc.__call__
  File "cupy/_core/_kernel.pyx", line 594, in cupy._core._kernel._get_out_args
ValueError: Out shape is mismatched
danieldk commented 2 years ago

Could you try this PR: https://github.com/explosion/spaCy/pull/11194 ?

miguelwon commented 2 years ago

Thanks. It's working now.

github-actions[bot] commented 2 years ago

