Unexpected key(s) in state_dict: "SOS_tensor" on Apple M1 with torch

rjalexa commented 1 year ago

Describe the bug When launching my project on Apple M1 hardware I get the following error:

(isagog-ai-py3.10) (base) bob@Roberts-Mac-mini isagog-ai % python src/isagog_api/nlp_api.py
2023-03-14 16:05:29 INFO: Loading these models for language: it (Italian):
========================
| Processor | Package  |
------------------------
| tokenize  | combined |
| mwt       | combined |
| ner       | fbk      |
========================

2023-03-14 16:05:29 INFO: Use device: cpu
2023-03-14 16:05:29 INFO: Loading: tokenize
2023-03-14 16:05:29 INFO: Loading: mwt
Traceback (most recent call last):
  File "/Users/bob/Documents/work/code/isagog-ai/src/isagog_api/nlp_api.py", line 14, in <module>
    from isagog_ai.nlp_it import StanzaLanguageProcessorIt
  File "/Users/bob/Documents/work/code/isagog-ai/src/isagog_ai/nlp_it.py", line 29, in <module>
    "entity": stanza.Pipeline(
  File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/core.py", line 278, in __init__
    self.processors[processor_name] = NAME_TO_PROCESSOR_CLASS[processor_name](config=curr_processor_config,
  File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/processor.py", line 173, in __init__
    self._set_up_model(config, pipeline, use_gpu)
  File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/mwt_processor.py", line 21, in _set_up_model
    self._trainer = Trainer(model_file=config['model_path'], use_cuda=use_gpu)
  File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/models/mwt/trainer.py", line 36, in __init__
    self.load(model_file, use_cuda)
  File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/models/mwt/trainer.py", line 149, in load
    self.model.load_state_dict(checkpoint['model'])
  File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1671, in load_state_dict
    raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for Seq2SeqModel:
    Unexpected key(s) in state_dict: "SOS_tensor".

Expected behavior Service launching just like with the exact same project on Intel.

Environment (please complete the following information):

OS: MacOS Ventura 13.2.1
Python version: 3.10.9

Stanza version: 1.4.2 Poetry managed environment:

/Users/bob/Documents/work/code/isagog-ai/.venv
astroid               2.14.2      An abstract syntax tree for Python with inference support.
attrs                 22.2.0      Classes Without Boilerplate
beautifulsoup4        4.11.2      Screen-scraping library
black                 22.12.0     The uncompromising code formatter.
certifi               2022.12.7   Python package for providing Mozilla's CA Bundle.
cfgv                  3.3.1       Validate configuration and produce human readable error messages.
charset-normalizer    3.0.1       The Real First Universal Charset Detector. Open, modern and actively maintained alternative to Chardet.
click                 8.1.3       Composable command line interface toolkit
clickclick            20.10.2     Click utility functions
connexion             2.14.2      Connexion - API first applications with OpenAPI/Swagger and Flask
coverage              7.1.0       Code coverage measurement for Python
dill                  0.3.6       serialize all of python
distlib               0.3.6       Distribution utilities
emoji                 2.2.0       Emoji for Python
exceptiongroup        1.1.0       Backport of PEP 654 (exception groups)
filelock              3.9.0       A platform independent file lock.
flake8                6.0.0       the modular source code checker: pep8 pyflakes and co
flasgger              0.9.5       Extract swagger specs from your flask project
flask                 2.2.3       A simple framework for building complex web applications.
huggingface-hub       0.12.0      Client library to download and publish models, datasets and other repos on the huggingface.co hub
identify              2.5.15      File identification library for Python
idna                  3.4         Internationalized Domain Names in Applications (IDNA)
inflection            0.5.1       A port of Ruby on Rails inflector to Python
iniconfig             2.0.0       brain-dead simple config-ini parsing
isodate               0.6.1       An ISO 8601 date/time/duration parser and formatter
isort                 5.11.4      A Python utility / library to sort Python imports.
itsdangerous          2.1.2       Safely pass data to untrusted environments and back.
jinja2                3.1.2       A very fast and expressive template engine.
joblib                1.2.0       Lightweight pipelining with Python functions
jsonschema            4.17.3      An implementation of JSON Schema validation for Python
keybert               0.7.0       KeyBERT performs keyword extraction with state-of-the-art transformer models.
langcodes             3.3.0       Tools for labeling human languages with IETF language tags
lazy-object-proxy     1.9.0       A fast and thorough lazy object proxy.
markdown-it-py        2.1.0       Python port of markdown-it. Markdown parsing, done right!
markupsafe            2.1.2       Safely add untrusted strings to HTML/XML markup.
mccabe                0.7.0       McCabe checker, plugin for flake8
mdurl                 0.1.2       Markdown URL utilities
mistune               2.0.5       A sane Markdown parser with useful plugins and renderers
mypy                  0.991       Optional static typing for Python
mypy-extensions       0.4.3       Experimental type system extensions for programs checked with the mypy typechecker.
nltk                  3.8.1       Natural Language Toolkit
nodeenv               1.7.0       Node.js virtual environment builder
numpy                 1.24.1      Fundamental package for array computing in Python
packaging             23.0        Core utilities for Python packages
pathspec              0.11.0      Utility library for gitignore style pattern matching of file paths.
pillow                9.4.0       Python Imaging Library (Fork)
platformdirs          2.6.2       A small Python package for determining appropriate platform-specific dirs, e.g. a "user data dir".
pluggy                1.0.0       plugin and hook calling mechanisms for python
pre-commit            3.0.1       A framework for managing and maintaining multi-language pre-commit hooks.
protobuf              4.21.12
pycodestyle           2.10.0      Python style guide checker
pyflakes              3.0.1       passive checker of Python programs
pygments              2.14.0      Pygments is a syntax highlighting package written in Python.
pylint                2.16.2      python code static checker
pyparsing             3.0.9       pyparsing module - Classes and methods to define and execute parsing grammars
pyrsistent            0.19.3      Persistent/Functional/Immutable data structures
pytest                7.2.1       pytest: simple powerful testing with Python
pytest-cov            4.0.0       Pytest plugin for measuring coverage.
pyyaml                6.0         YAML parser and emitter for Python
rapidfuzz             2.13.7      rapid fuzzy string matching
rdflib                6.2.0       RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.
regex                 2022.10.31  Alternative regular expression module, to replace re.
requests              2.28.2      Python HTTP for Humans.
rich                  13.2.0      Render rich text, tables, progress bars, syntax highlighting, markdown and more to the terminal
scikit-learn          1.2.1       A set of python modules for machine learning and data mining
scipy                 1.9.3       Fundamental algorithms for scientific computing in Python
sentence-transformers 2.2.2       Multilingual text embeddings
sentencepiece         0.1.97      SentencePiece python wrapper
setuptools            66.1.1      Easily download, build, install, upgrade, and uninstall Python packages
six                   1.16.0      Python 2 and 3 compatibility utilities
soupsieve             2.3.2.post1 A modern CSS selector implementation for Beautiful Soup.
sparqlwrapper         2.0.0       SPARQL Endpoint interface to Python
stanza                1.4.2       A Python NLP Library for Many Human Languages, by the Stanford NLP Group
swagger-ui-bundle     0.0.9       swagger_ui_bundle - swagger-ui files in a pip package
threadpoolctl         3.1.0       threadpoolctl
tokenizers            0.13.2      Fast and Customizable Tokenizers
toml                  0.10.2      Python Library for Tom's Obvious, Minimal Language
tomli                 2.0.1       A lil' TOML parser
tomlkit               0.11.6      Style preserving TOML library
torch                 1.13.1      Tensors and Dynamic neural networks in Python with strong GPU acceleration
torchvision           0.14.1      image and video datasets and models for torch deep learning
tqdm                  4.64.1      Fast, Extensible Progress Meter
transformers          4.26.0      State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow
typing-extensions     4.4.0       Backported and Experimental Type Hints for Python 3.7+
urllib3               1.26.14     HTTP library with thread-safe connection pooling, file post, and more.
virtualenv            20.17.1     Virtual Python Environment builder
werkzeug              2.2.3       The comprehensive WSGI web application library.
wrapt                 1.14.1      Module for decorators, wrappers and monkey patching.

AngledLuffa commented 1 year ago

This is not my experience. I have a Mac, and I can load the Italian pipeline just fine. Of course, there's the question of what versions are being run. I did this on a MacBook Air (M2, 2022), with an Apple M2 (not used because of pytorch mps bugs), OS version 12.6.3. Python 3.9.6, Stanza 1.5.0. So, my suggestion is that I'll try updating my OS, and you try updating Stanza, and we'll see if either of us gets different results.

On Tue, Mar 14, 2023 at 9:00 AM Robert Alexander @.***> wrote:

Describe the bug When launching my project on Apple M1 hardware I get the following error:

(isagog-ai-py3.10) (base) @.*** isagog-ai % python src/isagog_api/nlp_api.py 2023-03-14 16:05:29 INFO: Loading these models for language: it (Italian):

| Processor | Package |

| tokenize | combined | | mwt | combined | | ner | fbk |

2023-03-14 16:05:29 INFO: Use device: cpu 2023-03-14 16:05:29 INFO: Loading: tokenize 2023-03-14 16:05:29 INFO: Loading: mwt Traceback (most recent call last): File "/Users/bob/Documents/work/code/isagog-ai/src/isagog_api/nlp_api.py", line 14, in from isagog_ai.nlp_it import StanzaLanguageProcessorIt File "/Users/bob/Documents/work/code/isagog-ai/src/isagog_ai/nlp_it.py", line 29, in "entity": stanza.Pipeline( File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/core.py", line 278, in init self.processors[processor_name] = NAME_TO_PROCESSOR_CLASS[processor_name](config=curr_processor_config, File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/processor.py", line 173, in init self._set_up_model(config, pipeline, use_gpu) File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/mwt_processor.py", line 21, in _set_up_model self._trainer = Trainer(model_file=config['model_path'], use_cuda=use_gpu) File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/models/mwt/trainer.py", line 36, in init self.load(model_file, use_cuda) File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/models/mwt/trainer.py", line 149, in load self.model.load_state_dict(checkpoint['model']) File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1671, in load_state_dict raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format( RuntimeError: Error(s) in loading state_dict for Seq2SeqModel: Unexpected key(s) in state_dict: "SOS_tensor".

Expected behavior Service launching just like with the exact same project on Intel.

Environment (please complete the following information):

OS: MacOS Ventura 13.2.1

Python version: 3.10.9

Stanza version: 1.4.2

— Reply to this email directly, view it on GitHub https://github.com/stanfordnlp/stanza/issues/1216, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2AYWNA43S72AMPETEP5Q3W4CI2TANCNFSM6AAAAAAV2URTZM . You are receiving this because you are subscribed to this thread.Message ID: @.***>

AngledLuffa commented 1 year ago

I can confirm that updating to Ventura 13.2.1 does not start causing problems for me running an Italian pipeline

On Tue, Mar 14, 2023 at 12:54 PM John Bauer @.***> wrote:

This is not my experience. I have a Mac, and I can load the Italian pipeline just fine. Of course, there's the question of what versions are being run. I did this on a MacBook Air (M2, 2022), with an Apple M2 (not used because of pytorch mps bugs), OS version 12.6.3. Python 3.9.6, Stanza 1.5.0. So, my suggestion is that I'll try updating my OS, and you try updating Stanza, and we'll see if either of us gets different results.

On Tue, Mar 14, 2023 at 9:00 AM Robert Alexander @.***> wrote:

Describe the bug When launching my project on Apple M1 hardware I get the following error:

(isagog-ai-py3.10) (base) @.*** isagog-ai % python src/isagog_api/nlp_api.py 2023-03-14 16:05:29 INFO: Loading these models for language: it (Italian):

| Processor | Package |

| tokenize | combined | | mwt | combined | | ner | fbk |

2023-03-14 16:05:29 INFO: Use device: cpu 2023-03-14 16:05:29 INFO: Loading: tokenize 2023-03-14 16:05:29 INFO: Loading: mwt Traceback (most recent call last): File "/Users/bob/Documents/work/code/isagog-ai/src/isagog_api/nlp_api.py", line 14, in from isagog_ai.nlp_it import StanzaLanguageProcessorIt File "/Users/bob/Documents/work/code/isagog-ai/src/isagog_ai/nlp_it.py", line 29, in "entity": stanza.Pipeline( File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/core.py", line 278, in init self.processors[processor_name] = NAME_TO_PROCESSOR_CLASS[processor_name](config=curr_processor_config, File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/processor.py", line 173, in init self._set_up_model(config, pipeline, use_gpu) File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/pipeline/mwt_processor.py", line 21, in _set_up_model self._trainer = Trainer(model_file=config['model_path'], use_cuda=use_gpu) File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/models/mwt/trainer.py", line 36, in init self.load(model_file, use_cuda) File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/stanza/models/mwt/trainer.py", line 149, in load self.model.load_state_dict(checkpoint['model']) File "/Users/bob/Documents/work/code/isagog-ai/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1671, in load_state_dict raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format( RuntimeError: Error(s) in loading state_dict for Seq2SeqModel: Unexpected key(s) in state_dict: "SOS_tensor".

Expected behavior Service launching just like with the exact same project on Intel.

Environment (please complete the following information):

OS: MacOS Ventura 13.2.1

Python version: 3.10.9

Stanza version: 1.4.2

— Reply to this email directly, view it on GitHub https://github.com/stanfordnlp/stanza/issues/1216, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2AYWNA43S72AMPETEP5Q3W4CI2TANCNFSM6AAAAAAV2URTZM . You are receiving this because you are subscribed to this thread.Message ID: @.***>

rjalexa commented 1 year ago

Thank you very much. I should have mentioned that I am running 1.5.0 and previously 1.4.2 italian tasks without problems too. I am starting to believe torch is the problem here since on those two projects I believe I was using only TF. Traveling now will report back later.

AngledLuffa commented 1 year ago

Is that backwards? We only use torch in our models, not TF.

rjalexa commented 1 year ago

You're right. In any case updating to Stanza 1.5.0 apparently fixed the problem, so I'd thank you and close.

stanfordnlp / stanza

Unexpected key(s) in state_dict: "SOS_tensor" on Apple M1 with torch #1216

(isagog-ai-py3.10) (base) @.*** isagog-ai % python src/isagog_api/nlp_api.py 2023-03-14 16:05:29 INFO: Loading these models for language: it (Italian):

| Processor | Package |

| tokenize | combined | | mwt | combined | | ner | fbk |

(isagog-ai-py3.10) (base) @.*** isagog-ai % python src/isagog_api/nlp_api.py 2023-03-14 16:05:29 INFO: Loading these models for language: it (Italian):

| Processor | Package |

| tokenize | combined | | mwt | combined | | ner | fbk |