mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
https://mlabonne.github.io/blog/
Apache License 2.0
37.31k stars 3.92k forks source link

LazyMergeKit ERROR #31

Open JamesKnight0001 opened 8 months ago

JamesKnight0001 commented 8 months ago

mergekit-moe: command not found

mergekit-moe config.yaml merge --copy-tokenizer --allow-crimes --out-shard-size 1B --lazy-unpickle --trust-remote-code
/bin/bash: line 1: mergekit-moe: command not found
mlabonne commented 8 months ago

Hi, I don't think mergekit introduced a breaking change. Can you try to run it again and provide more details about the configuration you used?

HelpingAI commented 7 months ago

Checking if build backend supports build_editable Getting requirements to build editable .. done done

Installing backend dependencies . done Preparing editable metadata (pyproject.toml) done

Building editable for mergekit (pyproject.toml)... done mergekit-moe config.yaml merge-copy-tokenizer-cuda--low-cpu-memory

WARNING:root:--cuda is a no-op for mergekit-moe, use "--device cuda" instead Traceback (most recent call last):

File "/usr/local/bin/mergekit-moe", line 8, in sys.exit(main())

File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1157, incall return self.main(*args, **kwargs)

File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1078, in main rv = self.invoke(ctx)

File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1434, in invoke return ctx.invoke(self.callback, **ctx.params)

File r/local/lib/python3.10/dist-packages/click/core.py", line 783, in invoke return callback(*args, **kwargs)

File "/content/mergekit/mergekit/options.py", line 76, in wrapper f(*args, **kwargs)

File "/content/mergekit/mergekit/scripts/mixtral_moe.py", line 452, in main config = MistralMOEConfig.model_validate(yaml.safe_load(config_source))

File usr/local/lib/python3.10/dist-packages/pydantic/main.py", line 503, in model_validate return cls. pydantic_validator.validate_python(

pydantic_core._pydantic_core.ValidationError: 1 validation error for MistralMOEConfig experts

Field required [type=missing, input_value={'slices': [{'sources': [...}]}, 'dtype': 'float16'}, input_type=dict] information visit https://errors.pydantic.dev/2.5/v/missing