mistralai / mistral-inference

Official inference library for Mistral models
https://mistral.ai/
Apache License 2.0
9.16k stars 803 forks source link

os X pip install fail #157

Closed edmondja closed 1 month ago

edmondja commented 1 month ago

When i try to execute the pip install I have this error on mac os :

Collecting mistral-inference
  Using cached mistral_inference-1.0.4-py3-none-any.whl.metadata (8.3 kB)
Collecting fire>=0.6.0 (from mistral-inference)
  Using cached fire-0.6.0-py2.py3-none-any.whl
Collecting mistral_common<2.0.0,>=1.0.0 (from mistral-inference)
  Using cached mistral_common-1.0.2-py3-none-any.whl.metadata (3.5 kB)
Requirement already satisfied: safetensors>=0.4.0 in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from mistral-inference) (0.4.3)
Collecting simple-parsing>=0.1.5 (from mistral-inference)
  Using cached simple_parsing-0.1.5-py3-none-any.whl.metadata (7.7 kB)
Collecting xformers>=0.0.25 (from mistral-inference)
  Using cached xformers-0.0.26.post1.tar.gz (4.1 MB)
  Preparing metadata (setup.py) ... done
Requirement already satisfied: six in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from fire>=0.6.0->mistral-inference) (1.16.0)
Requirement already satisfied: termcolor in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from fire>=0.6.0->mistral-inference) (2.2.0)
Collecting jsonschema==4.21.1 (from mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached jsonschema-4.21.1-py3-none-any.whl.metadata (7.8 kB)
Collecting pydantic==2.6.1 (from mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached pydantic-2.6.1-py3-none-any.whl.metadata (83 kB)
Collecting sentencepiece==0.1.99 (from mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached sentencepiece-0.1.99-cp39-cp39-macosx_10_9_x86_64.whl.metadata (7.7 kB)
Collecting typing-extensions<5.0.0,>=4.11.0 (from mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached typing_extensions-4.11.0-py3-none-any.whl.metadata (3.0 kB)
Collecting attrs>=22.2.0 (from jsonschema==4.21.1->mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached attrs-23.2.0-py3-none-any.whl.metadata (9.5 kB)
Collecting jsonschema-specifications>=2023.03.6 (from jsonschema==4.21.1->mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached jsonschema_specifications-2023.12.1-py3-none-any.whl.metadata (3.0 kB)
Collecting referencing>=0.28.4 (from jsonschema==4.21.1->mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached referencing-0.35.1-py3-none-any.whl.metadata (2.8 kB)
Collecting rpds-py>=0.7.1 (from jsonschema==4.21.1->mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached rpds_py-0.18.1-cp39-cp39-macosx_10_12_x86_64.whl.metadata (4.1 kB)
Collecting annotated-types>=0.4.0 (from pydantic==2.6.1->mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached annotated_types-0.7.0-py3-none-any.whl.metadata (15 kB)
Collecting pydantic-core==2.16.2 (from pydantic==2.6.1->mistral_common<2.0.0,>=1.0.0->mistral-inference)
  Using cached pydantic_core-2.16.2-cp39-cp39-macosx_10_12_x86_64.whl.metadata (6.5 kB)
Collecting docstring-parser~=0.15 (from simple-parsing>=0.1.5->mistral-inference)
  Using cached docstring_parser-0.16-py3-none-any.whl.metadata (3.0 kB)
Collecting torch>=2.1 (from xformers>=0.0.25->mistral-inference)
  Using cached torch-2.2.2-cp39-none-macosx_10_9_x86_64.whl.metadata (25 kB)
Requirement already satisfied: numpy in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from xformers>=0.0.25->mistral-inference) (1.22.4)
Requirement already satisfied: filelock in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from torch>=2.1->xformers>=0.0.25->mistral-inference) (3.6.0)
Requirement already satisfied: sympy in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from torch>=2.1->xformers>=0.0.25->mistral-inference) (1.10.1)
Requirement already satisfied: networkx in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from torch>=2.1->xformers>=0.0.25->mistral-inference) (2.7.1)
Requirement already satisfied: jinja2 in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from torch>=2.1->xformers>=0.0.25->mistral-inference) (2.11.3)
Requirement already satisfied: fsspec in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from torch>=2.1->xformers>=0.0.25->mistral-inference) (2024.5.0)
Requirement already satisfied: MarkupSafe>=0.23 in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from jinja2->torch>=2.1->xformers>=0.0.25->mistral-inference) (2.0.1)
Requirement already satisfied: mpmath>=0.19 in /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages (from sympy->torch>=2.1->xformers>=0.0.25->mistral-inference) (1.2.1)
Using cached mistral_inference-1.0.4-py3-none-any.whl (18 kB)
Using cached mistral_common-1.0.2-py3-none-any.whl (702 kB)
Using cached jsonschema-4.21.1-py3-none-any.whl (85 kB)
Using cached pydantic-2.6.1-py3-none-any.whl (394 kB)
Using cached sentencepiece-0.1.99-cp39-cp39-macosx_10_9_x86_64.whl (1.2 MB)
Using cached pydantic_core-2.16.2-cp39-cp39-macosx_10_12_x86_64.whl (1.9 MB)
Using cached simple_parsing-0.1.5-py3-none-any.whl (113 kB)
Using cached docstring_parser-0.16-py3-none-any.whl (36 kB)
Using cached torch-2.2.2-cp39-none-macosx_10_9_x86_64.whl (150.8 MB)
Using cached typing_extensions-4.11.0-py3-none-any.whl (34 kB)
Using cached annotated_types-0.7.0-py3-none-any.whl (13 kB)
Using cached attrs-23.2.0-py3-none-any.whl (60 kB)
Using cached jsonschema_specifications-2023.12.1-py3-none-any.whl (18 kB)
Using cached referencing-0.35.1-py3-none-any.whl (26 kB)
Using cached rpds_py-0.18.1-cp39-cp39-macosx_10_12_x86_64.whl (327 kB)
Building wheels for collected packages: xformers
  Building wheel for xformers (setup.py) ... error
  error: subprocess-exited-with-error

  × python setup.py bdist_wheel did not run successfully.
  │ exit code: 1
  ╰─> [245 lines of output]
      running bdist_wheel
      /Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages/torch/utils/cpp_extension.py:476: UserWarning: Attempted to use ninja as the BuildExtension backend but we could not find ninja.. Falling back to using the slow distutils backend.
        warnings.warn(msg.format('we could not find ninja.'))
      running build
      running build_py
      creating build
      creating build/lib.macosx-10.9-x86_64-3.9
      creating build/lib.macosx-10.9-x86_64-3.9/xformers
      copying xformers/_deprecation_warning.py -> build/lib.macosx-10.9-x86_64-3.9/xformers
      copying xformers/attn_bias_utils.py -> build/lib.macosx-10.9-x86_64-3.9/xformers
      copying xformers/checkpoint.py -> build/lib.macosx-10.9-x86_64-3.9/xformers
      copying xformers/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers
      copying xformers/test.py -> build/lib.macosx-10.9-x86_64-3.9/xformers
      copying xformers/utils.py -> build/lib.macosx-10.9-x86_64-3.9/xformers
      copying xformers/_cpp_lib.py -> build/lib.macosx-10.9-x86_64-3.9/xformers
      copying xformers/info.py -> build/lib.macosx-10.9-x86_64-3.9/xformers
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/fused_linear_layer.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/vararg_kernel.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/k_activations.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/k_layer_norm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/k_fused_matmul_fw.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/dropout.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/k_dropout.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/softmax.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/layer_norm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/k_fused_matmul_bw.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      copying xformers/triton/k_softmax.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/triton
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/components
      copying xformers/components/simplicial_embedding.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components
      copying xformers/components/residual.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components
      copying xformers/components/reversible.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components
      copying xformers/components/activations.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components
      copying xformers/components/multi_head_dispatch.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components
      copying xformers/components/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components
      copying xformers/components/input_projection.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components
      copying xformers/components/patch_embedding.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_mem_eff_attention.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_indexing.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_mlp.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_blocksparse_transformers.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_transformer.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_revnet.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_swiglu.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_triton_layernorm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_causal_blocksparse.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_triton_fused_linear.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_tiled_matmul.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_triton_blocksparse.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_triton_softmax.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/utils.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_triton_dropout.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_nystrom_utils.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_attn_decoding.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_multi_head_dispatch.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_sequence_parallel_fused.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_sddmm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_sp24.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      copying xformers/benchmarks/benchmark_core.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/rmsnorm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/modpar_layers.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/swiglu_op.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/unbind.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/rope_padded.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/seqpar.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/ipc.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/sequence_parallel_fused_ops.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/sp24.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/common.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/differentiable_collectives.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/tiled_matmul.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      copying xformers/ops/indexing.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/profiler
      copying xformers/profiler/device_limits.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/profiler
      copying xformers/profiler/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/profiler
      copying xformers/profiler/profiler_dcgm_impl.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/profiler
      copying xformers/profiler/profiler_dcgm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/profiler
      copying xformers/profiler/api.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/profiler
      copying xformers/profiler/slow_ops_profiler.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/profiler
      copying xformers/profiler/profiler.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/profiler
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/sparse
      copying xformers/sparse/_csr_ops.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/sparse
      copying xformers/sparse/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/sparse
      copying xformers/sparse/utils.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/sparse
      copying xformers/sparse/blocksparse_tensor.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/sparse
      copying xformers/sparse/csr_tensor.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/sparse
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/helpers
      copying xformers/helpers/test_utils.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/helpers
      copying xformers/helpers/hierarchical_configs.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/helpers
      copying xformers/helpers/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/helpers
      copying xformers/helpers/timm_sparse_attention.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/helpers
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      copying xformers/_flash_attn/fused_softmax.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      copying xformers/_flash_attn/flash_blocksparse_attn_interface.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      copying xformers/_flash_attn/flash_blocksparse_attention.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      copying xformers/_flash_attn/bert_padding.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      copying xformers/_flash_attn/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      copying xformers/_flash_attn/flash_attn_triton_og.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      copying xformers/_flash_attn/flash_attn_triton.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      copying xformers/_flash_attn/flash_attn_interface.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/factory
      copying xformers/factory/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/factory
      copying xformers/factory/hydra_helper.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/factory
      copying xformers/factory/block_factory.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/factory
      copying xformers/factory/model_factory.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/factory
      copying xformers/factory/block_configs.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/factory
      copying xformers/factory/weight_init.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/factory
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/global_tokens.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/ortho.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/blocksparse.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/local.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/compositional.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/pooling.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/_sputnik_sparse.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/core.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/lambda_layer.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/random.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/fourier_mix.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/scaled_dot_product.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/utils.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/attention_mask.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/linformer.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/attention_patterns.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/visual.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/sparsity_config.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/nystrom.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/favor.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      copying xformers/components/attention/base.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/components/feedforward
      copying xformers/components/feedforward/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/feedforward
      copying xformers/components/feedforward/mixture_of_experts.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/feedforward
      copying xformers/components/feedforward/mlp.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/feedforward
      copying xformers/components/feedforward/conv_mlp.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/feedforward
      copying xformers/components/feedforward/fused_mlp.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/feedforward
      copying xformers/components/feedforward/base.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/feedforward
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/components/positional_embedding
      copying xformers/components/positional_embedding/vocab.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/positional_embedding
      copying xformers/components/positional_embedding/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/positional_embedding
      copying xformers/components/positional_embedding/param.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/positional_embedding
      copying xformers/components/positional_embedding/sine.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/positional_embedding
      copying xformers/components/positional_embedding/rotary.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/positional_embedding
      copying xformers/components/positional_embedding/base.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/positional_embedding
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention/feature_maps
      copying xformers/components/attention/feature_maps/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention/feature_maps
      copying xformers/components/attention/feature_maps/softmax.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention/feature_maps
      copying xformers/components/attention/feature_maps/base.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/components/attention/feature_maps
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA
      copying xformers/benchmarks/LRA/batch_submit.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA
      copying xformers/benchmarks/LRA/batch_fetch_results.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA
      copying xformers/benchmarks/LRA/run_with_submitit.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA
      copying xformers/benchmarks/LRA/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA
      copying xformers/benchmarks/LRA/run_tasks.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA
      copying xformers/benchmarks/LRA/run_grid_search.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA/code
      copying xformers/benchmarks/LRA/code/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA/code
      copying xformers/benchmarks/LRA/code/model_wrapper.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA/code
      copying xformers/benchmarks/LRA/code/dataset.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/benchmarks/LRA/code
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/ops/_triton
      copying xformers/ops/_triton/k_scaled_index_add.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/_triton
      copying xformers/ops/_triton/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/_triton
      copying xformers/ops/_triton/rope_padded_kernels.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/_triton
      copying xformers/ops/_triton/sequence_parallel_fused_kernels.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/_triton
      copying xformers/ops/_triton/tiled_matmul_kernels.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/_triton
      copying xformers/ops/_triton/k_index_select_cat.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/_triton
      copying xformers/ops/_triton/rmsnorm_kernels.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/_triton
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/decoder.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/dispatch.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/attn_bias.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/ck.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/common.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/ck_decoder.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/flash.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/small_k.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/cutlass.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/ck_splitk.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      copying xformers/ops/fmha/triton_splitk.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/ops/fmha
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/losses
      copying xformers/_flash_attn/losses/cross_entropy.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/losses
      copying xformers/_flash_attn/losses/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/losses
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/layers
      copying xformers/_flash_attn/layers/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/layers
      copying xformers/_flash_attn/layers/patch_embed.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/layers
      copying xformers/_flash_attn/layers/rotary.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/layers
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/utils
      copying xformers/_flash_attn/utils/pretrained.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/utils
      copying xformers/_flash_attn/utils/generation.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/utils
      copying xformers/_flash_attn/utils/benchmark.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/utils
      copying xformers/_flash_attn/utils/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/utils
      copying xformers/_flash_attn/utils/distributed.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/utils
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/bigcode.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/gptj.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/opt.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/llama.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/vit.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/btlm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/baichuan.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/bert.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/falcon.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/gpt_neox.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      copying xformers/_flash_attn/models/gpt.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/models
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops
      copying xformers/_flash_attn/ops/activations.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops
      copying xformers/_flash_attn/ops/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops
      copying xformers/_flash_attn/ops/fused_dense.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops
      copying xformers/_flash_attn/ops/rms_norm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops
      copying xformers/_flash_attn/ops/layer_norm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/modules
      copying xformers/_flash_attn/modules/embedding.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/modules
      copying xformers/_flash_attn/modules/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/modules
      copying xformers/_flash_attn/modules/mlp.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/modules
      copying xformers/_flash_attn/modules/block.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/modules
      copying xformers/_flash_attn/modules/mha.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/modules
      creating build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops/triton
      copying xformers/_flash_attn/ops/triton/cross_entropy.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops/triton
      copying xformers/_flash_attn/ops/triton/linear.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops/triton
      copying xformers/_flash_attn/ops/triton/k_activations.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops/triton
      copying xformers/_flash_attn/ops/triton/__init__.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops/triton
      copying xformers/_flash_attn/ops/triton/mlp.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops/triton
      copying xformers/_flash_attn/ops/triton/rotary.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops/triton
      copying xformers/_flash_attn/ops/triton/layer_norm.py -> build/lib.macosx-10.9-x86_64-3.9/xformers/_flash_attn/ops/triton
      running build_ext
      building 'xformers._C' extension
      creating build/temp.macosx-10.9-x86_64-3.9
      creating build/temp.macosx-10.9-x86_64-3.9/xformers
      creating build/temp.macosx-10.9-x86_64-3.9/xformers/csrc
      creating build/temp.macosx-10.9-x86_64-3.9/xformers/csrc/attention
      creating build/temp.macosx-10.9-x86_64-3.9/xformers/csrc/attention/autograd
      creating build/temp.macosx-10.9-x86_64-3.9/xformers/csrc/attention/cpu
      creating build/temp.macosx-10.9-x86_64-3.9/xformers/csrc/sequence_parallel_fused
      creating build/temp.macosx-10.9-x86_64-3.9/xformers/csrc/sparse24
      creating build/temp.macosx-10.9-x86_64-3.9/xformers/csrc/swiglu
      clang -Wno-unused-result -Wsign-compare -Wunreachable-code -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /Users/edmondjacoupeau/opt/anaconda3/include -arch x86_64 -I/Users/edmondjacoupeau/opt/anaconda3/include -fPIC -O2 -isystem /Users/edmondjacoupeau/opt/anaconda3/include -arch x86_64 -I/private/var/folders/vq/_stcpmys7n984r7gy0pm4_g40000gn/T/pip-install-5a1st423/xformers_2dfbd8981dda4983b6e310b34d15d5e6/xformers/csrc -I/Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages/torch/include -I/Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages/torch/include/TH -I/Users/edmondjacoupeau/opt/anaconda3/lib/python3.9/site-packages/torch/include/THC -I/Users/edmondjacoupeau/opt/anaconda3/include/python3.9 -c xformers/csrc/attention/attention.cpp -o build/temp.macosx-10.9-x86_64-3.9/xformers/csrc/attention/attention.o -O3 -std=c++17 -fopenmp -DTORCH_API_INCLUDE_EXTENSION_H -DPYBIND11_COMPILER_TYPE=\"_clang\" -DPYBIND11_STDLIB=\"_libcpp\" -DPYBIND11_BUILD_ABI=\"_cxxabi1002\" -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
      clang: error: unsupported option '-fopenmp'
      error: command '/usr/bin/clang' failed with exit code 1
      [end of output]

  note: This error originates from a subprocess, and is likely not a problem with pip.
  ERROR: Failed building wheel for xformers
  Running setup.py clean for xformers
Failed to build xformers
ERROR: Could not build wheels for xformers, which is required to install pyproject.toml-based projects

Click to add a cell.
patrickvonplaten commented 1 month ago

Hey @edmondja,

mistral-inference requires xformers to be installed at the moment. So you first need to figure out a way to install xformers in you OS -> if this works, then:

pip install mistral-inference

will work as well