unslothai / unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
https://unsloth.ai
Apache License 2.0
16.12k stars 1.1k forks source link

ERROR: Failed building wheel for xformers #414

Open proto1994 opened 4 months ago

proto1994 commented 4 months ago

I am on Apple M1 Pro, and the python environment is 3.9.13. When I execute the following command, I get an error.

pip install --no-deps "xformers<0.0.26" trl peft accelerate bitsandbytes

Defaulting to user installation because normal site-packages is not writeable Collecting xformers<0.0.26 Using cached xformers-0.0.25.post1.tar.gz (4.1 MB) Preparing metadata (setup.py) ... done Collecting trl Using cached trl-0.8.6-py3-none-any.whl.metadata (11 kB) Collecting peft Using cached peft-0.10.0-py3-none-any.whl.metadata (13 kB) Collecting accelerate Using cached accelerate-0.29.3-py3-none-any.whl.metadata (18 kB) Collecting bitsandbytes Using cached bitsandbytes-0.42.0-py3-none-any.whl.metadata (9.9 kB) Using cached trl-0.8.6-py3-none-any.whl (245 kB) Using cached peft-0.10.0-py3-none-any.whl (199 kB) Using cached accelerate-0.29.3-py3-none-any.whl (297 kB) Using cached bitsandbytes-0.42.0-py3-none-any.whl (105.0 MB) Building wheels for collected packages: xformers Building wheel for xformers (setup.py) ... error error: subprocess-exited-with-error

× python setup.py bdist_wheel did not run successfully. │ exit code: 1 ╰─> [248 lines of output] running bdist_wheel /Users/proto/Library/Python/3.9/lib/python/site-packages/torch/utils/cpp_extension.py:500: UserWarning: Attempted to use ninja as the BuildExtension backend but we could not find ninja.. Falling back to using the slow distutils backend. warnings.warn(msg.format('we could not find ninja.')) running build running build_py creating build creating build/lib.macosx-10.9-universal2-3.9 creating build/lib.macosx-10.9-universal2-3.9/xformers copying xformers/_deprecation_warning.py -> build/lib.macosx-10.9-universal2-3.9/xformers copying xformers/attn_bias_utils.py -> build/lib.macosx-10.9-universal2-3.9/xformers copying xformers/checkpoint.py -> build/lib.macosx-10.9-universal2-3.9/xformers copying xformers/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers copying xformers/test.py -> build/lib.macosx-10.9-universal2-3.9/xformers copying xformers/utils.py -> build/lib.macosx-10.9-universal2-3.9/xformers copying xformers/_cpp_lib.py -> build/lib.macosx-10.9-universal2-3.9/xformers copying xformers/info.py -> build/lib.macosx-10.9-universal2-3.9/xformers creating build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/fused_linear_layer.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/vararg_kernel.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/k_activations.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/k_layer_norm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/k_fused_matmul_fw.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/dropout.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/k_dropout.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/softmax.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/layer_norm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/k_fused_matmul_bw.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton copying xformers/triton/k_softmax.py -> build/lib.macosx-10.9-universal2-3.9/xformers/triton creating build/lib.macosx-10.9-universal2-3.9/xformers/components copying xformers/components/simplicial_embedding.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components copying xformers/components/residual.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components copying xformers/components/reversible.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components copying xformers/components/activations.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components copying xformers/components/multi_head_dispatch.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components copying xformers/components/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components copying xformers/components/input_projection.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components copying xformers/components/patch_embedding.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components creating build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_mem_eff_attention.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_indexing.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_mlp.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_blocksparse_transformers.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_mem_eff_attn_decoder.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_transformer.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_revnet.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_swiglu.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_triton_layernorm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_causal_blocksparse.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_triton_fused_linear.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_tiled_matmul.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_triton_blocksparse.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_triton_softmax.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_mem_eff_attention_mqa.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/utils.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_triton_dropout.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_nystrom_utils.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_attn_decoding.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_multi_head_dispatch.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_sequence_parallel_fused.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_sddmm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_sp24.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks copying xformers/benchmarks/benchmark_core.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks creating build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/rmsnorm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/modpar_layers.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/swiglu_op.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/unbind.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/rope_padded.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/seqpar.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/ipc.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/sequence_parallel_fused_ops.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/sp24.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/common.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/differentiable_collectives.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/tiled_matmul.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops copying xformers/ops/indexing.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops creating build/lib.macosx-10.9-universal2-3.9/xformers/profiler copying xformers/profiler/device_limits.py -> build/lib.macosx-10.9-universal2-3.9/xformers/profiler copying xformers/profiler/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/profiler copying xformers/profiler/profiler_dcgm_impl.py -> build/lib.macosx-10.9-universal2-3.9/xformers/profiler copying xformers/profiler/profiler_dcgm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/profiler copying xformers/profiler/api.py -> build/lib.macosx-10.9-universal2-3.9/xformers/profiler copying xformers/profiler/slow_ops_profiler.py -> build/lib.macosx-10.9-universal2-3.9/xformers/profiler copying xformers/profiler/profiler.py -> build/lib.macosx-10.9-universal2-3.9/xformers/profiler creating build/lib.macosx-10.9-universal2-3.9/xformers/sparse copying xformers/sparse/_csr_ops.py -> build/lib.macosx-10.9-universal2-3.9/xformers/sparse copying xformers/sparse/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/sparse copying xformers/sparse/utils.py -> build/lib.macosx-10.9-universal2-3.9/xformers/sparse copying xformers/sparse/blocksparse_tensor.py -> build/lib.macosx-10.9-universal2-3.9/xformers/sparse copying xformers/sparse/csr_tensor.py -> build/lib.macosx-10.9-universal2-3.9/xformers/sparse creating build/lib.macosx-10.9-universal2-3.9/xformers/helpers copying xformers/helpers/test_utils.py -> build/lib.macosx-10.9-universal2-3.9/xformers/helpers copying xformers/helpers/hierarchical_configs.py -> build/lib.macosx-10.9-universal2-3.9/xformers/helpers copying xformers/helpers/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/helpers copying xformers/helpers/timm_sparse_attention.py -> build/lib.macosx-10.9-universal2-3.9/xformers/helpers creating build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn copying xformers/_flash_attn/fused_softmax.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn copying xformers/_flash_attn/flash_blocksparse_attn_interface.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn copying xformers/_flash_attn/flash_blocksparse_attention.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn copying xformers/_flash_attn/bert_padding.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn copying xformers/_flash_attn/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn copying xformers/_flash_attn/flash_attn_triton_og.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn copying xformers/_flash_attn/flash_attn_triton.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn copying xformers/_flash_attn/flash_attn_interface.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn creating build/lib.macosx-10.9-universal2-3.9/xformers/factory copying xformers/factory/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/factory copying xformers/factory/hydra_helper.py -> build/lib.macosx-10.9-universal2-3.9/xformers/factory copying xformers/factory/block_factory.py -> build/lib.macosx-10.9-universal2-3.9/xformers/factory copying xformers/factory/model_factory.py -> build/lib.macosx-10.9-universal2-3.9/xformers/factory copying xformers/factory/block_configs.py -> build/lib.macosx-10.9-universal2-3.9/xformers/factory copying xformers/factory/weight_init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/factory creating build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/global_tokens.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/ortho.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/blocksparse.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/local.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/compositional.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/pooling.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/_sputnik_sparse.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/core.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/lambda_layer.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/random.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/fourier_mix.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/scaled_dot_product.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/utils.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/attention_mask.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/linformer.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/attention_patterns.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/visual.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/sparsity_config.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/nystrom.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/favor.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention copying xformers/components/attention/base.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention creating build/lib.macosx-10.9-universal2-3.9/xformers/components/feedforward copying xformers/components/feedforward/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/feedforward copying xformers/components/feedforward/mixture_of_experts.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/feedforward copying xformers/components/feedforward/mlp.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/feedforward copying xformers/components/feedforward/conv_mlp.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/feedforward copying xformers/components/feedforward/fused_mlp.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/feedforward copying xformers/components/feedforward/base.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/feedforward creating build/lib.macosx-10.9-universal2-3.9/xformers/components/positional_embedding copying xformers/components/positional_embedding/vocab.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/positional_embedding copying xformers/components/positional_embedding/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/positional_embedding copying xformers/components/positional_embedding/param.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/positional_embedding copying xformers/components/positional_embedding/sine.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/positional_embedding copying xformers/components/positional_embedding/rotary.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/positional_embedding copying xformers/components/positional_embedding/base.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/positional_embedding creating build/lib.macosx-10.9-universal2-3.9/xformers/components/attention/feature_maps copying xformers/components/attention/feature_maps/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention/feature_maps copying xformers/components/attention/feature_maps/softmax.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention/feature_maps copying xformers/components/attention/feature_maps/base.py -> build/lib.macosx-10.9-universal2-3.9/xformers/components/attention/feature_maps creating build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA copying xformers/benchmarks/LRA/batch_submit.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA copying xformers/benchmarks/LRA/batch_fetch_results.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA copying xformers/benchmarks/LRA/run_with_submitit.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA copying xformers/benchmarks/LRA/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA copying xformers/benchmarks/LRA/run_tasks.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA copying xformers/benchmarks/LRA/run_grid_search.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA creating build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA/code copying xformers/benchmarks/LRA/code/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA/code copying xformers/benchmarks/LRA/code/model_wrapper.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA/code copying xformers/benchmarks/LRA/code/dataset.py -> build/lib.macosx-10.9-universal2-3.9/xformers/benchmarks/LRA/code creating build/lib.macosx-10.9-universal2-3.9/xformers/ops/_triton copying xformers/ops/_triton/k_scaled_index_add.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/_triton copying xformers/ops/_triton/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/_triton copying xformers/ops/_triton/rope_padded_kernels.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/_triton copying xformers/ops/_triton/sequence_parallel_fused_kernels.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/_triton copying xformers/ops/_triton/tiled_matmul_kernels.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/_triton copying xformers/ops/_triton/k_index_select_cat.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/_triton copying xformers/ops/_triton/rmsnorm_kernels.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/_triton creating build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/decoder.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/dispatch.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/attn_bias.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/ck.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/common.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/ck_decoder.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/flash.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/small_k.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/cutlass.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/ck_splitk.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha copying xformers/ops/fmha/triton_splitk.py -> build/lib.macosx-10.9-universal2-3.9/xformers/ops/fmha creating build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/losses copying xformers/_flash_attn/losses/cross_entropy.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/losses copying xformers/_flash_attn/losses/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/losses creating build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/layers copying xformers/_flash_attn/layers/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/layers copying xformers/_flash_attn/layers/patch_embed.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/layers copying xformers/_flash_attn/layers/rotary.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/layers creating build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/utils copying xformers/_flash_attn/utils/pretrained.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/utils copying xformers/_flash_attn/utils/generation.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/utils copying xformers/_flash_attn/utils/benchmark.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/utils copying xformers/_flash_attn/utils/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/utils copying xformers/_flash_attn/utils/distributed.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/utils creating build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/bigcode.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/gptj.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/opt.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/llama.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/vit.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/btlm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/baichuan.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/bert.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/falcon.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/gpt_neox.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models copying xformers/_flash_attn/models/gpt.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/models creating build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops copying xformers/_flash_attn/ops/activations.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops copying xformers/_flash_attn/ops/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops copying xformers/_flash_attn/ops/fused_dense.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops copying xformers/_flash_attn/ops/rms_norm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops copying xformers/_flash_attn/ops/layer_norm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops creating build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/modules copying xformers/_flash_attn/modules/embedding.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/modules copying xformers/_flash_attn/modules/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/modules copying xformers/_flash_attn/modules/mlp.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/modules copying xformers/_flash_attn/modules/block.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/modules copying xformers/_flash_attn/modules/mha.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/modules creating build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops/triton copying xformers/_flash_attn/ops/triton/cross_entropy.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops/triton copying xformers/_flash_attn/ops/triton/linear.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops/triton copying xformers/_flash_attn/ops/triton/k_activations.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops/triton copying xformers/_flash_attn/ops/triton/init.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops/triton copying xformers/_flash_attn/ops/triton/mlp.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops/triton copying xformers/_flash_attn/ops/triton/rotary.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops/triton copying xformers/_flash_attn/ops/triton/layer_norm.py -> build/lib.macosx-10.9-universal2-3.9/xformers/_flash_attn/ops/triton running build_ext building 'xformers._C' extension creating build/temp.macosx-10.9-universal2-3.9 creating build/temp.macosx-10.9-universal2-3.9/xformers creating build/temp.macosx-10.9-universal2-3.9/xformers/csrc creating build/temp.macosx-10.9-universal2-3.9/xformers/csrc/attention creating build/temp.macosx-10.9-universal2-3.9/xformers/csrc/attention/autograd creating build/temp.macosx-10.9-universal2-3.9/xformers/csrc/attention/cpu creating build/temp.macosx-10.9-universal2-3.9/xformers/csrc/sequence_parallel_fused creating build/temp.macosx-10.9-universal2-3.9/xformers/csrc/sparse24 creating build/temp.macosx-10.9-universal2-3.9/xformers/csrc/swiglu clang -Wno-unused-result -Wsign-compare -Wunreachable-code -fno-common -dynamic -DNDEBUG -g -fwrapv -O3 -Wall -iwithsysroot/System/Library/Frameworks/System.framework/PrivateHeaders -iwithsysroot/Applications/Xcode.app/Contents/Developer/Library/Frameworks/Python3.framework/Versions/3.9/Headers -arch arm64 -arch x86_64 -Werror=implicit-function-declaration -I/private/var/folders/v5/94yjmmq93s1bk1l4nq8q90d40000gr/T/pip-install-6rc03t1i/xformers_584eec7fa0c44a389cb5867f7e08966d/xformers/csrc -I/Users/proto/Library/Python/3.9/lib/python/site-packages/torch/include -I/Users/proto/Library/Python/3.9/lib/python/site-packages/torch/include/torch/csrc/api/include -I/Users/proto/Library/Python/3.9/lib/python/site-packages/torch/include/TH -I/Users/proto/Library/Python/3.9/lib/python/site-packages/torch/include/THC -I/Applications/Xcode.app/Contents/Developer/Library/Frameworks/Python3.framework/Versions/3.9/include/python3.9 -c xformers/csrc/attention/attention.cpp -o build/temp.macosx-10.9-universal2-3.9/xformers/csrc/attention/attention.o -O3 -std=c++17 -fopenmp -DTORCH_API_INCLUDE_EXTENSION_H -DPYBIND11_COMPILER_TYPE="_clang" -DPYBIND11_STDLIB="_libcpp" -DPYBIND11_BUILD_ABI="_cxxabi1002" -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0 clang: error: unsupported option '-fopenmp' clang: error: unsupported option '-fopenmp' error: command '/usr/bin/clang' failed with exit code 1 [end of output]

note: This error originates from a subprocess, and is likely not a problem with pip. ERROR: Failed building wheel for xformers Running setup.py clean for xformers Failed to build xformers ERROR: Could not build wheels for xformers, which is required to install pyproject.toml-based projects

danielhanchen commented 4 months ago

Wait does Apple M1 Pro have a NVIDIA GPU?

proto1994 commented 4 months ago

emm, probably not. Is there any way to train on a mac computer?