Open FedorSymkin opened 1 month ago
Hello. I need CPU Inference of LLama 3.2, and I've found opened PR https://github.com/meta-llama/llama/pull/202 for it. Do you have plans to merge it? I could not find any mentions about CPU mode in main, so I assume this PR is still actual
Hello. I need CPU Inference of LLama 3.2, and I've found opened PR https://github.com/meta-llama/llama/pull/202 for it. Do you have plans to merge it? I could not find any mentions about CPU mode in main, so I assume this PR is still actual