-
您好!
我下载该模型搭配LLamafactory框架,在做api部署的时候,报以下错误
[2024-10-01 00:15:35,483] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[INFO|configuration_utils.py:670]…
-
While our [draft charter](https://www.w3.org/2023/03/proposed-webmachinelearning-charter.html) says that the group:
> priority on building blocks required by well-known model architectures such as re…
-
After getting to the option for what version to run im getting some errors like:
"C:\Users\dylan\OneDrive\Desktop\alinity\AIRIS-VtuberAI\.venv\lib\site-packages\transformers\models\auto\configurati…
-
While the models data can now be accessed through the repo this does not include the classification data. To make it easier for external tools to get to this info we should add a RESTful API to the MO…
-
Hello,
I'm trying to deploy `gte-multilingual-reranker-base` using the Text Embeddings Inference but have encountered issues despite following the guidance provided in [issue #366](https://github.c…
-
### 🚀 The feature, motivation and pitch
For transformer architecture (for example https://github.com/pytorch-labs/gpt-fast/blob/main/model.py#L195-L211) it tends to be most performant to merge the qk…
-
# 🧐 Problem Description
Fast-LLM lacks support for Llama 3.x models due to missing compatibility with Llama-3-style RoPE scaling. This prevents us from effectively training or using Llama 3.x check…
-
OmniGen is a new image generation model that is built by tuning an existing Phi-3 model into a transformer for diffusion task. It appears to have next-level multi-modal capability, like incorporating …
-
### System Info
- CPU Architecture: x86_64
- CPU Type: AMD Epyc 9654
- GPU Type: Nvidia H100
- Nvidia Docker Container: nvcr.io/nvidia/nemo:24.01.gemma
### Who can help?
@Tracin
### Information…
-
After setting up the environment by following the tutorial, I ran `gradio_demo.py` and encountered the following error:
```
> python .\gradio_demo.py
[2024-11-05 18:06:25,042] [ WARNING] easyocr.py:…