transformer-architecture Search Results

1000+ results
for transformer-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

deepseek-ai/DeepSeek-VL #60

ValueError: The checkpoint you are trying to load has model …

您好！我下载该模型搭配LLamafactory框架，在做api部署的时候，报以下错误 [2024-10-01 00:15:35,483] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda (auto detect) [INFO|configuration_utils.py:670]…

Y-PanC updated 2 weeks ago
1
webmachinelearning/webnn #375

Support for transformers

While our [draft charter](https://www.w3.org/2023/03/proposed-webmachinelearning-charter.html) says that the group: > priority on building blocks required by well-known model architectures such as re…

dontcallmedom updated 1 week ago
35
neurokitti/AIRIS-VtuberAI #2

not fully sure whats happening

After getting to the option for what version to run im getting some errors like: "C:\Users\dylan\OneDrive\Desktop\alinity\AIRIS-VtuberAI\.venv\lib\site-packages\transformers\models\auto\configurati…

sbk-djarman updated 1 month ago
8
lfai/model_openness_tool #39

RESTful API

While the models data can now be accessed through the repo this does not include the classification data. To make it easier for external tools to get to this info we should add a RESTful API to the MO…

lehors updated 1 week ago
3
huggingface/text-embeddings-inference #436

Support `gte-multilingual-reranker-base`

Hello, I'm trying to deploy `gte-multilingual-reranker-base` using the Text Embeddings Inference but have encountered issues despite following the guidance provided in [issue #366](https://github.c…

zhanghx0905 updated 21 hours ago
4
pytorch/pytorch #140069

DTensor support for fused qkv matmul

### 🚀 The feature, motivation and pitch For transformer architecture (for example https://github.com/pytorch-labs/gpt-fast/blob/main/model.py#L195-L211) it tends to be most performant to merge the qk…

HDCharles updated 1 week ago
1
ServiceNow/Fast-LLM #39

[feat] Llama 3.x rope scaling support

# 🧐 Problem Description Fast-LLM lacks support for Llama 3.x models due to missing compatibility with Llama-3-style RoPE scaling. This prevents us from effectively training or using Llama 3.x check…

tscholak updated 6 hours ago
2
huggingface/candle #2593

Model support: OmniGen

OmniGen is a new image generation model that is built by tuning an existing Phi-3 model into a transformer for diffusion task. It appears to have next-level multi-modal capability, like incorporating …

Czxck001 updated 2 weeks ago
1
NVIDIA/TensorRT-LLM #1201

"Fp16 precision has been set for a layer or layer output, bu…

### System Info - CPU Architecture: x86_64 - CPU Type: AMD Epyc 9654 - GPU Type: Nvidia H100 - Nvidia Docker Container: nvcr.io/nvidia/nemo:24.01.gemma ### Who can help? @Tracin ### Information…

giancarlo-metitieri updated 4 days ago
5
microsoft/OmniParser #74

No files matching the pattern were found

After setting up the environment by following the tutorial, I ran `gradio_demo.py` and encountered the following error: ``` > python .\gradio_demo.py [2024-11-05 18:06:25,042] [ WARNING] easyocr.py:…

sby-a-izumi updated 1 week ago
3

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for transformer-architecture

1000+ results
for transformer-architecture