transformers-models Search Results

1000+ results
for transformers-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

explodinggradients/ragas #1247

Facing error for evaluate() for Langchain instance LLM and E…

[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. ** Facing error with using Langchain wrapped hugging face models** I am …

kartik-angadi updated 2 weeks ago
1
UKPLab/sentence-transformers #2924

Can't load distiluse-base-multilingual-cased-v2 model

Hello, I am not able to load `sentence-transformers/distiluse-base-multilingual-cased-v2` model. Script example: ```python from sentence_transformers import SentenceTransformer model_name = …

b5y updated 1 week ago
1
vllm-project/llm-compressor #73

Llava model quantization seems not be supported

**Describe the bug** When I use llm-compressor to quantize llava model, but at the begining, it failed. (Unrecognized configuration class: 'transformers.models.llava.configuration_llava.LlavaConfig'…

caojinpei updated 1 week ago
4
QwenLM/Qwen2-VL #206

video attention calculation question.

In modeling_qwen2_vl.py https://github.com/huggingface/transformers/blob/main/src/transformers/models/qwen2_vl/modeling_qwen2_vl.py#L343 The attention_mask is set for each frame, when not set the f…

Edwardmark updated 3 days ago
1
Aidenzich/road-to-master #53

Scalable Diffusion Models with Transformers

[Scalable Diffusion Models with Transformers](https://arxiv.org/pdf/2212.09748) Due to the remarkable achievements of Google AlphaFold 3, it also uses DiT, which combines Diffusion and Transformers…

Aidenzich updated 4 months ago
1
microsoft/Phi-3CookBook #187

Converting Phi-3.5-vision-instruct to onnx format fails

> Please provide us with the following information: > --------------------------------------------------------------- ### This issue is for Phi-3.5-vision-instruct: (mark with an `x`) ``` - [ …

MaxAkbar updated 17 hours ago
3
huggingface/transformers #32892

Optional `bias` for qwen2 model

### Feature request `bias` of linear layers in `qwen2` model is hard coded as following: - https://github.com/huggingface/transformers/blob/85345bb439652d3f03bb4e123cef7a440f2ba95b/src/transformers/…

wavy-jung updated 4 weeks ago
2
assafbk/DeciMamba #1

Mamba2

Are there any plans to add decimated Mamba-2? Also is there a chance we will see some implementations based on transformers package implementations of mamba? https://github.com/huggingface/transformer…

DanFosing updated 1 month ago
1
QwenLM/Qwen2-VL #156

ImportError: cannot import name 'Qwen2VLForConditionalGenera…

Cannot import this from the library, optimum-1.21.4 transformers-4.43.4 how to fix this issue the code i am running is from the quick start documentation of `Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4…

Haseeeb21 updated 13 hours ago
2
modelscope/ms-swift #1986

LLaVA-NeXT-Video model configuration initialize error

**Describe the bug** When deploying LLaVA-NeXT-Video-34B-hf, I find that the configuration key passed to transformers is "llava_next_video", while the accurate key in tranformers is "llava-next-video…

VenusHui updated 1 day ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for transformers-models

1000+ results
for transformers-models