transformer-models Search Results

1000+ results
for transformer-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ServerlessLLM/ServerlessLLM #144

[docker] 500 error occurred during deploy model phase

does not use docker-compose. step1.Start the ray_head node ``` docker run -d \ --name ray_head \ --privileged \ --env MODEL_FOLDER=${MODEL_FOLDER} \ --env RAY_NUM_CPUS=8 \ -p 6379:…

ToviHe updated 1 week ago
15
HKUDS/LightRAG #275

The program runs into an infinite loop and cannot be stopped…

I run lightrag_hf_demo.py, but there is no response after running it. Does anyone know what is going on? My code is as follows: ``` import os from lightrag import LightRAG, QueryParam from …

Cherries-Man updated 1 week ago
1
modelscope/ms-swift #2246

Finetuning Qwen2VL yield error when enabling FlashAttention …

**Describe the bug** When using Flash Attention (--use-flash-attention true) to train Qwen2VL model with mixed data (both image and text data), the code will yield the following error ``` [rank0]: …

VietDunghacker updated 2 weeks ago
7
ouusan/some-papers #24

Utilizing Attention Mechanism

1.PARE: Part Attention Regressor for 3D Human Body Estimation(2021) img-->volumetric features(before the global average pooling)-->part branch: estimates attention weights +feature branch: performs S…

ouusan updated 1 month ago
3
huggingface/transformers #32672

fp16 support for grounding dino

### Feature request Currently, if fp16 is used with grounding dino via https://huggingface.co/docs/transformers/main/en/model_doc/grounding-dino, there is an error of the following: ``` ... Fi…

Benjamin-Tan updated 3 months ago
1
InAnYan/jabref #85

Choose embedding model

This is a "living issue". Editing is appreciated. ### Context: - Most prominent benchmark for embedding models: https://huggingface.co/spaces/mteb/leaderboard - We can choose to index the pdf dat…

ThiloteE updated 1 month ago
11
Lightning-AI/lightning-thunder #1424

ThunderFX fails with FP8 and Activation Checkpointing

## 🐛 Bug When training models: 'vicuna-7b-v1.5-16k', 'longchat-13b-16k', 'Mistral-7B-v0.2', 'falcon-180B', 'Llama-3-70B', 'CodeLlama-34b-hf' with FSDP and FP8 we get KeyError: 'scaling_fwd'. This m…

mpatel31415 updated 2 days ago
3
NVIDIA/Megatron-LM #695

[BUG] No Module Error

**Describe the bug** I am running data preprocessing script using the following command: ``` python tools/preprocess_data.py \ --input ./openwebtext/scraped_100/train_data.json \ --…

zhentingqi updated 6 months ago
2
mutonix/pyramidinfer #4

Support for Llama3-8B-Instruct model

Great work! I am trying to run pyramidinfer with a Llama3-8B-Instruct model, but it seems that the version of "transformers" is too old to load the weight of Llama3-8B model. I ran this command …

KwokhoTsui updated 1 week ago
2
huggingface/transformers #34824

Flash attention 2 broke when batch inference

### System Info - `transformers` version: 4.46.2 - Platform: Linux-5.15.0-120-generic-x86_64-with-glibc2.35 - Python version: 3.10.15 - Huggingface_hub version: 0.26.2 - Safetensors version: 0.…

pspdada updated 6 days ago
1

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for transformer-models

1000+ results
for transformer-models