model-performance Search Results

1000+ results
for model-performance

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeed #6727

[BUG] any clue for MFU drop?

![Image](https://github.com/user-attachments/assets/459f5917-ac00-449c-8e15-b4bb3d840255) y-axis is MFU and x-axis is training step. I'm testing qwen 72b with huggingface trainer and whenever i trai…

SeunghyunSEO updated 2 days ago
4
dzjxzyd/FusionESP #1

question about the confidence score and where is the cosine …

Hi, thanks for your work. I am confused with the output : the confidence socre. I tested enzyme 2q8m with its substrate fbp (as shown in the figure). I got the confidence score of 0.66. However, th…

Huilin-Li updated 3 days ago
1
microsoft/onnxruntime #22533

[Feature Request] Add Wasm Relaxed SIMD support and integer …

### Describe the feature request Wasm Relaxed SIMD includes integer dot product instructions, which will map to VNNI instructions on X86-64 platforms with AVX-VNNI (on ARM maybe SDOT, but I haven't t…

jing-bao updated 3 weeks ago
3
LILY-QML/LLY-DML-V1 #4

Data Drift Issue

#### Subpoint 4.1: Limited Recognition Based on Training Patterns **Ticket Title**: Address Data Drift from Variability in Training Patterns **Description**: Investigate and implement strategi…

xleonplayz updated 6 days ago
1
fredzzhang/pvic #60

Training error

Hi. I'm trying to train SWIN-L backbone based model on hico-det. ``` # Training DETR=advanced python main.py --backbone swin_large --use-checkpoint \ --drop-path…

YangJae96 updated 2 days ago
2
FlagOpen/FlagEmbedding #1134

Question Regarding bge-m3 Model Loading

I recently attempted to use the BGE-M3 model by loading it with SentenceTransformer. However, I noticed suboptimal performance. I observed that the sample code on Huggingface uses the loading method `…

zhongxifang updated 1 month ago
2
openxla/xla #18611

XLA:CPU performance regression with the min alignment change…

I'm observing performance regressions for bert and bart model inference with jax mainline compared to jax-v0.4.34 on both x86 and arm64 cpu platforms. The performance drop is around 50%. I have root-c…

snadampal updated 1 day ago
3
UTSAVS26/PyVerse #1084

[Code Addition Request]: COVID Detection from CXR Using Expl…

### Have you completed your first issue? - [X] I have completed my first issue ### Guidelines - [X] I have read the guidelines - [X] I have the link to my latest merged PR ### Latest Merged PR Lin…

inkerton updated 1 week ago
1
huggingface/optimum #2083

Please don't kill BetterTransformer — 1.88x faster inference…

### Feature request I would like to request that BetterTransformer not be deprecated. ### Motivation I have come to rely on BetterTransformer significantly for accelerating RoBERTa and BERT models.…

umarbutler updated 2 weeks ago
1
CVIU-CSU/HRDecoder #1

Clarification on Class Handling and Background Processing

Hi! I hope you're doing well. I have a couple of questions regarding the class handling in your code. In lesion_dataset.py and hr_idrid_2880x1920-slide.py, the classes are defined as ['bg', 'EX…

ChubbyPear updated 3 days ago
2

上一页 1...30 31 32 33 34 35 36...100 下一页

1000+ results for model-performance

1000+ results
for model-performance