accumulation Search Results

1000+ results
for accumulation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sashirestela/simple-openai #204

HTTP Threads Accumulation and Thread-Safety Inquiry with Loc…

Hello, First of all, thank you for your work on this library. I am using it to integrate a local LLM and I have encountered some strange behavior. I would like to know if it is necessary to manu…

danielsawan updated 1 day ago
1
unslothai/unsloth #1179

Can't import unsloth when both the latest version of unsloth…

To repro: Install the latest versions of unsloth and transformers ``` !pip uninstall unsloth -y && pip install --upgrade --no-cache-dir "unsloth[colab-new] @ git+https://github.com/unslothai/unslot…

lossflow updated 5 days ago
7
huggingface/trl #2175

Gradient accumulation yields worse results than the equivale…

I expected a training configuration with per_device_train_batch_size=1 and gradient_accumulation_steps=32 to yield the same (or similar) result to per_device_train_batch_size=32 and gradient_accumulat…

benjamin-marie updated 1 day ago
22
microsoft/vscode #210509

[Testing] Coverage API Issues: Monorepo & Accumulation

Type: Bug As we integrate our extension, vscode-jest, with the newly available Testing Coverage API, we've identified some issues that notably impact usability. Below, I provide some video demonstr…

connectdotz updated 2 weeks ago
3
Owen-Oertell/rlcm #2

The reward quires for REBEL are twice as for DDPO

Hello, I’ve been following your work recently. Based on the configurations in your repo, it seems that the reward queries for REBEL are twice as for DDPO, since REBEL uses two sampling traces per batc…

ZiyiZhang27 updated 1 week ago
10
UKPLab/sentence-transformers #2916

Question about MultipleNegativesRankingLoss and gradient acc…

How does the MultipleNegativesRankingLoss function when used with gradient accumulation steps? According to the [docs](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#mult…

DogitoErgoSum updated 1 month ago
8
huggingface/transformers #31677

Mismatch with epoch when using gradient_accumulation

### System Info - `transformers` version: 4.43.0.dev0 - Platform: Linux-5.4.0-167-generic-x86_64-with-glibc2.35 - Python version: 3.10.14 - Huggingface_hub version: 0.23.4 - Safetensors version: …

SangbumChoi updated 1 week ago
9
grafana/loki #14148

chore: Simplify streams and metrics accumulation in frontend

This issue comes from https://github.com/grafana/loki/pull/13881#pullrequestreview-2237990726: > I see now what I did wrong. The stats, warnings etc are joined in Downstream [here](https://github.com…

jeschkies updated 1 month ago
1
NVIDIA/Fuser #2904

testValidate uses half tolerances for fp32 accumulation.

This can lead to false negatives because the threshold is overly relaxed. ```diff diff --git a/tests/cpp/test_gpu_fused_reduction.cpp b/tests/cpp/test_gpu_fused_reduction.cpp index e67875f4..b3923d6…

wujingyue updated 1 month ago
1
metno/discovery-metadata-catalog-ingestor #211

Go through DMCI workdir file-accumulation

johtoblan updated 3 weeks ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for accumulation

1000+ results
for accumulation