accumulation Search Results

1000+ results
for accumulation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Lightning-AI/litgpt #1474

Gradient Accumulation Step under Multi-node Pretaining

@awaelchli I found that in the `pretrain.py`, the accumulation steps are calculated based on global batch size, device number and micro batch size. This works fine under single-node setting, e.g. glo…

SHUMKASHUN updated 2 months ago
8
lucidrains/gigagan-pytorch #37

Multi GPU with gradient accumulation

Hi! While training on multi GPU and using gradient accumulation steps > 1 there's no substantial speedup with relation to a single GPU (there is a speedup if the value is equal to 1). I found followin…

dprze updated 2 months ago
1
NVIDIA/TensorRT-LLM #1906

Ada `FP8xint4` performance issue

Since Ada GPUs like 4090 limit the FP8 arithmetic into `fp32` accumulation, it only achieve the same max `TFLOPs` compared to `fp16xfp16` with `fp16` accumulation. Further more, according to my test,…

jcao-ai updated 1 month ago
6
swiss-ai/nanotron #19

Get 70b in our fork working with pp4, tp4, dp>1

Using our launcher and the latest pull of our pretrain repo you can run a Llama3 70B model as follows. Thanks to @AleHD for getting activation recompute and async working. ``` (export DP=1 PP=4 BA…

ischlag updated 5 days ago
5
unslothai/unsloth #1019

No Validation Loss logged (possibly related to train_on_resp…

Evaluations are being run, _but no validation loss is logged or sent to WandB_ The console shows that eval is running, but displays a table along the lines of: | eval loss | validation loss | |…

selalipop updated 8 hours ago
2
telefonicaid/fiware-cygnus #2401

Cygnus LD Subscription stop sending data to postgres (Batch …

Hello, I am trying to persist the IoT data that I have by applying Cygnus LD and connect it to postgres. I ran the following command for the subscription: ``` curl -L -X POST 'http://localhost:…

fadelcg updated 3 weeks ago
5
calvinkwong/ehci #24

error accumulation

``` What steps will reproduce the problem? 1. run the 6dofhead 2. produce abrupt head motion 3. head angles will accumulate errors What version of the product are you using? 0.7 On what operating sy…

GoogleCodeExporter updated 9 years ago
3
61--/ehci #24

error accumulation

``` What steps will reproduce the problem? 1. run the 6dofhead 2. produce abrupt head motion 3. head angles will accumulate errors What version of the product are you using? 0.7 On what operating sy…

GoogleCodeExporter updated 9 years ago
3
huangkyle/ehci #24

error accumulation

``` What steps will reproduce the problem? 1. run the 6dofhead 2. produce abrupt head motion 3. head angles will accumulate errors What version of the product are you using? 0.7 On what operating sy…

GoogleCodeExporter updated 9 years ago
3
jhzh/ehci #24

error accumulation

``` What steps will reproduce the problem? 1. run the 6dofhead 2. produce abrupt head motion 3. head angles will accumulate errors What version of the product are you using? 0.7 On what operating sy…

GoogleCodeExporter updated 8 years ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for accumulation

1000+ results
for accumulation