on-device-training Search Results

1000+ results
for on-device-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #133549

Dtensor shard uses more gpu memory than raw tensor

### 🐛 Describe the bug Dtensor shard uses more gpu memory than raw tensor. With test, Shard gpu mem: 21890MiB > Replicate gpu mem: 17448MiB > Raw tensor gpu mem: 16804MiB. Confused for a long time…

v4if updated 3 weeks ago
5
scverse/scvi-tools #1801

Posterior sampling with Messenger Pyro guides -> huge memory…

Hi Posterior sampling with Messenger Pyro guides does not remove observed variables leading to huge memory use. https://github.com/scverse/scvi-tools/blob/main/scvi/model/base/_pyromixin.py#L184 …

vitkl updated 2 months ago
5
pytorch/xla #7102

Problem with mesh shape in HybridMesh on TPU

## ❓ Questions and Help I recived error when try create sqmd mesh on kaggle notebook when flow [Huggingface optimum-tpu](https://github.com/huggingface/optimum-tpu/blob/695ee84d657d9ed2761fcf481685af…

manh3152924 updated 3 months ago
12
atkm/avazu-ctr #4

Models don't perform as expected when trained on a large tra…

From Kaggle submissions (private scores): Model 3: - Half of train.csv: 0.4241803 - train_small.csv: 0.4123864 Model 1: - Half of train.csv: 0.4172254 - train_small.csv: 0.4168748 Run an …

atkm updated 5 years ago
2
DIT112-V20/group-06 #54

Network Service Discovery

The mDNS on the SmartCar cannot be picked up by the android app by using `djsmartcar.local`. One of the two following solutions has to be implemented in order for the app to find the SmartCar on the l…

JenniNord updated 4 years ago
1
leaderj1001/Attention-Augmented-Conv2d #29

How long will it cost for training AA-Wide-ResNet on CIFAR10…

How long will it cost for training AA-Wide-ResNet on CIFAR100 dataset? And can you share your training device with me? Thank you!

XYZ-916 updated 2 years ago
1
asumanc/Machine-Learning-Practice #2

ShuffleNet with Keras

This task is about an algorithm is designed for mobile devices which is [ShuffleNet](https://arxiv.org/pdf/1707.01083.pdf). This task asks you to implement ShuffleNet with Keras. Also, you need to sav…

erolrecep updated 5 years ago
10
huggingface/transformers #33086

LR = 0 when using DeepSpeed Config and LORA on Trainer.

### System Info If DeepSpeed Config has optimizer/scheduler/fp16 config,will showing warning and **loss** Not Converges in training: tried to get lr value before scheduler/optimizer started steppi…

youningnihaobang updated 2 weeks ago
3
Lightning-AI/lightning-thunder #756

Segmentation fault for fp8 and thunder_cudnn

## 🐛 Bug For a few models ( Platypus-30B with FSDP zero3, Gemma7b with DDP and vicuna-33b-v1.3 with FSDP zero3) we get segmentation fault error when trying to use fp8 with thunder_cudnn. When usi…

mpatel31415 updated 1 week ago
3
yhenon/pytorch-retinanet #197

Sporadic message : input tensors must be on the same device.…

During training i have a sporadic message : input tensors must be on the same device. Received cpu and cuda:0 At the end it does not have any influence on the training, it works correctly, i'm just w…

wvalcke updated 3 years ago
8

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for on-device-training

1000+ results
for on-device-training