a-train Search Results - Githubissues

1000+ results
for a-train

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openxla/xla #18214

PR#18052 caused runtime crashes for all the MaxText training…

The issue started with https://github.com/openxla/xla/pull/18052 Error log: ``` Per train step: Total TFLOPs: 377.53 split as 86.02% learnable weight flops and 13.98% attention flops jax.er…

gpupuck updated 5 hours ago
1
dmlc/xgboost #10882

Prediction of 2.1.1 compared to 1.7.6 is significantly slowe…

We are currently using xgboost 1.6.2 and are trying to upgrade to 2.1.1. On the way through the versions, we observed the following prediction time averages: 1.6.2: 15ms 1.7.6: 17ms 2.0.3: 43ms …

Raemi updated 8 hours ago
1
zhenshij/arbitrary-scale-diffusion #1

Do I need to train the model myself? Is there a model that h…

i don't know how to run

intersoulest updated 1 month ago
3
aim-uofa/DiverGen #3

About visualization of data distributions on different sourc…

Hi, great work! I have two questions about visualization of data distributions on different sources in fig.1. Q1: Is the generated data visualized here learned from the corresponding data source? Fo…

pILLOW-1 updated 1 week ago
2
wesnoth/wesnoth #9372

WC: Adding trainers doesn't really work

### Describe the desired feature Currently there are hard coded indices used to pick trainers so it only really works with the original 6. Indices hard coded to 1 to 6 and special handling of inde…

soliton- updated 1 week ago
2
NTU-MedAI/FOTF-CPI #1

RuntimeError: CUDA error: device-side assert triggered CUDA …

When I run the train.py file to train the Davis dataset, I set input_dim_drug in the config file to 212 as prompted by the author. But then a runtime error occurs: RuntimeError: CUDA error: device-si…

2776222856 updated 3 days ago
2
ROCm/Tensile #1975

[Issue]: How is a decision tree type YAML trained?

### Problem Description Based on the decision tree used in the Test to select a kernel, I want to see how the features in the YAML file are trained. ### Operating System Centos ### CPU AMD ### G…

lty-qd updated 2 months ago
1
zhao-zilong/Tabula #12

How to train model across multiple GPUs?

I am still very new to LLMs. I have access to a large amount of GPUs, and I would like to train this model across multiple GPUs ( though, I am not sure whether this is necessary/overkill ). Previously…

HatedFate updated 3 days ago
3
hiyouga/LLaMA-Factory #5596

ValueError: Trying to set a tensor of shape torch.Size([1970…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `Accelerate` version: 0.34.2 - Platform: Linux-6.5.0-35-generic-x86_64-with-glibc2.35 - `accele…

amankumarhal updated 4 days ago
3
Littleor/textual-inversion-script #1

Ti does not seem to be working correctly

Hey @Littleor I've been trying your training script here but am not getting great results so far.. Are you able to successfully train things into a token? I've been running with the following comm…

aiXander updated 2 weeks ago
1

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for a-train

1000+ results
for a-train