-
-
Training used to work on my computer, though I hadn't tried it in several months. I've occasionally done some sampling, but no training since July (largely because my CPU was overheating, and CUDA is…
-
### System Info
- NVIDIA A100 80G * 2
- Libraries
- TensorRT-LLM: 0.11.0.dev2024052800
- Driver Version: 525.105.17
- CUDA Version: 12.4
### Who can help?
@byshiue @schetlur-nv
##…
-
Hi,
I can't find the implementation of this model that was added recently to the [results](https://github.com/codekansas/keras-language-modeling/blob/master/results.notes), and that appears to perfor…
-
按照README.md介绍的方法执行:
```shell
#python3 main.py --train=True --clean=True --model_type=bilstm
```
出现了以下问题:
```powershell
Traceback (most recent call last):
File "main.py", line 228, in
…
-
While using `sample.lua` to generate text to inspect how the quality is changing between checkpoints, it would be nice if each character could be printed as it's generated (like in `char-rnn`) so you …
gwern updated
7 years ago
-
@jmvalin , Thank you for sharing your code in Github.
I have a question about 8x4 block computation in vec_avx.h
(My question is based on ```lpcnet_efficiency``` branch after reading the paper, [NEU…
-
Sorry, I‘m new to train lstm. Will you kindly provide you training parameters? Since I can't get results as you provided. Or maybe it's because you didn't provide arithmetic coding block?
-
Running the tinyshakespeare dataset with the default settings, I get timings of around 0.3s/iteration with CPU, but using the OpenCL backend I get more like 2.6s/iteration. These timings seem to be si…
mewo2 updated
6 years ago
-
```
luajit: ...bio/torch/install/share/lua/5.1/nn/ClassNLLCriterion.lua:50: bad argument #1 to 'ClassNLLCriterion_updateOutput' (torch.CudaTensor expected, got nn.ClassNLLCriterion)
stack traceback:
…