-
Hi @beckdaniel ,
I would like to try to run the model. However, I can't do that using the README.md. I think I need the complete steps ( 2 Training models, 3 Decoding, postprocessing and evaluation…
-
### System Info
Hi everyone, when trying to update from Llama 3 8B Instruct to Llama 3.1 8B Instruct, I noticed a crash:
```bash
Args {
model_id: "meta-llama/Meta-Llama-3.1-8B-Instruct",
…
-
Hi, thank you for publishing such a nice paper. I just have one question. I do not understand the subtraction of the input in eqn.4. Is it necessary? What will happen if we just do the average pooling…
-
The current sparse support is pretty nice! However, there are some important of collation operations that can not be efficiently be composed out of matmul or operations on _values(). One that is in …
-
Post your questions here about: “[Training and Taming Deep Networks](https://docs.google.com/document/d/1gne-oWcJs1p5sEjUumapq6HKeaOet3EHxJ-Ij0LuTro/edit?usp=sharing)” & “[The Expanding Universe of De…
-
**Describe the bug**
I tried converting [epicphotogasm_lastUnicorn ](https://civitai.com/models/132632/epicphotogasm) with 768x768 or 1024x1024 and the conversion fails. The model converted successfu…
-
-
Embedding Guile Scheme
M
gnu guile snarf
Edit
I'm afraid I don't have enough context to fully understand your query "gnu guile snarf". However, let me provide some potentially relevant informatio…
-
Post questions here for one or more of our fundamentals readings:
Fortunato, Santo. 2010. “[Community Detection in Graphs.](https://www.sciencedirect.com/science/article/pii/S0370157309002841)” Phy…
-
Hi
Is there a way to convert pretrained `returnn` networks to `ONNX` or at least save the network to `tensorflow's saved model` format?
Best
Musharraf