-
as above
-
have anyone who training from scratch, not use pre-trained weight of ViT-B?
Can author or anyone released the training log of train from scratch?
the author said it takes more epochs to training fro…
-
I'm trying to train mamba2 130m from scratch.
```
config = Mamba2Config(
vocab_size=len(tokenizer.vocab),
n_positions=10,
n_embd=768,
…
-
I've been trying to train diffusion model with stable audio 1.0 config, I also trained the autoencoder with the Stable Audio 1.0 VAE for 50k steps [autoencoder result](https://storage.googleapis.com/…
-
Hi @pabloferz!
Following your kind invitation, here's a prototype of what I would like to achieve in [DifferentiationInterfaceJAX.jl](https://github.com/gdalle/DifferentiationInterfaceJAX.jl): call…
-
This would be another section in the Biff docs, underneath "Content Library." Write a series of guides that show you how to make a web app from scratch _without_ using Biff, but generally using the sa…
-
### Update disclaimer
- [X] Yes, I have checked and my request is not related to the game updating and plugins not working correctly.
### Platform
Windows
### Ask your question here
So my old dis…
-
What year are you gonna do that Ima be excited and rooting for you
-
# Neural networks from scratch
It's very difficult to understand what a neural network is without coding one. We're gonna do that here.
## Learning objectives
## Content to cover
## Capsto…
-
Hi,
I have gone through the steps to install and load up geoportal server for the first time.
When the site loads, i get an error in red which reads
![image](https://github.com/user-attachments/a…