-
### Model description
https://github.com/noanabeshima/tiny_model
It's a small language model trained on TinyStories for interpretability with sparse autoencoders and transcoders added. It has no…
-
**I am running the following command for testing the pre-trained model on ROAD dataset inside a docker continer.**
python3 main.py /workspace/ /workspace/ /workspace/kinetics-pt/ --MODE=gen_dets -…
-
Thank you very much for your work. I encountered an issue and would like to ask for your assistance. I trained using the configuration file SGG-Benchmark-main/configs/VG150/e2e_relation_yolov8m.yaml, …
-
### System Info
GPU: `A10`
Base Image: `FROM nvidia/cuda:12.1.0-runtime-ubuntu22.04`
Tensorrt-llm:
- `0.12.0` : It's working, but I can't use it because of a version mismatch in TRT and trt-llm-back…
-
Hi there! I have tried the weight of decoder you provided here:
[WM weights of latent decoder](https://dl.fbaipublicfiles.com/ssl_watermarking/sd2_decoder.pth)
and I generate an image using code pro…
-
If you take a look at the weights of the learned positional embedding in THUDM/CogVideoX-5b-I2V, you will find that the mean is close to 0 and standard deviation is very low. This is to say that the w…
-
@kohya-ss @lansing @rockerBOO @akx @tsukimiya With the following configuration, multi-GPU training works properly, and the results are normal.
Does sd-scripts not support DeepSpeed acceleration? Cou…
-
### System Info
GPU: 4090
Tensorrt: 10.3
tensorrt-llm: 0.13.0.dev2024081300
### Who can help?
@Tracin May you please have a look, thank you very much
### Information
- [ ] The official example sc…
-
Hi, thanks for developing such a comprehensive tool for using sequence models.
I was wondering where I could find the set of methods I should include to be able to use your functions on a model th…
-
We want to be able to train a new model from existing model weights. We need code so that we can grab this from the `.ckpt` state dict and start a new model from these weights.
preferably we can pr…