model-weights Search Results

1000+ results
for model-weights

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #31804

TinyModel addition

### Model description https://github.com/noanabeshima/tiny_model It's a small language model trained on TinyStories for interpretability with sparse autoencoders and transcoders added. It has no…

noanabeshima updated 1 month ago
6
gurkirt/3D-RetinaNet #17

No such file or directory: '/workspace/road/cache/resnet50I3…

**I am running the following command for testing the pre-trained model on ROAD dataset inside a docker continer.** python3 main.py /workspace/ /workspace/ /workspace/kinetics-pt/ --MODE=gen_dets -…

rafayaamirgull updated 1 week ago
2
Maelic/SGG-Benchmark #40

Non-existent config key: MODEL.ROI_RELATION_HEAD.USE_SPATIAL…

Thank you very much for your work. I encountered an issue and would like to ask for your assistance. I trained using the configuration file SGG-Benchmark-main/configs/VG150/e2e_relation_yolov8m.yaml, …

wudibadan updated 4 days ago
3
NVIDIA/TensorRT-LLM #2419

Assertion failed: Must set crossKvCacheFraction for encoder-…

### System Info GPU: `A10` Base Image: `FROM nvidia/cuda:12.1.0-runtime-ubuntu22.04` Tensorrt-llm: - `0.12.0` : It's working, but I can't use it because of a version mismatch in TRT and trt-llm-back…

Saeedmatt3r updated 1 day ago
2
facebookresearch/stable_signature #29

Message decoding problem using weight provided

Hi there! I have tried the weight of decoder you provided here: [WM weights of latent decoder](https://dl.fbaipublicfiles.com/ssl_watermarking/sd2_decoder.pth) and I generate an image using code pro…

LiRunyi2001 updated 3 weeks ago
17
a-r-r-o-w/cogvideox-factory #26

Allowing training without the learned positional embeddings …

If you take a look at the weights of the learned positional embedding in THUDM/CogVideoX-5b-I2V, you will find that the mean is close to 0 and standard deviation is very low. This is to say that the w…

a-r-r-o-w updated 2 weeks ago
13
kohya-ss/sd-scripts #1591

error when use deepspeed for FLUX.1 fine-tuning

@kohya-ss @lansing @rockerBOO @akx @tsukimiya With the following configuration, multi-GPU training works properly, and the results are normal. Does sd-scripts not support DeepSpeed acceleration? Cou…

huxian0402 updated 2 weeks ago
2
NVIDIA/TensorRT-LLM #2392

Qwen2-72B w4a8 empty output

### System Info GPU: 4090 Tensorrt: 10.3 tensorrt-llm: 0.13.0.dev2024081300 ### Who can help? @Tracin May you please have a look, thank you very much ### Information - [ ] The official example sc…

lishicheng1996 updated 1 day ago
4
Genentech/gReLU #47

guidelines to implement custom models

Hi, thanks for developing such a comprehensive tool for using sequence models. I was wondering where I could find the set of methods I should include to be able to use your functions on a model th…

MiqG updated 4 weeks ago
1
AllenCell/allencell-segmenter-ml #367

[1.1]create function to copy state dict for model weights

We want to be able to train a new model from existing model weights. We need code so that we can grab this from the `.ckpt` state dict and start a new model from these weights. preferably we can pr…

yrkim98 updated 1 month ago
1

上一页 1...65 66 67 68 69 70 71...100 下一页

1000+ results for model-weights

1000+ results
for model-weights