-
Sorry to bother you again, but where can I find your model weights? Will you release them?
* Conifer-7B-SFT
* Conifer-7B-DPO
-
For a fair comparison with the baseline in test_transformer_engine.py.
I tried the following briefly and hit big number mismatches.
```patch
diff --git a/tests/cpp/test_multidevice_transformer.cpp…
-
**Feature Overview**
This Feature card is for transitioning our model training infrastructure from DeepSpeed to PyTorch's Fully Sharded Data Parallel (FSDP) to enhance training metrics visibility, bro…
-
Thanks for sharing your code. When I loaded the code weights, I found that the dimensions were wrong, but I strictly followed your code to load.
`
def get. net():
nun_classes = 7
anc…
-
Hello!
we are looking for a pretrained 9x9 model, and it will be used as a baseline in our paper.
Can I get some help from you? Any response will be appreciated :)
-
I run convert.py to convert albert tensorhub model to TF2.0 model with following commands
```shell
MODEL_DIR=albert-base
SIZE=base
# Converting weights to TF 2.0
python converter.py --tf_hub_…
-
**Describe the bug**
I did `ilab model train`, but `ilab model test` failed with
```
OTE: Adapter file does not exist. Testing behavior before training only. - /Users/ahmedazraq/Library/Application…
-
Hi I get this error/warning
```
stable-diffusion-webui-1 | /sd-webui/modules_forge/patch_basic.py:38: FutureWarning: You are using `torch.load` with `weights_only=False` (the current default va…
-
The ``encoder_weights`` parameter in the model initializers is a bit ambiguous/strange, especially given its lax type hint.
At least to me, it looked like you could pass strings to represent other …
-
The `generate.py` script won't run because the weights on hugging face are incompatible with the model architecture in the repository.
Here's a greatly simplified part of the file `generated.py`.
…