-
Dear @dfalbel I have bought a new MacBook Air with the M3 chip which has 8 CPUs, 10 GPUs and 16GB integrated memory. My R `torch` apps are crashing. I have put together a MWE which works on all other …
-
https://arxiv.org/abs/2009.01411
-
I'm trying to merge some embedding models with this config file. the architectures are similar but I think it is erroring out on some names of layers? Would love some suggestions on how to change the …
-
Hi,
Great work on this! Is Mistral supported? Right now I only see GPT-J and Llama 2.
Thank you!
-
### 🐛 Describe the bug
######################################
# #
# Retrieving MEMIT hyperparameters #
# #
################…
-
Running:
```
hf_username="Trelis"
new_model_name="Meta-Llama-3-8B-Instruct-Gaeilge"
if True: model.push_to_hub_merged(f"{hf_username}/{new_model_name}", tokenizer, save_method = "merged_16bit")
`…
-
Hi! Thanks for sharing the great work!
I have some questions about PoolFormer.
If I explain PoolFormer like the following attachments, can I say PoolFormer is just a non-trainable MLP-like model?
…
-
This is a very interesting library and I want to try this for my project. I wanted to know
if it's possible to have a Graph Neural Network example in the tutorials?
-
I tried several different base models based on 1.5. Pasted the following in `Path_to_HuggingFace`, no path or link. `1.5` selected as custom model version:
- darkstorm2150/Protogen_v5.3_Official_Rele…
-
## Instructions To Reproduce the Issue:
I have trained a pretrained ImageNet resnet on a custom dataset with 12 classes.
For training I used following yaml file:
[training_yaml_file](https://git…