-
Hi all,
I was wondering whether it is possible to do selective activation checkpointing with the LayerNormMLP where we only recompute FFN1 and not FFN2, therefore not having to save the ffn1_out an…
-
### Describe the issue
Hi there, thank you for sharing this awesome project.
I have one question about the requried packages for multi-image and multi-prompt generation.
In the following link…
-
### Feature request
transformer js should support video classification as it support image classification so video classification will make this library somewhere complete.
### Motivation
i want to…
-
**Description**
Please consider adding Core ML model package format support to utilize Apple Silicone Nural Engine + GPU.
**Success Criteria**
Utilize both ANE & GPU, not just GPU on Apple Sili…
-
Hey! Awesome work on this project! I know it's not technically vanilla Mamba but I've been trying to convert the new SSM-Transformers Jamba into MLX for more efficient training and usability but am ha…
-
**Description**
Please consider adding Core ML model package format support to utilize Apple Silicone Nural Engine + GPU.
**Success Criteria**
Utilize both ANE & GPU, not just GPU on Apple Sili…
-
When an empty file is referenced, this error is thrown.
```
error: bundling failed: Error: No input specified: provide a file name or a source string to process
at Object.module.exports.rende…
-
Hi,
Thank you for your awesome work! I would like to know if there is any plan on supporting vision-based Transformer? As transformers are becoming popular in vision tasks, I believe this will be a…
-
Hi, this is awesome work. I'm wondering if there is a minimal way to integrate megablocks into transformers codebase for the mixtral architecture?
Would simply replacing the [`MixtralSparseMoeBlock…
-
Hi!
Let's bring the documentation to all the Spanish-speaking community 🌐
Who would want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/transformers/blob/m…