-
### System Info
Environment:
OS: Ubuntu 24.04
Python version: 3.11.8
Transformers version: transformers==4.45.2
Torch version: torch==2.3.0
Model: Meta-Llama-3.1-70B-Q2_K-GGUF - https://hugg…
-
Hello, @YTianZHU . I read the Differential Transformer paper and found it very interesting.
Thank you so much for your work.
I was wondering how you visualized the attention scores in Figure 1:
![Ima…
-
### Describe the bug
I installed text generation webui and downloaded the model(TheBloke_Yarn-Mistral-7B-128k-AWQ) and I can't run it. I chose Transofmer as Model loader. I tried installing autoawq b…
-
## Describe the bug
Trying to run the geneformer example on provided testdata as explained in [tutorials](https://chanzuckerberg.github.io/cellxgene-census/notebooks/analysis_demo/comp_bio_geneform…
-
**Is your feature request related to a problem? Please describe.**
GGUF is becoming the mainstream method for large model compression and accelerated inference. Transformers currently supports the lo…
-
Hello,
I am very interested in your research and am currently trying to run some experiments based on it. However, I encountered an issue while running the program from the HuggingFace_EncDec directo…
-
Original Repository: https://github.com/ml-explore/mlx-examples/
Listing out examples from there which would be nice to have. We don't expect the models to work out the moment they are translated to …
-
Currently the exporter does not handle higher order ops. Autocasts are expressed as HOPs:
```python
# Inside the ExportedProgram
...
class submod_1(torch.nn.Module):
def forward(self, e…
-
pixart model: https://huggingface.co/PixArt-alpha/PixArt-Sigma-XL-2-1024-MS
Trying to using the EvilEdit to attack pixart but it didn't work well.
shape error
RuntimeError: mat1 and mat2 shapes…
-
### Model description
I have a custom SentenceTransformer model that is a custom class (And also quite nested), so on the top level the modules.json file look like
```
[
{
"idx": 0,
"n…