-
### Describe the bug
I have tested PixArt-Sigma with following code, where I load text_encoder separately since I will fine-tune it in later. I found `T5EncoderModel.from_pretrained(torch_dtype=tor…
-
Is there any model based on or supporting Transformer-XL architecture in tensor2tensor? Architecture is referenced with paper: https://arxiv.org/abs/1901.02860
Title: Transformer-XL: Attentive Langua…
-
The recipe for rare power transformer XL is wrong, it now require 2401 maraging steel on top of all the magnet and components.
I believe your recipe says 343.
![RPTXL](https://user-images.githubuser…
-
Hello,
In order to use the open-source oasis model effectively, some necessary information is missing:
1. What is the maximum sequence length that the oasis500m model was trained on?
2. Was the m…
-
Hi every body,
I want to use external language model for ASR. according to documentation found in openseq2deq page, it must be launched in 2 steps:
1. run inference to dump logits to pickle.
2. …
-
## Description:
Hello! I’ve been following the development of this repository and appreciate the efforts to benchmark various efficient Transformer variants. I’d like to propose the implementation of…
-
When I downloaded all the weight files of stable-diffusion-xl-base-1.0 and prepared to run stable_diffusion_xl_test.py, I got the error OSError: ./stable-diffusion-xl-base-1.0 does not appear to have …
-
I want to use [this model](https://huggingface.co/ibm/MoLFormer-XL-both-10pct) as an encoder. As you can see from the description, the model can be uploaded like:
```
model = AutoModel.from_pretrain…
-
When trying the model from
`https://huggingface.co/alibaba-pai/EasyAnimateV2-XL-2-768x768`
gives
`RuntimeError: D:\Tests\EasyAnimate\EasyAnimate\models\Diffusion_Transformer\EasyAnimateV2-XL-2-768x…
-
**Describe the bug**
I have a transformer-xl (Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context) gpt-xl ( 41 layer ), and the code is implemented by myself; After transfer to o…