-
## Question
**For English only**, other languages will not accept.
Before asking a question, make sure you have:
- Googled your question.
- Searched open and closed [GitHub issues](https://g…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Feature Description
While reviewing the design of the "Home" section, I noticed that the overall user experien…
-
Thank you for this amazing work!
I was wondering if the fp8 implementation of flash attention 3 will be able for public to use? My main concern will be accuracy (block quant may have alleviated thi…
-
**Assignment** | **Points** | **Grade** | **Evidence** |
|----------------------------|---------------|-----------|--------------|
| Baseline Grade | 55% | …
-
We need them to know that we code! so, a way i was thinking is this, we can blurred them in case just to have less attention!
we just need the logic working behind them and animation we will implemen…
-
## Description
A DUW stakeholder (Joshua Gibbs) has discovered some incorrect text on Staging. Please see below for details and steps to reproduce (thanks to Michelle M!).
This change should apply t…
-
Sometimes I get the following exception on server side on 'await jsonRpc.Completion' when jsonRpc is disposed just after disposal of marshallable object on client side:
System.InvalidOperationE…
-
使用最新的transformers 4.47.0.dev0
删除 improt _expand_mask 改为自定义
def _expand_mask(mask: torch.Tensor, dtype: torch.dtype, tgt_len: Optional[int] = None):
"""
Expands attention_mask from `[bs…
-
Like.. it's kinda obvious, ain't it? And sure, probably nobody of you intended this, but like... please don't? I'm sure you don't want to appeal to those kind of people, so I'd rather think this shoul…
-
Hello,
I hope you're doing well. Thank you for creating this package. I had 2 questions, pertaining to creating a reproducible workflow:
1. Is there a way to use RCy3 to generate the same netwo…