-
### Feature request
Improving the Implementation of Phi3SuScaledRotaryEmbedding to Reduce Unnecessary Computation
I'm not entirely sure if there is a deeper meaning to the implementation here. It …
-
the environment is :
------------------------------- -------------------------------------------------------------------------------------------------------------------------------------------------…
-
**Describe the issue**
Hello. I'm testing the first tutorial as it is with around 5000 text files, some are 1 page some are 15 pages long.
When the answer is getting printed I get this error.
ER…
-
Hi, I have two issues with Minimax strategy and early stopping :
1, loss1 is to maximize series_association and loss2 is to minimize prior_association. In the original paper, it was minimize, then ma…
-
Hi! I am trying to use Segment Anything, but I constantly get the following error: RuntimeError: CUDA error: an illegal memory access was encountered.
Here is a small piece of my code:
```
# bu…
-
### System Info
ubuntu 20.04
tensorrt 10.0.1
tensorrt-cu12 10.0.1
tensorrt-cu12-bindings 10.0.1
tensorrt-cu12-libs 10.0.1
tensorrt-llm 0.10.…
-
### Description
### Expected behavior with the suggested feature
- [ ] [ContraRec: "Sequential Recommendation with Multiple Contrast Signals" Wang et al., TOIS'2022.](https://github.com/TH…
-
Hi!
In your paper you mention:
```
We do not make any significant change to model architecture other than ad- justing the base of RoPE, as in Xiong et al. (2023).
```
however it appears that th…
-
### Before
```python
import pymc as pm
from pymc.model.transform.optimization import freeze_dims_and_data
with pm.Model() as m:
...
with freeze_dims_and_data(m):
idata_prior = pm.sam…
-
**Describe the bug**
One can run into a crash
```
"/opt/NeMo/nemo/collections/nlp/modules/common/megatron/language_model.py", line 352, in forward
embeddings = words_embeddings + position_em…