-
### Background
For large document understanding or tasks like code completion, it's often beneficial to have a large context length e.g. > 8K. In order for this to be enabled by default, a model wo…
-
**Describe the bug**
What the bug is, and how to reproduce, better with screenshots(描述bug以及复现过程,最好有截图)
When running inference on a fine-tuned model, I get the following error:
` File "/home/ubu…
-
### ⚠️ Please check that this feature request hasn't been suggested before.
- [X] I searched previous [Ideas in Discussions](https://github.com/axolotl-ai-cloud/axolotl/discussions/categories/ideas) …
-
the environment is :
------------------------------- -------------------------------------------------------------------------------------------------------------------------------------------------…
-
### System Info
- `transformers` version: 4.41.2
- Platform: Linux-5.4.0-173-generic-x86_64-with-glibc2.31
- Python version: 3.10.0
- Huggingface_hub version: 0.23.3
- Safetensors version: 0.4.3
…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Feature Description
A English to French translator powered by machine learning represents a transformative too…
-
### Feature request
Improving the Implementation of Phi3SuScaledRotaryEmbedding to Reduce Unnecessary Computation
I'm not entirely sure if there is a deeper meaning to the implementation here. It …
-
**Is your feature request related to a problem? Please describe.**
MTK is a fairly large dependency that is really two packages in a trench coat.
Projects like DAECompiler and (I expect) JuliaSimC…
-
```
Traceback (most recent call last):
File "/root/miniconda3/envs/opensora/lib/python3.10/site-packages/gradio/queueing.py", line 541, in process_events
response = await route_utils.call_pro…
-
### Before
```python
import pymc as pm
from pymc.model.transform.optimization import freeze_dims_and_data
with pm.Model() as m:
...
with freeze_dims_and_data(m):
idata_prior = pm.sam…