-
I checked this [issue](https://github.com/EleutherAI/lm-evaluation-harness/issues/714#top) has similar problem I have, however using the latest main branch doesn't solve the problem!
## Model:
- F…
-
### System Info
platform: linux: `ubuntu 22.04`
python version: `3.10.12`
transformers version: `4.44.2`
### Who can help?
_No response_
### Information
- [ ] The official example scr…
-
With limited memory on most of phones, there's community requests on supporting a model with a smaller size like Phi-3 mini. It may be supported out of box, but need to verification, evaluation and pr…
-
`python run.py --train --model ./TinyLlama-1.1B-Chat-v1.0 --data ./data --batch-size 1 --lora-layers 4`
`Loading pretrained model
You are using the default legacy behaviour of the . This is expect…
-
### Issues Policy acknowledgement
- [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md)
### Willi…
-
### What is the issue?
Background:
Kubernetes 1.31 introduced a new feature: [Read-Only Volumes Based on OCI Artifacts](https://kubernetes.io/blog/2024/08/16/kubernetes-1-31-image-volume-source/).…
-
### Description & Motivation
https://github.com/pytorch/pytorch/pull/104810 adds the recommendation that the `save` APIs should be called in a single node (`shard_group`).
https://github.com/pyt…
-
Hi,
Some of the models in Hugeface shows the support of `create_chat_completion`, but now this plugin seems only support the `Simple inference`, will `Chat Completion` be supported in the future ve…
-
### System Info
- `transformers` version: 4.37.2
- Platform: macOS-14.5-arm64-arm-64bit
- Python version: 3.11.9
- Huggingface_hub version: 0.23.1
- Safetensors version: 0.4.2
- Accelerate ver…
-
NB: I have the US version of the Samsung Note10+, which is different than the european. (See: [GSMArena Comparison, edit: wrong note10 for Europe. Fixed](https://gsmarena.com/compare.php3?idPhone1=973…