-
Since https://github.com/vllm-project/vllm/pull/3065, the eval suite https://github.com/EleutherAI/lm-evaluation-harness is broken.
Repro (this should be run on 2 A100s or H100s to make sure the Mi…
-
### Please make sure this feature request hasn't been suggested before.
- [X] I searched previous Issues and didn't find any similar feature requests.
### Feature description
Please allow the user …
-
I run the official code example in `intro.ipynb`:
```python
import dspy
lm = dspy.LM(model='openai/default', api_key=" ", api_base=" ",temperature=0.9, max_tokens=3000,)
colbertv2_wiki17_abstrac…
-
### Describe the issue:
when ninja is buliding the numpy, it spits out an error saying cython is not found
### Reproduce the code example:
```python
.
```
### Error message:
```shel…
-
Megatron-LM/Megatron-core
Tensor-RT
FasterTransformer
-
Hi guys,
I followed [this guide](https://huggingface.co/docs/accelerate/en/usage_guides/megatron_lm) to pre-train a GPT-2 model using Accelerate with Megatron as backend. The current version of Meg…
-
Hybrid indexing: I/J remains relative and K is absolute
The pattern to solves are as follows.
```fortran
do L=1,LM
do J=1,JM
do I=1,IM
...
Field(i,j,LM)
Field(i,j,0)
…
-
**Describe the bug**
After a model is generated running `big_model_fp8.py`, lm_eval dont not work unless the .py files from the original base model is transferred to the generated model folder. Happe…
-
**Describe the bug**
We're in the process of upgrading Megatron-Core from 0.6 to 0.8 and have noticed some problematic behavior with the new distributed async checkpoint saving introduced in mcore 0.7…
-
We noticed that lm_eval --model vllm did not work when data_parallel_size > 1 and got `Error: No available node types can fulfill resource request` from Ray. After some research, I believe when `tenso…