-
I reinstall `pip install flash-attn==2.6.1` in NGC pytorch docker image 24.06.
When I run train job, I got follow error:
```
Traceback (most recent call last):
File "/data1/nfs15/nfs/bigdata/zha…
-
I'm attempting to train LLaMA-3 using Megatron-LM but have encountered an issue: LLaMA-3 utilizes Tiktoken for tokenization and doesn't provide a tokenizer.model file, which is required by Megatron-LM…
SDsly updated
1 month ago
-
Congratulations for this great Extension!!
Have you planned the possibility of using it with lm studio? because when we put “ip:1234/v1” we get the message “Ollama is running” but it does not load…
-
**The bug**
If `model.set('name', ['1', '2', '3'])` is used to create a variable, it can't be appended by later calling `gen`.
**To Reproduce**
```python
from guidance import models, gen, user,…
-
**Acceptance Criteria**
- When a payment is made on an LM folder, AMANDA sends out an email to Applicant as shown in the screenshot below.
- This email needs to be updated as mentioned in the sc…
-
**Acceptance Criteria**
- When a new LM Application is submitted in the portal, AMANDA sends an email to the applicant as shown in the screenshot below.
- In the email that is sent, the text 'Dev…
-
What do you think of the BERT (tinyBert, MobileBert, DistillBer, Albert etc) language model for spelling correction?
this could be a way of significantly improving automatic correction, no? I'm not an…
MXC48 updated
1 month ago
-
I am currently exploring the internal workings of the LM-Harness library and am considering contributing to its documentation to help clarify some of its mechanisms for future users.
I have a query…
-
## Description
Add reverse functionality for taking models in LM studio and making them available in ollama. Very similar to `L`, except it works in the reverse direction.
-
Hi,
I am trying to run some LLMs (currently trying openai models) on MMLU. My first question is which configuration is the standard setup (5 shot without CoT)? What does flan mean in some of the c…