-
Hi, It seems that the same code is **working fine with when the Megatron-LM that I git-cloned in April. With the latest Megatron-LM, I've got the following error raised with the pretrain_gpt.py code. …
-
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
Hi, thank you for your nice work. I left the question to ask the availability of batch g…
-
**Is your feature request related to a problem? Please describe.**
When using mindnlp to infer GPT2, I found that the inference speed is 10X slower than pytorch.
Here is the torch version implementa…
-
### Describe the feature request
It would be great to have the option to provide pre-optimised TensorRT engine plans to ORT.
### Describe scenario use case
Using TensorRT in standalone, e.g. trtex…
-
- [ ] [README.md · defog/sqlcoder-7b-2 at main](https://huggingface.co/defog/sqlcoder-7b-2/blob/main/README.md?code=true)
# README.md · defog/sqlcoder-7b-2 at main
**DESCRIPTION:**
```yaml
license:…
-
- [ ] [unsloth/README.md at main · unslothai/unsloth](https://github.com/unslothai/unsloth/blob/main/README.md?plain=1)
# unsloth/README.md at main · unslothai/unsloth
…
-
The code is concise and very helpful. Please provide a demo for the client.
-
### Describe the issue
I implemented a program with GPT NEO in python (attached the program) and the equivalent version in C++. To acquire the exported GPT NEO model I made some slight modification…
-
**Describe the bug**
Following the instructions in [`examples/mistral`](https://github.com/microsoft/Olive/tree/main/examples/mistral) does not result in a quantized onnx model. After running the wor…