-
Hi! How's it going? Is there any documentation on using this model? If not, could I write some, and request you merge it to this repo? Thanks!
-
We have added a `generate()` method to `GPT2CausalLM`, and we need a way to benchmark this API since performance is a key to text generation.
More details will be added soon.
-
## 🐛 Bug
I use the script as follow:
CUDA_VISIBLE_DEVICES="0, 1, 2, 3" metaseq-train --task streaming_language_modeling \
data/pile-test/ \
--num-workers 4 \
--reset-dataloader \
--vo…
-
### Issue with current documentation:
The [documentation](https://python.langchain.com/docs/use_cases/summarization) describes the different options for summarizing a text, for longer texts the 'map_…
-
### Is your feature request related to a problem? / 你想要的功能和什么问题相关?
gpt2-chatbot
### Describe the solution you'd like. / 你想要的解决方案是什么?
gpt2-chatbot
### Describe alternatives you've considered. / 你考虑…
-
### System Info
TGI from Docker
text-generation-inference:2.2.0
host: Ubuntu 22.04
NVIDIA T4 (x1)
nvidia-driver-545
### Information
- [X] Docker
- [ ] The CLI directly
### Tasks
- [X] An o…
-
## Description
It seems hybridized gpt2 in V0.9.0 generate different results with previous versions (not as a hybridblock).
I compared the result between the sequence_sampling.py script in v0.9.0
(…
-
### Discussed in https://github.com/ggerganov/llama.cpp/discussions/9197
Originally posted by **Francis235** August 27, 2024
Hi, I want to know how to add an extra fixed tensor to the token em…
-
针对qwen的SFT for student models 代码没看到
-
Can we train gpt2-xl on nanoGPT? If possible,where's its datasets?