gpt2 Search Results - Githubissues

1000+ results
for gpt2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

p208p2002/Transformer-QG-on-SQuAD #5

Documentation for gpt2-squad-qg-hl

Hi! How's it going? Is there any documentation on using this model? If not, could I write some, and request you merge it to this repo? Thanks!

rempel1234 updated 4 months ago
2
keras-team/keras-nlp #792

Benchmark GPT2 `generate` method

We have added a `generate()` method to `GPT2CausalLM`, and we need a way to benchmark this API since performance is a key to text generation. More details will be added soon.

chenmoneygithub updated 4 months ago
1
facebookresearch/metaseq #692

Sub-workers exits without messages

## 🐛 Bug I use the script as follow: CUDA_VISIBLE_DEVICES="0, 1, 2, 3" metaseq-train --task streaming_language_modeling \ data/pile-test/ \ --num-workers 4 \ --reset-dataloader \ --vo…

GongZhengLi updated 1 week ago
7
langchain-ai/langchain #15347

DOC: Summarization 'map_reduce' - Can't load tokenizer for '…

### Issue with current documentation: The [documentation](https://python.langchain.com/docs/use_cases/summarization) describes the different options for summarizing a text, for longer texts the 'map_…

analyticsinsights updated 4 days ago
11
ai-shifu/ChatALL #820

[FEAT] gpt2-chatbot ADD asap

### Is your feature request related to a problem? / 你想要的功能和什么问题相关？ gpt2-chatbot ### Describe the solution you'd like. / 你想要的解决方案是什么？ gpt2-chatbot ### Describe alternatives you've considered. / 你考虑…

johnfelipe updated 4 months ago
1
huggingface/text-generation-inference #2456

Running TGI on NVIDIA T4

### System Info TGI from Docker text-generation-inference:2.2.0 host: Ubuntu 22.04 NVIDIA T4 (x1) nvidia-driver-545 ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An o…

ivoras updated 4 days ago
3
dmlc/gluon-nlp #1165

GPT2 HybridBlock

## Description It seems hybridized gpt2 in V0.9.0 generate different results with previous versions (not as a hybridblock). I compared the result between the sequence_sampling.py script in v0.9.0 (…

carter54 updated 4 years ago
4
ggerganov/llama.cpp #9198

how to add an extra fixed tensor to the token embedding in …

### Discussed in https://github.com/ggerganov/llama.cpp/discussions/9197 Originally posted by **Francis235** August 27, 2024 Hi, I want to know how to add an extra fixed tensor to the token em…

Francis235 updated 2 weeks ago
2
songmzhang/DSKD #11

qwen

针对qwen的SFT for student models 代码没看到

zjjznw123 updated 1 month ago
3
karpathy/nanoGPT #138

gpt2-xl

Can we train gpt2-xl on nanoGPT? If possible，where's its datasets?

zscwind updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for gpt2

1000+ results
for gpt2