gpt-j Search Results - Githubissues

1000+ results
for gpt-j

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kingoflolz/mesh-transformer-jax #219

GPT-J inference on TPU

Is it possible to use a TPU for inference? The guys at [NLPCloud.io](https://nlpcloud.io) told me that's what they're doing, but I have no idea how they're doing it... First I don't know how to su…

airesearch38 updated 2 years ago
3
NVIDIA/FasterTransformer #467

INT8 support for GPT-J

it would be great to have int8 support for GPT-J, both (INT8 for weights only) but ideally w8a8 too

OrenLeung updated 1 year ago
1
triton-inference-server/fastertransformer_backend #91

GPT-J streaming: getting garbage response

### Description ```shell branch: main fastertransformer docker: 22.12 ``` ### Reproduced Steps ```shell docker run -it --rm --gpus=all --shm-size=1g --ulimit memlock=-1 -v ${WORKSPACE}:…

vax-dev updated 1 year ago
1
triton-inference-server/fastertransformer_backend #45

GPT-J Preprocessing Incorrectly Tokenizes `<|endoftext|>`

### Description Expected behavior: ```shell >>> from transformers import AutoTokenizer >>> tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-j-6B") >>> tokenizer.encode('') [50256] ``…

mitchellgordon95 updated 1 year ago
8
artidoro/qlora #52

EleutherAI/gpt-j-6b not supported

It looks like EleutherAI/gpt-j-6b is not supported: Env: Running from docker: ``` FROM pytorch/pytorch:2.0.1-cuda11.7-cudnn8-devel RUN apt-get update && apt-get install git -y RUN pip …

muelletm updated 9 months ago
2
baoguangsheng/fast-detect-gpt #27

Tokenizer Mismatch Problem

Hi, I have a question about the tokenizer mismatch. When the reference model is fixed to be "gpt-j-6B", several scoring models do not share the same tokenizer, such as "gpt-neox-20b" and "llama". …

Jacobsonradical updated 4 weeks ago
2
mlcommons/inference #1722

Performance improvement- GPT-J and BERT Offline scenario

The current implementation of GPT-J and BERT carries out the prediction in sequential manner. Could the performance of GPT-J and BERT be improved by implementing parallel processing through threads ra…

anandhu-eng updated 5 months ago
1
ggerganov/ggml #488

Use custom GPT-J checkpoint

I would like to run the `ggml/gpt-j` version on the MLPerf benchmark. Is it possible to use a fine-tuned GPT-J checkpoint listed here: https://github.com/mlcommons/inference/blob/master/language/gpt-j…

mariecwhite updated 1 year ago
1
allenai/RL4LMs #21

Larger models like GPT-J and GPT-NeoX-20B

Has this library been tested with larger models such as GPT-J-6B and GPT-NeoX-20B? Are there plans to support larger models like these? Thanks.

loganlebanoff updated 1 year ago
3
ggerganov/ggml #22

[gpt-J] swap space support

is it possible to have swap space support? ( I heard about ' Handling big models for inference' and was wondering if ggml can support a similar feature or store part of the large model in swap.)

joshuachris2001 updated 1 year ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for gpt-j

1000+ results
for gpt-j