gpt-j Search Results - Githubissues

1000+ results
for gpt-j

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bkerler/edl #432

Commands to flash imgs extracted from ops to restore OnePlus…

I have a OP6 and would like to flash images from packages named similar to `enchilada_22_O.15_180810.ops`, making sure that all images from the `ops` are written to all the device's A and B partitions…

Espionage724 updated 2 months ago
8
mlcommons/inference #1617

Inference v3.1 results table has a wrong name for GPT-J colu…

Edge doesn't contain "Server", and the metric should be latency, not Queries/s. Could you correct this? ![image](https://github.com/mlcommons/inference/assets/6924448/02c88f64-2073-495a-bcc2-f6d37b…

hanyunfan updated 9 months ago
2
ahmed3520/lam-store #2

Improving task classification by using cache and GPT-J along…

This feature proposal aims to improve the accuracy of task classification in our project by leveraging the capabilities of GPT-J and chatGPT, along with the use of cache. By using GPT-J, a small and o…

ahmed3520 updated 1 year ago
1
LLukas22/llm-rs-python #36

Will you please guide how to run the conversion script?

Hie 👋🏻 Coming from [this](https://github.com/ggerganov/ggml/blob/master/examples/gpt-j/convert-h5-to-ggml.py) GGML conversion script and the issue that you commented in https://github.com/ggerganov/…

AayushSameerShah updated 1 year ago
1
ggerganov/ggml #83

Run T5 ggml

Dear author, How can i run t5 like gpt-2 or gpt-j? Thank

phanxuanphucnd updated 1 year ago
2
microsoft/DeepSpeed #1797

[BUG] DeepSpeed Inference with GPT-J using batches with padd…

**Describe the bug** Using DeepSpeed Inference (using `deepspeed.init_inference`) gives weird outputs when using batch size > 1 and padding the inputs. I'll first state the problem with more detai…

tomerip updated 1 year ago
25
namhyung/uftrace #1949

WARN: Segmentation fault: address not mapped in llama.cpp

The llama.cpp project already has an option to add `-pg` option with `LLAMA_GPROF=1`. But it gets crashed when `llama-cli` is traced with uftrace as follows. ``` $ git clone https://github.com/gg…

honggyukim updated 2 months ago
12
huggingface/transformers #26350

Community contribution: Adding Flash Attention 2 support for…

### Feature request Flash Attention 2 is a library that provides attention operation kernels for faster and more memory efficient inference and training: https://github.com/Dao-AILab/flash-attentio…

younesbelkada updated 3 weeks ago
108
coreweave/kubernetes-cloud #129

Dataset permission errors from the tokenizer in finetune-wor…

Hi! I'm working on reproducing your [Argo workflow for fine-tuning GPT-J](https://github.com/coreweave/kubernetes-cloud/tree/master/finetuner-workflow). I'm able to create a PVC, download the da…

parallelo updated 1 year ago
1
project-baize/baize-chatbot #2

was trying this on gpt-j-6b but landed into error on finetun…

py", line 70, in set_module_8bit_tensor_to_device new_value = bnb.nn.Int8Params(new_value, requires_grad=False, has_fp16_weights=has_fp16_weights).to(device) File "/opt/conda/lib/python3.10/si…

allthingssecurity updated 1 year ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for gpt-j

1000+ results
for gpt-j