gpt2-inference-performance Search Results

236 results
for gpt2-inference-performance

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/server #7118

error running simple example

**Description** A clear and concise description of what the bug is. ``` triton_python_backend_stub /tensorrt/triton-repos/trtibf-Trendyol-LLM-7b-chat-v1.0/preprocessing/1/model.py triton_python_bac…

geraldstanje updated 5 months ago
15
Lightning-AI/lightning-thunder #1010

Add `apex_ex` to default executor

Currently, `apex_ex` has special implementation for `cross_entropy` and `fused_rms_norm` (with a registered lookaside) from apex. If we find that the `cross_entropy` from apex is faster than curre…

kshitij12345 updated 1 month ago
5
shm007g/LLaMA-Cult-and-More #3

Papers

shm007g updated 1 year ago
7
yoheikikuta/paper-reading #31

[1907.11692] RoBERTa: A Robustly Optimized BERT Pretraining …

## 論文リンク https://arxiv.org/abs/1907.11692 ## 公開日（yyyy/mm/dd） 2019/07/26 ## 概要 BERT の事前学習を様々な観点から検証・実験して original の BERT が undertrained であることを発見し、optimize して学習した結果、XLNet など BERT 以降に提案されたモデルと同等…

yoheikikuta updated 1 month ago
9
huggingface/transformers #14839

Fine-tuning GPT-J-6B in colab: 8-bit weights with low-rank a…

# 🌟 New model addition ## Model description This is a version of EleutherAI's GPT-J with 6 billion parameters that is modified so you can generate and fine-tune the model in colab or equivalent …

dvmazur updated 1 year ago
34
pytorch/pytorch #127383

4 GPT2 can't run into `_scaled_dot_product_flash_attention_f…

### 🐛 Describe the bug ## Description Before https://github.com/pytorch/pytorch/pull/123732, when running with AOTI, the SDPA pattern can be hit and `torch.ops.aten._scaled_dot_product_flash_atten…

chunyuan-w updated 4 months ago
3
huggingface/transformers #4969

Request: pretrained distilgpt2-medium, distilgpt2-large mode…

# Plans for distilgpt2-medium and distilgpt2-large ## Motivation While distilgpt2 is useful, I was wondering if there are any plans to create a distilgpt2-medium and distilgpt2-large. I'm also won…

joeyism updated 2 months ago
12
BerriAI/litellm #361

🎅 I WISH LITELLM HAD...

This is a ticket to track a wishlist of items you wish LiteLLM had. # **COMMENT BELOW 👇** ### With your request 🔥 - if we have any questions, we'll follow up in comments / via DMs Respond …

krrishdholakia updated 1 week ago
195
pytorch/pytorch #77764

General MPS op coverage tracking issue

### This issue is to have a centralized place to list and track work on adding support to new ops for the MPS backend. [**PyTorch MPS Ops Project**](https://github.com/users/kulinseth/projects/1/vi…

albanD updated 8 hours ago
1500
webmachinelearning/webnn #375

Support for transformers

While our [draft charter](https://www.w3.org/2023/03/proposed-webmachinelearning-charter.html) says that the group: > priority on building blocks required by well-known model architectures such as re…

dontcallmedom updated 2 weeks ago
35

上一页 1...5 6 7 8 9 10 11...24 下一页

236 results for gpt2-inference-performance

236 results
for gpt2-inference-performance