attention-architecture Search Results

1000+ results
for attention-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PKU-YuanGroup/Open-Sora-Plan #495

block number in 1.3.

Hi, thanks for your great work! What is the number of blocks in v1.3? is it still the same of v1.2? It seems strage not double the block number like stdit.

Edwardmark updated 1 day ago
1
distantmagic/llmops-handbook #3

Section about optimizing LLM responses

[Outline] I would like to add a section about optimizing the speed and response times from LLMs under General Concepts. I plan to include the below topics: - Quantization - Flash attention - Arch…

devayani-kv updated 2 months ago
1
Dao-AILab/flash-attention #1235

WHEN can we get the flash-attention 2.x for Turing GPU ?

I have already downloaded Flash-attention 1.x(actually flash-attn 1.0.8) because currently I only have a GPU with TURING architecture(TITAN RTX). But for my needs (running a demo of a multimodal LLM)…

eileen2003-w updated 2 weeks ago
3
66RING/tiny-flash-attention #9

is the cutlass version support on sm75

* The terminal process "/bin/bash '-c', '/usr/local/cuda-12.4/bin/nvcc -g -G -diag-suppress=177 -lineinfo --std=c++17 -arch=sm_75 '-D CUTE_ARCH_LDSM_SM75_ACTIVATED' -o flash_attention_cutlass_standa…

A-transformer updated 1 week ago
1
microsoft/BitNet #76

BitNet model outputs repetitive gibberish ('Breis') regardle…

Problem: The model generates repetitive, nonsensical outputs like "Breis" regardless of the input provided. This happens even with different generation settings (e.g., temperature, top_k, top_p). fro…

EladWarshawsky updated 1 day ago
3
unslothai/unsloth #774

Support for model trained by OLMo?

Hi, it seems that unsloth currently does not support loading base model trained by [OLMo](https://github.com/allenai/OLMo). Is it possible to write custom script to load the model into unsloth? The mo…

CloudyDory updated 3 months ago
1
ultralytics/ultralytics #16310

Custom YOLOv8 Architecture - Fine Tuning vs. Training from s…

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…

Andyvince01 updated 3 days ago
9
manoa-inspire/MATP #86

Review: Dashboard.jsx and Landing.jsx

## Overview The focus for this code review will be centered around the Dashboard and Landing pages. Please pay attention too: * Javascript issues * React components ## Review Branch [r…

Sydnee-You updated 1 month ago
8
dotnet/TorchSharp #1375

Torch was not compiled with flash attention warning

This is printed when I call `functional.scaled_dot_product_attention`: > [W914 13:25:36.000000000 sdp_utils.cpp:555] Warning: 1Torch was not compiled with flash attention. (function operator ()) …

lostmsu updated 1 week ago
1
fastmachinelearning/hls4ml #1020

LLM support, attention head and transformer architecture sup…

The **transformer** architecture (https://arxiv.org/pdf/1706.03762) has been instrumental to scale sequence neural networks. The transformer architecture is the fundamental building block of all LLMs…

danlg updated 4 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for attention-architecture

1000+ results
for attention-architecture