rope Search Results - Githubissues

1000+ results
for rope

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlc-ai/mlc-llm #1344

[Tracking] RoPE scaling support

## Overview This issue tracks the support of RoPE scaling, one important configurable parameter adopted by many new models, in MLC LLM. ## Action Items - [ ] Support linear RoPE scaling…

MasterJH5574 updated 5 hours ago
1
wiremod/wire #3090

Possible wiremod expression2 constraintcore rope-constraint …

Hello, a project of mine requires a rope constraint that is updating it's length during run, I tried to use winch for this but due to winches custom physics that make it super unrealistic on its behav…

raven1934 updated 1 week ago
2
NVIDIA/Fuser #2577

alias analysis missing out opportunities on aliasing within …

In full rope example, we have a fusion starting with something like this: ``` Inputs: T0_g[ iS467{1024}, iS468{8} ], __bfloat T1_g[ iS471{1024}, iS472{8} ], __bfloat T2_g[ iS463{2}, iS464…

jjsjann123 updated 1 week ago
5
lucidrains/rotary-embedding-torch #25

RoPE-Mixed: Improvement over Axial for n-D

Hi @lucidrains, These folks talk about improving axial-RoPE performance. Some comparisons to axial-RoPE look nice, but for some, I am not convinced. I wanted to get your thoughts on this. If it mak…

tasansal updated 2 weeks ago
1
unslothai/unsloth #774

Support for model trained by OLMo?

Hi, it seems that unsloth currently does not support loading base model trained by [OLMo](https://github.com/allenai/OLMo). Is it possible to write custom script to load the model into unsloth? The mo…

CloudyDory updated 1 day ago
1
lucidrains/rotary-embedding-torch #26

LieRE: Generalizing Rotary Position Encodings. Beats RoPE-mi…

Hi, @lucidrains ! There was a promising research published this month (vs. RoPE-mixed (#25) in March), the so-called LieRE positional encodings generalize the kv-vector rotation to any numbers of d…

kabachuha updated 2 weeks ago
3
joeyoravec/stargate #2

Type of rope

Hi, do you remember the type of rope and how many leds/m you have?? your link for it is dead. Thanks

mindstorm88 updated 2 months ago
5
unslothai/unsloth #584

unsloth/codegemma-2b-bnb-4bit: Model Error when Setting max_…

**Issue: Model Error when Setting max_seq_length > 8192** **Description:** The `unsloth/codegemma-2b-bnb-4bit` model throws an error when attempting to set `max_seq_length` greater than 8192. …

terraformmachine updated 1 week ago
5
flashinfer-ai/flashinfer #194

Shared-prefix rope issue

![image](https://github.com/flashinfer-ai/flashinfer/assets/32770237/e13571da-9bff-456b-b67b-764aee66b7fd) I found that during shared-prefix calculation, this kenerl won't use _qo_indptr_ to split ba…

lkc1997 updated 2 months ago
1
sgl-project/sglang #591

`model_override_args` with server

When using a server, one currently cannot use the `model_overide_args` which could be very useful, e.g. for rope scaling. This is currently the `sglang.launch_server.py`: ```py import argparse…

ValeKnappich updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for rope

1000+ results
for rope