longlm Search Results - Githubissues

predibase/lorax #182

LongLM

### Feature request https://arxiv.org/pdf/2401.01325.pdf Abstract This work elicits LLMs’ inherent ability to handle long contexts without fine-tuning. The limited length of the training sequen…

flozi00 updated 5 months ago

Dao-AILab/flash-attention #868

will the flash attention embed self-extend?

https://github.com/datamllab/LongLM

wangyuxin87 updated 3 months ago

Lightning-AI/litgpt #970

Can we integrating LongRoPE.

They have implemented LongRope patch for llama and mistral. Is it possible to port same into ligGPT ? https://github.com/datamllab/LongLM/tree/master

PSanni updated 4 weeks ago

irthomasthomas/undecidability #661

Gemma llm context window extended to 90k using LongLM

- [ ] [HongyeJ on X: "Despite the mixed feelings about Google's latest Gemma model, we're big fans! @GoogleAI Why? Coz we found it pairs incredibly well with our SelfExtend 🤣🤣🤣 - like, perfectly! With…

irthomasthomas updated 4 months ago

sdan/selfextend #3

Plans to support Solar 10.7b as well?

Hi, thank you for the great work! I'm wondering if you have plans to provide this for the Solar-Models as well?

olsn updated 5 months ago

thu-coai/LOT-LongLM #4

公布的LongLM模型在OutGen任务的生成结果不对

你好，测试LongLM模型，在填空任务的输出正确： ``` 两个多小时后,石峰把四百五十块灵石,花了一个干干净净,而狼城各大药材店中,丹的药材,也被石峰搜罗一空。回到住处,没有片刻停歇,人极境界用到的回气丹,这种丹药,对于石峰来说,没有一炉炉丹药不断出炉,石峰和小黑的脸了花。 ``` 可以得到正确输出： ``` ▁炼石峰和小黑药都得到了什么用处。色都变了,脸上都绽放 ``` 但是，…

BlinkDL updated 2 years ago

google/gemma.cpp #60

Add Self-Extend to the gemma.cpp

Hi team, I checked the locallama and found that gemma can work well with the Self-Extend method. It would be awesome if this technique could be added to the gemma.cpp. References: - [locallama](http…

namtranase updated 2 weeks ago

vllm-project/vllm #2349

[Feature] Support for SelfExtend-style context expansion

In the paper [LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning](https://arxiv.org/pdf/2401.01325.pdf), the authors describe a method to extend the context-window of _any rope-based_ mod…

creatorrr updated 3 months ago

ChuckFork/github-daily #164

GitHub Daily Top 10 @2024-01-13

# Trending repositories for C# 1. [**microsoft / PowerToys**](https://github.com/microsoft/PowerToys) __Windows system utilities to maximize productivity__ 218 stars toda…

chuck20230613001[bot] updated 5 months ago

thu-coai/LOT-LongLM #7

运行 gen.sh 脚本的时候报错

在运行gen.sh 脚本的时候报错，模型使用的是longlm-small, 指向命令 ` gen = model.generate(input_ids, do_sample=True, max_length=512, top_k=40, temperature=0.7, decoder_start_token_id=1)` 如果将这一行替换成 `gen = model.generat…

hehedaozuiteng updated 2 years ago

25 results for longlm

25 results
for longlm