llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AutoGPTQ/AutoGPTQ #224

Will AutoGPTQ support Lora traning for llama2?

I try train lora with AutoGPTQ v3.0. And I got error: Exception in thread Thread-17 (threaded_run): Traceback (most recent call last): File "E:\chat\text-generation-webui\conda\lib\threading.py…

wudijimao updated 1 year ago
2
zepingyu0512/neuron-attribution #2

Should the attention consider the attention group number?

`#find query FFN neurons activating attn neurons curfile_ffn_score_dict = {} for l_h_n_p, increase_score in cur_file_attn_neuron_list_sort[:30]: attn_layer, attn_head, attn_neuron, attn_pos = l…

sev777 updated 4 days ago
1
marella/ctransformers #59

jupyter notebook crashed when importing llama2 models

Hi, I tried the following code, but my kernel crashed and restarted, let me know how I should fix this, thanks! : ``` from ctransformers import AutoModelForCausalLM llm = AutoModelForCausalLM.f…

littlehifive updated 1 year ago
3
RLFHOssca/RL_study #4

8월 23일 까지 ko-llama2 QA 파인튜닝

- [ ] 손기훈 - [ ] 노태엽 - [ ] 백인진 - [ ] 김해원 - [ ] 강민재

kihoon71 updated 1 year ago
3
meta-llama/llama #538

help me

File "C:\Users\giorgio\OneDrive\Desktop\LLAMA2 MAIN\Conda Environment\Lib\site-packages\fire\core.py", line 141, in Fire component_trace = _Fire(component, args, parsed_flag_args, context, name)…

G10Rg10C updated 1 year ago
2
X-PLUG/mPLUG-Owl #146

When will LLAMA2 Mplug-owl be released?

Hi team, May I know when will LLAMA2 based mplug-owl be released?

heylamourding updated 1 year ago
5
OpenGVLab/OmniQuant #21

Cannot compile with mlc-llm

I quantized a custom fine-tuned llama2 70b model like this. ```bash $ python main.py \ --model /data/finetuned_llama2_70b \ --epochs 20 \ --output_dir /data/finetuned_llama2_70b_output \…

0x1997 updated 1 year ago
2
sgl-project/sglang #1374

[Feature] 4-bit quantized prefix cache

### Motivation LMDeploy's 4-bit quantized prefix cache (along with 4-bit AWQ for weights) allows running ~70B models on 48GB of RAM with good performance for many-user scenarios. The prefix cache c…

josephrocca updated 1 month ago
4
opea-project/GenAIExamples #515

CloudFormation stack for deploying ChatQnA using Amazon Bedr…

ChatQnA is one of the GenAI examples. It is a chatbot for question and answer through retrieval augmented generation (RAG). All details about the sample are available at https://github.com/opea-projec…

arun-gupta updated 1 month ago
3
b4rtaz/distributed-llama #58

network utilization

# Let's calculate the transfer time theoretically. ## llama3 8B The original experiment data is [here](https://github.com/b4rtaz/distributed-llama/discussions/41#discussioncomment-9435671). Since t…

zhengpeirong updated 6 months ago
3

上一页 1...35 36 37 38 39 40 41...100 下一页

1000+ results for llama2

1000+ results
for llama2