llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/torchchat #1058

Slimming down torchchat: Replace replace_attention_with_cust…

### 🚀 The feature, motivation and pitch First surfaced in https://github.com/pytorch/torchchat/pull/1057, the `replace_attention_with_custom_sdpa_attention` function, used when exporting models in …

Jack-Khuu updated 1 week ago
2
InternLM/lmdeploy #1749

[Feature] Low priority: Allow specifying HuggingFace model/r…

### Motivation This is **not** an important feature, but I figured I'd mention it because it was a small point of friction that I think could be improved in the future. Currently my script does this:…

josephrocca updated 5 months ago
2
jdelaunay/coastal_area_term_extraction #9

Prompt engineering

- [ ] Llama2-7b - [ ] Llama2-13b - [ ] Llama2-70b

honghanhh updated 8 months ago
1
locuslab/wanda #30

Publish the Llama2 sparsified models

Hi, I was wondering if you plan to put in a public domain the sparsified Llama2 models. In particular I am interested in the Llama2-70B with 50% unstructured sparsity. Thanks!

egeor updated 11 months ago
4
modelrecords/modelrecords #82

Home page

The UMR home page should be inspired by - https://paperswithcode.com/ - https://mlcommons.org/about-us/ Content outline (to be refined @sschafft ) - [x] github link somewhere - [x] Title: Unified M…

wayferer updated 3 months ago
3
NVIDIA/TensorRT-LLM #1221

cuda out of memory when convert_checkpoint

### System Info CPU: x86_64 GPU: RTX 4080 16G OS: fedora39 Deployment: - tensorrt_llm/devel:latest container based; deployed by kubernetes (with runtime crio). - time-slice deployed by gpu-op…

wt2017 updated 4 days ago
4
langchain-ai/langchain-aws #228

Error using custom imported Llama3.1 model on Bedrock

- Lanchain V 0.3.2 - Lanchain AWS v 0.2.2 We are using a fine-tuned version of Llama 3.1-instruct, uploaded to Bedrock. Since we are using an ARN model ID (which does not contain any information a…

mgaionWalit updated 2 weeks ago
5
alimohammadbeigi/Model-Attribution-in-Machine-Generated-Disinformation #1

Why is the Coaid Paraphrase dataset for ChatGPT different th…

Specifically this data: `Model-Attribution-in-Machine-Generated-Disinformation/data/filtered_llm/gpt-3.5-turbo/coaid/synthetic-gpt-3.5-turbo_coaid_paraphrase_generation_filtered.csv` The features …

aa-dank updated 1 month ago
1
Princeton-SysML/Jailbreak_LLM #8

Missing Chat Template

Hi Authors, we notice that all of the attack code are missing chat templates for models. Things like `USER: {instruction} ASSISTANT:` for vicuna or `[INST] {} {/INST}` for Llama2 which make the benchm…

justinphan3110cais updated 3 weeks ago
2
git-cloner/llama2-lora-fine-tuning #3

llama2-13B和llama2-70b所需要的显卡配置

您好！需要多少显卡呀？

batindfa updated 1 year ago
1

上一页 1...21 22 23 24 25 26 27...100 下一页

1000+ results for llama2

1000+ results
for llama2