Nota-NetsPresso / shortened-llm

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]
63 stars 8 forks source link

Add scripts for Gemma #6

Closed bokyeong1015 closed 6 months ago

bokyeong1015 commented 6 months ago

Description

Changes

To Do (not in this PR)

PPL Results

Gemma-2b | Model (Params) | Use `--add_bos_to_every`
for pruning criterion | Metric | PPL↓
WikiText2 | PPL↓
PTB | |---|:---:|:---:|:---:|:---:| | gemma-2b (2.5B) | n/a | ppl | 649.7 | 2969.3 | | **gemma-2b (2.5B)** | **n/a** | **ppl_bos** | **21.1** | **82.3** | | st-gemma-ppl (2B) | X | ppl | 295.2 | 1292.1 | | st-gemma-ppl (2B) | X | ppl_bos | 35.2 | 186.9 | | st-gemma-ppl (2B) | O | ppl | 524.1 | 3075.6 | | **st-gemma-ppl (2B)** | **O** | **ppl_bos** | **30.2** | **133.6** | | st-gemma-taylor (2B) | X | ppl | 38356.8 | 44494.7 | | st-gemma-taylor (2B) | X | ppl_bos | 83.9 | 321.6 | | st-gemma-taylor (2B) | O | ppl | 547.1 | 3458.0 | | **st-gemma-taylor (2B)** | **O** | **ppl_bos** | **29.3** | **149.0** |
Gemma-7b | Model (Params) | Use `--add_bos_to_every`
for pruning criterion | Metric | PPL↓
WikiText2 | PPL↓
PTB | |---|:---:|:---:|:---:|:---:| | gemma-7b (8.5B) | n/a | ppl | 3119318.2 | 19714697.9 | | **gemma-7b (8.5B)** | **n/a** | **ppl_bos** | **17.2** | **62.8** | | st-gemma-ppl (7.1B) | X | ppl | 8680264.0 | 58857215.9 | | st-gemma-ppl (7.1B) | X | ppl_bos | 43.6 | 173.5 | | st-gemma-ppl (7.1B) | O | ppl | 17128426.3 | 124600727.1 | | **st-gemma-ppl (7.1B)** | **O** | **ppl_bos** | **21.8** | **88.6** | | st-gemma-taylor (7.1B) | X | ppl | 448704455.2 | 5135150558.7 | | st-gemma-taylor (7.1B) | X | ppl_bos | 580.1 | 2549.7 | | st-gemma-taylor (7.1B) | O | ppl | 16862874.7 | 147967024.6 | | **st-gemma-taylor (7.1B)** | **O** | **ppl_bos** | **22.4** | **89.0** |