castorini / rank_llm

Repository for prompt-decoding using LLMs (GPT3.5, GPT4, Vicuna, and Zephyr)
http://rankllm.ai
Apache License 2.0
273 stars 35 forks source link

Adds script for AWQ-quantizing model #101

Open ru5h16h opened 4 months ago

ru5h16h commented 4 months ago

Pull Request Checklist

Reference Issue

ref: https://github.com/castorini/ura-projects/issues/4

Checklist Items

Before submitting your pull request, please review these items:

PR Type

What kind of change does this PR introduce?

ru5h16h commented 3 months ago

Here are the details outlining the insights gathered and other experimental information: https://docs.google.com/document/d/1BHpN9lDVGjtjIAFMxjUxNuu1K4KJOgIaOZXkWF1_K8c/edit