oobabooga Search Results

1000+ results
for oobabooga

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AutoGPTQ/AutoGPTQ #142

How to load a LoRA for inference?

LoRAs are distributed on Hugging Face as folders containing two files: ``` $ ls kaiokendev_SuperCOT-7b adapter_config.json adapter_model.bin ``` How can such LoRA be loaded using the new pef…

oobabooga updated 10 months ago
7
irthomasthomas/undecidability #647

Qwen-1.5-8x7B : r/LocalLLaMA

- [ ] [Qwen-1.5-8x7B : r/LocalLLaMA](https://www.reddit.com/r/LocalLLaMA/comments/1atw4ud/qwen158x7b/) # TITLE: Qwen-1.5-8x7B : r/LocalLLaMA **DESCRIPTION:** "Qwen-1.5-8x7B New Model Someone creat…

irthomasthomas updated 6 months ago
1
Aaronhuang-778/BiLLM #1

Request: please consider evaluating pareto-optimality of BiL…

Hi! Thank you for the paper! It is inspiring that you can compress weights to about 1 bit and the model still works better than random. A practical sub-2-bit quantization algorithm would be a grea…

justheuristic updated 7 months ago
3
zed-industries/zed #17209

An option to specify OpenAI endpoint URL and port is missing

### Check for existing issues - [X] Completed ### Describe the bug / provide steps to reproduce it When I configure OpenAI, I only see a field for API key but no field to enter URL and port. In my …

Lissanro updated 3 weeks ago
1
PromtEngineer/localGPT #456

very slow GPU compared with CPU

Hi all ! model is working great ! i am trying to use my 8GB 4060TI with MODEL_ID = "TheBloke/vicuna-7B-v1.5-GPTQ" MODEL_BASENAME = "model.safetensors" I changed the GPU today, the previous one wa…

N1h1lv5 updated 5 months ago
15
google-research/google-research #1742

Request for Guidance and Best Practices for Evaluating MADLA…

## Issue Description Hello GitHub community, I am currently seeking guidance on how to effectively evaluate the MADLAD 400 model, a 7.2B parameter machine translation (MT) model that has been fi…

mehran-mousavi updated 1 month ago
9
EricLBuehler/mistral.rs #635

Add DRY repetition penalty

DRY is a modern repetition penalty that is superior to the standard frequency and presence penalties at preventing repetition, while having virtually none of their negative effects on language quality…

p-e-w updated 3 weeks ago
1
BuilderIO/ai-shell #17

Run locally, for free

Have you considered using langchain+some truely-open source model? there's also a JS port of langchan + alpaca: https://github.com/linonetwo/langchain-alpaca can't recommend any use-case specif…

knoopx updated 1 year ago
7
mit-han-lab/llm-awq #92

[Announcement] AWQ is now supported in text-generation-infer…

Hi, Thanks to the great work of the authors of AWQ, maintainers at [TGI](https://github.com/huggingface/text-generation-inference), and the open-source community, AWQ is now supported in TGI ([link…

abhinavkulkarni updated 7 months ago
5
jamesturk/scrapeghost #18

Make API backend pluggable to allow for non-OpenAI models

This seems like it'll be the most important task to make this more viable for people. Alternative models will be cheaper, potentially much faster, allow running on someone's own hardware (LLaMa), a…

jamesturk updated 10 months ago
12

上一页 1...36 37 38 39 40 41 42...100 下一页

1000+ results for oobabooga

1000+ results
for oobabooga