guanaco Search Results - Githubissues

484 results
for guanaco

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

artidoro/qlora #205

How to pretrain "raw" text?

Hi! I would like to use QLora to "pretrain" a model and wanted to ask if that is possible, in the release time of qlora I've heard something about a 'raw' mode not existing right now For example, l…

SinanAkkoyun updated 1 year ago
4
artidoro/qlora #262

uneven distribution of GPU workload

Hello, Thanks so much for providing such resource so that we can all leverage the latest development of AI on different platforms. I was able to use your example to run a job to fine tune Llama-…

liatamax updated 1 year ago
1
srush/llama2.rs #23

Tensor has shape torch.Size([448, 1024]) ... this looks inco…

Thank you for building this - very interested in trying it. In my hands, when I try to export the model to a .bin I get the following error - is this something simple / user error? (MacOS Ventura …

timfpark updated 11 months ago
9
PromtEngineer/localGPT #251

NameError: name 'autogptq_cuda_256' is not defined

It seems like AutoGPTQ quantization module not being able to access the CUDA extension. The previous ingest problem is solve by pip install git+https://github.com/Keith-Hon/bitsandbytes-windows.g…

CalendulaED updated 1 year ago
10
gururise/AlpacaDataCleaned #8

Any chance we could improve the dataset beyond fixing?

Would that be relevant in the scope of this project? Like adding a couple sorts of task examples could improve its generalized capabilities, for instance: Longer responses GPT-4 Generated Response…

teknium1 updated 1 year ago
41
LAION-AI/Open-Assistant #3144

Curate SFT-9 dataset mixes

Iterate on the SFT-8 dataset mixes to create pretraining and final SFT mixes for SFT-9. This requires investigating the quality and usefulness of the datasets. Community input welcome below. See the `…

olliestanley updated 1 year ago
10
bigcode-project/octopack #17

Reproducing the OctoCoder model

Hello, I have a few questions about OctoCoder. For this part in the paper: > For instruction tuning our models, we select 5,000 random samples from COMMITPACKFT across the 6 programming languages…

mstallone updated 10 months ago
12
LostRuins/koboldcpp #213

AutoGenerate Memory only saves "Certainly!" in memory even t…

# Expected Behavior After clicking "AutoGenerate Memory", the full summary should be saved in memory # Current Behavior After clicking "AutoGenerate Memory", only "[Summary: Certainly!]" is s…

Kaiten10 updated 1 year ago
6
haotian-liu/LLaVA #744

[Question] size mismatch for mm_projector

### Question when i run cmd below, got size mismatch. CUDA_VISIBLE_DEVICES=1 python -m llava.serve.cli --model-path liuhaotian/llava-v1.5-13b-lora --model-base liuhaotian/**vicuna-13b-v1.5** --image…

TonyUSTC updated 6 months ago
9
rmusser01/tldw #29

Improvement: Add Evaluation tests for LLMs

This issue is now to track the implementation of various evaluation methods and workflows for LLMs. Evaluations: - [x] G-Eval - [ ] PingPong - [ ] InfiniteBench - [ ] Ruler - [ ] MMLU - [ ] M…

rmusser01 updated 2 weeks ago
2

上一页 1...6 7 8 9 10 11 12...49 下一页

484 results for guanaco

484 results
for guanaco