meta-fine-tuning Search Results

ekinakyurek/marc #3

Low Accuracy on 80 Tasks After Fine-Tuning Meta-Llama-3-8B-I…

I used 80 tasks from the file task_info_selected.csv in [this repository](https://github.com/ekinakyurek/marc/blob/main/task_info_selected.csv), and fine-tuned Meta-Llama-3-8B-Instruct using the train…

bingwork updated 1 week ago

kohya-ss/sd-scripts #1542

How generate meta_lat.json for flux full fine tuning

prepare_buckets_latents.py seems to not working with fluxx, is they're any way to generate this file for a flux full finetuning ? Thanks

BenDes21 updated 3 months ago

unslothai/unsloth #1239

Fine tuned Llama3.1 does not support tools

This might be a silly question, but when using the Llama3.1 base model I can effortlessly pass in tools when running it in Ollama. ``` response = ollama.chat( model='llama3.1'…

darkroasted updated 1 month ago

ml-explore/mlx-examples #1104

Significantly reduced inference speed after Lora finetunig

**Describe** I found that after finetuning with Lora, the token throughput is significantly reduced. I trained a model on the unit test generation. And then fused the Lora adapter. For my test dat…

hschaeufler updated 3 weeks ago

meta-llama/llama-recipes #727

Llama 3.2-11B-vision fully fine-tuned model file question

During the use of LoRA fine-tuning, everything was normal, but the following issue arose during full-scale fine-tuning. I use the following script for full fine-tuning : ```shell #!/bin/bash N…

Kidand updated 1 month ago

mayhugotong/GenTKG #7

which one pre-trained model checkpoint in fune-tuning?

Thank you great work! I have a few questiones about peft? I hope you can answer that. Thank you a lot! 1. Which model fine-tuning is best to use? Is it a pre-trained model (llama2-7b) or a supervise…

zsxzs updated 1 month ago

pytorch/torchtitan #677

Fine-Tuning Llama Model with Large Context and Customized D…

Hi, I am trying to fine-tune a Llama model with a large context size, and I found that to efficiently shard activations across multiple GPUs, I need to use Torchtitan. Here are some questions relat…

Amerehei updated 3 weeks ago

sktime/sktime #7336

Clarification Needed on cv='prefit' in ForecastGridSearch

Hi everyone! First of all, thank you for the amazing work on sktime—it's an incredibly useful library. I have a question regarding the ForecastGridSearch implementation. Specifically, I'm unsure fr…

olitei updated 1 month ago

pytorch/torchtune #1699

How can I used the fine-tuned model for inferencing?

So when I did fine-tuning of a llama3, my configuration file looks like: ``` # Tokenizer tokenizer: _component_: torchtune.models.llama3.llama3_tokenizer path: ~/meta-llama/Meta-Llama-3-8B-In…

himanshushukla12 updated 2 months ago

AI-ON/Few-Shot-Music-Generation #18

Implement LSTM with meta-learned initialization+fine-tuning …

See section 4.3.1. There could be more than one instantiation of such a model.

larocheh updated 6 years ago

1000+ results for meta-fine-tuning

1000+ results
for meta-fine-tuning