instruction-tuning Search Results

1000+ results
for instruction-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

intel-analytics/ipex-llm #10389

Instruction about How to fine-tuning QWEN model with BigDL.

Hi, please provide additional Instruction about How to fine-tuning QWEN model with BigDL speed up.

antchainmappic updated 8 months ago
1
unslothai/unsloth #1210

Continued Pre-Training Notebook not working with unsloth/Lla…

Hi, I tried to perform FFT, using the notebook ```Continued pretraining - Korean + Unsloth.ipynb``` However, with unsloth/Llama-3.2-1B-bnb-4bit after instruction finetune, the model hallucinates…

githomein updated 2 weeks ago
5
FlagOpen/FlagEmbedding #387

When fine-tuning and using `query_instruction_for_retrieval`…

If I followed the fine-tuning instructions and added a `query_instruction_for_retrieval`, should I use the same, different, or blank (`""`) instruction for the document ingestion part?

austinmw updated 10 months ago
1
hiyouga/LLaMA-Factory #6008

Serious misalignment in LLaVA implementation

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.9.1.dev0 - Platform: Linux-6.5.0-35-generic-x86_64-with-glibc2.35 - P…

oncleJules updated 4 days ago
2
AkihikoWatanabe/paper_notes #1474

Super-NaturalInstructions: Generalization via Declarative In…

# URL - https://arxiv.org/pdf/2204.07705 # Affiliations - Yizhong Wang, N/A - Swaroop Mishra, N/A - Pegah Alipoormolabashi, N/A - Yeganeh Kordi, N/A - Amirreza Mirzaei, N/A - Anjana Arunku…

AkihikoWatanabe updated 2 weeks ago
1
NVIDIA/TensorRT-LLM #2038

Use 2 Lora in one request

I have a base mode: **model_0**. I created a Lora which is corresponding to instruction tuning: **lora_1** The we merged **model_0** + **lora_1** to create: **model_1**. Then we created a Lora base…

Alireza3242 updated 1 month ago
4
instructlab/instructlab #2627

Artifact Management

**Overview (aka. Goal Summary)** Implement better management for artifacts (which includes data and models) - that we provide and that users will create (fine-tuned models, data generated after SDG).…

ktam3 updated 4 days ago
1
ZGC-LLM-Safety/TrafficLLM #10

Confusing about the two-stage fine-tuning process

Dear authors, Hello! I have a question regarding the two-stage fine-tuning process described in your work. Could you kindly help me understand how the two stages are connected during training? Specif…

TracyTd updated 2 weeks ago
2
neuralmagic/deepsparse #1666

[Question] About CPU performance

Hi, I am an engineer from Intel and I work mostly on the performance optimization of PyTorch on intel Xeon CPUs (also I am the pytorch module maintainer for cpu performance). Just come across this…

mingfeima updated 3 weeks ago
3
kongds/MoRA #16

How to run the fine-tuning using slurm?

Could you provide a slurm script to run the fine-tuning code? Apparently there are some issues with deepspeed, by just using the provided instructions.

AlessioQuercia updated 2 months ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for instruction-tuning

1000+ results
for instruction-tuning