vocabulary-trainer Search Results

997 results
for vocabulary-trainer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

datawhalechina/self-llm #124

peft微调llama3 8b，从第10补开始loss一直都是0

问题描述：使用peft微调llama3 8b，训练代码基本是按照样例稍作修改，在训练的时候前10个steps，loss稍高，后面输出的loss，一直都是0.0了微调代码： ```python import torch from datasets import Dataset import pandas as pd from transformers impo…

ykallan updated 3 months ago
6
numfocus/YouTubeVideoTimestamps #146

Transformers from the Ground Up - Sebastian Raschka | PyData…

0:00 - Introduction 0:42 - Sponsors & Contact information 1:31 - Transformers from the ground up 2:37 - Examples for transformers 4:48 - Outline 6:29 - Disclaimer 7:11 - Augmenting RNNs with at…

9x updated 9 months ago
1
Aidenzich/road-to-master #41

2024-03 Latest Health LLM

- [LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day](https://arxiv.org/abs/2306.00890) - [MEDITRON-70B: Scaling Medical Pretraining for Large Language Models](http…

Aidenzich updated 6 months ago
2
bigscience-workshop/t-zero #20

how to use this t-zero to train my own dataset? did not find…

flyingwaters updated 2 years ago
4
waityousea/xuniren #15

ubuntu下执行 python fay_connect.py 报错

用的云服务器 /root/miniconda3/lib/python3.8/site-packages/pydub/utils.py:170: RuntimeWarning: Couldn't find ffmpeg or avconv - defaulting to ffmpeg, but may not work warn("Couldn't find ffmpeg or avconv…

gemini0524 updated 7 months ago
3
microsoft/unilm #1429

[Kosmos-2] Fine-tune your checkpoint model on my downstream …

Hello everyone, thank you very much for your contribution. I appreciate the effort and consistency in uploading the code for such many models and maintaining this repository. I saw Kosmos-2 and I q…

basteran updated 6 months ago
22
unslothai/unsloth #434

I used a 2060 graphics card and reported an error "Feature '…

==((====))== Unsloth: Fast Llama patching release 2024.4 \\ /| GPU: NVIDIA GeForce RTX 2060 SUPER. Max memory: 7.785 GB. Platform = Linux. O^O/ \_/ \ Pytorch: 2.3.0. CUDA = 7.5. CUDA Too…

yangcecode updated 4 months ago
2
jprante/elasticsearch-analysis-decompound #6

Decompound adds letters

Hi, I just got stuck with some "FetchPhaseExecutionException" when using the highlighting and the decomp filter: InvalidTokenOffsetsException[Token verzinnte exceeds length of provided text sized 83…

marbleman updated 9 years ago
5
yangjianxin1/Firefly #32

torch.distributed.elastic.multiprocessing.errors.ChildFailed…

CUDA SETUP: Highest compute capability among GPUs detected: 8.0 CUDA SETUP: Detected CUDA version 116 CUDA SETUP: Loading binary /opt/conda/lib/python3.10/site-packages/bitsandbytes/libbitsandbytes_…

zxy333666 updated 1 year ago
3
jack-and-rozz/vocabulary_adaptation #2

the file whose Directory address is 'tools/llm/llm.py'

Thanks for your great work! I have some questions about the code which you uploaded on github. - where is the file whose Directory address is 'tools/llm/llm.py' ? - I also wonder whether t…

Thovenfish updated 2 years ago
6

上一页 1...7 8 9 10 11 12 13...100 下一页

997 results for vocabulary-trainer

997 results
for vocabulary-trainer