homebrewltd / ichigo

Llama3.1 learns to Listen
150 stars 5 forks source link

epic: llama3-s v0.3: "I cannot hear / understand you" #56

Open 0xSage opened 1 week ago

0xSage commented 1 week ago

Goal

Make v0.3 multilingual, accept longer questions, and other data improvements.

Problem

Methodology

To solve the above mentioned issues, this run is focused on data improvements

Pipeline improvements:

Data Resources

Training Resources

Results

## Eval - Perf: MMLU (instruction), some ASR (transcription), human hieuristics - Hardware: ## Challenges & Learnings - # Tasklist - [x] #60 - [x] #59 - [ ] #38
0xSage commented 1 week ago

From Bach:

Phase 1: Pre-training

Data source: https://github.com/homebrewltd/llama3-s/issues/53

@bachvudinh mind updating results from phase 1 here when you have it? Thanks!

tikikun commented 1 week ago

@0xSage added centralized data source for the epic

bachvudinh commented 1 week ago

Results

Screenshot from 2024-09-09 19-36-32

tikikun commented 1 week ago

Latest run result is not good, we tried to finetune using very low r to avoid degradation, it still happens.

r=4 alpha=4 LR ~ 8e-6 image