issues
search
YuanGongND
/
ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
334
stars
26
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Stage training sh scripts for low resource
#42
yangdongdong2000
opened
4 days ago
1
where to download whisper model?
#41
yangdongdong2000
opened
5 days ago
2
Error when run finetune_toy_low_resource.sh
#40
blue-blue272
opened
6 days ago
0
About the audio-text pair of AudioSet dataset.
#39
blue-blue272
opened
1 week ago
1
Question: Half Float Inference?
#38
IanZ2020
opened
1 week ago
1
train_scripts
#37
yangdongdong2000
opened
1 week ago
3
Modifications to the llama model
#36
peggyxpxu
opened
1 month ago
1
Question: LLaMA-7B LLM
#35
peggyxpxu
opened
1 month ago
2
Question:Why are the prompts for training and inference for audio event classification are different?
#34
peggyxpxu
opened
2 months ago
2
OpenAQA Dataset's audio files
#33
CleyLyChen
opened
2 months ago
2
question on cutoff_len
#32
BenoitWang
opened
2 months ago
1
Eval code error
#31
peggyxpxu
opened
2 months ago
4
LICENSE of AQA datasets and checkpoints
#30
joemzhao
opened
2 months ago
0
Eval_metrics
#29
joemzhao
closed
2 months ago
4
Question about vicuna version
#28
CleyLyChen
closed
3 months ago
2
Question about Finetune exp
#27
ErikIsMel
opened
3 months ago
4
Issue with Loading 13B Model: Size Mismatch Error
#26
EnisBerk
opened
3 months ago
4
Batch Inference Support
#25
EnisBerk
closed
4 months ago
0
Maximux Length for LTU-AS Audio Input
#24
dingdongwang
opened
4 months ago
1
CPU local inference is not working.
#23
vivekupadhyay1
opened
4 months ago
0
Question about model loading in inference
#22
dingdongwang
opened
4 months ago
2
Issue while loading openaqa_5.6M.json
#21
sonalkum
opened
4 months ago
4
Running Issue about Low-Resource Training for LTU-AS
#20
dingdongwang
opened
5 months ago
8
Question about Multi-GPU Training
#19
dingdongwang
opened
5 months ago
1
Question about LTU-AS base model
#18
dingdongwang
opened
5 months ago
1
Question about LTU-AS Downstream Tasks
#17
dingdongwang
opened
5 months ago
3
LTU_AS ASR Task
#16
dingdongwang
opened
5 months ago
4
extract_whisper_feature.py
#15
dingdongwang
opened
5 months ago
3
Missing Checkpoints
#14
Sreyan88
closed
5 months ago
8
Missing Tokenize Audio Info during Fine-tuning/Training
#13
dingdongwang
opened
5 months ago
1
How to process audio that exceeds 10 seconds in length
#12
qisawO3
opened
5 months ago
5
no evaluation script for open-set problem
#11
alexaway
opened
5 months ago
1
whisper-at on cuda:1
#10
alexanderwerning
opened
5 months ago
1
Model Parallelization
#9
BhashaBluff
opened
6 months ago
5
Requirements for more pretrained weights
#8
Ming-er
closed
5 months ago
5
vicuna_ltu model file missing
#7
zengxijuan
opened
6 months ago
1
Which model is 7B (Default) and which is 13B (Beta)?
#6
yl4579
opened
6 months ago
12
Question about the Realism of Simulated Acoustic Event Combinations in Data Generation
#5
haoxiangsnr
opened
6 months ago
2
About the experimental results of the paper LTU-AS
#4
yangyuxiang1996
opened
8 months ago
1
OpenAQA Dataset Access
#3
MBAnslow
opened
9 months ago
2
API access to Gradio demos broken?
#2
jpgard
closed
10 months ago
9
Questions about data construction
#1
zengxijuan
opened
11 months ago
1