issues
search
QwenLM
/
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Other
1.49k
stars
107
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
如何进行AudioCaption任务
#77
gaochangfeng
opened
2 weeks ago
0
Compute Requirements and Execution Time
#76
Sedherthe
opened
1 month ago
1
请问有计划接入TTS模块吗
#74
CD678
opened
2 months ago
0
在rustc 1.80.1编译tokenizers v0.13.3报错
#72
martinzh717
opened
3 months ago
0
Get token in predict ?
#71
CungNguyenHuy
closed
3 months ago
0
cuda版本错误
#70
ZHUHF123
opened
4 months ago
0
微信群满了
#69
qgzang
opened
4 months ago
6
Problems for speech translation tasks
#68
ShoutaoGuo
opened
4 months ago
0
Update TUTORIAL.md
#67
pranit-gandhi
opened
4 months ago
0
Input multiple audio file to audio encoder
#66
DevKiHyun
opened
4 months ago
0
Clarification | Datasets used for training.
#65
Iosifts
opened
4 months ago
2
Evaluation script for VSC task seems not correct
#64
mlxu995
opened
4 months ago
0
能否获得hidden表示?
#63
Kristopher-Chen
opened
4 months ago
1
qwen-audio和lauragpt的相关问题讨论
#62
wwfcnu
opened
5 months ago
0
关于训练数据中不同语言分布情况
#61
shihuai
opened
5 months ago
0
本地部署需要多少算力‘’
#60
Gpwner
opened
5 months ago
0
微信群满了
#59
adeamoy
opened
6 months ago
1
chat模型,相同文本问题,不同音频,每次ASR返回结果都一样
#58
LiXuanming
opened
6 months ago
0
有量化后的版本吗
#57
whk6688
opened
6 months ago
1
有onnx格式的模型吗
#56
whk6688
opened
6 months ago
2
how can i chat in demo
#55
lzl-mt
opened
7 months ago
0
fix: fixed a typo in README.md
#54
m2moiz
opened
7 months ago
0
是不支持中文提示词吗
#53
GioGioBond
opened
7 months ago
1
请问是否支持 VLLM 等api部署
#52
su-zelong
opened
7 months ago
1
训练超参数相关问题
#51
jwang1993
opened
7 months ago
1
微信群满了
#49
zhangfan-algo
opened
7 months ago
0
allow_pickle=False
#48
Leejl0011
opened
8 months ago
2
支持本地api调用吗?
#47
dfengpo
opened
8 months ago
0
请问Qwen-audio的训练速度,阿里官方达到多少?
#46
luboxu
opened
8 months ago
1
关于粤语支持
#45
lq0104
opened
8 months ago
1
Few-shot Examples
#44
aqibsaeed
closed
8 months ago
0
Infer eval_audio目录下的multi-task eval脚本,发现模型针对batch 解码性能衰减很快,请问是训练时候attention mask 或者tokenizer padding部分处理有问题吗?
#43
yangjiabupt
opened
8 months ago
0
报错,requests.exceptions.HTTPError: Response details: 404 page not found, Request id: ab8a478639c847c6bbb41438e4d8606e
#42
wukongbuku
closed
8 months ago
0
确定给的本地模型没问题吗
#41
wukongbuku
closed
8 months ago
0
可以问一下微调代码的公开的计划嘛?预计什么时候能开源呢?非常感谢!!!
#40
icemoon-creative
opened
8 months ago
2
Fix README.md typo
#39
tianyu-z
opened
9 months ago
0
qwen-audio 微调
#38
wjfwjfwjf
closed
9 months ago
2
Update README.md
#37
huangxu1991
opened
9 months ago
0
wechat full
#36
lixf071213
opened
9 months ago
4
End of sentence id
#35
marcoyang1998
opened
9 months ago
0
qwen-audio处理长音频(五分钟左右)结果只输出前面20秒的文本是什么原因?
#34
Wolverhampton0
opened
10 months ago
9
use of whisper audio encoder
#33
x75
opened
10 months ago
4
请问prompt要怎么写才能获得单个task的信息或者想要的task的信息?
#32
wjyfelicity
closed
8 months ago
2
关于Output Instruction的问题
#31
jwang1993
opened
10 months ago
1
是否考虑加入whisper.cpp的支持?
#30
dyt06
opened
10 months ago
0
关于训练数据问题
#29
qy-NJU
opened
10 months ago
0
问题请教,关于gradio的问题,我在本地部署好了,想在手机上使用,显示找不到麦克风
#28
cl886699
opened
10 months ago
1
Tokenizer vocab size mismatch model vocab size
#27
yangjiabupt
opened
10 months ago
0
复现实验结果有差距
#26
roydcai
opened
10 months ago
2
The number of people in the WeChat group is full. Can you update the WeChat group QR code?
#25
rookie0607
opened
10 months ago
1
Next