THUDM AgentTuning issues

THUDM / AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

https://thudm.github.io/AgentTuning/

1.36k stars 95 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Clarification on SFT required

#65 emrecanacikgoz opened 2 days ago
0
Does the docker contain the training dataset?

#64 zhiyuanc2001 closed 1 month ago
0
About AgentInstruct dataset

#63 listentomi opened 3 months ago
0
How do you select the prompt for AlfWorld?

#62 HCHCXY opened 3 months ago
0
是否用TGI封装过的模型都可以进行测试？

#61 YinSonglin1997 opened 4 months ago
0
Can you open source the unfiltered dataset

#59 whi497 opened 4 months ago
0
训练数据中指令与模型行为不匹配

#58 haichao592 opened 6 months ago
0
本地模型

#57 lz2021211161 closed 7 months ago
0
请问哪里可以找到工作里对于数据库方面的训练数据

#56 Mucalinda2436 closed 8 months ago
1
魔塔上的 AgentInstruct 数据集的 conversation 都是空值

#55 XianglongTan opened 8 months ago
0
weight decay确定是0.1吗？

#54 Fu-Dayuan closed 8 months ago
1
貌似hotpotqa测试脚本跑不起来？

#53 Fu-Dayuan opened 9 months ago
1
训练数据是如何采样的？

#52 Fu-Dayuan closed 9 months ago
3
if it is possible to conduct RLHF from env

#51 SHITIANYU-hue opened 9 months ago
1
Can you point to the ShareGPT filtered/cleaned data used?

#50 harshraj172 closed 9 months ago
1
Can I run AgentInstruct data on the AgentBench?

#49 harshraj172 opened 10 months ago
1
可以给个简单点的工具调用示例吗

#48 qq594495953 opened 10 months ago
1
期待用 Qwen72B 训练的模型。

#47 milomoon closed 10 months ago
1
基于fastchat部署，推理异常

#46 ruifengma opened 11 months ago
3
请问下agentlm-7b最少需要多少显存可以推理

#45 nicolasNi closed 11 months ago
5
关于TRAJECTORY FILTERING问题

#44 QingChengLineOne closed 11 months ago
3
Finetuning with Mistral or Yi?

#43 jFkd1 closed 11 months ago
1
除了用docker运行，还有其他方式可以运行AgentLM吗？

#42 caizhuoyue77 closed 11 months ago
6
通用数据如何筛选

#41 LuoKaiGSW opened 12 months ago
7
Dataset details 中找不到reward的计算方式

#40 DryPilgrim closed 12 months ago
5
AgentTuning 7b evaluate in HH， not expect as paper result

#39 Dhaizei opened 12 months ago
13
关于dataset statics 和 download

#38 DryPilgrim closed 12 months ago
3
关于数据集

#37 DryPilgrim closed 12 months ago
0
Number of training steps

#36 Mayer123 closed 12 months ago
1
微调显存

#35 Reason-Wang closed 12 months ago
1
agent tuning和toolbench的区别

#34 Connor-Shen closed 12 months ago
1
Start TGI worker

#33 mayilin0714 closed 1 year ago
1
关于reward

#32 DryPilgrim closed 12 months ago
2
requests.exceptions.MissingSchema: Invalid URL '127.0.0.123332/generate': No scheme supplied. Perhaps you meant https://127.0.0.123332/generate?

#31 mayilin0714 closed 1 year ago
1
请教reward分数的各种情况

#30 DryPilgrim closed 1 year ago
1
Inference with `vllm`

#29 yc1999 closed 1 year ago
1
Updated Contributors Section

#28 mohitd404 opened 1 year ago
0
Adding Contributors Section in readme.md file.

#27 mohitd404 closed 11 months ago
0
Fix typos

#26 HKABIG closed 1 year ago
1
Create CONTRIBUTING.md

#25 0Armaan025 opened 1 year ago
0
Add license

#24 dalvishruti14 closed 1 year ago
0
Updated README with Badges

#23 Killer2OP closed 7 months ago
1
Create LICENSE

#22 Killer2OP closed 7 months ago
1
Auto comment

#21 shraddha761 closed 1 year ago
1
Update README.md

#20 shraddha761 closed 1 year ago
1
Grammer mistake in readme

#19 shraddha761 closed 1 year ago
1
什么时候上魔塔社区

#18 QingChengLineOne closed 1 year ago
1
论文中Table 2中的数字的含义和计算方式

#17 DryPilgrim closed 1 year ago
2
论文中的问题

#16 QingChengLineOne closed 1 year ago
1
Updated language of README.md file

#15 rohan37kumar closed 1 year ago
0