THUDM / AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs
https://thudm.github.io/AgentTuning/
1.36k stars 95 forks source link

关于dataset statics 和 download #38

Closed DryPilgrim closed 12 months ago

DryPilgrim commented 12 months ago
  1. Filt. Traj. 就是huggingface上数据的行数,那# Inst. 和 Avg # Filt.Traj. Turns是怎么计算的呀?举例来说,下面这个例子的# Inst. 和 Avg # Filt.Traj. Turns要怎么算呢?

    
    [
    {
        "from": "human",
        "loss": null,
        "value": "You are web shopping.\xxx"
    },
    {
        "from": "gpt",
        "loss": false,
        "value": "Ok."
    },
    {
        "from": "human",
        "loss": null,
        "value": "WebShop [SEP] Instruction: [SEP] i'm looking for a mid century coffee table for my living room [SEP] Search"
    },
    {
        "from": "gpt",
        "loss": true,
        "value": "Thought:\nI think I should use the search bar to look for a mid-century coffee table for my living room.\n\nAction:\nsearch[mid-century coffee table]"
    },
    {
        "from": "human",
        "loss": null,
        "value": "Instruction: [SEP] i'm looking for a mid century coffee table for my living room [SEP] Back to Search [SEP] Page 1 (Total results: 50) [SEP] Next > [SEP] B09FDCPKRR [SEP] Home Square 2 Piece Living Room Set with Coffee Table and End Table in Walnut [SEP] $757.76 [SEP] B07XQ712TW [SEP] IWELL Mid Century Coffee Table for Living Room, Modern Cocktail Table with Storage Shelf, Couch Table, Sofa Table for Office, Easy to Assemble, Dark Walnut [SEP] $100.0 [SEP] B01DBKYOV6 [SEP] Baxton Studio 424-6898-AMZ Coffee Table, Brown [SEP] $239.99 [SEP] B08WRSWF5X [SEP] Modway Render Mid-Century Modern Coffee Table in Walnut [SEP] $272.12 [SEP] B09333CV7P [SEP] Modway Envision 70\" Mid-Century Modern Low Profile Entertainment TV Stand in Walnut White & Transmit Mid-Century Coffee Table, Walnut White [SEP] $453.5 [SEP] B078KKB8Z9 [SEP] Modway Transmit Mid-Century Coffee Table, Walnut White [SEP] $199.5 [SEP] B077Y987TL [SEP] Furniture of America Regols Rego Mid-Century Coffee Table with Glass Top, Gray [SEP] $100.0 [SEP] B06W52GBKW [SEP] Archian Walnut Wood High Coffee Table [SEP] $88.79 [SEP] B07NKG7DF9 [SEP] Modway Origin 47\" Mid-Century Modern Wood Coffee Table In Walnut White [SEP] $218.34 [SEP] B01HQGZEO2 [SEP] Baxton Studio Pierce Mid-Century Modern Coffee Table Walnut Brown/Contemporary [SEP] $105.53"
    },
    {
        "from": "gpt",
        "loss": true,
        "value": "Thought:\nI think I should click on the 'IWELL Mid Century Coffee Table for Living Room' option as it fits my requirements and has a price of $100.0.\n\nAction:\nclick[B07XQ712TW]"
    },
    {
        "from": "human",
        "loss": null,
        "value": "Instruction: [SEP] i'm looking for a mid century coffee table for my living room [SEP] Back to Search [SEP] < Prev [SEP] IWELL Mid Century Coffee Table for Living Room, Modern Cocktail Table with Storage Shelf, Couch Table, Sofa Table for Office, Easy to Assemble, Dark Walnut [SEP] Price: $100.0 [SEP] Rating: N.A. [SEP] Description [SEP] Features [SEP] Reviews [SEP] Buy Now"
    },
    {
        "from": "gpt",
        "loss": true,
        "value": "Thought:\nI think I should click on the 'Buy Now' button to purchase the IWELL Mid Century Coffee Table.\n\nAction:\nclick[Buy Now]"
    }
    ]```
  2. huggingface不让访问,有推荐的下载方式吗?

lr-tsinghua11 commented 12 months ago
  1. # Inst. 代表该任务下指令数据的总数,例如 webshop 共 1485 条,这部分数据是 webshop 仓库训练集的子集,不在 AgentInstruct 中;Avg # Filt.Traj. Turns 代表 AgentInstruct 对于某个特定任务的平均交互轮数,一问一答算一轮,比如上面那个例子的 turns 记为 4,例如,filter 后 webshop 任务筛选率 23.6%,共 351 条数据,平均轮数为 3.68;
  2. 可以参考国内镜像 https://www.wisemodel.cn/datasets/%E6%99%BA%E8%B0%B1AI/AgentInstruct
DryPilgrim commented 12 months ago

”# Inst. 代表该任务下指令数据的总数“ 意思是human的说话次数吗?比如上面那个例子中指令数就为4(human出现了四次)

DryPilgrim commented 12 months ago

奥,不对,应该是有”Instruction:“字样的才算指令