Question about the graph

We construct the graph based on the dependencies between tasks rather than from ground-truth trajectories. Specifically, if the output of task A can serve as an input for it, we add a directed link from task A to task B.

Example: In HuggingFace, the output of Pose Detection is a textual description of a pose, which can be used as input for Pose-to-Image. Similarly, the output of Summarization can serve as input for Translation. And the code for constructing this dependency-guided graph is available in TaskBench's repository (https://github.com/microsoft/JARVIS/blob/main/taskbench/generate_graph.py, Lines 13-17).

Here are the specific details regarding the task graph construction for each dataset:

HuggingFace and Multimedia (TaskBench) We strictly adhere to the mentioned dependencies.
DailyLife (TaskBench) This features a fully connected graph, with the implementation outlined in https://github.com/microsoft/JARVIS/blob/main/taskbench/generate_graph.py, Lines 37-40.
TMDB (RestBench) The graph is constructed based on the API's type (e.g., whether it belongs to the person or movie category) as well as shared parameters. Implementations are provided in https://github.com/WxxShirley/GNN4TaskPlan/blob/main/data/raw_process_restgpt.py (function format_tool_graph_files starting from Line 103).
UltraTool Given that the original dataset encompasses diverse domains, we consider both ground-truth trajectories and API semantics to create a more realistic and coherent task graph. Implementations are provided in https://github.com/WxxShirley/GNN4TaskPlan/blob/main/data/raw_process_ultratool.py (function construct_task_graph starting from Line 124).

Sorry for the late reply. Should you have any further questions, please feel free to reach out!

WxxShirley / GNN4TaskPlan

Question about the graph #5