Open junxnone opened 11 months ago
flowchart TB
subgraph S1[Eureka iteration]
subgraph S2[Generate Sample]
A(Build Prompt)
C[Build RL Envs]
end
B[LLMs GPT4]
D[Analysis Results]
end
A -->|OpenAI API| B
B-.->|Reward Function| C
subgraph S3[IsaacGym]
S3B[Evaluate]
S3A[Training]
end
C --> |Start Training| S3A
S3A-.-> | Return Results | D
D --> |Evaluate the best reward code| S3B
D -.-> A