Open tt-rkim opened 1 month ago
For API: We will need workflow_id
and likely be using only attempt: 1
for the first MVP for data. Luckily, we can do some sort of batching since we can list workflows by status
and created
time: https://docs.github.com/en/rest/actions/workflow-runs?apiVersion=2022-11-28#list-workflow-runs-for-a-repository
Relevant discussion on workflow_run
triggers: https://github.com/orgs/community/discussions/128694
@kyma-tt @vmilosevic @TT-billteng Webhook for jobs only created and sending to our infra, not workflow runs (pipelines) yet . Everything is still POC phase, so this won't be getting us all the data we want quite yet.
Should probably have tighter data schema enforcement with something like Pydantic, but no need for now. MVP mode
Note that JUnit xml test times include both setup and teardown per: https://docs.pytest.org/en/7.0.x/how-to/output.html#creating-junitxml-format-files
There is a way to only record the call time, but will keep default for now
Example JUnit xml: https://gist.github.com/tt-rkim/27cb642ea394903586b0b3e810fde52d
Some good stuff, but not a whole lot.
One of the requirements is we need to know which tests that we ran are associated with which GitHub job. Because we get test data from Junit XMLs, part of this work is we need to know is given a JUnit XML, we need to know with which job it's associated.
Because names can unreliable be the same across jobs, the only way to identify a job is with its job ID. Therefore, we need to know which Job ID is associated with which JUnit XML.
The JUnit XMLs are generated during job/test runtime. The problem is that there is actually no way, during job runtime/test runtime, to associate the unique Job ID of a GitHub job to anything running inside it. In other words, no matter what artifact we're trying to associate with the job, whether it's JUnit XMLs or another artifact, there's no way to uniquely identify the job in which the artifact was generated.
So even though we want to upload JUnit XMLs as an artifact to consume later for uploading, we can't attach the Job ID to it. There's therefore no way to know which jobs any given JUnit XML is from.
Other people have complained. I'm posting to https://github.com/orgs/community/discussions/8945 for help
https://github.com/tenstorrent/tt-metal/pull/9659/files is the PR that enabled benchmarking on falcon t3k demos
As of today:
Will be starting some initial verification and PoC of certain data values at a workflow (pipeline) level.
cc: @TT-billteng @dimitri-tenstorrent
using this to track data science issues