Open dmitshur opened 5 years ago
Change https://golang.org/cl/142362 mentions this issue: cmd/gopherbot: reduce gardening reaction time
When working on other issues, I saw that GitHub introduced a "unified" timeline for events on an issue, the Timeline Api. I understand that it is still in beta (since 2016) and would be a major, but it might help fix this issue by providing a single source of truth on a GitHubIssue
@orthros Thanks for pointing that out. The Timeline API can indeed be helpful for eliminating races between issue comments, events, and PR reviews (for #21086).
Something to be mindful of is that it may not, on its own, be enough to solve the most important race: between the issue state (whether it's open or closed, which labels it has applied) and events. Unless we use the events to deduct the state, rather than querying state separately. (But that can be done independently of using the Timeline API.)
Also, for information, the Timeline API is indeed in preview, and in my experience using it, it had some data gap edge cases where I had to fall back to querying reviews separately (e.g., see here). It may have been resolved by now, but it's worth being aware of. It seems there are 2 Timeline APIs in GitHub API v4 (PullRequestTimelineConnection
and PullRequestTimelineItemsConnection
, the latter being a part of a preview API), in addition to the Timeline API in GitHub API v3 (https://developer.github.com/v3/issues/timeline/).
It’s not just short windows of time. There are some issues that have events missing within the maintner corpus. This makes it impossible to create an accurate milestone burndown chart where you want to query for the state of an issue at a particular time window. (/cc @griesemer).
A few examples of issues in maintner that have incomplete event lists:
=== Issue events for golang.org/issues/28559
labeled milestone: label:Testing
labeled milestone: label:help wanted
labeled milestone: label:OS-OpenBSD
labeled milestone: label:Builders
labeled milestone: label:NeedsInvestigation
milestoned milestone: Go1.12 label:
It does not record the final “closed” event: https://api.github.com/repos/golang/go/issues/28559/events
=== Issue events for golang.org/issues/28306
mentioned milestone: label:
subscribed milestone: label:
mentioned milestone: label:
subscribed milestone: label:
assigned milestone: label:
labeled milestone: label:Documentation
labeled milestone: label:NeedsInvestigation
milestoned milestone: Go1.12 label:
renamed milestone: label:
The above event log is missing a few milestone-related events: https://api.github.com/repos/golang/go/issues/28306/events
@andybons That sounds like a valid issue that is related, but not the same as this one. I see these two issues:
Mind opening a separate issue for it? The reason I suggest that is because I expect the fix for one will not resolve the other, and vice versa. Thanks!
Problem
A program that fetches a
maintner
corpus and tries to use its data to make decisions may make a mistake, because the world view is inconsistent during short windows of time. Even though the windows are short, it's guaranteed to happen for any daemon that loops over doing corpus updates and making decisions immediately after.The most visible high-level example of this is #21312.
Cause
This happens because there are effectively two GitHub data sources that are not synchronized:
To give a concrete example of an inconsistent state that
maintner
can report, consider when an issue has just been unlabeled. The first mutation received and processed by acorpus.Update
call will be that the issue no longer has that label.The mutation reporting that there has been an unlabeled event on the same issue may come in a few seconds later. Until it does, it will appear that the issue does not have said label and it has never been unlabeled (e.g.,
!gi.HasLabel("Documentation") && !gi.HasEvent("unlabeled")
will be true). Which is not the reality (if one considers the reality to be one where the unlabeled event and its effect to happen simultaneously).Details
These are two distinct mutations received and processed by
corpus.Update
method:There is more relevant information in https://github.com/golang/go/issues/21312#issuecomment-430051456.
/cc @bradfitz