cncf / gitdm

πŸ“œFork for tracking CNCF projects
35 stars 383 forks source link

Stats for kubernetes/website and kubernetes/kubernetes are not updated #214

Open pranav-pandey0804 opened 10 months ago

pranav-pandey0804 commented 10 months ago

Hello, @lukaszgryglicki I am a contributor to the Kubernetes/website and Kubernetes/kubernetes repositories. I noticed that my stats for these repositories are not updated since the first week of October 2023. According to the developer activity count, my total contributions are only 48 but after that, I have made 20 more contributions but those aren't being reflected here.

I would appreciate it if you could look into this issue and update the dev stats for the mentioned repositories, Thanks!

lukaszgryglicki commented 10 months ago

Will TAL when I have time - probably Friday or next Monday.

pranav-pandey0804 commented 10 months ago

@lukaszgryglicki thanks!

lukaszgryglicki commented 10 months ago

Should be fixed now.

pranav-pandey0804 commented 10 months ago

Should be fixed now.

Hi @lukaszgryglicki The contributor/developers stats have been updated but my issue still has not been resolved.

lukaszgryglicki commented 10 months ago

I'll taking a look on this today.

pranav-pandey0804 commented 10 months ago

thanks!

lukaszgryglicki commented 10 months ago

Right, so your actor's data and affiliations data is looking good IMHO:

gha=# select * from gha_actors where login = 'pranav-pandey0804';
          id          |       login       |     name      | country_id | sex | sex_prob | tz | tz_offset | country_name | age 
----------------------+-------------------+---------------+------------+-----+----------+----+-----------+--------------+-----
            126320452 | pranav-pandey0804 | Pranav Pandey | in         |     |          |    |           | India        |    
 -6888372656531941121 | pranav-pandey0804 | Pranav Pandey | in         |     |          |    |           | India        |    
(2 rows)

gha=# select * from gha_actors_affiliations where actor_id in (select id from gha_actors where login = 'pranav-pandey0804');
       actor_id       |  company_name   |       dt_from       |        dt_to        | original_company_name | source 
----------------------+-----------------+---------------------+---------------------+-----------------------+--------
            126320452 | NEC Corporation | 1900-01-01 00:00:00 | 2100-01-01 00:00:00 | NEC Corporation       | user
 -6888372656531941121 | NEC Corporation | 1900-01-01 00:00:00 | 2100-01-01 00:00:00 | NEC Corporation       | user
(2 rows)

Now checking the contributions:

gha=# select * from gha_events where created_at >= '2023-08-15 10:00:00' and actor_id in (select id from gha_actors where login = 'pranav-pandey0804') and type in ('PushEvent', 'PullRequestEvent', 'IssuesEvent', 'PullRequestReviewEvent', 'CommitCommentEvent', 'IssueCommentEvent', 'PullRequestReviewCommentEvent') order by created_at desc;
     id      |       type        | actor_id  | repo_id  | public |     created_at      |  org_id  | forkee_id |  dup_actor_login  |     dup_repo_name     
-------------+-------------------+-----------+----------+--------+---------------------+----------+-----------+-------------------+-----------------------
 33468367859 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-11-20 04:06:30 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 33468349440 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-11-20 04:05:07 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32038618059 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 17:20:07 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32031228261 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 12:29:36 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32031124711 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 12:25:35 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32027297711 | IssueCommentEvent | 126320452 | 20580498 | t      | 2023-09-22 09:43:48 | 13629408 |           | pranav-pandey0804 | kubernetes/kubernetes
 32025652784 | IssueCommentEvent | 126320452 | 20580498 | t      | 2023-09-22 08:39:07 | 13629408 |           | pranav-pandey0804 | kubernetes/kubernetes
 32025520791 | IssueCommentEvent | 126320452 | 20580498 | t      | 2023-09-22 08:33:45 | 13629408 |           | pranav-pandey0804 | kubernetes/kubernetes
 32025055903 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 08:15:09 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32022888775 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 06:38:00 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32022876864 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 06:37:22 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32022865552 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 06:36:47 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32022399392 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 06:11:55 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32022358359 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 06:09:39 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32022293821 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 06:05:55 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32022178408 | IssueCommentEvent | 126320452 | 20580498 | t      | 2023-09-22 05:58:55 | 13629408 |           | pranav-pandey0804 | kubernetes/kubernetes
 32021468757 | IssueCommentEvent | 126320452 | 20580498 | t      | 2023-09-22 05:10:42 | 13629408 |           | pranav-pandey0804 | kubernetes/kubernetes
 32021237931 | IssueCommentEvent | 126320452 | 20580498 | t      | 2023-09-22 04:53:32 | 13629408 |           | pranav-pandey0804 | kubernetes/kubernetes
 32021169694 | IssueCommentEvent | 126320452 | 20580498 | t      | 2023-09-22 04:48:03 | 13629408 |           | pranav-pandey0804 | kubernetes/kubernetes
 32020868747 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 04:22:42 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32020857082 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 04:21:46 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32020275526 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 03:30:53 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32020257590 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 03:29:21 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32020217678 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-22 03:25:51 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32009214479 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 16:46:28 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32009197676 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 16:45:45 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32009189375 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 16:45:25 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32009180773 | IssuesEvent       | 126320452 | 51478266 | t      | 2023-09-21 16:45:04 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32001101292 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 12:20:45 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32001094754 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 12:20:31 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 32001087179 | IssuesEvent       | 126320452 | 51478266 | t      | 2023-09-21 12:20:15 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31999523380 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 11:21:48 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31999519402 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 11:21:39 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31999033126 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 11:03:22 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31999007641 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 11:02:25 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31999000648 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 11:02:08 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31998982143 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-21 11:01:26 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31998971477 | IssuesEvent       | 126320452 | 51478266 | t      | 2023-09-21 11:01:02 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31878531811 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-16 04:57:23 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31871063456 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-15 17:45:09 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31729342904 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-11 04:37:20 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31729333036 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-11 04:36:34 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31640316946 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-06 14:34:42 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31640145685 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-06 14:29:18 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31611000488 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-05 14:59:31 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31603025700 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-05 10:25:17 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31601333117 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-05 09:23:30 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31601318026 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-05 09:22:58 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31601268987 | IssueCommentEvent | 126320452 | 51478266 | t      | 2023-09-05 09:21:14 | 13629408 |           | pranav-pandey0804 | kubernetes/website
 31601240623 | IssuesEvent       | 126320452 | 51478266 | t      | 2023-09-05 09:20:14 | 13629408 |           | pranav-pandey0804 | kubernetes/website
(50 rows)

I see 50 but 2 newest are from today 11/20 - so at the time when I regenerate dthis - there were 48 contributions - exactly the same as the dashboard shows. Now I will chekc contributions you mentioned in slack to see if they are present - for now I see that dashboard is OK data-wise (according to low level data that DevStats has).

Note that contributions are counted since 2023-08-15 10:00:00 which is the v1.28 release date, for your user AND for types that are considered contributions (DevStats wide) which are:

'PushEvent', 'PullRequestEvent', 'IssuesEvent', 'PullRequestReviewEvent', 'CommitCommentEvent', 'IssueCommentEvent', 'PullRequestReviewCommentEvent'

Now chekcing your slack mentioned contributions one after another...

lukaszgryglicki commented 10 months ago

So for your PR:

I don't have PullRequestEvent for it, see this is all data I have for that PR:

gha=# select * from gha_events where id in (select event_id from gha_pull_requests where dup_repo_name = 'kubernetes/website' and number = 43548 order by updated_at);
       id        |             type              | actor_id  | repo_id  | public |     created_at      |  org_id  | forkee_id | dup_actor_login |   dup_repo_name    
-----------------+-------------------------------+-----------+----------+--------+---------------------+----------+-----------+-----------------+--------------------
     32666938577 | PullRequestReviewCommentEvent |  12787482 | 51478266 | t      | 2023-10-18 17:56:43 | 13629408 |           | Gauravpadam     | kubernetes/website
     32695550073 | PullRequestReviewCommentEvent |  12787482 | 51478266 | t      | 2023-10-19 15:00:01 | 13629408 |           | Gauravpadam     | kubernetes/website
     32666938512 | PullRequestReviewEvent        |  12787482 | 51478266 | t      | 2023-10-18 17:57:17 | 13629408 |           | Gauravpadam     | kubernetes/website
     32647254047 | PullRequestReviewCommentEvent | 116108975 | 51478266 | t      | 2023-10-18 06:57:51 | 13629408 |           | aj11anuj        | kubernetes/website
     32666938537 | PullRequestReviewEvent        |  12787482 | 51478266 | t      | 2023-10-18 17:57:17 | 13629408 |           | Gauravpadam     | kubernetes/website
 281485737293008 | labeled                       |  20407524 | 51478266 | t      | 2023-10-25 00:18:30 | 13629408 |           | k8s-ci-robot    | kubernetes/website
     32647253941 | PullRequestReviewEvent        | 116108975 | 51478266 | t      | 2023-10-18 06:59:28 | 13629408 |           | aj11anuj        | kubernetes/website
     32647254013 | PullRequestReviewEvent        | 116108975 | 51478266 | t      | 2023-10-18 06:59:28 | 13629408 |           | aj11anuj        | kubernetes/website
     32695549923 | PullRequestReviewEvent        |  12787482 | 51478266 | t      | 2023-10-19 15:00:03 | 13629408 |           | Gauravpadam     | kubernetes/website
     32647254058 | PullRequestReviewCommentEvent | 116108975 | 51478266 | t      | 2023-10-18 06:58:16 | 13629408 |           | aj11anuj        | kubernetes/website
(10 rows)

While for example for my PR on the cncf/landscape repo, I have a PullRequestEvent data:

cncf=# select event_id, created_at, updated_at, dup_actor_login, dup_type, dup_user_login from gha_pull_requests where dup_repo_name = 'cncf/landscape' and number = 3399 order by updated_at;
    event_id     |     created_at      |     updated_at      | dup_actor_login |     dup_type     | dup_user_login  
-----------------+---------------------+---------------------+-----------------+------------------+-----------------
     30891727507 | 2023-08-04 05:56:19 | 2023-08-04 05:56:19 | lukaszgryglicki | PullRequestEvent | lukaszgryglicki
     30892333897 | 2023-08-04 05:56:19 | 2023-08-04 06:33:32 | lukaszgryglicki | PullRequestEvent | lukaszgryglicki
 281485119778896 | 2023-08-04 05:56:19 | 2023-08-04 06:33:33 | llarsson        | referenced       | lukaszgryglicki
(3 rows)

This measn that for your particular PR I only have reviews, comments and labeled events - I will now download JSON with all GitHub events for that hour from GitHub archives and check if that data is there. If data is there - this means a bug in DevStats because I shoudl also have that data in my DB, but if that data is NOT there, then I will report this on GitHub archives projects and add external-bug label here.

Before doing so, I'll chekc your other mentioned contributions.

lukaszgryglicki commented 10 months ago

For issue I also don't have issue-opened event, see:

gha=# select event_id, created_at, updated_at, dup_actor_login, dup_type, dup_user_login from gha_issues where dup_repo_name = 'kubernetes/website' and number = 43410 order by updated_at;

  event_id   |     created_at      |     updated_at      | dup_actor_login |     dup_type      |  dup_user_login   
-------------+---------------------+---------------------+-----------------+-------------------+-------------------
 32449967154 | 2023-10-10 13:01:42 | 2023-10-10 13:17:40 | tengqm          | IssueCommentEvent | pranav-pandey0804
 32467361361 | 2023-10-10 13:01:42 | 2023-10-11 02:12:12 | tengqm          | IssueCommentEvent | pranav-pandey0804
 32467445636 | 2023-10-10 13:01:42 | 2023-10-11 02:18:18 | mengjiao-liu    | IssueCommentEvent | pranav-pandey0804
(3 rows)

Same for k/k issue, I don't have your comments:

gha=# select event_id, created_at, updated_at, dup_actor_login, dup_type, dup_user_login from gha_issues where dup_repo_name = 'kubernetes/kubernetes' and number = 121809 order by updated_at;
  event_id   |     created_at      |     updated_at      | dup_actor_login |     dup_type      | dup_user_login 
-------------+---------------------+---------------------+-----------------+-------------------+----------------
 33193797139 | 2023-11-08 14:28:32 | 2023-11-08 14:28:32 | kannon92        | IssuesEvent       | kannon92
 33193801418 | 2023-11-08 14:28:32 | 2023-11-08 14:28:41 | k8s-ci-robot    | IssueCommentEvent | kannon92
 33193802580 | 2023-11-08 14:28:32 | 2023-11-08 14:28:43 | k8s-ci-robot    | IssueCommentEvent | kannon92
 33193805043 | 2023-11-08 14:28:32 | 2023-11-08 14:28:47 | kannon92        | IssueCommentEvent | kannon92
 33231631492 | 2023-11-08 14:28:32 | 2023-11-09 17:29:31 | kannon92        | IssueCommentEvent | kannon92
(5 rows)

Finally for 121806 k/k issue, I also don't have your comments:

gha=# select event_id, created_at, updated_at, dup_actor_login, dup_type, dup_user_login from gha_issues where dup_repo_name = 'kubernetes/kubernetes' and number = 121869 order by updated_at;
    event_id     |     created_at      |     updated_at      | dup_actor_login |     dup_type      | dup_user_login 
-----------------+---------------------+---------------------+-----------------+-------------------+----------------
     33331045386 | 2023-11-14 10:59:59 | 2023-11-14 10:59:59 | pohly           | IssuesEvent       | pohly
 281485929791171 | 2023-11-14 10:59:59 | 2023-11-14 10:59:59 | pohly           | labeled           | pohly
 281485929791822 | 2023-11-14 10:59:59 | 2023-11-14 11:00:02 | k8s-ci-robot    | labeled           | pohly
 281485929791675 | 2023-11-14 10:59:59 | 2023-11-14 11:00:02 | k8s-ci-robot    | labeled           | pohly
 281485929792532 | 2023-11-14 10:59:59 | 2023-11-14 11:00:06 | k8s-ci-robot    | labeled           | pohly
     33331049192 | 2023-11-14 10:59:59 | 2023-11-14 11:00:08 | k8s-ci-robot    | IssueCommentEvent | pohly
(6 rows)

I now wonder if there was some GH outage - will be now download RAW JSONs for GHA and checking if data is there.

lukaszgryglicki commented 10 months ago

Taking th PR 43548 as an example and checking https://www.gharchive.orgI'm doing (just in case to rule out Timezone problems):

wget https://data.gharchive.org/2023-10-18-4.json.gz
wget https://data.gharchive.org/2023-10-18-6.json.gz

Then I modify those files (apply formatting as they are minimized) and then search for that PR data inside them I'm getting:

{
  "id": "32644350939",
  "type": "IssueCommentEvent",
  "actor": {
    "id": 20407524,
    "login": "k8s-ci-robot",
    "display_login": "k8s-ci-robot",
    "gravatar_id": "",
    "url": "https://api.github.com/users/k8s-ci-robot",
    "avatar_url": "https://avatars.githubusercontent.com/u/20407524?"
  },
  "repo": {
    "id": 51478266,
    "name": "kubernetes/website",
    "url": "https://api.github.com/repos/kubernetes/website"
  },
  "payload": {
    "action": "created",
    "issue": {
      "url": "https://api.github.com/repos/kubernetes/website/issues/43548",
      "repository_url": "https://api.github.com/repos/kubernetes/website",
      "labels_url": "https://api.github.com/repos/kubernetes/website/issues/43548/labels{/name}",
      "comments_url": "https://api.github.com/repos/kubernetes/website/issues/43548/comments",
      "events_url": "https://api.github.com/repos/kubernetes/website/issues/43548/events",
      "html_url": "https://github.com/kubernetes/website/pull/43548",
      "id": 1948771899,
      "node_id": "PR_kwDOAxF--s5dFuLt",
      "number": 43548,
      "title": "updated service page",
      "user": {
        "login": "pranav-pandey0804",
        "id": 126320452,
        "node_id": "U_kgDOB4d_RA",
        "avatar_url": "https://avatars.githubusercontent.com/u/126320452?v=4",
        "gravatar_id": "",
        "url": "https://api.github.com/users/pranav-pandey0804",
        "html_url": "https://github.com/pranav-pandey0804",
        "followers_url": "https://api.github.com/users/pranav-pandey0804/followers",
        "following_url": "https://api.github.com/users/pranav-pandey0804/following{/other_user}",
        "gists_url": "https://api.github.com/users/pranav-pandey0804/gists{/gist_id}",
        "starred_url": "https://api.github.com/users/pranav-pandey0804/starred{/owner}{/repo}",
        "subscriptions_url": "https://api.github.com/users/pranav-pandey0804/subscriptions",
        "organizations_url": "https://api.github.com/users/pranav-pandey0804/orgs",
        "repos_url": "https://api.github.com/users/pranav-pandey0804/repos",
        "events_url": "https://api.github.com/users/pranav-pandey0804/events{/privacy}",
        "received_events_url": "https://api.github.com/users/pranav-pandey0804/received_events",
        "type": "User",
        "site_admin": false
      },
      "labels": [
        {
          "id": 484571291,
          "node_id": "MDU6TGFiZWw0ODQ1NzEyOTE=",
          "url": "https://api.github.com/repos/kubernetes/website/labels/cncf-cla:%20yes",
          "name": "cncf-cla: yes",
          "color": "bfe5bf",
          "default": false,
          "description": "Indicates the PR's author has signed the CNCF CLA."
        },
        {
          "id": 676219457,
          "node_id": "MDU6TGFiZWw2NzYyMTk0NTc=",
          "url": "https://api.github.com/repos/kubernetes/website/labels/size/XS",
          "name": "size/XS",
          "color": "009900",
          "default": false,
          "description": "Denotes a PR that changes 0-9 lines, ignoring generated files."
        },
        {
          "id": 1078396901,
          "node_id": "MDU6TGFiZWwxMDc4Mzk2OTAx",
          "url": "https://api.github.com/repos/kubernetes/website/labels/language/en",
          "name": "language/en",
          "color": "e9b3f9",
          "default": false,
          "description": "Issues or PRs related to English language"
        }
      ],
      "state": "open",
      "locked": false,
      "assignee": null,
      "assignees": [],
      "milestone": null,
      "comments": 1,
      "created_at": "2023-10-18T04:13:45Z",
      "updated_at": "2023-10-18T04:14:10Z",
      "closed_at": null,
      "author_association": "CONTRIBUTOR",
      "active_lock_reason": null,
      "draft": false,
      "pull_request": {
        "url": "https://api.github.com/repos/kubernetes/website/pulls/43548",
        "html_url": "https://github.com/kubernetes/website/pull/43548",
        "diff_url": "https://github.com/kubernetes/website/pull/43548.diff",
        "patch_url": "https://github.com/kubernetes/website/pull/43548.patch",
        "merged_at": null
      },
      "body": "Clarification and Enhancement of Documentation Statement.\r\n\r\n",
      "reactions": {
        "url": "https://api.github.com/repos/kubernetes/website/issues/43548/reactions",
        "total_count": 0,
        "+1": 0,
        "-1": 0,
        "laugh": 0,
        "hooray": 0,
        "confused": 0,
        "heart": 0,
        "rocket": 0,
        "eyes": 0
      },
      "timeline_url": "https://api.github.com/repos/kubernetes/website/issues/43548/timeline",
      "performed_via_github_app": null,
      "state_reason": null
    },
    "comment": {
      "url": "https://api.github.com/repos/kubernetes/website/issues/comments/1767609069",
      "html_url": "https://github.com/kubernetes/website/pull/43548#issuecomment-1767609069",
      "issue_url": "https://api.github.com/repos/kubernetes/website/issues/43548",
      "id": 1767609069,
      "node_id": "IC_kwDOAxF--s5pW5Lt",
      "user": {
        "login": "k8s-ci-robot",
        "id": 20407524,
        "node_id": "MDQ6VXNlcjIwNDA3NTI0",
        "avatar_url": "https://avatars.githubusercontent.com/u/20407524?v=4",
        "gravatar_id": "",
        "url": "https://api.github.com/users/k8s-ci-robot",
        "html_url": "https://github.com/k8s-ci-robot",
        "followers_url": "https://api.github.com/users/k8s-ci-robot/followers",
        "following_url": "https://api.github.com/users/k8s-ci-robot/following{/other_user}",
        "gists_url": "https://api.github.com/users/k8s-ci-robot/gists{/gist_id}",
        "starred_url": "https://api.github.com/users/k8s-ci-robot/starred{/owner}{/repo}",
        "subscriptions_url": "https://api.github.com/users/k8s-ci-robot/subscriptions",
        "organizations_url": "https://api.github.com/users/k8s-ci-robot/orgs",
        "repos_url": "https://api.github.com/users/k8s-ci-robot/repos",
        "events_url": "https://api.github.com/users/k8s-ci-robot/events{/privacy}",
        "received_events_url": "https://api.github.com/users/k8s-ci-robot/received_events",
        "type": "User",
        "site_admin": false
      },
      "created_at": "2023-10-18T04:14:10Z",
      "updated_at": "2023-10-18T04:14:10Z",
      "author_association": "CONTRIBUTOR",
      "body": "[APPROVALNOTIFIER] This PR is **NOT APPROVED**\n\nThis pull-request has been approved by:\n**Once this PR has been reviewed and has the lgtm label**, please assign [tengqm](https://github.com/tengqm) for approval. For more information see [the Kubernetes Code Review Process](https://git.k8s.io/community/contributors/guide/owners.md#the-code-review-process).\n\nThe full list of commands accepted by this bot can be found [here](https://go.k8s.io/bot-commands?repo=kubernetes%2Fwebsite).\n\n<details open>\nNeeds approval from an approver in each of these files:\n\n- **[content/en/OWNERS](https://github.com/kubernetes/website/blob/main/content/en/OWNERS)**\n\nApprovers can indicate their approval by writing `/approve` in a comment\nApprovers can cancel approval by writing `/approve cancel` in a comment\n</details>\n<!-- META={\"approvers\":[\"tengqm\"]} -->",
      "reactions": {
        "url": "https://api.github.com/repos/kubernetes/website/issues/comments/1767609069/reactions",
        "total_count": 0,
        "+1": 0,
        "-1": 0,
        "laugh": 0,
        "hooray": 0,
        "confused": 0,
        "heart": 0,
        "rocket": 0,
        "eyes": 0
      },
      "performed_via_github_app": null
    }
  },
  "public": true,
  "created_at": "2023-10-18T04:14:10Z",
  "org": {
    "id": 13629408,
    "login": "kubernetes",
    "gravatar_id": "",
    "url": "https://api.github.com/orgs/kubernetes",
    "avatar_url": "https://avatars.githubusercontent.com/u/13629408?"
  }
}

This is for this k8s-ci-robot's comment see:

"actor": {
    "id": 20407524,
    "login": "k8s-ci-robot",
    "display_login": "k8s-ci-robot",
    "gravatar_id": "",
    "url": "https://api.github.com/users/k8s-ci-robot",
    "avatar_url": "https://avatars.githubusercontent.com/u/20407524?"
  }

then there is an issue body (which your have create, PRs are also an issue internally), there is a PR object too:

"pull_request": {
        "url": "https://api.github.com/repos/kubernetes/website/pulls/43548",
        "html_url": "https://github.com/kubernetes/website/pull/43548",
        "diff_url": "https://github.com/kubernetes/website/pull/43548.diff",
        "patch_url": "https://github.com/kubernetes/website/pull/43548.patch",
        "merged_at": null
      }

there is k8s-ci-robots (bot)'s comment:

"body": "[APPROVALNOTIFIER] This PR is **NOT APPROVED**\n\nThis pull-request has been approved by:\n**Once this PR has been reviewed and has the lgtm label**, please assign [tengqm](https://github.com/tengqm) for approval. For more information see [the Kubernetes Code Review Process](https://git.k8s.io/community/contributors/guide/owners.md#the-code-review-process).\n\nThe full list of commands accepted by this bot can be found [here](https://go.k8s.io/bot-commands?repo=kubernetes%2Fwebsite).\n\n<details open>\nNeeds approval from an approver in each of these files:\n\n- **[content/en/OWNERS](https://github.com/kubernetes/website/blob/main/content/en/OWNERS)**\n\nApprovers can indicate their approval by writing `/approve` in a comment\nApprovers can cancel approval by writing `/approve cancel` in a comment\n</details>\n<!-- META={\"approvers\":[\"tengqm\"]} -->"

And it is created_at:

"created_at": "2023-10-18T04:14:10Z",

about a minute after you created a PR.

Then there is netify's bots comment at 6:16:

{
  "id": "32644381228",
  "type": "IssueCommentEvent",
  "actor": {
    "id": 40209326,
    "login": "netlify[bot]",
    "display_login": "netlify",
    "gravatar_id": "",
    "url": "https://api.github.com/users/netlify[bot]",
    "avatar_url": "https://avatars.githubusercontent.com/u/40209326?"
  },
  "repo": {
    "id": 51478266,
    "name": "kubernetes/website",
    "url": "https://api.github.com/repos/kubernetes/website"
  },
  "payload": {
    "action": "created",
    "issue": {
      "url": "https://api.github.com/repos/kubernetes/website/issues/43548",
      "repository_url": "https://api.github.com/repos/kubernetes/website",
      "labels_url": "https://api.github.com/repos/kubernetes/website/issues/43548/labels{/name}",
      "comments_url": "https://api.github.com/repos/kubernetes/website/issues/43548/comments",
      "events_url": "https://api.github.com/repos/kubernetes/website/issues/43548/events",
      "html_url": "https://github.com/kubernetes/website/pull/43548",
      "id": 1948771899,
      "node_id": "PR_kwDOAxF--s5dFuLt",
      "number": 43548,
      "title": "updated service page",
      "user": {
        "login": "pranav-pandey0804",
        "id": 126320452,
        "node_id": "U_kgDOB4d_RA",
        "avatar_url": "https://avatars.githubusercontent.com/u/126320452?v=4",
        "gravatar_id": "",
        "url": "https://api.github.com/users/pranav-pandey0804",
        "html_url": "https://github.com/pranav-pandey0804",
        "followers_url": "https://api.github.com/users/pranav-pandey0804/followers",
        "following_url": "https://api.github.com/users/pranav-pandey0804/following{/other_user}",
        "gists_url": "https://api.github.com/users/pranav-pandey0804/gists{/gist_id}",
        "starred_url": "https://api.github.com/users/pranav-pandey0804/starred{/owner}{/repo}",
        "subscriptions_url": "https://api.github.com/users/pranav-pandey0804/subscriptions",
        "organizations_url": "https://api.github.com/users/pranav-pandey0804/orgs",
        "repos_url": "https://api.github.com/users/pranav-pandey0804/repos",
        "events_url": "https://api.github.com/users/pranav-pandey0804/events{/privacy}",
        "received_events_url": "https://api.github.com/users/pranav-pandey0804/received_events",
        "type": "User",
        "site_admin": false
      },
      "labels": [
        {
          "id": 484571291,
          "node_id": "MDU6TGFiZWw0ODQ1NzEyOTE=",
          "url": "https://api.github.com/repos/kubernetes/website/labels/cncf-cla:%20yes",
          "name": "cncf-cla: yes",
          "color": "bfe5bf",
          "default": false,
          "description": "Indicates the PR's author has signed the CNCF CLA."
        },
        {
          "id": 676219457,
          "node_id": "MDU6TGFiZWw2NzYyMTk0NTc=",
          "url": "https://api.github.com/repos/kubernetes/website/labels/size/XS",
          "name": "size/XS",
          "color": "009900",
          "default": false,
          "description": "Denotes a PR that changes 0-9 lines, ignoring generated files."
        },
        {
          "id": 764235355,
          "node_id": "MDU6TGFiZWw3NjQyMzUzNTU=",
          "url": "https://api.github.com/repos/kubernetes/website/labels/sig/docs",
          "name": "sig/docs",
          "color": "d2b48c",
          "default": false,
          "description": "Categorizes an issue or PR as relevant to SIG Docs."
        },
        {
          "id": 1078396901,
          "node_id": "MDU6TGFiZWwxMDc4Mzk2OTAx",
          "url": "https://api.github.com/repos/kubernetes/website/labels/language/en",
          "name": "language/en",
          "color": "e9b3f9",
          "default": false,
          "description": "Issues or PRs related to English language"
        }
      ],
      "state": "open",
      "locked": false,
      "assignee": null,
      "assignees": [],
      "milestone": null,
      "comments": 3,
      "created_at": "2023-10-18T04:13:45Z",
      "updated_at": "2023-10-18T04:16:24Z",
      "closed_at": null,
      "author_association": "CONTRIBUTOR",
      "active_lock_reason": null,
      "draft": false,
      "pull_request": {
        "url": "https://api.github.com/repos/kubernetes/website/pulls/43548",
        "html_url": "https://github.com/kubernetes/website/pull/43548",
        "diff_url": "https://github.com/kubernetes/website/pull/43548.diff",
        "patch_url": "https://github.com/kubernetes/website/pull/43548.patch",
        "merged_at": null
      },
      "body": "Clarification and Enhancement of Documentation Statement.\r\n\r\n",
      "reactions": {
        "url": "https://api.github.com/repos/kubernetes/website/issues/43548/reactions",
        "total_count": 0,
        "+1": 0,
        "-1": 0,
        "laugh": 0,
        "hooray": 0,
        "confused": 0,
        "heart": 0,
        "rocket": 0,
        "eyes": 0
      },
      "timeline_url": "https://api.github.com/repos/kubernetes/website/issues/43548/timeline",
      "performed_via_github_app": null,
      "state_reason": null
    },
    "comment": {
      "url": "https://api.github.com/repos/kubernetes/website/issues/comments/1767610488",
      "html_url": "https://github.com/kubernetes/website/pull/43548#issuecomment-1767610488",
      "issue_url": "https://api.github.com/repos/kubernetes/website/issues/43548",
      "id": 1767610488,
      "node_id": "IC_kwDOAxF--s5pW5h4",
      "user": {
        "login": "netlify[bot]",
        "id": 40209326,
        "node_id": "MDM6Qm90NDAyMDkzMjY=",
        "avatar_url": "https://avatars.githubusercontent.com/in/13473?v=4",
        "gravatar_id": "",
        "url": "https://api.github.com/users/netlify%5Bbot%5D",
        "html_url": "https://github.com/apps/netlify",
        "followers_url": "https://api.github.com/users/netlify%5Bbot%5D/followers",
        "following_url": "https://api.github.com/users/netlify%5Bbot%5D/following{/other_user}",
        "gists_url": "https://api.github.com/users/netlify%5Bbot%5D/gists{/gist_id}",
        "starred_url": "https://api.github.com/users/netlify%5Bbot%5D/starred{/owner}{/repo}",
        "subscriptions_url": "https://api.github.com/users/netlify%5Bbot%5D/subscriptions",
        "organizations_url": "https://api.github.com/users/netlify%5Bbot%5D/orgs",
        "repos_url": "https://api.github.com/users/netlify%5Bbot%5D/repos",
        "events_url": "https://api.github.com/users/netlify%5Bbot%5D/events{/privacy}",
        "received_events_url": "https://api.github.com/users/netlify%5Bbot%5D/received_events",
        "type": "Bot",
        "site_admin": false
      },
      "created_at": "2023-10-18T04:16:23Z",
      "updated_at": "2023-10-18T04:16:23Z",
      "author_association": "NONE",
      "body": "### <span aria-hidden=\"true\">βœ…</span> Pull request preview available for checking\nBuilt [without sensitive environment variables](https://docs.netlify.com/configure-builds/environment-variables/#sensitive-variable-policy)\n\n|  Name | Link |\n|:-:|------------------------|\n|<span aria-hidden=\"true\">πŸ”¨</span> Latest commit | f7aed0929cc83cf25cb63f895ffc7ecdd7582ea2 |\n|<span aria-hidden=\"true\">πŸ”</span> Latest deploy log | https://app.netlify.com/sites/kubernetes-io-main-staging/deploys/652f5b7d321c7a0008b02a64 |\n|<span aria-hidden=\"true\">😎</span> Deploy Preview | [https://deploy-preview-43548--kubernetes-io-main-staging.netlify.app](https://deploy-preview-43548--kubernetes-io-main-staging.netlify.app) |\n|<span aria-hidden=\"true\">πŸ“±</span> Preview on mobile | <details><summary> Toggle QR Code... </summary><br /><br />![QR Code](https://app.netlify.com/qr-code/eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJ1cmwiOiJodHRwczovL2RlcGxveS1wcmV2aWV3LTQzNTQ4LS1rdWJlcm5ldGVzLWlvLW1haW4tc3RhZ2luZy5uZXRsaWZ5LmFwcCJ9.qp6f7QaYBCxGRbmNFAGpfWAirsSSR0Mdi2Dix2ErmZM)<br /><br />_Use your smartphone camera to open QR code link._</details> |\n---\n\n_To edit notification comments on pull requests, go to your [Netlify site configuration](https://app.netlify.com/sites/kubernetes-io-main-staging/configuration/deploys#deploy-notifications)._",
      "reactions": {
        "url": "https://api.github.com/repos/kubernetes/website/issues/comments/1767610488/reactions",
        "total_count": 0,
        "+1": 0,
        "-1": 0,
        "laugh": 0,
        "hooray": 0,
        "confused": 0,
        "heart": 0,
        "rocket": 0,
        "eyes": 0
      },
      "performed_via_github_app": null
    }
  },
  "public": true,
  "created_at": "2023-10-18T04:16:24Z",
  "org": {
    "id": 13629408,
    "login": "kubernetes",
    "gravatar_id": "",
    "url": "https://api.github.com/orgs/kubernetes",
    "avatar_url": "https://avatars.githubusercontent.com/u/13629408?"
  }
}

timestamp is 2023-10-18T04:16:24Z and body of comment is:

"body": "### <span aria-hidden=\"true\">βœ…</span> Pull request preview available for checking\nBuilt [without sensitive environment variables](https://docs.netlify.com/configure-builds/environment-variables/#sensitive-variable-policy)\n\n|  Name | Link |\n|:-:|------------------------|\n|<span aria-hidden=\"true\">πŸ”¨</span> Latest commit | f7aed0929cc83cf25cb63f895ffc7ecdd7582ea2 |\n|<span aria-hidden=\"true\">πŸ”</span> Latest deploy log | https://app.netlify.com/sites/kubernetes-io-main-staging/deploys/652f5b7d321c7a0008b02a64 |\n|<span aria-hidden=\"true\">😎</span> Deploy Preview | [https://deploy-preview-43548--kubernetes-io-main-staging.netlify.app](https://deploy-preview-43548--kubernetes-io-main-staging.netlify.app) |\n|<span aria-hidden=\"true\">πŸ“±</span> Preview on mobile | <details><summary> Toggle QR Code... </summary><br /><br />![QR Code](https://app.netlify.com/qr-code/eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJ1cmwiOiJodHRwczovL2RlcGxveS1wcmV2aWV3LTQzNTQ4LS1rdWJlcm5ldGVzLWlvLW1haW4tc3RhZ2luZy5uZXRsaWZ5LmFwcCJ9.qp6f7QaYBCxGRbmNFAGpfWAirsSSR0Mdi2Dix2ErmZM)<br /><br />_Use your smartphone camera to open QR code link._</details> |\n---\n\n_To edit notification comments on pull requests, go to your [Netlify site configuration](https://app.netlify.com/sites/kubernetes-io-main-staging/configuration/deploys#deploy-notifications)._"

And those are the only two events for that PR in the JSON from 4AM utc, now 6 AM UTC:

aj11anuj's reviews/comments

(...)
    "id": "32647254013",
    "type": "PullRequestReviewEvent",
    "actor": {
      "id": 116108975,
      "login": "aj11anuj",
      "display_login": "aj11anuj",
      "gravatar_id": "",
      "url": "https://api.github.com/users/aj11anuj",
      "avatar_url": "https://avatars.githubusercontent.com/u/116108975?"
    },
(...)
    "type": "PullRequestReviewCommentEvent",
    "actor": {
      "id": 116108975,
      "login": "aj11anuj",
      "display_login": "aj11anuj",
      "gravatar_id": "",
      "url": "https://api.github.com/users/aj11anuj",
      "avatar_url": "https://avatars.githubusercontent.com/u/116108975?"
    },
    "repo": {
      "id": 51478266,
      "name": "kubernetes/website",
      "url": "https://api.github.com/repos/kubernetes/website"
    },
(...)
    "id": "32647254058",
    "type": "PullRequestReviewCommentEvent",
    "actor": {
      "id": 116108975,
      "login": "aj11anuj",
      "display_login": "aj11anuj",
      "gravatar_id": "",
      "url": "https://api.github.com/users/aj11anuj",
      "avatar_url": "https://avatars.githubusercontent.com/u/116108975?"
    },

And there is nothing more, so I'm going to report thsi to GHA archives website. There is nothing else I can do if we miss data.

lukaszgryglicki commented 10 months ago

@pranav-pandey0804 craeted an external issue (bug) for GH Archives: https://github.com/igrigorik/gharchive.org/issues/294. This is blocked until GH archives resolve this.

pranav-pandey0804 commented 10 months ago

@lukaszgryglicki thanks for the help!

lukaszgryglicki commented 10 months ago

Seems like issues on GHA are not fixed, thinking about escalating this... @craigbox @pranav-pandey0804 @caniszczyk ...