opensource-observer / oso

Measuring the impact of open source software
https://opensource.observer
Apache License 2.0
71 stars 16 forks source link

docs: network graph analysis raw data sources #2072

Closed ccerv1 closed 2 months ago

ccerv1 commented 2 months ago

We should create a guide for doing network graph analysis on OSO.

Here is a Loom Video explaining how to query the data.

For now, I will just do a short list of resources that exist:

Farcaster social graph

select distinct
  fid,
  target_fid
from `{YOUR_PROJECT_NAME}.farcaster.links`
where deleted_at is null
limit 10

Github contribution graph

select distinct
  user_artifacts.artifact_name as user,
  repo_artifacts.artifact_name as repo
from `{YOUR_PROJECT_NAME}.oso.timeseries_events_by_artifact_v0` as events
join `{YOUR_PROJECT_NAME}.oso.artifacts_v1` as user_artifacts
  on events.from_artifact_id = user_artifacts.artifact_id
join `{YOUR_PROJECT_NAME}.oso.artifacts_v1` as repo_artifacts
  on events.to_artifact_id = repo_artifacts.artifact_id
where
    events.event_type = 'COMMIT_CODE'
    and events.time > '2024-01-01'

OpenRank data challenge with Byte Explorers

There are a collection of starter notebooks here.

RPGF3 PageRank bounty

There is a playground GitHub dataset and several PageRank implementations (for repos, not developers) here

ryscheng commented 2 months ago

@ccerv1 can you link to where the resources are?

ccerv1 commented 2 months ago

@ccerv1 can you link to where the resources are?

They are already linked above @ryscheng ... did you mean something else?

ccerv1 commented 2 months ago

Add to discussion thread

ccerv1 commented 2 months ago

cc @astralblue

ccerv1 commented 2 months ago

Cataloged in this discussion: https://github.com/opensource-observer/oso/discussions/2120