woc-hack / tutorial

Other
47 stars 20 forks source link

Version T Mappings #17

Closed k----n closed 2 years ago

k----n commented 3 years ago

There seems to be new mappings for Version T.

For example, what is the difference between b2fA and b2fa? The a and A seems to indicate author, but there are different authors?

audrism commented 3 years ago

Yes as per tutorial capital letter indicates curation: in this case de-aliasing via a2A/A2a Similarly, P indicates deforked version of p (via p2P/P2p map)

k----n commented 3 years ago

Here's a list of the mappings and their respective servers for version T:

da2_data/c2p
da3_data/A2fb
da3_data/b2fa
da3_data/f2b
da3_data/f2c
da4_data/a2f
da4_data/b2c
da4_data/b2fa
da4_data/f2a
da5_data/a2fb
da5_data/A2tspan
da5_data/b2fA
da5_data/b2f
da5_data/b2tk
da5_data/c2b
da5_data/c2fbb
da5_data/c2f
da5_data/c2PtabllfPkg
da5_data/fc2BSLP
da5_data/fPnbnfb2P
da5_data/fPnLbnfLb2P
da5_data/fPnSbnfSb2P
da5_data/fPnSLbnfSLb2P
da5_data/obb2cf
da5_data/P2A
da5_data/p2a
da5_data/p2c
da5_data/P2core
da5_data/P2fb
da5_data/P2fP3
da5_data/P2LfP3
da5_data/P2mnc
da5_data/P2SfP3
da5_data/P2SSfP3
da5_data/Pnb
da5_data/tk2b

It was found using ls -d /da?_data/basemaps/gz/*FullT1.s | cut -d/ -f2,5 | sed -e "s/FullT1.s$//"

There are some new key identifying letters (found by splitting on '2'):

Do you mind explaining them? I'm happy to help update the tutorial after to make it more clear.

audrism commented 3 years ago

Per tutorial a - author, b - blob, c -commit, p-project, f-file Capital letter means corrected.

The following list is of less "experimental" maps with explanation where needed

A2P
A2b
A2c
A2f
A2fb - to first blob
A2tPc
P2A
P2L - project blobs corresponding to LICENSE* files
P2Lb - project blobs that are shared among fewer than 100 Projects
P2a
P2b
P2c
P2core - Project to devs who make 80+% of the commits
P2f
P2fb - Project to first blob (originated in that project)
P2g - gender of devs in the project
P2tac - project to time, author, commit
a2P
a2c
a2f
a2fb - first blob (created by the author)
a2p
b2P
b2T  - experimental: defork projects P by root tree
b2c
b2f
b2fA  - blob to time of first author/commit
b2fLICENSE - grep for LICESNSE in b2f
b2fa - blob to time of first author/commit
b2ob - blob to old blob (matching old an new blob in a commit for the same file path)
b2tA - to time author (sort ted by blob/time)
b2ta - to time author
bL2P - license blob to project
bb2cf  - result of diff on a commit: blob old blob, commit, file
c2P
c2b
c2cc - to child commit
c2dat - full commit data in semicolon-separated fields
c2f
c2fbb  - result of diff on a commit: commit file, blob, old blob
c2p
c2pc - to parent commit
dl2Pf - API defined; language; project; file
f2a
f2b
f2c
ob2b -  old blob to blob (matching old an new blob in a commit for the same file path)
obb2cf - see bb2cf but blobs reversed
p2a
p2c
t2P  - root tree to project
t2c - root tree to commit