Closed k----n closed 2 years ago
Yes as per tutorial capital letter indicates curation: in this case de-aliasing via a2A/A2a Similarly, P indicates deforked version of p (via p2P/P2p map)
Here's a list of the mappings and their respective servers for version T:
da2_data/c2p
da3_data/A2fb
da3_data/b2fa
da3_data/f2b
da3_data/f2c
da4_data/a2f
da4_data/b2c
da4_data/b2fa
da4_data/f2a
da5_data/a2fb
da5_data/A2tspan
da5_data/b2fA
da5_data/b2f
da5_data/b2tk
da5_data/c2b
da5_data/c2fbb
da5_data/c2f
da5_data/c2PtabllfPkg
da5_data/fc2BSLP
da5_data/fPnbnfb2P
da5_data/fPnLbnfLb2P
da5_data/fPnSbnfSb2P
da5_data/fPnSLbnfSLb2P
da5_data/obb2cf
da5_data/P2A
da5_data/p2a
da5_data/p2c
da5_data/P2core
da5_data/P2fb
da5_data/P2fP3
da5_data/P2LfP3
da5_data/P2mnc
da5_data/P2SfP3
da5_data/P2SSfP3
da5_data/Pnb
da5_data/tk2b
It was found using ls -d /da?_data/basemaps/gz/*FullT1.s | cut -d/ -f2,5 | sed -e "s/FullT1.s$//"
There are some new key identifying letters (found by splitting on '2'):
Do you mind explaining them? I'm happy to help update the tutorial after to make it more clear.
Per tutorial a - author, b - blob, c -commit, p-project, f-file Capital letter means corrected.
The following list is of less "experimental" maps with explanation where needed
A2P
A2b
A2c
A2f
A2fb - to first blob
A2tPc
P2A
P2L - project blobs corresponding to LICENSE* files
P2Lb - project blobs that are shared among fewer than 100 Projects
P2a
P2b
P2c
P2core - Project to devs who make 80+% of the commits
P2f
P2fb - Project to first blob (originated in that project)
P2g - gender of devs in the project
P2tac - project to time, author, commit
a2P
a2c
a2f
a2fb - first blob (created by the author)
a2p
b2P
b2T - experimental: defork projects P by root tree
b2c
b2f
b2fA - blob to time of first author/commit
b2fLICENSE - grep for LICESNSE in b2f
b2fa - blob to time of first author/commit
b2ob - blob to old blob (matching old an new blob in a commit for the same file path)
b2tA - to time author (sort ted by blob/time)
b2ta - to time author
bL2P - license blob to project
bb2cf - result of diff on a commit: blob old blob, commit, file
c2P
c2b
c2cc - to child commit
c2dat - full commit data in semicolon-separated fields
c2f
c2fbb - result of diff on a commit: commit file, blob, old blob
c2p
c2pc - to parent commit
dl2Pf - API defined; language; project; file
f2a
f2b
f2c
ob2b - old blob to blob (matching old an new blob in a commit for the same file path)
obb2cf - see bb2cf but blobs reversed
p2a
p2c
t2P - root tree to project
t2c - root tree to commit
There seems to be new mappings for Version T.
For example, what is the difference between
b2fA
andb2fa
? Thea
andA
seems to indicate author, but there are different authors?