Open satra opened 4 years ago
which dataset are we talking about? ADHD200 seems to not carry participants.tsv and all split per site:
$> datalad ls s3://fcp-indi/data/Projects/ADHD200/RawDataBIDS/
Connecting to bucket: fcp-indi
[INFO ] S3 session: Connecting to the bucket fcp-indi with authentication
Bucket info:
Versioning: S3ResponseError: 403 Forbidden
Website: S3ResponseError: 403 Forbidden
ACL: S3ResponseError: 403 Forbidden
data/Projects/ADHD200/RawDataBIDS/Brown/
data/Projects/ADHD200/RawDataBIDS/KKI/
data/Projects/ADHD200/RawDataBIDS/KKI_1/
data/Projects/ADHD200/RawDataBIDS/KKI_2/
data/Projects/ADHD200/RawDataBIDS/NYU/
data/Projects/ADHD200/RawDataBIDS/NeuroIMAGE/
data/Projects/ADHD200/RawDataBIDS/OHSU/
data/Projects/ADHD200/RawDataBIDS/Peking_1/
data/Projects/ADHD200/RawDataBIDS/Peking_2/
data/Projects/ADHD200/RawDataBIDS/Peking_3/
data/Projects/ADHD200/RawDataBIDS/Pittsburgh/
data/Projects/ADHD200/RawDataBIDS/Pittsburgh_Test/
data/Projects/ADHD200/RawDataBIDS/WashU/
data/Projects/ADHD200/RawDataBIDS/du_1/
data/Projects/ADHD200/RawDataBIDS/mta_1/
data/Projects/ADHD200/RawDataBIDS/nyu_1/
there is a participants.tsv
per site.
d'oh - didn't spot that there was find .
for those files.
FWIW, recrawled those subdatasets -- no changes in the bucket
in ADHD200 this is the only mprage file: ['RawData/Peking_3/1404738/session_1/anat_1/mprage.nii.gz'] that doesn't have a correspondence in RawDataBIDS.
the good news is that all participant ids match between BIDS and RawData. so the participants.tsv is simply missing a lot of info. we are going to pull the info the RawData phenotype files.
these phenotypic csv's for rawdata are missing from the s3 bucket
Peking_2_phenotypic.csv
Peking_3_phenotypic.csv
and if i get the participants.tsv and simply concatenate them
suggesting that about 400 participants are not indexed in the participants.tsv.
i'm using datalad to get these files
cc/ @yarikoptic @dbkeator
@ccraddock - let us know who could fix these things assuming they are an issue.