neurocausal / neurocausal_data

The data used by neurocausal
0 stars 1 forks source link

NeuroCausal: data-storage

In this repository we will be listing the parametric files that is required for the analysis pipeline to run. We will also share the OSF link to the downloaded open access clinical papers that will be used in NeuroCausal pipeline.

The dataset will be periodically updated as the new bunch of data added and new features are extended to the tool.

The current data resides in NeuroCausal OSF Repository for a specific queries (aphasia + neurodegeneration and aphasia + fMRI ) we run with NeuroQuery pipeline.

The data folder structure of the shared data is:

query-aphasia_neurodegenerative.zip
  └── query-aphasia_neurodegenerative
      ├── articlesets
      │   ├── articleset_00000.xml
      │   └── info.json
      └── articles
          ├── 000
          │   └──pmcid_4382926.xml
          ├── 00a
          │   └──pmcid_8317687.xml
          ├── 00b
          │   └── pmcid_6625472.xml
          ├── ...
          ├── e12
          │   ├── pmcid_8832765.xml
          ├── ...
          │
          └── info.json
          └── subset_extractedData
                 ├── authors.csv
                 ├── coordinates.csv
                 ├── info.json
                 ├── metadata.csv
                 └── text.csv

articlesets Folder contains the raw xml file that contains the bulk of papers downloaded with the query terms. articles folder contains the xml file for the each papers downloaded. subset_allArticles_extractedData folder contains the data separated into metadata specific individual csv files.