This repository contains the code for the analysis on the expression profile of the transportome in Cancer based on the MTP-DB.
[!NOTE] Read the preprint here: Profiling the Expression of Transportome Genes in cancer: A systematic approach
It's potentially out of date.
The project follows the Kerblam! standard.
You can run the analysis pipelines with Kerblam! and docker
:
# Clone the repo
git clone git@github.com:TCP-Lab/transportome_profiler.git
cd ./transportome_profiler
kerblam data fetch # Fetch the input data not present in the repository
kerblam run <pipeline>
Kerblam! will build docker containers and run the analysis locally. To run without docker, read below.
The project currently encompasses the following pipelines:
heatmaps
: Create large heatmaps from the expression matrices by using GSEA
on computed gene rankings, testing all possible gene lists that can be made
from the MTP-DB
.
test
profile makes this pipeline much faster by running on smaller
(i.e. sampled) input data (~75% reduction in sample number, only 5000 random genes).You need some requirements to be installed before you can run the analysis locally:
R
version 4.3.0+
.
./src/helper_scripts/install_r_pkgs.R
.Python
version 3.11+
.
pip install -r ./src/requirements.txt
.jq
utility (that you can find here).xsv
program, required by metasplit
(sudo pacman -Syu xsv
on Arch, not packaged by Debian, but this guide might be useful.
If you have cargo
installed, you can simply run cargo install xsv
).generanker
(namely installing fast-cohen)xls2csv
utility (on arch yay -Syu perl-xls2csv
)Rscript ./src/helper_scripts/install_R_pkgs.R
N_THREADS
(with export N_THREADS=...
) to run with less threads.If you have all the requirements, you can simply:
kerblam run <pipeline> --local
[!IMPORTANT] The manuscript for this project is also available online in this repository.