Open 223050025 opened 7 months ago
Public cfDNA whole genome sequencing datasets: GSE71378, GSE124686, GSE81314
- WGBS dataset: CRA001537
pls specify the save path of your data
Try to use SRA ToolKit to download fastq files. As example, GSE71378's raw file id is SRR061633, comment "fastq-dump --split-files SRR061633" will split two original paired-end reads (paired sequencing sequences) into two files, with the first and second sequences of each paired reads stored separately.
The data is saved by /share/home/grp-sunhao/liyixiao/SRR061633_1.fastq
@Milokita 师兄,请问如何下载EGA数据库里的数据
In short, if it's controlled dataset, you need to write an application otherwise you can simply download it
The request access for EGAD00001000856 is submitted, I am waiting the reply from CUHK Circulating Nucleic Acids Research Group. And then I will submit the Signed Policy for Plasma DNA data sharing by following the reply email.
Modify of Tamplatet-Latex, add background part from proposal with reference as example of create a bibliography. This project is synced with the GitHub repository at 223050025/LiYixiao-template
I meet the error with message: I tensorflow/core/common_runtime/executor.cc:1197] [/device:CPU:0] (DEBUG INFO) Executor start aborting (this does not indicate an error and you can ignore this message): INVALID_ARGUMENT: You must feed a value for placeholder tensor 'gradients/split_2_grad/concat/split_2/split_dim' with dtype int32 [[{{node gradients/split_2_grad/concat/split_2/split_dim}}]] How do I solve this problem. @Milokita
I run the code in an online platform Kaggle, the problem is disappear. But when I run the code on Jupyter notebook in my own computer, will meet the error above.
not to use this to submit file, it may lost during transit
when use bowtie2-build on HPC: error: /usr/bin/env: python3: No such file or directory
@Milokita
pls open new issue for your problem and append detail info realated to your question. This issue is for progress report ONLY
Talking with Dr.Sun to ensure the research topic, read the paper "DNA methylation analysis explores the molecular basis of plasma cell-free DNA fragmentation. Nature Communications, 14(287), https://doi.org/10.1038/s41467-023-35959-6".
Run the code with public dataset first. The final goal is to try to use deep learning model to classify and analyse the cell-free DNA dataset, and build the docker.