Closed linfanxiao closed 6 months ago
Have you checked whether you have sufficient disk space to save the indexes? Either way, this thrashing behavior isn't good, but I expect that something is preventing the files from being saved.
It might help me diagnose the issue if you included the full error output instead of just the end of it.
@jltsiren I think we might be hitting a failure case in the robustness code I implemented for the GBWT buffer size (e.g. https://github.com/vgteam/vg/blob/master/src/index_registry.cpp#L2686-L2733). I might need to add a distinct error code like I did in GCSA2.
Hi jeizenga , This is the complete error output of the command. We have allocated 2T storage spaces for saving temporary files, but the process has not yet ended, and there are no significant error messages displayed in the output. It's quite unusual. index_473734.txt
With this behavior, I don't think it will finish. It looks to me like it's stuck in some kind of thrashing behavior, so you will probably have to kill the job. Before you do that though, can you check the size of the intermediate files in the working directory (supplied with -T
) using du -h
? And also the size of the outputs (starting with --prefix
)? That should clarify whether it's a disk use issue. When you kill the process, the temporary files will be deleted, so be sure to check the disk usage first.
[yc07671@login-0-0 VG]$ du -h tmp/
1.2G tmp/vg-jGUbiA/dir-zf5FfW
469M tmp/vg-jGUbiA/dir-1PYTcu
63M tmp/vg-jGUbiA/dir-Z932da
46M tmp/vg-jGUbiA/dir-EUHmY9
552K tmp/vg-jGUbiA/dir-oDPhh9
312K tmp/vg-jGUbiA/dir-sr8J43
99M tmp/vg-jGUbiA/dir-pmlo2R
440K tmp/vg-jGUbiA/dir-raMbpu
332K tmp/vg-jGUbiA/dir-51SQfM
130M tmp/vg-jGUbiA/dir-bw3612
312K tmp/vg-jGUbiA/dir-UYyUuv
432K tmp/vg-jGUbiA/dir-87im8g
260K tmp/vg-jGUbiA/dir-7OxT9r
308K tmp/vg-jGUbiA/dir-8kFZy1
532K tmp/vg-jGUbiA/dir-FMUkdY
166M tmp/vg-jGUbiA/dir-2LAuku
352K tmp/vg-jGUbiA/dir-ne4BLa
112M tmp/vg-jGUbiA/dir-ED1QaA
144K tmp/vg-jGUbiA/dir-ZXBGkv
155M tmp/vg-jGUbiA/dir-AkPSXY
116K tmp/vg-jGUbiA/dir-N3tepj
131M tmp/vg-jGUbiA/dir-0h903d
132K tmp/vg-jGUbiA/dir-MlSwE8
168K tmp/vg-jGUbiA/dir-dX4yN3
216K tmp/vg-jGUbiA/dir-P1DVVo
168K tmp/vg-jGUbiA/dir-ULhEQm
160K tmp/vg-jGUbiA/dir-8x419R
144K tmp/vg-jGUbiA/dir-MLKyis
164K tmp/vg-jGUbiA/dir-bUEyyC
152K tmp/vg-jGUbiA/dir-ruq8kl
50M tmp/vg-jGUbiA/dir-LerJ35
172K tmp/vg-jGUbiA/dir-n2kSzE
188K tmp/vg-jGUbiA/dir-VTVKnk
135M tmp/vg-jGUbiA/dir-qPbIp3
208K tmp/vg-jGUbiA/dir-SYScTa
1.9M tmp/vg-jGUbiA/dir-45HqzM
185M tmp/vg-jGUbiA/dir-LpFHIq
57M tmp/vg-jGUbiA/dir-xsqhxd
76G tmp/vg-jGUbiA/dir-5hpCIS
86G tmp/vg-jGUbiA
88G tmp/
Hi jeizenga, all the files in the 'tmp' directory are temporary files, and there are no files with the 'output' prefix. I have deleted all of the tmp files. I should consider reindexing them one by one to investigate the cause of the issue. If you can provide some reference indexed pangenome or pantranscriptome such as hg38, maybe it would be better for me. Thanks for your consideration and time!
I just merged a PR that should help us determine what's happening. If you rebuild with the current master branch, you can try again to see if the issue is fixed. At a minimum, I expect that the thrashing behavior where it repeatedly tries to re-make the GBWT should be gone. The indexing may still fail, but we should at least get a more informative error message.
*1. What were you trying to do?
2. What did you want to happen? make the index file including gpwt,xg and so on
3. What actually happened?
it runs three days and there is nothing new update
![image](https://github.com/vgteam/vg/assets/62028275/20db6a81-c6d3-4aad-8fc4-2cb8617d21f4)
4. If you got a line like
Stack trace path: /somewhere/on/your/computer/stacktrace.txt
, please copy-paste the contents of that file here:5. What data and command can the vg dev team use to make the problem happen?
vcf: https://ftp.ensembl.org/pub/release-110/variation/vcf/homo_sapiens/1000GENOMES-phase_3.vcf.gz reference fasta: https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_44/GRCh38.primary_assembly.genome.fa.gz gtf: https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_44/gencode.v44.annotation.gtf.gz 6. What does running
vg version
say?