davidebolo1993 / VISOR

VarIant SimulatOR for short, long and linked reads
GNU Lesser General Public License v3.0
41 stars 11 forks source link

Major update coming #13

Closed davidebolo1993 closed 3 years ago

davidebolo1993 commented 3 years ago

This is just to let VISOR's users know that a major update is coming, ideally within one month. In its current form, VISOR works just fine but I feel that part of its code has to be refined (e.g. most calls to subprocesses avoided) and new features have to be added. I will also try to implement different read-simulators in VISOR LASeR, so that users can choose which one to use. Feel free to add your suggestions.

davidebolo1993 commented 3 years ago

I've by now reworked HACk and SHORtS, which are faster and include new features (SHORtS can simulate multiple single cells, some from a reference clone for instance). Maybe it's worth adding RG ID in simulated BAM, as it seems few SV callers require this (https://github.com/tjiangHIT/cuteSV/issues/20#issuecomment-743654177).

biozzq commented 3 years ago

Hi @davidebolo1993

If avaialble, could you add this feature (https://github.com/davidebolo1993/VISOR/issues/11) to the upcoming version.

davidebolo1993 commented 3 years ago

Also adding -Y to minimap2 alignment with VISOR LASER (https://github.com/tjiangHIT/cuteSV/issues/20#issuecomment-744185516)

davidebolo1993 commented 3 years ago

I'm done with the major changes to all the modules. @biozzq, long-read simulations are now performed using Badread. This solves #11, as Badread stores regions used for simulations in the FASTQ header (and you will find this information in the read name of the BAM file generated by LASeR). Long-read simulation is overall slower with Badread (but VISOR implements a parallel version that should help saving time), but it gives users control over a series of useful parameters (and users can also train their error and quality score model - I think this can be adjusted also for #8). BWA is no more supported in VISOR, which now uses minimap2 for paired-end short-read alignments as well (it is tens of time faster and has a similar accuracy for what I've seen in preliminary tests). All the code has been reworked and calls to external bash scripts removed. If you notice any bug, please get in touch !

davidebolo1993 commented 3 years ago

Of course, have a look at the documentation before running VISOR. Some parameters have changed since the previous version.