-
**Is your feature request related to a problem? Please describe.**
I am currently working with a dataset consisting of 20,000 sequences, and I need to perform distance-based clustering on these seq…
-
Hi developer,
Thank for developing the phammseqs!
I noticed that only mmseqs2=13.45111 is required for running phammseqs. I am encountered error associated with mmseqs2 during the clustering of …
-
Hi,
I tried to run TEtrimmer with the test set, using the command:
**TEtrimmer --input_file test_input.fa --genome_file test_genome.fasta --output_dir test_output --num_threads 20 --classify_all…
-
Hi,
I was checking ESM-3 structure embedding, and ESM-3 sequence embedding, and found that the distance between embeddings is very less ( 0.0001) . I am curious how ESM-3 model is pre-trained with …
-
Hello,
I've got a question regarding the argument `--maxaccepts` of the `vsearch` command `--cluster_fast`:
The manpage states the following about `maxaccepts`:
"_The search process sorts tar…
-
Hello,
Firstly, thank you for the package! We are currently trying the tool for our transcriptomic data set.
The tool works perfectly fine in clustering mode but we encountered an error when extract…
-
Hi,
I am not sure if this is the right place to ask this, but is there a set of options to recreate the same clustering as the ones in the weekly sequence clustering of PDB. As in this link for exam…
-
Dear Edgardo, Ive noticed that when adding reference sequences to alignments, prior to informed paralog filtering, in some cases these are added in the reverse direction to the extracted sequences in …
-
Hi, I ran this command to perform Isoquant quantification with the aim of obtaining transcript quantification results for all samples: isoquant.py --reference genome.fa --genedb ./merge.combined.gtf -…
-
How does VSEARCH scores alignments on masked regions? I would expect it to discard hits between masked regions but, when clustering the sequences:
```
>HWI-ST397:349:C2YYMACXX:3:1102:20033:76341…