-
First time using this package. I have run an alignment with linclust and am wondering if there is a way of extracting the positional information of the clusters and the distance between them, with the…
-
## Expected Behavior
I am clustering billions of protein sequences. I already built the database. I was expecting linclust to run fairly quickly, but it seems to get stuck on the initial `kmermatcher…
-
Dear MMseqs2 team,
Can you please give some advice on how to proceed with clustering a large set of short k-mers, something between 7-mer and 9-mers? I tried playing around with linclust but it's…
-
Hi Edgardo,
When running `captus_assembly extract -a 02_assemblies -o 03_extractions -n target_file.fasta -p SeedPlantsPTD -m SeedPlantsMIT -c` on a hpc with Captus version: v1.0.1, the run termina…
-
As the data that can be processed with mmseqs is generally quite large. The tsv output is usually very large. I wonder if it would be difficult to create hdf files or other more efficient file formats…
-
## Expected Behavior
Unsure
## Current Behavior
Clustering of a large fasta file has been stuck at the prefilter stage for multiple days (>5 days).
## Steps to Reproduce (for bugs)
Please make su…
-
If I use the command:
```
mmseqs easy-cluster {input} {params.outprefix} tmp/ --min-seq-id 0.9
```
How will mmseqs determine the representative cluster? I've tried reading through the documentat…
-
Dear all,
thank you very much for making the code and weights available as well as providing installation instructions etc.
I tried to fetch the databases, but hit a bump in between.
As you can…
-
## Expected Behavior
I'm using gene predictions form Refseq. They unified the gene names, so that the same gene in different genomes has the same name, e.g. `WP_012419350.1`
Now I can pass this …
-
## Expected Behavior
When setting --split-memory-limit to 250G, got error said "Error in argument --split-memory-limit".
## Current Behavior
If setting it to 250, the error was gone. But got "Not …