-
## Expected Behavior
After using “mmseqs easy-linclust” clustering, the retained sequence is non-redundant
## Current Behavior
After using “mmseqs easy-linclust” clustering, he retained seque…
-
Hi,
I want to cluster a large dataset of DNA sequences. Must I first convert my fasta file into a DB format file? As is written here: https://github.com/soedinglab/MMseqs2/wiki#linclust, or can I u…
-
Hi all, thanks for this MMseqs2 that seems very efficient.
Unfortunately it seems to not be willing to run on my machine :
mmseqs easy-cluster /Users/s/Documents/Albatros/protein//short_name-Gr…
-
## Expected Behavior
[dataset.zip](https://github.com/user-attachments/files/16023829/dataset.zip)
I have a group of sequences which is properly aligned with almost full length and >95% identity usi…
-
Version: `commit 3eabfaff83bb77eac5ef342e8905cc4f7d378cb7`
Command:
```
./conterminator/build/bin/conterminator protein GTDB97.faa taxonomy/mapping_new conterminator_phylum_level conterminator_tm…
-
Hi,
I couldn’t find any information in the documentation about the differences between the `cluster`, `linclust`, and `deepclust` commands. Based on the paper’s description, I believe that the resu…
-
Hello,
## Expected Behavior
Output clustering results.
## Current Behavior
Segmentation in linclust.sh
## Steps to Reproduce (for bugs)
```
mmseqs createdb seq.fa db/dbclust
mmseqs linclust …
-
Hi,
I'm running `linclust` on a cloud instance with network file systems. I'm wondering which `--db-load-mode` should I use to alleviate the I/O bottleneck of NFS.
```
--db-load-mode INT …
-
Hi,
I really like Linclust, which makes it possible to cluster genes within linear time. For my dataset with 1.1G genes, it seems impossible to get it done by using CD-HIT. Linclust opens a door to…
-
## Expected Behavior
mmseqs easy-cluster should finish without errors.
## Current Behavior
```
Query database size: 19552 type: Nucleotide
Estimated memory consumption: 8G
Target database …