kalininalab / DataSAIL

DataSAIL is a tool to split datasets while reducing information leakage.
https://datasail.readthedocs.io
MIT License
18 stars 1 forks source link

Accept Similarity Matrices as Input without Separate Naming #21

Open Old-Shatterhand opened 8 months ago

Old-Shatterhand commented 8 months ago

Is your feature request related to a problem? Please describe. It's just for more convenience and removal of duplicate code or to circumvent inaccessible data.

Describe the solution you'd like It would be cool if one could provide the similarity/distance matrix as an argument without giving the actual data as another argument because if the matrix is user-provided, the actual data is unnecessary to provide.

Describe alternatives you've considered So far, one has to provide the actual data, e. g., FASTA sequences or whole genomes, to DataSAIL, which can be unnecessarily memory consuming.

Additional context None