Closed amdqiao closed 1 year ago
Ping @adamgayoso who might have a thing or two to say about scvi-tools and MuData support.
@amdqiao Thanks for the question! Are there particular applications or scenarios that would benefit from supporting MuData the most, in your opinion?
As @Zethson mentioned, there's MuData support for some multimodal models in scvi-tools. Newly developed models for single-cell multi-omics are hopefully supporting MuData straightaway — and we're happy to see what we can do from our side to make MuData support more abundant.
Yes, I've checked the scvi-tools and found it a useful tool. However, it will be even better if MuData can be supported as the input format for popular ML packages such as scikit-learn.
As scikit-learn provides a generic collection of machine learning methods in Python, I don't think it will or should support more domain-specific data structures. However as it operates on numpy arrays and pandas data frames as well as AnnData/MuData, it is possible to write lean interfaces for the methods implemented in scikit-learn.
If there are methods that are particularly suited for single-cell omics, feel free to suggest and/or contribute them to scanpy, muon or other frameworks!
Since many ML packages support AnnData format already, will MuData developers consider reaching out to ML package developers to make more ML packages load MuData as input? Thanks!