giotto-ai / giotto-tda

A high-performance topological machine learning toolbox in Python
https://giotto-ai.github.io/gtda-docs
Other
845 stars 173 forks source link

Working with large datasets. #624

Closed OmarRafique closed 2 years ago

OmarRafique commented 2 years ago

Hi.

I have a datasets of one 1 million samples and each sample has 50000 features. My machine is Intel i9 with 64GB DDR4 RAM. Will the giotto-tda Mapper scale up to this dataset on the specified machine?

In general, what are the scalability issues that I should expect while using giotto-tda Mapper?