HazyResearch / HypHC

Hyperbolic Hierarchical Clustering.
194 stars 26 forks source link

Support for Large Datasets #5

Open Tanvi141 opened 3 years ago

Tanvi141 commented 3 years ago

Our paper was recently accepted at WI-IAT and will be published soon, here is the arxiv version:(https://arxiv.org/abs/2110.15923)

We leverage HypHC in our work to reduce the dimensions. Our dataset had 59260 data points, each of dimension 600. The current version of the code is giving out of memory errors in the pre-training stage itself. By moving some lines around, and rewriting sections of the code, we were able to keep the same functionality of the code and train the model for our dataset.

I have included two additional arguments in the config file: