Clay-foundation / model

The Clay Foundation Model (in development)
https://clay-foundation.github.io/model/
Apache License 2.0
262 stars 30 forks source link

Explore embeddings using tsne #132

Closed brunosan closed 1 month ago

brunosan commented 6 months ago

This PR adds a sample notebook to explore the embeddings space using openTSNE locally. Depending on your compute resources it can scale up to the full training set of v0.

It uses Mapbox to pull a RGB context, and also uses -and documents- a few tricks to tSNE such a large corpus with a many dimensions.

This is an example of the output.

Screenshot 2024-01-23 at 10 48 32
yellowcap commented 1 month ago

@brunosan I think we should update this to v1 or close. What do you think?

brunosan commented 1 month ago

We can re-do. Closing for now