Parametric t-SNE - Githubissues

One way to do this is to use the initial_config parameter. This parameter accepts a pre-calculated embedding as a bootstrap.

The process would involve:

Train an embedding on an initial set of data, and save it.
Add new observations to the training data, and set their locations in the embedding to some initial coordinates (e.g. perhaps the origin, or a median of some sort).
Provide the modified training data to the tsne function, along with the modified embedding (as initial_config).

This carries some caveats, so here's my thoughts on those:

If an initial_config is provided, tsne skips the initial embedding phase (a PCA layout). Normally, this is what you want, but if you add too much data for training it's probably just a better idea to re-do the initial layout.
It's a good idea to decrease the min_cost parameter on the second pass, to give the new data a fair chance to find an optimal embedding.

Keep in mind you can visualize the progress of the tsne algorithm using the epoch_callback method. You could flag the new points, watch them settle, and get a better idea of how to optimize the parameters.

jdonaldson / rtsne

Parametric t-SNE #3