Closed wagpa closed 1 year ago
The split provides an insight on how the decoder would perform on different embeddings generated by the same embedder. But it could be a problem, if the decoder only sees a part of the embedding.
This could be tested with more complex embeddings. Otherwise we could provide a simple setting that disables splitting.
Do we want to split the dataset? What information would we get?