Closed davidfstein closed 1 month ago
Can you check the model definition (i.e., the size of the embedding layer) and the batch that includes indices to the embedding layer? I guess there's a new category at val/test time that isn't included in the col_stats
, but I'm not 100% sure until I reproduce it on my side. If you could provide us with your code and full error message to reproduce it, one of us will take a look :)
It seems like it was an unrelated error actually, but shouldn't the library be able to handle new categories at val/test time. I worked around this by using the "split_col" of the dataset, but what if I want my train, test, and val data to be separate datasets?
PyTorch Frame could have some default handling by, e.g., treating new categories as N/A and mapping N/A to the most frequent category. I'm closing this issue as your original issue has already been resolved, but feel free to open a new issue to discuss this further.
Im trying to run the ExampleTransformer from the documentation on a custom dataset. Training proceeds fine, but each time, at a particular batch, the inference fails. Anyone know what would cause this?