Benchmark datasets - Githubissues

stevie1023 / cspn-gpspn

0 stars 0 forks source link

Benchmark datasets #4

Open trappmartin opened 3 years ago

trappmartin commented 3 years ago

It would be good to additional benchmark on the datasets used in: https://papers.nips.cc/paper/7905-heterogeneous-multi-output-gaussian-process-prediction.pdf

Those are heterogeneous datasets, so even more interesting. I'll open an additional issue explaining how to work with heterogeneous data.

trappmartin commented 3 years ago

Maybe also check the UCI repository for data: https://archive.ics.uci.edu/ml/datasets.php?format=&task=reg&att=num&area=&numAtt=&numIns=greater1000&type=mvar&sort=nameUp&view=table

minimrbanana commented 3 years ago

And the following Multitask_Exact_GPs are the one I mentioned, not sure if it fits well. https://github.com/cornellius-gp/gpytorch/tree/master/examples/03_Multitask_Exact_GPs

trappmartin commented 3 years ago

Yes, that looks like a good reference. They also describe two datasets that we can use in the paper.

stevie1023 commented 3 years ago

Hi, Martin! Hope you're safe and sound under the current corona situation~ I have another question though- - After I had the model with 4 children trained for usflight dataset, I come across this out-of-memory problem again while doing testing even though I tried using batches. Do you think it's because of the model structure or I simply did something wrong in testing?