Closed davidshumway closed 1 year ago
Our library has no strict requirements for datasets (such as data format). So I think the problem here is how to define a dataset in PyTorch.
For transfer learning, I recommend the implementation of Wilds, such as https://github.com/p-lambda/wilds/blob/main/wilds/datasets/amazon_dataset.py
For tabular data, I think it will be helpful to refer to the open source code of related research. And I'm not very familiar with this field.
Thank you, @thucbx99! I will look at Wilds and consider related research.
I implemented one method on non-image data. I used 2D points. please check it out on: https://github.com/mashaan14/ADDA-toy
Similar to this question about adding a custom dataset: (https://github.com/thuml/Transfer-Learning-Library/issues/188).
Are there any examples showing use of this library with non-image data, or could help by showing how to setup a simple non-image example? For example, I'd like to use the library in the SDA, UDA, or SSL scenarios, with simple tabular data such as:
Labeled data Source 1 Xs1 = Feature1, Feature2, Feature3 ys1 = Feature4
Source 2 Xs2 = Feature1, Feature2, Feature3 ys2 = Feature4
Target 1 Xt1 = Feature1, Feature2, Feature3 yt1 = Feature4
Unlabeled target data Target 1 Xt1_unlabeled = Feature1, Feature2, Feature3
For example, starting from pandas, and assuming a two-class classification problem:
Perhaps using regression is also possible? For example:
Thanks!