Is the pretrain stage of Deepjoin only designed for Opendata?

Dear all,

Thank you for huge effort towards this project!

I have a question about the implementation of the pretrain stage for Deepjoin. In multi_process_csv.py, the function process_before_train calls process_task4 which looks for tables of opendata regardless of what file you pass (either sato_opendata_new.csv or sato_webtable_new.csv) for the --tain_csv_file argument of deepjoin_train.py.

Could you teach me how to fix this issue if the functions are defined as expected, and if not, I would really appreciate if you could rewrite them to work for the webtable dataset.

Thank you in advance and best regards, Ryosuke

RLGen / LakeBench

Is the pretrain stage of Deepjoin only designed for Opendata? #6