Open haymant opened 2 years ago
I met the same problem, could you please tell me how to fix it if you solved it . Thx!
I use single thread in prepare_data.py main() to fix it.
def main(_):
if not os.path.exists(FLAGS.output_data_path):
os.makedirs(FLAGS.output_data_path)
# args_list = []
# vertical_to_websites_map = constants.VERTICAL_WEBSITES
# verticals = vertical_to_websites_map.keys()
# for vertical in verticals:
# websites = vertical_to_websites_map[vertical]
# for website in websites:
# args_list.append((vertical, website))
# num_cores = int(mp.cpu_count()/2)
# with mp.Pool(num_cores) as pool, tqdm(total=len(args_list), desc="Processing swde-data") as t:
# for res in pool.imap_unordered(generate_nodes_seq_and_write_to_file, args_list):
# t.update()
# use single thread
vertical_to_websites_map = constants.VERTICAL_WEBSITES
verticals = vertical_to_websites_map.keys()
for vertical in verticals:
websites = vertical_to_websites_map[vertical]
for website in websites:
print(f"start process main().generate_nodes_seq_and_write_to_file({vertical},{website})")
generate_nodes_seq_and_write_to_file((vertical, website))
Describe the bug Model I am using (UniLM, MiniLM, LayoutLM ...): markuplm/SWDE
The problem arises when using:
A clear and concise description of what the bug is. prepare_data.py stuck at 1% after writng university-usnews-2000.pickle
To Reproduce Steps to reproduce the behavior:
Expected behavior A clear and concise description of what you expected to happen.