Where are the files in 'data/b3g'

Hi, Thanks for your time!

Based on the ReadMe that 'We provide an example corpus in data/b3g to demonstrate our pipeline.', I wonder where are the files in 'data/b3g'?

In addition, for the step 4 Run the search distributed job, there are two commands. Command 1: python run.py --command search --config configs/config_test.yaml --xb ccnet_new --cluster_run --partition learnlab Command 2: python run.py --command search --config configs/config_test.yaml --xb ccnet_new --xq edouard_val

I am confused about the remark. For just one database that has multiple documents, should I run these 2 commands step by step or just command 1 ?

swj0419 / in-context-pretraining

Where are the files in 'data/b3g' #1