Question about storage path

Hi, @jxmorris12. Thanks for the great work and sharing code.

I wonder where does the results of your code(data, tokenized results, embedding values, ...) are stored.
And if you want to change the storage path, which line in the code should you modify?

While reproducing results from the paper, I got

No space left on device

issue on experiments.py line 403 (def _load_train_dataset_uncached)

for key in raw_datasets:
            raw_datasets[key] = dataset_map_multi_worker(
                dataset=raw_datasets[key],
                map_fn=tokenize_fn(
                    tokenizer,
                    embedder_tokenizer,
...
            )

So I tried to change 'DATASET_CACHE_PATH' in utils.py and experiments.py as below.

DATASET_CACHE_PATH = os.environ.get(
    # original: "VEC2TEXT_CACHE", os.path.expanduser("~/.cache/inversion") 
    "VEC2TEXT_CACHE", os.path.expanduser("target_path")
)

However, for some reason, the tokenized results are not stored in 'target_path'; they are still stacked in '.cache/inference'

Is it correct that all the data and embedding values... are stored in the .cache/inversion folder? If so, are there any additional modifications that need to be made in order to store them in the path I specified?

Thanks again!

jxmorris12 / vec2text

Question about storage path #45

And if you want to change the storage path, which line in the code should you modify?