jordanbehrendt / good_practices_ml

2 stars 2 forks source link

File Naming Between Embedding Generation and Dataset Creation #16

Open felixbusch1 opened 2 days ago

felixbusch1 commented 2 days ago

When running generate_embeddings.py, the script outputs CSV files in batches with indices appended to the file name, e.g., geoguessr_embeddings_0.csv.

1

The create_datasets_from_embeddings.py script expects a single CSV file with a simple name, such as geoguessr_embeddings.csv.

2