the routine for importing data and collapsing data to one row per text is not abstract enough (essentially the same code in ETCSL-distrance-clustering/Distances and Clusters.ipynb and in Topic_model_saao/import_saa_letters.ipynb). This code should be consolidated in one or two functions.
the routine for importing data and collapsing data to one row per text is not abstract enough (essentially the same code in ETCSL-distrance-clustering/Distances and Clusters.ipynb and in Topic_model_saao/import_saa_letters.ipynb). This code should be consolidated in one or two functions.