Closed afermg closed 2 months ago
Not all columns can be removed (namely the id one, see here), but I removed as many unnecessary columns as possible (using '?_nocol=COLNAME) in the url name. The main downside is that the Broad's shortlinker doesn't work if we have more than one '?' in the url, so I had to let this go. Maybe if we go for a different shortlinking system in the future this can be reopened once again.
Now that we can use the metadata to modify visual elements from the web interfaces, we can remove the unnecessary columns with more ease (namely, the ones resulting from the source of the data being a parquet file).
It requires sorting the columns It renders the sorting step in the dataset generation scripts unnecessary.