Things like type of columns should be fixed to best possible type for that column. The files should be cached so that a data engineered DataFrame object is loaded into memory.
Especially convert categorical columns, which should utilise faster methods written for CategoricalArray types.
Things like type of columns should be fixed to best possible type for that column. The files should be cached so that a data engineered DataFrame object is loaded into memory.
Especially convert categorical columns, which should utilise faster methods written for
CategoricalArray
types.