dlab-berkeley / Data-Science-Social-Justice

Materials for D-Lab / UC Berkeley Graduate Division's Data Science for Social Justice summer workshop. These materials provide an introduction to Python, natural language processing, text analysis, word embeddings, and network analysis. They also include discussions on critical approaches to data science to promote social justice.
11 stars 7 forks source link

Module 3: took long time to run preprocess function #22

Open mingyu-yuan opened 3 months ago

mingyu-yuan commented 3 months ago

It took a long time to run the preprocess function both locally and on datahub. Could we export a pickle file of the preprocessed dataframe so folks can load it if the function turns out to take a long time to run?

jellomoat commented 2 months ago

Great suggestion Mingyu! I'm reviewing the lesson notebook again now. I also notice the lag, especially on DataHub.

Since I'll be covering that section next week anyway, I'll go ahead and export the pickle file to share via Slack the day of the lesson.

mingyu-yuan commented 2 months ago

Awesome!