data-derp / small-exercises

6 stars 9 forks source link

Create an Exercise that covers remaining Spark Functions Topics #39

Open kelseymok opened 1 year ago

kelseymok commented 1 year ago

With the new exercises, we're not covering some of the more interesting Spark functions.

We'll create a new exercise for "Additional Spark Functions" (in the small-exercises repo) to cover the following:

DataFrame Cleaning

DataFrame Queries

Aggregations

String Operations

For everything else

CFRs

Notes

We might be able to reuse some of the examples we had in the Wrangling with Spark exercise, but do it better. If there's an opportunity to use our domain data, that would be best but we might need to dirty up some data and save it as a CSV or something in the repo in order to pull it in

Open Questions

Are these valuable?

Should all functions have a test? Perhaps we can do it later?