Open lmeyerov opened 4 years ago
Just to make things clear, the project has transformation functions, and you want to have tests that can clarify if those are working as intended?
A few pieces:
Just having pytest, and hooked up to a CI system, would already be a step up and provide a foundation for others :)
For each of those, we currently have methods in https://github.com/TheDataRideAlongs/ProjectDomino/blob/master/modules/FirehoseJob.py for the above conversions for Twitter in particular ( cc @bechbd )
We're starting to have other notebooks as well, such as for extracting URLs and blockchain addresses, that'd benefit from this as well. My guess is we'd find bugs + get the code cleaner & more modular anyways as part of this process of moving from Notebook prototypes to Python modules that get plugged into Prefect.io pipelines.
do you have any CI system in mind?
For some sample tweets, test especially:
Less clear: diff search jobs