Closed abarciauskas-bgse closed 2 years ago
I took these rough notes from our meeting Notes meeting with Eli Orland 08/25
FWIW, here is a synthesis of Eli's workflow from a technical perspective.
High-level flow: Use MTBS as labels for generating training features from EVT data.
In order to run Eli's workflow in a cloud environment, we need to support the following steps:
Not sure what exactly the model is supposed to predict / classify and whether the result should be published anywhere. But the key job of this workflow is to generate the training features by combining and selecting input data.
The input data sources are described here: https://docs.google.com/spreadsheets/d/1uCDLYUUkSBhKBAmkkaPIxmB65W2HjaDLHRStNSmQTbQ/edit?usp=sharing
Nice thanks for doing this evaluation @j08lue
Next step: A PoC JupyterLab environment where Eli's workflow can run, perhaps with some modifications to load data from the VEDA data store instead of from local disk.