Closed peterdudfield closed 2 years ago
How slow is it? :slightly_smiling_face:
I'm always a little nervous about using parallelization in production systems, because it's so much harder to debug :slightly_smiling_face:
currently takes ~ 2mins to check all 1400 sites, there might be some easier speed ups
cool beans, sounds good! And, just to check: Does the code only check a PV system if that PV system was checked more than pv_sample_period
minutes ago? (Which might speed things up by only checking a subset of the PV systems)
yep, only checks the system if {now} - {last data} > {sample period }
Awesome, thank you!
One way to make debuggin easier when paralization, we just make sure all logs go to cloudwatch and also reference what thread its on and then its a bit easier to debug
Currently takes ~1 minute 30
POssible now an issue as consumer was taking more then 5 mins to run - https://github.com/openclimatefix/nowcasting_infrastructure/issues/63
Start: 08:31:35
Get last datetime end / start pulling data: 08:41:23
End: 08:42:09
Problem probably is pulling the latest datetime from the database, is taking too long
Deployed anew version using datamodel=0.0.11 which loads all 'last pv yields' at once. The new times are
It took only 5 seconds to get all the pv yields now
Its down to about 1 min now, so I think ill close this for the moment
Detailed Description
Would be good to use some sort of parralization in the code
Context
good to speed up