Closed cdolfi closed 1 year ago
The tl;dr is this approach will :
@jasonbrooks : I should also mention that we are closing in on an alpha release of a new veresion of Augur that incorporates Josh B's generous feedback. As of right now, the CPU utilization on that new version is a fraction of the old version. Instead of pinning serveral CPU's, we are rarely exceeding 10% utilization on any one process.
As soon as we hit alpha, I’ll create a parallel instance of your data on my server. The punch list we are working through include:
The speed increase is a byproduct of refactoring data collection, implementing celery/Redis instead of our job scheduler, and the use of SQLAlchemy’s “upsert” function when values like “opened”/“closed” are changed.
An update on this, we haven't deployed the second blade yet because we're having some issues with the blade chassis that will require someone at the data center to put their hands on it. We've got a ticket in for that.
@jasonbrooks : how is this process going? We are ready to get this setup. It will be difficult to take Aspen to the next level without getting the infrastructure aligned. Regarding Augur, I can confirm that that our parallelization is enabling linear scaling. More power, more better.
Hey Sean, we had some hardware issues that were slowing us down, but we provisioned another blade for use by augur. I'll msg you on slack about it.
@jasonbrooks : Looking to setup some time to talk this week.
Tracking deployment of Augur elsewhere- blade has been deployed, we have access to it.
To be able to help with collection of data and data access, an additional blade is going to be integrated into the system. With that, we are going to be created monitoring tools to be able to understand the collection better. Others can add context to this as well