Open SverreNystad opened 4 months ago
It could be of use to use the Chain of Responsibility to handle the failover: Chain of Responsibility is behavioral design pattern that allows passing request along the chain of potential handlers until one of them handles request. Read more here
Implement Failover Mechanism for Critical Dependencies to Ensure 99% Uptime
Description
To meet the Availability quality requirement A1, which states: "System uptime must be 99%, with capabilities to handle critical operations around the clock," we need to address the uptime dependencies of TutorAI on our commercial off-the-shelf (COTS) solutions, specifically OpenAI and MongoDB.
Current Issue
OpenAI:
MongoDB:
Proposed Solution
To ensure TutorAI meets its uptime requirement, we must implement a failover mechanism for both OpenAI and MongoDB:
For OpenAI:
For MongoDB:
Action Items
Conclusion
Implementing these failover mechanisms is crucial to ensuring that TutorAI can achieve the required 99% uptime, thus maintaining reliable operations around the clock despite potential downtime from our COTS dependencies.