Open puchengy opened 1 year ago
cc @sopel39
@puchengy take a look at fault tolerant execution (FTE, code name Tardigrade) mode of Trino. When that mode is enabled, you could use spot instance.
cc @arhimondr @losipiuk
Yeah, however coordinator is expected to be run on a stable instance
Even with FTE there could be some benefit to preferring the volatile nodes for scheduling shorter running tasks as there will be lesser wasted work when the spot node is reclaimed. I don't know if it is possible to detect "short running" task before scheduling though. Maybe the size of input data to the task is good enough proxy to determine that.
Spot instances are cheap but usually short-lived.
Imagine we have a Trino cluster mixed with spot instances and regular instances. Can we place certain type of tasks (historically short running tasks) on spot instances?
Questions: