Support thread group - Githubissues

BusyJay commented 4 years ago

Is your feature request related to a problem? Please describe.

fail-rs utilizes global registry to expose simple APIs and convenient FailPoint definition. But it also means all parallel tests have to be run one by one and do cleanup between each run to avoid configurations affect each other.

Describe the solution you'd like

This issue proposes to utilize thread group. Each test case defines a unique thread group, all configuration will be bound to exact one thread group. Every time a new thread is spawn, it needs to be registered to one thread group to make FailPoint reads configurations. If a thread is not registered to any group, it belongs to a default global group.

New public APIs include:

pub fn current_fail_group() -> FailGroup;

impl FailGroup {
    pub fn register_current(&self) -> Result<()>;
    pub fn deregister_current(&self);
}

Note that it doesn't require users have the ability to spawn a thread, register the thread before using FailPoint is enough.

Describe alternatives you've considered

One solution to this is pass the global registry to struct constructor, but it will interfere the general code heavily, it needs to be passed to anywhere FailPoints are defined.

Another solution is #24, but it lacks threaded cases support.

BusyJay commented 4 years ago

/cc @brson @Little-Wallace @lucab

lucab commented 4 years ago

@BusyJay nice! A bunch of high-level questions:

can the thread-group logic be folded into FailScenario? register_current doesn't seem to be very different from threadgroup-bound setup.
would it be better for the API to take a ThreadId argument instead?
I have no idea how this would actually work at the low level; how do you intend to plug this into the eval logic?

BusyJay commented 4 years ago

can the thread-group logic be folded into FailScenario?

FailGroup can be accessed via a global fuction, so it may not be appropriate to use FailScenario directly. But I think it can be defined as a field of FailScenario if necessary.

would it be better for the API to take a ThreadId argument instead?

I'm not sure. I find it more convenient to just register current thread instead of others. For example, thread pool generally provides after_start, before_stop hooks to allow do work in a thread, but generally won't provide thread ID directly.

plug this into the eval logic?

When evaluating, it uses current thread ID to find out which thread group it belongs to, and then check the rules of the group.

Little-Wallace commented 4 years ago

It seems complex. Why can not we launch multiple process and each process runs some test with a single thread?

BusyJay commented 4 years ago

Process isn't compatible with the default test runner.

tikv / fail-rs

Support thread group #51