Celluloid::IO behaves like a Celluloid Actor failures

ioquatix commented 8 years ago

  2) Celluloid::IO behaves like a Celluloid Actor detects recursion
     Failure/Error:
       fail Celluloid::ThreadLeak, "Aborted due to runaway threads (#{location})\n"\
         "List: (#{loose.map(&:inspect)})\n:#{backtraces.join("\n")}"

     Celluloid::ThreadLeak:
       Aborted due to runaway threads (after example: detects recursion)
       List: (["#<Celluloid::Thread:0x007f965dcaab70@/Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:47 sleep>", "#<Celluloid::Thread:0x007f965dca8488@/Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:47 sleep>", "#<Celluloid::Thread:0x007f965dc50cb0@/Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:47 sleep>", "#<Celluloid::Thread:0x007f965dc318d8@/Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:47 sleep>"])
       :Runaway thread: ================ #<Celluloid::Thread:0x007f965dcaab70@/Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:47 sleep>
       Backtrace: 
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `sleep'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `block in check'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/timers-41145ed260e4/lib/timers/wait.rb:14:in `for'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:58:in `check'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:155:in `block in run'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/timers-41145ed260e4/lib/timers/group.rb:66:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:152:in `run'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:131:in `block in start'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-essentials-f0545ce47ed9/lib/celluloid/internals/thread_handle.rb:14:in `block in initialize'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor/system.rb:78:in `block in get_thread'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:50:in `block in instantiate'

       Runaway thread: ================ #<Celluloid::Thread:0x007f965dca8488@/Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:47 sleep>
       Backtrace: 
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `sleep'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `block in check'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/timers-41145ed260e4/lib/timers/wait.rb:14:in `for'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:58:in `check'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:155:in `block in run'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/timers-41145ed260e4/lib/timers/group.rb:66:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:152:in `run'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:131:in `block in start'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-essentials-f0545ce47ed9/lib/celluloid/internals/thread_handle.rb:14:in `block in initialize'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor/system.rb:78:in `block in get_thread'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:50:in `block in instantiate'

       Runaway thread: ================ #<Celluloid::Thread:0x007f965dc50cb0@/Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:47 sleep>
       Backtrace: 
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `sleep'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `block in check'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/timers-41145ed260e4/lib/timers/wait.rb:14:in `for'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:58:in `check'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:155:in `block in run'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/timers-41145ed260e4/lib/timers/group.rb:66:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:152:in `run'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:131:in `block in start'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-essentials-f0545ce47ed9/lib/celluloid/internals/thread_handle.rb:14:in `block in initialize'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor/system.rb:78:in `block in get_thread'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:50:in `block in instantiate'

       Runaway thread: ================ #<Celluloid::Thread:0x007f965dc318d8@/Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:47 sleep>
       Backtrace: 
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `sleep'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `block in check'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/timers-41145ed260e4/lib/timers/wait.rb:14:in `for'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:58:in `check'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:155:in `block in run'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/timers-41145ed260e4/lib/timers/group.rb:66:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:152:in `run'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor.rb:131:in `block in start'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-essentials-f0545ce47ed9/lib/celluloid/internals/thread_handle.rb:14:in `block in initialize'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/actor/system.rb:78:in `block in get_thread'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/group/spawner.rb:50:in `block in instantiate'
     Shared Example Group: "a Celluloid Actor" called from ./spec/celluloid/io/actor_spec.rb:4
     # /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/spec/support/loose_threads.rb:72:in `assert_no_loose_threads!'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/spec/support/configure_rspec.rb:48:in `block (2 levels) in <top (required)>'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:424:in `instance_exec'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:424:in `instance_exec'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/hooks.rb:389:in `execute_with'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/hooks.rb:620:in `block (2 levels) in run_around_example_hooks_for'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:319:in `call'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-retry-0.4.5/lib/rspec/retry.rb:98:in `block in run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-retry-0.4.5/lib/rspec/retry.rb:88:in `loop'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-retry-0.4.5/lib/rspec/retry.rb:88:in `run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-retry-0.4.5/lib/rspec_ext/rspec_ext.rb:12:in `run_with_retry'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-retry-0.4.5/lib/rspec/retry.rb:22:in `block (2 levels) in setup'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:424:in `instance_exec'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:424:in `instance_exec'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/hooks.rb:389:in `execute_with'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/hooks.rb:620:in `block (2 levels) in run_around_example_hooks_for'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:319:in `call'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/hooks.rb:621:in `run_around_example_hooks_for'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/hooks.rb:478:in `run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:434:in `with_around_example_hooks'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:477:in `with_around_and_singleton_context_hooks'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example.rb:233:in `run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example_group.rb:581:in `block in run_examples'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example_group.rb:577:in `map'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example_group.rb:577:in `run_examples'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example_group.rb:543:in `run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example_group.rb:544:in `block in run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example_group.rb:544:in `map'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/example_group.rb:544:in `run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/runner.rb:115:in `block (3 levels) in run_specs'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/runner.rb:115:in `map'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/runner.rb:115:in `block (2 levels) in run_specs'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/configuration.rb:1680:in `with_suite_hooks'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/runner.rb:114:in `block in run_specs'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/reporter.rb:77:in `report'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/runner.rb:113:in `run_specs'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/runner.rb:89:in `run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/runner.rb:73:in `run'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/lib/rspec/core/runner.rb:41:in `invoke'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/gems/rspec-core-3.4.1/exe/rspec:4:in `<top (required)>'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/bin/rspec:23:in `load'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/bin/rspec:23:in `<main>'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/bin/ruby_executable_hooks:15:in `eval'
     # /Users/samuel/.rvm/gems/ruby-2.3.0/bin/ruby_executable_hooks:15:in `<main>'

All "it behaviour like a celluloid actor" crashing like this. Any ideas?

tarcieri commented 8 years ago

Looks related to the GSoC work? /cc @digitalextremist

ioquatix commented 8 years ago

I'm getting literally hundreds of these errors from all "it behaves like an actor" specs.

ioquatix commented 8 years ago

I feel there is something wrong with 4448543 or thereabouts. It's a bit hard for me to understand this new code because the documentation is non-existent. Can people please document classes and functions with at least one line explaining what they do and how they fit in with the rest of the codebase? Thanks.

ioquatix commented 8 years ago

Okay so I ran git bisect and found that the first commit to cause this problem is d8fd5b2f5e13fe93392d2d170a9377ad2337242d

ioquatix commented 8 years ago

I think this problem comes from

+    Specs.assert_no_loose_threads(ex.description) do

which is now wrapping all specs. So the question is, is this the correct behaviour?

ioquatix commented 8 years ago

I found this line in celluloid-io

  ALLOW_SLOW_MAILBOXES = true # TODO: Remove hax.

It's not clear why this is needed but it appears the functionality is broken because this appears to suppress the loose threads but the backtrace is different (off by one):

 ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `sleep'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `wait'
        ** /Users/samuel/.rvm/gems/ruby-2.3.0/bundler/gems/celluloid-85f850a41bb5/lib/celluloid/mailbox.rb:63:in `block in check'

IS not matched by:

      if Specs::ALLOW_SLOW_MAILBOXES
        next if thread.backtrace[1] =~ /mailbox\.rb/ && thread.backtrace[1] =~ /sleep/
      end

ioquatix commented 8 years ago

So, I've fixed the hack which allows tests to pass, simply by ignoring the sleeping mailboxes. Um, can we have a discussion about this? What is going on here and why is the hack necessary?

tarcieri commented 8 years ago

No idea what's going on here

ioquatix commented 8 years ago

Yeah it looks like something @digitalextremist but it's not well documented or explained anywhere.

digitalextremist commented 8 years ago

Sorry for the delay in replying. No, it was not GSoC related, and no, I didn't introduce it -- I just consolidated its many different forms in different gems. This was introduced by @e2 during test cleaning, and I too have problems with it/am refactoring it when possible

ioquatix commented 8 years ago

Could you explain what the loose thread is and why it needs to be ignored for celluloid-io?

e2 commented 8 years ago

Loose threads means threads leaking between tests.

It means things weren't cleaned up properly.

This caused hideously hard to debug cases (where some tests failed because another random test failed to kill all its threads).

It wasn't just "test cleaning", it was about getting tests to stop failing randomly.

Here, it looks like the actor wasn't killed properly - that's why it was detected as sleeping on the mailbox.

I wouldn't ignore it - I'd make sure the actor is killed during test teardown.

HoneyryderChuck commented 8 years ago

Apparently some require is where it naught to be. I see the Fanout and IncidentReporter actors there if I boot the system.

HoneyryderChuck commented 8 years ago

Apparently the loose_threads method is not ruling out the root services actors (fanout, incident reporter, etc), hence the slow mailboxes hack (it rules them out). But the condition seems not to work properly for ruby 2.0.0:

next if thread.backtrace[0] =~ /mailbox\.rb/ && thread.backtrace[0] =~ /sleep/

# backtrace
# 2.0.0
/opt/rubies/ruby-2.0.0-p645/lib/ruby/2.0.0/thread.rb:72:in `sleep'
/opt/rubies/ruby-2.0.0-p645/lib/ruby/2.0.0/thread.rb:72:in `block (2 levels) in wait'
/opt/rubies/ruby-2.0.0-p645/lib/ruby/2.0.0/thread.rb:68:in `handle_interrupt'
/opt/rubies/ruby-2.0.0-p645/lib/ruby/2.0.0/thread.rb:68:in `block in wait'
/opt/rubies/ruby-2.0.0-p645/lib/ruby/2.0.0/thread.rb:66:in `handle_interrupt'
/opt/rubies/ruby-2.0.0-p645/lib/ruby/2.0.0/thread.rb:66:in `wait'
/home/taacati1/celluloid/lib/celluloid/mailbox.rb:63:in `block in check'
...

#ruby 2.1.6
/home/taacati1/celluloid/lib/celluloid/mailbox.rb:63:in `sleep'
/home/taacati1/celluloid/lib/celluloid/mailbox.rb:63:in `wait'
/home/taacati1/celluloid/lib/celluloid/mailbox.rb:63:in `block in check'
/home/taacati1/celluloid-io/.bundle/ruby/2.1.0/bundler/gems/timers-41145ed260e4/lib/timers/wait.rb:14:in `for'
/home/taacati1/celluloid/lib/celluloid/mailbox.rb:58:in `check'
...

HoneyryderChuck commented 8 years ago

I think that the "slow mailbox" designation is quite wrong. The whole concept is that one doesn't count service threads as loose threads, right?

Or maybe the fix would be to remove them from the booting altogether. If the goal is to isolate the core functionality from the supervision, It doesn't make sense to be using supervision around here. @digitalextremist , what's the plan exactly?

e2 commented 8 years ago

@TiagoCardoso1983 - the idea was to prevent Celluloid threads from leaking between specs.

The only threads ALLOWED to survive between specs should be core RBX and JRuby threads ONLY.

All Celluloid should be killed before the next spec starts, or debugging failed tests can be hard.

Celluloid should be COMPLETELY shut down between specs - so leaving Celluloid-owned threads is a bad practice, ESPECIALLY with persistent things like fanout, probing, etc.

NOTE: I didn't introduce the Specs::ALLOW_SLOW_MAILBOXES hack. I don't see a reason for it other than as a workaround for "forcing failing specs to pass". Maybe there was a reason due to rushing for GSoC, but it's best to remove the "mail thread hack" and fix the cause.

If there are threads stuck on the mailbox AFTER a spec, that's a design issue or a bug.

e2 commented 8 years ago

@TiagoCardoso1983 - I didn't bother to check the affected specs here, but when I worked on the tests months ago, I made sure all the tests passed without any Celluloid-created threads still running afterwards ("loose threads").

RBX uses extra threads for timeouts (that's just how they are implemented), while JRuby runs some extra core threads (e.g. in the way it implements a futex, which is managed internally+globally in JRuby).

MRI runs without extra threads, so the thread count after every spec should be 1 - (just the main thread).

What you mentioned about other services (supervision) - removing those from a spec is just a for optimizing how long the tests run. Tests should be green with or without them.

(Unless they're testing e.g. supervision, etc..)

HoneyryderChuck commented 8 years ago

@e2 it's not only for optimizing, they are the reason why the tests were failing. In celluloid-io (not celluloid) all specs are wrapped inside an around hook which calls boot and shutdown. Shutdown is kinda asynchronous, i.e. killing the thread is dependent of the current message being processed and wrapped inside a ruby timeout (the most dangerous weapon yada yada).

Celluloid bootstrapping is starting those root services by default, IMO needlessly, this has been subject of debate before, why are they being started is beyond me, that's what the autostart file was here for, I think. Booting the system shouldn't start silent actors by default.

To sum it up, i'm from your opinion, one must not leak threads in between specs, but more important, one should try really hard not to allocate resources which are out of the scope of the specs.

My proposal:

Move back the initialization of the root services to the autostart file.
Remove the ALLOW_SLOW_MAILBOXES hack altogether.
Remove the autostart file and the dependency from celluloid-supervision (breaking change, possibly 1.0.0).

HoneyryderChuck commented 8 years ago

just removed the hack. one does not need to boot the system every time. @digitalextremist , I saw that the commit came from you. I left the one for Celluloid::ZMQ. Any specific reason why you call boot instead of init?

e2 commented 8 years ago

@TiagoCardoso1983 - I agree 100% with you on setting up at little as possible for the tests.

e2 commented 8 years ago

@TiagoCardoso1983 - months ago I started implementing "graceful and complete shutdown", but there was too much integration work in Celluloid at that time.

(I spent more time rebasing that doing anything useful, so I gave up).

Just saying a "clean shutdown" is feasible (though complex), but no one needs it in production code.

That's because no one is interested beyond Celluloid specs (because in production services are expected to run forever, and apps kill threads when they die anyway).

tarcieri commented 8 years ago

@e2 FWIW, I suggested removing the default at_exit handler in celluloid/celluloid#698

I think in general the whole lifecycle should use some re-evaluation for a subsequent release. Most people are using Celluloid as a library, not a framework, and we generally have not provided good tooling for using it as the latter.

HoneyryderChuck commented 8 years ago

@tarcieri it seems that removing the at_exit handler would make people opt in and eventually build their own clean shutdown a la what @e2 suggested. I'm more than ok with that.

I'd really like to push for supervision and the essentials bit non-inclusion in basic celluloid require. They are opt-in services which could be "plugged-in" a la what sequel does. Hell, they don't even need to be in a separate repo.

Most people don't use them anyways. As you said, most celluloid usages don't require a 'system', just a set of tools. Celluloid Actors end up being used as isolated workers (as sidekiq used to do), and why not? One should therefore try to get out of the way and provide the lowest common denominator. Specs will eventually solve themselves.

If there's fear of breaking current clients, I'd like to remember that without sidekiq, celluloid loses a lot of weight in the deparment of breaking people's production environment. And it's still not even 1.0.0. Aim for stability then?

To change subject and cut the conversation short: the build passed. Can someone merge it to master? :)

ioquatix commented 8 years ago

I agree, I'd like to see celluloid come not as a global system but as something that can be integrated in a modular fashion.

celluloid / celluloid-io

Celluloid::IO behaves like a Celluloid Actor failures #165