n0-computer / iroh

A toolkit for building distributed applications
https://iroh.computer
Apache License 2.0
2.3k stars 147 forks source link

flaky tests: iroh-ffi kotlin #2526

Open dignifiedquire opened 1 month ago

dignifiedquire commented 1 month ago

it seems the node is randomly shutting down Oo

2024-07-22T12:44:30.591711Z DEBUG gossip{me=s5lh6xipp7eyyq4z}: iroh_gossip::net: handle in_event  UpdatePeerData(PeerData(46b))
2024-07-22T12:44:30.592279Z DEBUG iroh_ffi::doc: docs    
2024-07-22T12:44:30.593440Z DEBUG iroh_ffi::doc: create    
2024-07-22T12:44:30.593958Z DEBUG iroh::node::rpc: handling rpc request: Docs
2024-07-22T12:44:30.594465Z DEBUG sync{me=ya6qwj576ytknv2r}: iroh_docs::actor: open namespace=szw76mj5pt2h5ane
2024-07-22T12:44:30.602351Z ERROR sync{me=ya6qwj576ytknv2r}:sync{me=ya6qwj576ytknv2r}: iroh_docs::engine::live: gossip recv actor failed: to_actor closed
2024-07-22T12:44:30.602411Z DEBUG downloader{me=ya6qwj576ytknv2r}: iroh_blobs::downloader: shutting down
2024-07-22T12:44:30.607777Z DEBUG iroh_ffi::doc: subscribe    
2024-07-22T12:44:30.608820Z DEBUG sync{me=ya6qwj576ytknv2r}: iroh_docs::actor: docs actor shutdown
2024-07-22T12:44:30.609342Z DEBUG iroh_ffi::doc: share    
iroh.IrohException
    at iroh.FfiConverterTypeIrohError.lift(iroh_ffi.kt:13586)
    at iroh.FfiConverterTypeIrohError.read(iroh_ffi.kt:13591)
    at iroh.IrohException$ErrorHandler.lift(iroh_ffi.kt:13578)
    at iroh.IrohException$ErrorHandler.lift(iroh_ffi.kt:13571)
    at iroh.Iroh_ffiKt.uniffiCheckCallStatus(iroh_ffi.kt:298)
    at iroh.Iroh_ffiKt.access$uniffiCheckCallStatus(iroh_ffi.kt:1)
    at iroh.Iroh_ffiKt.uniffiRustCallAsync(iroh_ffi.kt:21246)
    at iroh.Doc.share$suspendImpl(iroh_ffi.kt:8671)
    at iroh.Doc.share(iroh_ffi.kt)
    at Node_test$1.invokeSuspend(node_test.kts:43)
    at kotlin.coroutines.jvm.internal.BaseContinuationImpl.resumeWith(ContinuationImpl.kt:33)
    at kotlinx.coroutines.DispatchedTask.run(DispatchedTask.kt:106)
    at kotlinx.coroutines.EventLoopImplBase.processNextEvent(EventLoop.common.kt:284)
    at kotlinx.coroutines.BlockingCoroutine.joinBlocking(Builders.kt:85)
    at kotlinx.coroutines.BuildersKt__BuildersKt.runBlocking(Builders.kt:59)
    at kotlinx.coroutines.BuildersKt.runBlocking(Unknown Source)
    at kotlinx.coroutines.BuildersKt__BuildersKt.runBlocking$default(Builders.kt:38)
    at kotlinx.coroutines.BuildersKt.runBlocking$default(Unknown Source)
    at Node_test.<init>(node_test.kts:23)

https://github.com/n0-computer/iroh-ffi/actions/runs/10040850241/job/27747717016?pr=167

divagant-martian commented 1 month ago

random thought: could it be that something running inside the local pool is panicking and since the panic is swallowed this somehow triggers a shutdown? don't really a particular reason to believe this, just a rand idea never mind, this still produces an error log, and the error log is not present, so it's something else