cloudius-systems / osv

OSv, a new operating system for the cloud.
osv.io
Other
4.08k stars 602 forks source link

ZFS dmu.c/dmu_write() functions fails with assertion #911

Open wkozaczuk opened 6 years ago

wkozaczuk commented 6 years ago

Following assertion failed in dmu.c:

VERIFY(0 == dmu_buf_hold_array(os, object, offset, size, FALSE, FTAG, &numbufs, &dbp));

I am attaching the screen snapshot from standard out:

zfs_crash

raphaelsc commented 6 years ago

On Tue, Sep 26, 2017 at 7:23 PM, WALDEMAR KOZACZUK <notifications@github.com

wrote:

Following assertion failed in dmu.c:

VERIFY(0 == dmu_buf_hold_array(os, object, offset, size, FALSE, FTAG, &numbufs, &dbp));

I am attaching the screen snapshot from standard out:

How did you reproduce it?

[image: zfs_crash] https://user-images.githubusercontent.com/5296673/30887078-d0be6b5a-a2e7-11e7-921c-091f4ef93fce.PNG

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/cloudius-systems/osv/issues/911, or mute the thread https://github.com/notifications/unsubscribe-auth/ABWAcy1ySPhWwThU_5ScdcfWeUcR-QbLks5smXl5gaJpZM4Pk9w5 .

wkozaczuk commented 6 years ago

I am not sure I can reproduce it. It happened on a live OSv instance running on VMware ESXi 6.0 cluster. It was a node.js app which does not even even write to file system. Truth be told this is a test VMware cluster that we have been experiencing some networking issue when hosts would keep disconnecting. So who knows if there was some underlying host/hardware issue.

I found this issue reported on zfsonlinux that seems related - https://github.com/zfsonlinux/zfs/issues/1440. It seem that it is related to corrupted Zfs pool.

They reference a patch improves reporting of this error by changing VERIFY to VERIFY0.

Sent from my iPhone

On Sep 26, 2017, at 19:29, Raphael Carvalho notifications@github.com wrote:

On Tue, Sep 26, 2017 at 7:23 PM, WALDEMAR KOZACZUK <notifications@github.com

wrote:

Following assertion failed in dmu.c:

VERIFY(0 == dmu_buf_hold_array(os, object, offset, size, FALSE, FTAG, &numbufs, &dbp));

I am attaching the screen snapshot from standard out:

How did you reproduce it?

[image: zfs_crash] https://user-images.githubusercontent.com/5296673/30887078-d0be6b5a-a2e7-11e7-921c-091f4ef93fce.PNG

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/cloudius-systems/osv/issues/911, or mute the thread https://github.com/notifications/unsubscribe-auth/ABWAcy1ySPhWwThU_5ScdcfWeUcR-QbLks5smXl5gaJpZM4Pk9w5 .

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.