openvstorage / framework

The Framework is a set of components and tools which brings the user an interface (GUI / API) to setup, extend and manage an Open vStorage platform.
Other
27 stars 23 forks source link

Missing/broken disk used by vPool causes error during rescan #832

Closed JeffreyDevloo closed 7 years ago

JeffreyDevloo commented 8 years ago

Problem description

I've added disks to my node and got 'An error occurred while rescanning. selection_036

/var/log/upstart/ovs-workers.log :

2016-08-22 13:55:40 05300 +0200 - ovs-node1 - 15206/140602219763520 - celery/celery.worker.job - 104 - ERROR - Task ovs.disk.sync_with_reality[861492e5-28cc-4f34-8248-37769d9c487f] raised unexpected: LinkedObjectException('There are 3 items left in self.storagedrivers',)
Traceback (most recent call last):
  File "/usr/lib/python2.7/dist-packages/celery/app/trace.py", line 240, in trace_task
    R = retval = fun(*args, **kwargs)
  File "/usr/lib/python2.7/dist-packages/celery/app/trace.py", line 438, in __protected_call__
    return self.run(*args, **kwargs)
  File "/opt/OpenvStorage/ovs/lib/helpers/decorators.py", line 301, in new_function
    output = function(*args, **kwargs)
  File "/opt/OpenvStorage/ovs/lib/disk.py", line 211, in sync_with_reality
    partition.delete()
  File "/opt/OpenvStorage/ovs/dal/dataobject.py", line 700, in delete
    raise LinkedObjectException('There {0} left in self.{1}'.format(multi, key))
LinkedObjectException: There are 3 items left in self.storagedrivers

Possible root of the problem

A disk/ partition seems to be missing. OVS will attempt to fix this in the model (by removing the information) but it's not able to due to a link to the storagedriver (most likely the used/reserved size).

Additional information

Setup

Hyperconverged setup

JeffreyDevloo commented 8 years ago

Related: https://github.com/openvstorage/framework-alba-plugin/issues/169

khenderick commented 8 years ago

Actually, I'm not sure whether this can or should be resolved as such; if a disk is in use by a vPool and for some reason that disk breaks, we have currently no functionality in place that can cope with the failed disk (we don't have disk replacements on a StorageRouter yet). Any thoughts?

wimpers commented 8 years ago

After discussion with @khenderick : this should be tackled in a broader scope where we monitor the disks used by storage routers https://github.com/openvstorage/framework/issues/775

JeffreyDevloo commented 7 years ago

Closing because the broader scope has been tackled.