openvstorage / alba

Open vStorage ALBA (alternate backend) creates a replicated or flexible network raid’ed object storage backend out of Seagate Kinetic drives and local disk supporting compression, encryption.
Other
28 stars 10 forks source link

When restarting the maintenance agent, asd add Maintenance.NotMyTask to the recently errors #696

Closed jeroenmaelbrancke closed 7 years ago

jeroenmaelbrancke commented 7 years ago

Due to some problems in the maintenance agent we restart the agent every hour with a cronjob.

What we see in the osd information that we receive an error with Maintenance.NotMyTask. Framework checks those stats to calculate when they change the status in the GUI of the asd from OK to Warning. len(errors) > 0 and (len(reads + writes) == 0 or max(min(reads), min(writes)) <= max(errors) + 5min)

If the asd is less used we will receive each hour a warning in the ovs portal with recently errors.

22 : { Nsm_model.OsdInfo.kind =
  (Nsm_model.OsdInfo.Asd ((["172.20.20.5"], 8614, false, false),
     "mo80PmLGTty94mc0AkiL3epgcCoI9a1v"));
  decommissioned = false; node_id = "r3awp4l52PK2Hg7zfKnDONBkGQgeIMhP";
  other =
  "{ \"id\" : \"mo80PmLGTty94mc0AkiL3epgcCoI9a1v\", \"node_id\" : \"r3awp4l52PK2Hg7zfKnDONBkGQgeIMhP\", \"port\" : 8614, \"used_bytes\" : \"15333602792\", \"total_bytes\" : \"66647394304\", \"version\" : \"AsdV1\", \"network_interfaces\":[{\"ipv4_addr\": \"172.20.20.5\"} ]}";
  total = 66647394304L; used = 15333602792L;
  seen =
  [2017/04/03_16:26:28.4629; 2017/04/03_16:26:18.4625;
    2017/04/03_16:26:8.4626; 2017/04/03_16:25:58.4624;
    2017/04/03_16:25:48.4607; 2017/04/03_16:25:38.4622;
    2017/04/03_16:25:28.4604; 2017/04/03_16:25:18.4600;
    2017/04/03_16:25:8.4588; 2017/04/03_16:24:58.4588];
  read =
  [2017/04/03_16:22:22.1170; 2017/04/03_16:14:22.3054;
    2017/04/03_16:06:12.6419; 2017/04/03_15:58:3.4716;
    2017/04/03_15:50:5.6378; 2017/04/03_15:42:9.9903;
    2017/04/03_15:34:18.3160; 2017/04/03_15:26:25.9559;
    2017/04/03_15:18:8.5363; 2017/04/03_15:09:56.5125];
  write =
  [2017/04/03_16:22:22.1154; 2017/04/03_16:14:22.3041;
    2017/04/03_16:06:12.6383; 2017/04/03_15:58:3.4683;
    2017/04/03_15:50:5.6361; 2017/04/03_15:42:9.9880;
    2017/04/03_15:34:18.3311; 2017/04/03_15:26:25.9538;
    2017/04/03_15:18:8.5339; 2017/04/03_15:09:56.5089];
  errors =
  [(2017/04/03_15:34:18.2974, "End_of_file");
    (2017/04/03_15:00:12.3470, "Maintenance.NotMyTask");
    (2017/04/03_15:00:10.0730, "Maintenance.NotMyTask");
    (2017/04/03_10:03:12.1687, "End_of_file");
    (2017/04/02_20:00:2.2668, "Maintenance.NotMyTask");
    (2017/03/31_23:00:25.3194, "Maintenance.NotMyTask");
    (2017/03/30_21:00:22.6330, "Maintenance.NotMyTask");
    (2017/03/30_20:00:25.6275, "Maintenance.NotMyTask");
    (2017/03/30_20:00:23.8443, "Maintenance.NotMyTask");
    (2017/03/30_19:24:44.4750, "End_of_file")];

Can you guys remove this error from the asds when a maintenance agent restarts?

toolslive commented 7 years ago

312 #441 #547