[Bug 1728527] Re: hostmonitor can not monitor pacemaker_remote node via cibadmin query

OpenStack Infra 1728527 at bugs.launchpad.net
Thu Oct 10 11:43:02 UTC 2019


Reviewed:  https://review.opendev.org/647756
Committed: https://git.openstack.org/cgit/openstack/masakari-monitors/commit/?id=dc9b77772417c99368a4bbe243bb8e0e7c0bca47
Submitter: Zuul
Branch:    master

commit dc9b77772417c99368a4bbe243bb8e0e7c0bca47
Author: Liam Young <liam.young at canonical.com>
Date:   Tue Mar 19 20:05:22 2019 +0000

    Use crm_mon for pacemaker-remote deployments
    
    As described in bug #1728527 cibadmin does not expose the state of
    the pacemaker-remote nodes which means hostmonitor cannot track
    them. This change switches to use crm_mon to check the status of
    remote nodes if the new config option host.restrict_to_remotes
    to set to True. This will trigger host monitor to use crm_mon
    to monitor nodes and will only monitor nodes that are marked
    as remotes (not members).
    
    Change-Id: I3f2026805413504c875ea5f39eb036d44b26dd43
    Depends-On: Iaa2251708616e9c69817bf5b346d795ea7a4d21b
    Closes-Bug: #1728527


** Changed in: masakari-monitors
       Status: In Progress => Fix Released

-- 
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to Ubuntu Cloud Archive.
https://bugs.launchpad.net/bugs/1728527

Title:
  hostmonitor can not monitor pacemaker_remote node via cibadmin query

Status in Ubuntu Cloud Archive:
  Fix Released
Status in Ubuntu Cloud Archive rocky series:
  New
Status in Ubuntu Cloud Archive stein series:
  Fix Released
Status in masakari-monitors:
  Fix Released
Status in masakari-monitors package in Ubuntu:
  Fix Released
Status in masakari-monitors source package in Disco:
  Fix Released

Bug description:
  Currently Masakari host-monitor only grep the `crmd` status of real node via `cibadmin -Q` command.
  In case of pacemaker_remote, the `crmd` attribute is not existed, so remote node always marked in `None` state.

  Below is an example xml status of remote node:
  <node_state remote_node="true" id="cpu1" uname="cpu1" crm-debug-origin="remote_node_init_status" node_fenced="0">
        <transient_attributes id="cpu1">
          <instance_attributes id="status-cpu1"/>
        </transient_attributes>
      </node_state>
      <node_state remote_node="true" id="cpu2" uname="cpu2" crm-debug-origin="remote_node_init_status" node_fenced="0"/>

  And the log from masakari hostmonitor:
  2017-10-30 14:15:44.679 1813 INFO masakarimonitors.hostmonitor.host_handler.handle_host [-] Recognized 'cpu1' as a new member of cluster. Host status is 'None'.

To manage notifications about this bug go to:
https://bugs.launchpad.net/cloud-archive/+bug/1728527/+subscriptions



More information about the Ubuntu-openstack-bugs mailing list