Friday, 2019-09-20

*** goldyfruit___ has joined #openstack-masakari00:01
*** openstackgerrit has joined #openstack-masakari02:50
openstackgerritpengyuesheng proposed openstack/masakari master: Blacklist requests-mock 1.7.0  https://review.opendev.org/68326902:50
openstackgerritArthur Dayne proposed openstack/masakari master: Fix the bug #1836354 that masakari cannot funtion well with noauth2 strategy  https://review.opendev.org/67068003:36
openstackbug 1836354 in masakari "Masakari service cannot function well when set auth_strategy=noauth2" [Undecided,In progress] https://launchpad.net/bugs/1836354 - Assigned to Arthur Dayne (palagend)03:36
*** jawad_axd has joined #openstack-masakari05:58
openstackgerritShilpa Devharakar proposed openstack/masakari-monitors master: Add operator guide documentation  https://review.opendev.org/48909506:01
*** jawad_ax_ has joined #openstack-masakari06:02
*** jawad_axd has quit IRC06:03
openstackgerritShilpa Devharakar proposed openstack/python-masakariclient master: Update operator guide documentation  https://review.opendev.org/68330607:21
*** tpatil has joined #openstack-masakari08:04
tpatiluneek: Ping?08:09
uneekpong08:09
tpatiluneek: You want to set up hostmonitor right?08:10
uneekI want to setup, whats neccessary ;)08:10
tpatiluneek: we are updating operator's guide for masakari-monitors08:11
uneekso as I assume, hostmonitor is the least that I need08:11
tpatil#link : https://review.opendev.org/#/c/489095/1108:11
tpatilyes, if you want to evacuate VMs running on the compute host, you need masakari-hostmonitor08:11
uneekyeah, I'm following the commit logs08:11
tpatilplease download https://review.opendev.org/#/c/489095/11/doc/source/_static/images/masakari-monitors.jpg,unified and see the architecture diagram.08:12
tpatilit will help you to understand the components needs in your setup08:12
tpatils/needs/needed08:13
tpatilthe same architecture diagram is also available on masakari wiki08:14
tpatil#link : https://wiki.openstack.org/wiki/Masakari08:14
uneekI'm sorry, but somehow there is still a twisted knot in my head: How is this supposed to work? How should the engine realize that the compute is down, if the hostmonitor is running on the computenode? Is the hostmonitor permanently talking to the local pacemaker? I don't assume that. I'm afraid that diagram doesn't answer my question.08:20
uneekor is the hostmonitor just an process, that connects to pacemaker and monitors the availability of the compute resource?08:22
uneekI'm happy to contribute to a better documentation, that would answer this, but I would need to understand it before08:23
tpatilThat's correct, hostmonitor is a process which get the information from crm if any of the compute node is down and then it send the notification to masakari08:23
tpatilsay you have compute nodes A and B and you have created a pacemaker cluster08:26
tpatilNow, if compute A goes down, masakari-hostmonitor running on compute B will read the information from CRM and it will see that compute A is down, then it will send the notification to masakari-api service that compute A is down, and then masakari-engine will execute the workflow to evacuate VMs running on compute host A08:27
uneekAh, so essentially it could also run on the control nodes08:28
tpatilone should run masakari-api and masakari-engine services on the controller nodes08:28
uneekcan the hostmonitor also connect to a pacemaker-remote process?08:28
tpatilmasakari-monitor process should run only on compute nodes08:28
openstackgerritShilpa Devharakar proposed openstack/python-masakariclient master: Update operator guide documentation  https://review.opendev.org/68330608:29
uneekso can you please elaborate, what the tasks of host hostmonitor are, so that they must be running on the computes? If its only reading from the CRM then this can clearly also be done on the controllers.08:31
uneekI can see, why instance and process monitor needs running on compute. But does hostmonitor do anything beside that?08:32
tpatilI agree, it's possible to run hostmonitor on controller node as well08:33
tpatilbut the other three masakari-monitor services should run on the compute node i..e processmonitor, instancemonitor and introspectivemonitor08:34
tpatilhostmonitor internally calls cibadmin --query to check the status of all hosts and if any of the hosts are offline, notification will be sent08:36
tpatilyou can see the code here: https://github.com/openstack/masakari-monitors/blob/master/masakarimonitors/hostmonitor/host_handler/handle_host.py#L33208:38
tpatiland https://github.com/openstack/masakari-monitors/blob/master/masakarimonitors/hostmonitor/host_handler/handle_host.py#L30108:38
uneekmom, afk08:38
uneekre08:46
uneekso, on the pacemaker-remote nodes the "cibadmin --query" command fails, thats one of the reasons, I want to understand whats happening08:47
uneekAnd I assume the matching of the resource/node-names in pacemaker is done against the name of the compute service that is registered in openstack-nova - so best stick with fqdn there08:49
tpatilyes that's correct08:50
tpatilhostmonitor internally maintain the status of the hosts, after it run the command "cibadmin --query", it will get the current status of the hosts08:51
tpatilit will compare it with the old status and according send notification to masakari08:51
uneekthe other monitor processes don't need access to the pacemaker, right? they only gather local information and talk directly to the masakari-api, right?08:51
tpatilyes, that' correct. no need of pacemaker for other monitors08:52
uneekok - I think I got it now a bit more clear.08:52
tpatilGreat, if you have any further questions please contact me on mailing list or IRC08:53
uneekI would suggest for the architecture diagram, to split it completely between the different monitor types/scenarios so you don't get overwhelmed of all the possible communications08:54
uneekyeah, I will, I'll stick here in IRC ;)08:54
tpatilI think that's a good suggestion, will try to do it at our end08:55
tpatilI will bring up this point in the bi-weekly meeting08:55
tpatilthe next meeting with be held on Tuesday08:56
uneekJust FIY: One thing that makes it a bit harder for me to disclutter all the information is, that we have deployment policy here, that we build minimal docker containers for every service and only run the needed parts, where they should. So thats why I'm a) trying to get this straight in my head, b) trying to explicitly describe the interfaces of the components between each other. And from my current view, I08:58
uneekreally think, that the hostmonitor-process is better placed on some control nodes rather then the compute nodes08:58
uneekbecause by simple trial I found, that the cibadmin -query fails on the pacemaker-remote nodes08:59
uneekwhich will be installed on all the compute-nodes08:59
tpatiluneek: I will discuss about this point with Sampath who is PTL in the next meeting and answer to this question09:02
uneek+109:02
openstackgerritpengyuesheng proposed openstack/masakari master: Update the constraints url  https://review.opendev.org/68333309:22
*** tpatil has quit IRC09:34
*** brinzhang has quit IRC10:14
*** goldyfruit___ has quit IRC12:12
*** jawad_ax_ has quit IRC13:16
*** jawad_axd has joined #openstack-masakari13:17
*** jawad_axd has quit IRC13:22
*** goldyfruit___ has joined #openstack-masakari13:28
*** goldyfruit_ has joined #openstack-masakari13:35
*** goldyfruit___ has quit IRC13:35
*** goldyfruit___ has joined #openstack-masakari13:36
*** goldyfruit_ has quit IRC13:39
*** openstackgerrit has quit IRC14:06
*** goldyfruit___ has quit IRC15:21
*** goldyfruit has joined #openstack-masakari15:26
*** jawad_axd has joined #openstack-masakari15:55
*** jawad_axd has quit IRC15:59
*** goldyfruit has quit IRC17:01
*** goldyfruit has joined #openstack-masakari17:01
*** goldyfruit has quit IRC17:31
*** openstackgerrit has joined #openstack-masakari17:41
openstackgerritOpenStack Release Bot proposed openstack/python-masakariclient master: Update master for stable/train  https://review.opendev.org/68361517:41
*** goldyfruit has joined #openstack-masakari18:07
*** goldyfruit_ has joined #openstack-masakari18:14
*** goldyfruit has quit IRC18:17
*** goldyfruit_ has quit IRC22:42
*** jawad_axd has joined #openstack-masakari23:24
*** jawad_axd has quit IRC23:28

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!