Wednesday, 2015-12-23

openstackgerritLingxian Kong proposed openstack/python-mistralclient: Support ID for workflow operations in CLI  https://review.openstack.org/26075500:46
*** gyee has quit IRC01:28
*** lkannan has quit IRC01:59
*** lkannan has joined #openstack-mistral02:01
*** bobh has joined #openstack-mistral04:44
*** bobh has quit IRC05:45
*** bobh has joined #openstack-mistral05:49
openstackgerritLingxian Kong proposed openstack/python-mistralclient: Support ID for workflow operations in CLI  https://review.openstack.org/26075505:54
*** bobh has quit IRC06:14
*** melisha has quit IRC06:22
*** melisha has joined #openstack-mistral06:22
*** [1]melisha has joined #openstack-mistral07:26
*** melisha has quit IRC07:26
*** [1]melisha is now known as melisha07:26
openstackgerritLiat Fried proposed openstack/mistral-dashboard: Mistral-dashboard:  Actions screen * Added “Run Action” screen to each row which allows of a specific action execution run.  https://review.openstack.org/25997107:28
*** [1]melisha has joined #openstack-mistral07:33
*** [2]melisha has joined #openstack-mistral07:37
*** melisha has quit IRC07:37
*** [2]melisha is now known as melisha07:37
*** [1]melisha has quit IRC07:40
openstackgerritLiat Fried proposed openstack/mistral-dashboard: Mistral-dashboard:  Actions screen  https://review.openstack.org/25997107:55
*** openstackgerrit has quit IRC08:47
*** openstackgerrit has joined #openstack-mistral08:47
LimorStotlandHi guess i have a question... anyone of you run ansible playbook from mistral?09:52
*** zhenguo has quit IRC10:01
openstackgerritChaozhe Chen proposed openstack/mistral: devstack/plugin.sh: stop using deprecated option group for rabbit  https://review.openstack.org/26093810:21
*** [1]melisha has joined #openstack-mistral11:02
*** melisha has quit IRC11:03
*** [1]melisha is now known as melisha11:03
openstackgerritLiat Fried proposed openstack/mistral-dashboard: Mistral-dashboard:  Actions screen * Added “Run Action” screen to each row which allows of a specific action execution run.  https://review.openstack.org/25997112:07
openstackgerritGal Margalit proposed openstack/mistral-dashboard: Mistral-dashboard:  Actions screen * Added “Run Action” screen to each row   which allows of a specific action execution run.  https://review.openstack.org/25997112:14
openstackgerritGal Margalit proposed openstack/mistral-dashboard: Mistral-dashboard:  Actions screen * Added “Run Action” screen to each row   which allows of a specific action execution run.  https://review.openstack.org/25997112:16
*** openstackgerrit has quit IRC12:17
*** openstackgerrit has joined #openstack-mistral12:18
openstackgerritLiat Fried proposed openstack/mistral-dashboard: Mistral-dashboard:  Actions screen  https://review.openstack.org/25997112:18
*** zhenguo has joined #openstack-mistral12:24
nmakhotkinhi, LimorStotland!12:31
LimorStotlandHi nmakhotkin12:31
nmakhotkinno, I haven't tried that12:31
*** tdurakov has joined #openstack-mistral12:37
tdurakovhi folks12:37
_gryfhi tdurakov12:37
rakhmerovtdurakov, _gryf12:38
rakhmerovyes12:38
rakhmerovso, Roman, we just talked to Timofei about whether he needs to use Mistral for VM evacuation and why12:38
rakhmerovand we'd like to know your perspective on that12:38
rakhmerovwhat are the reasons you decided to use Mistral?12:39
tdurakov_gryf,yep, could you provide some usecases where we need mistral12:39
_gryfOk12:39
_gryffirst of all there was at least 3 different approaches, that I've been involved regarding automatic evacuation12:40
_gryfmaybe let me start from the beginning12:40
tdurakovyes, please12:40
_gryfas you probably know, there is an enterprise gap in openstack regarding instance availability, which we are attempt to fill12:42
rakhmerovyep12:42
rakhmerovmaybe we should sync first on what you VM evacuation is12:42
rakhmerovso that we understand it the same way12:42
_gryfoh12:42
_gryfok12:42
_gryfso for me, automatic evacuation is a way, to detect and perform instance (vm - if you please) rebuild on other compute node (a physical host)12:43
_gryfI intentionally use the "rebuild" word12:44
_gryfsince this is a post mortem process12:44
_gryfso it would be performed in situation, where there is no way to reach the instance, and we have to assume that it is dead12:45
rakhmerovok12:45
_gryfthis one is a bit tricky, since in some circumstances it is not easy to state that instance is dead or not12:46
_gryffor example - there might be situations, that nova compute is not responding12:46
tdurakov_gryf, what about cases when load on host is increased, so it's possible to move instance without rebuild?12:47
_gryfwhich might happen by network issues, software malfunction, and so on12:47
_gryftdurakov, yup12:47
_gryfit is called live migration12:47
tdurakovyep12:47
_gryfbut it's completely different story :)12:47
tdurakovok, let's focus first on rebuild then12:48
_gryfok12:48
_gryfthere couple of known approaches reagrding automatic evac12:49
_gryfyou can find kind of summary there: https://etherpad.openstack.org/p/automatic-evacuation12:49
rakhmerovok12:51
_gryfthere is an initiative from #openstack-ha people to bring all of the together and choose one as a reference solution for instance ha12:51
* tdurakov stated to read etherpad/masakari/etc. 12:52
_gryfas ypu can see, lots of the solution is based on the cluster manager - namely pacemaker12:52
_gryfwhich would be the source of truth, regarding the state of the hosts, on which instances are run12:53
rakhmerovyep, so as far as I understand12:54
_gryfbecasue of pacemaker abilities, we can say in 100% sure, that host, from which we are doing evacuation is really down.12:54
rakhmerovyes12:55
rakhmerovas far as I understand the question is the following12:55
rakhmerovis evacuation really a long running (heavy) process?12:55
rakhmerovso that we need to automate it with Mistral12:55
_gryflet me put it that way12:55
rakhmerovbecause if it's just a couple of API calls then it doesn't make sense probably to use Mistral12:56
rakhmerovok12:56
_gryfevacuation of single vm is dependent of what kind of vm are we going to resurrect12:56
rakhmerovtdurakov just expressed an opinion that using Mistral might be overengineering12:56
_gryfthe bigger vm, it might take couple of minutes12:56
rakhmerovok12:57
_gryfright12:57
_gryfwe can just fire nova evacuate --on-shared-host vm_name12:57
_gryfand forget12:57
_gryfthe truth is12:58
_gryfwe cannot12:58
_gryfsince if we really like our vm, we have to be sure, that certain vm will be up and running12:58
rakhmerovhm.. not sure I understand12:59
_gryfnova client doesn't gives us such certainty12:59
rakhmerovtdurakov: does it make sense to you?12:59
rakhmerovok, how does that command work?12:59
rakhmerovcould you explain very briefly12:59
rakhmerov?12:59
_gryfyup12:59
tdurakovyep, this make sence13:00
rakhmerovgood :)13:00
_gryfyou can use it just like I already described (evacuate a single vm). command will gives you only the ack, that nova client just sent the request to the nova api13:00
_gryfand return an ack13:00
rakhmerovooh, I see what you mean now13:01
rakhmerovok13:01
_gryfthat's it. it will not monitor anything regarding the rebuilding process13:01
_gryfmoreover13:01
_gryfyou can just execute the command within all the host, which you want to evacuate machines from13:01
_gryfand this is tricky13:02
_gryfsince the functionality is currently implemented on… client side13:02
tdurakovso, it's gonna be not fire and forget, but track the process, right?13:02
_gryfso, nova client will sequentailly send a req to the nova api13:02
_gryfactually - no :)13:02
_gryfthis is all the same13:03
rakhmerovand nova client is not HA13:03
_gryfbut another threat it bring up - what if nova client dies in that process?13:03
rakhmerovyeah-yeah, this makes sense13:04
rakhmerovI actually didn't realize so far that it works this way13:04
rakhmerovthat it's implemented on a client13:04
rakhmerov113:04
rakhmerov(sorry)13:04
_gryfright13:04
LimorStotlandHi can anyone pleas review ? https://review.openstack.org/#/c/259762/ Thanks13:05
rakhmerovLimorStotland: ok13:05
_gryfthere might be another scenario, like what if compute dies during the rebuilding - even in single vm evac nova client will be unaware of this13:05
tdurakov_gryf, one point here13:06
tdurakovif compute dies during rebuild, what should be done next?13:06
_gryfthat's why another HA enabled service for executing task is needed, which is aware of the state of the tasks and might perform another action13:06
_gryftdurakov, it depends :)13:07
_gryftdurakov, for a VMs which have to be up and running, evacuation should be performed again13:08
_gryfso it will up and running eventually13:08
rakhmerovLimorStotland: can you please a couple of formatting issues?13:09
rakhmerovother than that looks ok13:09
rakhmerovI left my comments13:09
LimorStotlandok thanks rakhmerov13:09
tdurakov_gryf, guess, I got the point:)13:09
rakhmerovok, if we try to summarize:13:10
_gryfand this task IMO Mistral is perfectly matched - it might be configured to perform tasks (including the task, which would monitor the process of evacuation) and perform an action depending on result13:10
tdurakovlet's set up meeting next week, say Monday, need time to read all etherpad/links to be on the same page13:11
_gryftdurakov, you can also read the irc meeting logs for high availability13:11
tdurakovyep, good point!13:12
_gryfi'll put the links here13:12
rakhmerovmy summary: even though there's an API call in nova for this there can be a lot of issues happening so we need to monitor the process and make sure it will be completed. If we want to evacuate multiple VMs it's a problem because it's implemented on a client side and client can't be HA13:12
_gryfrakhmerov, the problem with HA of client is also true for evacuating single  VM13:13
rakhmerovyes, got it13:13
rakhmerovbecause it only sends a signal to start evacuation13:13
_gryfright13:13
rakhmerovw/o providing a guarantee that it will be completed13:13
_gryfcorrect13:13
tdurakovyep, we need to have more control over the process13:13
rakhmerov_gryf: I'm actually a little bit confused. Where are you based of?13:14
rakhmerovPoland?13:14
rakhmerovor US13:14
_gryfrakhmerov, yes13:14
_gryfit's Poland13:14
rakhmerovooh, I thought it was US13:14
rakhmerovgotcha13:14
_gryf:)13:14
rakhmerov_gryf: could you please suggest a couple of time slots for the next Monday?13:15
rakhmerovI'm not sure I can participate because I'll be on a trip but I'll do my best13:15
rakhmerovor Tue13:16
_gryfrakhmerov, anything will do from 9:00am UTC13:16
rakhmerovTue would work better for me13:16
rakhmerovok, from noon msk13:17
_gryfrakhmerov, I'll be able to participate from Mon-Thu13:17
tdurakovrakhmerov, maybe Tue then?13:17
rakhmerovyes13:17
rakhmerovI'll send an invite in a min13:17
tdurakovcool13:18
_gryfwait, Monday to Thursday13:18
_gryfso whenever you guys can in that period :)13:18
rakhmerovjust sent an invite13:20
tdurakovyep, thanks a lot, _gryf13:20
rakhmerovplease let me know if it works13:20
_gryfrakhmerov, 10am UTC?13:21
_gryf10:30 UTC, got it13:22
rakhmerovyes13:22
rakhmerov)13:22
_gryfrakhmerov, could you also add ddeja for this meeting?13:22
rakhmerovyes, give me his email please?13:24
rakhmerov_gryf: ^13:25
_gryfrakhmerov, already did :) checkout the query msg :)13:25
rakhmerovgot it13:27
rakhmerovthanks13:27
_gryfok. so we all set.13:27
openstackgerritLimor Stotland proposed openstack/mistral: If task fails on timeout - there is no clear message of failure  https://review.openstack.org/25976213:28
openstackgerritLimor Stotland proposed openstack/mistral: If task fails on timeout - there is no clear message of failure  https://review.openstack.org/25976213:29
*** dprince has joined #openstack-mistral13:30
_gryftdurakov, here are HA meetings logs - might be useful: http://eavesdrop.openstack.org/meetings/ha/2015/13:36
_gryftdurakov, and the first (missing) one: http://eavesdrop.openstack.org/meetings/ha__automated_recovery_from_hypervisor_failure/2015/ha__automated_recovery_from_hypervisor_failure.2015-11-16-09.00.html13:36
tdurakov_gryf, ok, will read them13:37
_gryfcool :)13:37
*** bobh has joined #openstack-mistral13:43
*** bobh has quit IRC13:47
lane_kongooh, i have read a good story of vm HA using mistral :-)13:56
*** bradjones has quit IRC14:16
*** bobh has joined #openstack-mistral14:46
*** bobh has quit IRC14:53
*** zhenguo has quit IRC14:56
*** dprince has quit IRC15:33
*** dprince has joined #openstack-mistral15:35
*** melisha has quit IRC15:49
*** melisha has joined #openstack-mistral15:51
*** bobh_ has joined #openstack-mistral16:17
openstackgerritMerged openstack/mistral-dashboard: UI: Task screen auto refresh  https://review.openstack.org/25993516:26
*** bobh has joined #openstack-mistral16:28
*** bobh_ has quit IRC16:31
*** bobh has quit IRC16:39
openstackgerritMerged openstack/mistral: If task fails on timeout - there is no clear message of failure  https://review.openstack.org/25976216:56
*** bobh has joined #openstack-mistral17:46
*** bobh has quit IRC17:48
*** Ephur has joined #openstack-mistral19:36
*** tonytan4ever has joined #openstack-mistral19:46
*** dprince has quit IRC21:30
*** bobh has joined #openstack-mistral21:41
*** openstack has joined #openstack-mistral22:22
*** openstack has joined #openstack-mistral22:37
*** bobh has quit IRC23:01
*** Ephur has quit IRC23:02
*** bobh has joined #openstack-mistral23:06
*** tonytan4ever has quit IRC23:18
*** tonytan4ever has joined #openstack-mistral23:19
*** tonytan4ever has quit IRC23:20
*** bobh has quit IRC23:54

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!