Thursday, 2018-10-18

*** bobh has quit IRC01:00
*** rakhmerov has joined #openstack-mistral03:16
*** akovi has joined #openstack-mistral04:18
akovi@vgoleg: clearing out the cache everywhere may be the right solution. It's a cache anyway. However, the whole rerun feature seems to be very much off-course in this respect. Just asking: why is it so that you are rerunning stuff instead of creating a normal excecution?05:27
*** hardikjasani has joined #openstack-mistral06:02
*** jtomasek has joined #openstack-mistral06:18
openstackgerritAkhil jain proposed openstack/mistral master: Add framework for mistral-status upgrade check  https://review.openstack.org/61151306:47
rakhmerovd0ugal: hi07:16
rakhmerovhere?07:16
*** shardy has joined #openstack-mistral07:23
rakhmerovd0ugal, akovi: hey, I just sent an email to openstack-dev about oslo messaging & "blocking" executor07:23
rakhmerovwould be good if you could say a couple supporting words07:24
*** openstackgerrit has quit IRC07:35
*** apetrich has quit IRC07:39
vgvolegakovi: e.g. service was unavailable and we perform rerun07:46
akovidoesn't sound like an idempotent solution :(07:46
akoviStill, I think it should be the WF that takes care of not running actions that neeed not be running07:47
akoviThe whole concept of rerun with overwriting history is not a particularly good design decision07:48
vgvolegok, I got you, rerun == deprecated feature07:52
vgvolegBut the problem is that it isn't deprecated and we found some mistral problems using it07:53
vgvolegMy question was about how to get rid of them07:55
vgvolegNot about advantages or disadvantages of rerun feature07:56
vgvolegFor now, caches may cause some problems in case of using multiple engine instances07:57
vgvolegI am not sure if I understand this issue right or not08:00
*** apetrich has joined #openstack-mistral08:00
d0ugalrakhmerov: sure, I'll take a look  shortly08:14
d0ugalI need to go out briefly this morning tho08:14
akoviyes, so the cache is building on the assumption that tasks in final states will not change08:18
*** shardy has quit IRC08:18
therverakhmerov: What doesn't work with pymysql and eventlet?08:18
akovirerun simply violates this08:18
akovione solution would be your proposal, to blow the cache if rerun is executed08:19
akoviyou will have to implement some enhancement that ensures all engines do this before beginning to deal with outdated tasks08:20
akoviI don't have an idea now how this should be implemented08:21
akovisynchronization between the engines exists only through the DB08:21
akovibut storing this in the DB would require making an additional check for the new state before every operation on the tasks08:22
akovitherve: we see that the engine process is getting overwhelmed by parallel tasks and starts failing towards external services08:23
therveakovi: Failing how?08:24
akovitherve: in our case RMQ hearbeats are missed and then the whole cluster crashes when the different mistral processes start reconnecting every few seconds08:24
therveThat doesn't sound closely related to eventlet08:25
therveMore a general concurrency issue08:25
akovisure08:25
akoviwith the blocking executor it happens less frequently and the service stays stable in the long run08:26
*** shardy has joined #openstack-mistral08:26
therveI'd say that ought to be fixed by configuring the concurrency properly08:27
akovican you help with this?08:27
therveSet executor_thread_pool_size for example08:27
*** openstackgerrit has joined #openstack-mistral09:15
openstackgerritRenat Akhmerov proposed openstack/mistral stable/rocky: Update OnClauseSPec task name criteria  https://review.openstack.org/61155009:15
rakhmerovapetrich: hi, were you going to take this one? https://bugs.launchpad.net/mistral/+bug/179365109:17
openstackLaunchpad bug 1793651 in Mistral "Backwards compatibility issue: when starting a workflow "params" can't be null now" [High,Confirmed]09:17
rakhmerovI may be mistaken09:17
rakhmerovjust want to confirm09:18
*** gkadam has joined #openstack-mistral09:21
*** gkadam has quit IRC09:21
rakhmerovakovi: as far as the cache, it simply needs to be cleared out on rerun09:24
rakhmerovit's safe to do09:25
rakhmerovbtw, I found an issue with that cache recently, will soon be pushed upstream09:25
rakhmerovtherve: Andras described his issues with rmq, I've also seen some other things. For example, we have a mechanism DB based lockes based on transactional properties where a thread should block till it gets a lock in DB09:28
rakhmerovand it doesn't work09:28
rakhmerovit's not blocked and moves forward09:28
rakhmerovtherve: it's still a very early feedback though and I keep investigating09:28
therverakhmerov: OK. That sounds fixable, but I don't know how hard it is obviously09:43
rakhmerovtherve: you mean fixable what?09:44
rakhmerovnot removing "blocking" executor for now?09:45
therverakhmerov: Your db locking mechanism09:45
rakhmerovaah09:45
rakhmerovit worked but without eventlet09:45
rakhmerovthat's the whole point09:45
rakhmerovif I run two processes and simulate a situation when one process (a thread within it) needs to block then it blocks09:46
rakhmerovbut yeah... we will see09:46
therveYeah that worked because you had no concurrency09:47
therveThat helps locking mechanism in general :)09:47
rakhmerovtherve: no-no09:56
rakhmerovI may have misunderstood09:56
rakhmerovit worked with different processes09:57
rakhmerova thread in one process would block until a thread in the other one released it09:57
rakhmerovit'd be totally fine if eventlet dispatched to another green thread in this situation09:58
rakhmerovbut it seems to let the same thread to move forward09:58
apetrichrakhmerov, I wasn't but I probably can10:13
rakhmerovapetrich: ooh, ok )10:17
rakhmerovso, if you could look.. :)10:17
rakhmerovshould be very simple10:17
openstackgerritAkhil jain proposed openstack/mistral master: Add framework for mistral-status upgrade check  https://review.openstack.org/61151310:17
apetrichrakhmerov, aye10:18
apetrichassigning to me.10:18
rakhmerovthanks!10:19
akovirakhmerov: yes, the cache should be cleared; the issue is how do you do this on all engine instances?10:44
rakhmerovakovi: ooh... right10:44
rakhmerovgot your point now10:44
rakhmerovI thought about having a TTL cache10:45
rakhmerovit's not 100% reliable although if set to a reasonable value it would help10:45
akoviunfortunately, a ttl would not be deterministic10:48
openstackgerritAndras Kovi proposed openstack/mistral master: Fix state change propagation in workflows  https://review.openstack.org/60796011:00
*** thrash|g0ne is now known as thrash11:44
*** apetrich has quit IRC12:21
*** apetrich has joined #openstack-mistral12:33
*** gkadam has joined #openstack-mistral12:34
openstackgerritRenat Akhmerov proposed openstack/mistral master: WIP: improving join  https://review.openstack.org/61046112:39
d0ugalrakhmerov: I thought the oslo team had agreed to keep the blocking executor?12:52
d0ugalhas there been a recent change and a move to remove it again?12:53
*** bobh has joined #openstack-mistral13:01
openstackgerritMerged openstack/mistral master: Reduce the concurrency in the 500 wb join Rally task  https://review.openstack.org/60891013:10
*** jrist has quit IRC13:11
*** jrist has joined #openstack-mistral13:13
*** akovi has quit IRC13:28
*** hardikjasani has quit IRC13:38
*** thrash is now known as thrash|biab13:56
*** gkadam has quit IRC13:59
*** gkadam has joined #openstack-mistral13:59
*** apetrich has quit IRC14:11
*** thrash|biab is now known as thrash14:16
*** apetrich has joined #openstack-mistral14:20
*** gkadam_ has joined #openstack-mistral14:33
*** gkadam has quit IRC14:35
*** gkadam_ has quit IRC15:38
*** shardy has quit IRC16:43
*** apetrich has quit IRC18:10
*** apetrich has joined #openstack-mistral18:24
*** bobh has quit IRC19:17
*** apetrich has quit IRC19:44
*** openstackgerrit has quit IRC20:36
*** bobh has joined #openstack-mistral22:37
*** bobh has quit IRC22:41

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!