Tuesday, 2018-08-14

*** jpenag has joined #zuul00:56
*** tflink has quit IRC00:57
*** Shrews has quit IRC00:57
*** jpena|off has quit IRC00:57
*** lennyb has quit IRC00:57
*** panda|ruck has quit IRC00:59
*** panda has joined #zuul01:02
*** tflink has joined #zuul01:04
*** Shrews has joined #zuul01:04
*** panda has quit IRC01:06
*** openstackgerrit has quit IRC01:06
*** panda has joined #zuul01:07
*** openstackgerrit has joined #zuul01:32
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: executor: enable add_host for trusted play and update inventory  https://review.openstack.org/59009201:32
*** jiapei has joined #zuul01:49
*** elyezer has quit IRC02:00
*** elyezer has joined #zuul02:03
*** elyezer has quit IRC02:35
*** elyezer has joined #zuul02:38
*** elyezer has quit IRC02:52
*** elyezer has joined #zuul02:53
*** elyezer has quit IRC03:20
*** elyezer has joined #zuul03:21
*** elyezer has quit IRC03:40
*** elyezer has joined #zuul03:44
*** jiapei has quit IRC03:59
*** elyezer has quit IRC05:02
*** elyezer has joined #zuul05:05
*** odyssey4me has quit IRC06:14
*** odyssey4me has joined #zuul06:14
*** elyezer has quit IRC06:40
*** elyezer has joined #zuul06:43
*** pcaruana has joined #zuul06:44
openstackgerritMarkus Hosch proposed openstack-infra/nodepool master: Move sphinx + deps to doc/requirements.txt  https://review.openstack.org/59156506:47
SpamapSwoooot.. AnsibleFest talk accepted!06:56
*** elyezer has quit IRC06:57
*** elyezer has joined #zuul07:01
*** gtema has joined #zuul07:18
SpamapStobiash: when you're around.. I'm curious why you have zuul-executor as a statefulset.07:48
SpamapSseems like it would work fine with just a deployment+volume07:48
gtemaSpamapS: yes, it should work also fine. I think it was just to keep "cached" info for each individual instance07:49
gtemastatefulset is a more recommended approach if there is a state in place and i.e. instance name is important (second is not very relevant here, compared to zookeeper)07:51
tobiashSpamapS: we're using the cinder driver for provisioning volumes. These are single attach. Thus a deployment would need the recreate strategy and rolling restarts/upgrades would not be possible.07:54
tobiashSpamapS: the second reason is that the executors need a real identity which can be reached by zuul-web for log streaming07:54
tobiashSpamapS: also having a constant identity is what we want with regards to statsd statistics07:56
SpamapSah interesting07:59
SpamapSI think having the caches sitting around as volumes is a good thing too.08:00
SpamapSI'm about 70% of the way through converting the openshift stuff to pure k8s08:00
SpamapSNow I'm dealing mostly with like, how to set up AWS's EKS and ECR with the right policies/users/etc.08:01
SpamapSanother weird thing is that kubectl apply doesn't like the yamls to be lists of objects08:04
SpamapSso I'm having to strip them down to single objects08:04
*** openstackstatus has quit IRC08:12
*** jpenag is now known as jpena08:27
*** sshnaidm|off is now known as sshnaidm08:48
*** elyezer has quit IRC08:53
*** elyezer has joined #zuul08:58
*** electrofelix has joined #zuul09:04
*** elyezer has quit IRC09:38
*** openstackstatus has joined #zuul09:42
*** ChanServ sets mode: +v openstackstatus09:42
*** elyezer has joined #zuul09:44
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: [wip] web: rewrite interface in react  https://review.openstack.org/59160409:56
tristanCmordred: i updated the react-zuul poc and proposed it in 591604. this version only includes the builds page, though the plumbing for white-label/multi-tenant should be working10:12
openstackgerritMatthieu Huin proposed openstack-infra/nodepool master: Do not abort node launch if failed node cannot be deleted  https://review.openstack.org/58985410:13
*** panda is now known as panda|ruck10:24
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: [wip] web: rewrite interface in react  https://review.openstack.org/59160410:40
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: [wip] web: rewrite interface in react  https://review.openstack.org/59160410:58
tristanCmordred: i shifted the base of the js things to the web directory so that it will be easier to move to a dedicated project, though that will raise the challenge of cross npm projects testing11:02
*** jpena is now known as jpena|lunch11:15
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: [wip] web: rewrite interface in react  https://review.openstack.org/59160411:32
*** elyezer has quit IRC11:39
*** elyezer has joined #zuul11:43
*** ssbarnea has quit IRC11:44
tristanChere is what zuul webui can looks like with patternfly-react: logs.openstack.org/04/591604/4/check/zuul-build-dashboard-multi-tenant/7e7e6b8/npm/html/11:47
*** ssbarnea has joined #zuul11:51
*** jpena|lunch is now known as jpena12:21
*** rlandy has joined #zuul12:48
openstackgerritMatthieu Huin proposed openstack-infra/nodepool master: Do not abort node launch if failed node cannot be deleted  https://review.openstack.org/58985412:55
ShrewsFYI, out for a bit this morning to get my stitches removed. bbl12:57
mordredtristanC: interesting - it's gonna take me a bit to wrap my head around the differences in the javascript enough to be able to comment - but the builds and jobs pages seem to be working on both single and multi tenant13:02
*** elyezer has quit IRC13:10
tristanCmordred: cool, a couple of notes. it's using HashRouter as it seems easier to deploy, but we can use the BrowserRouter instead to keep the previous url path13:21
*** elyezer has joined #zuul13:21
tristanCmordred: and i used a redux store with a reducer for the initial /api/info call, but i don't know the logic exactly, so the rest of api calls are using '.then()' instead of the dispatch logic13:22
mordredtristanC: yeah - just like the angular I think it's likely there are a few things we'd need to learn13:43
mordredtristanC: I'm otherwise leaving some comments on the patch itself13:44
openstackgerritMarkus Hosch proposed openstack-infra/nodepool master: Add list of metrics provided to statsd  https://review.openstack.org/59023313:49
*** elyezer has quit IRC13:52
*** elyezer has joined #zuul13:55
mordredtristanC: silly question - in javascript and typescript the trailing ;'s are optional - does the jsx parser allow you to leave them out? (asking because I've come to enjoy the ;-less code)13:56
tristanCmordred: i think so, i put them because rjsx-mode warn about them by default13:58
openstackgerritMatthieu Huin proposed openstack-infra/nodepool master: Do not abort node launch if failed node cannot be deleted  https://review.openstack.org/58985414:01
tristanCmordred: thanks a lot for the prompt review. I'll try to add a status and stream page implementation tomorrow.14:08
mordredtristanC: cool. and ok - don't put TOO much work into it until we've had a chance to circle back around with corvus - and we've got a couple of infra things today that might dominate our time14:14
mordredtristanC: I think the builds/jobs pages have enough code in them to be able to look at what's going on to assess the differences14:15
corvusmnaser: ^ fyi because you're interested in this too14:33
* mnaser reads14:33
*** bstinson_ is now known as bstinson14:55
tobiashmhu: commented on 58985414:58
panda|ruckhow are the variables added to a zuul job configuration passed to the ansible playbook ? are they passed through inventory only ? Is there any way to pass them as extra-vars so they can't be overridden in the playbook itself ?15:05
mordredtobiash: aren't you supposed to be on vacation?15:11
clarkbmordred: panda|ruck I want to say secrets are treated as extra vars15:11
mordredclarkb, panda|ruck: yes - and also I think pabelanger was working on ability to pass arbitrary extra_vars15:11
mordredI do not remember if we landed that yet or not15:11
pabelangerhttps://zuul-ci.org/docs/zuul/user/config.html#attr-job.extra-vars15:11
pabelangeralready landed15:11
pabelangerbut, not an file, need to add them into zuul.yaml for now15:12
panda|ruckmmmmhh15:12
mordredhttps://zuul-ci.org/docs/zuul/user/jobs.html#job-extra-variables15:12
mordredpabelanger beat me to it15:12
panda|ruckusing a file may certainly be convenient, but this is a start, thanks :)15:13
tobiashmordred: yes, I'm on vacation, but sometimes I like doing some reviews ;)15:13
pabelangerI think we could eventually use an extra-vars file from the repo too15:13
mordredtobiash: :)15:13
panda|ruckmordred: clarkb pabelanger thanks all15:13
pabelangerpanda|ruck: https://zuul-ci.org/docs/zuul/user/jobs.html#variables is a good read for zuul variable precedence15:15
clarkbwithout knowing what you are trying to do, I think one of the ideas with zuul job inheritance is that you can modify these things15:16
clarkbthe devstack base job and all of its children are a great example of this15:16
openstackgerritMarkus Hosch proposed openstack-infra/nodepool master: Add list of metrics provided to statsd  https://review.openstack.org/59023315:20
pabelangertobiash: do you have any check jobs in place to validate a github project needs to have repo.change_merge_setting setup?15:24
pabelangerI'm struggling to find that setting in our zuul docs15:24
tobiashpabelanger: no, we don't have this as a check job15:25
tobiashbut yes, that's needed if you want to do gating on github15:26
pabelangertobiash: also, are you managing your github projects via yaml at all? Or do users just use the webui to create them?15:38
tobiashpabelanger: they just use the webui15:39
*** darkwisebear has joined #zuul15:40
pabelangertobiash: and they follow docs to make sure branch protection / merge commits is properly setup?15:43
tobiashyes15:43
pabelangerk15:44
openstackgerritMarkus Hosch proposed openstack-infra/nodepool master: Add metric for image build result  https://review.openstack.org/59041215:44
tobiashthat's one reason why we gate our tenant config15:45
pabelangertobiash: yah, I want to do the same for sf.io15:45
pabelangerI'm sure with winterscale, we'd want to do the same, if supporting github15:46
*** pcaruana has quit IRC16:02
pabelangerany chance of landing https://review.openstack.org/591457/ today, adds nodepool info into emit-job-header role16:07
*** jpena is now known as jpena|off16:13
*** elyezer has quit IRC16:18
*** elyezer has joined #zuul16:19
darkwisebeartobiash: Is https://review.openstack.org/591565 sufficient to separate doc requirements from test requirements? It's the new prerequisite so that https://review.openstack.org/590233 may eventually land16:22
tobiashdarkwisebear: I think you also need to adapt tox.ini to not break local doc builds16:23
*** gtema has quit IRC16:32
*** darkwisebear has quit IRC16:35
*** rlandy is now known as rlandy|brb16:46
*** rlandy|brb is now known as rlandy17:24
*** Shrews has quit IRC17:52
pabelangertristanC: corvus: do you have a moment to look at http://paste.openstack.org/show/728030/18:09
pabelangerI am wondering if we have a bug in zuul, when we have the same project in 2 different connections18:09
pabelangerhttps://softwarefactory-project.io/r/13256/ was the change in question18:09
corvustristanC, mordred: the react patch looks promising.  i think i can follow it fairly well so far (i think it may be easier to follow than angular).  i think if you wanted to look at getting the jquery status page in there along with streaming, so we can see if there are potential problems with that, it would be a good use of time. also, maybe just so we can see some more navigation and html, maybe you could18:09
corvuslook at doing the "job" page (the thing we were talking about over the weekend)18:09
corvuspabelanger: biab.18:11
fungiso if we wanted to add some temporary info pointing people to the summit from https://zuul-ci.org/ where would that best fit into the page? i think i have a blurb on the way from the events marketing team to let people know about zuul content scheduled for berlin18:11
pabelangercorvus: tristanC: okay, I think it is a syntax issue in our tenant configuration18:14
corvuspabelanger: you found it?  (it's included twice under the same source (line 884 and 904)18:18
pabelangercorvus: I did thanks, missed it the first time looking18:18
pabelangerremote:   https://softwarefactory-project.io/r/13414 Remove duplicate openstack-infra/tripleo-ci18:18
pabelangergoing to see how to best add a check job to protect users from making the mistake again18:18
corvusfungi: i don't think we have something ready-made for that (we already used all the tricks arcana had).  we could re-do the shaded banner at https://html5up.net/arcana fairly easily ("a responsive site template...") or we could make another call-to-action bar ("are you ready to continue your quest").  or we could do something new.18:24
fungicool, i'll see what i can come up with to propose18:24
fungithanks for the insights!18:25
*** electrofelix has quit IRC18:26
*** Shrews has joined #zuul18:48
*** elyezer has quit IRC19:02
*** elyezer has joined #zuul19:05
*** elyezer has quit IRC19:21
*** abelur has quit IRC19:27
*** abelur has joined #zuul19:27
*** zxiiro has quit IRC19:27
*** zxiiro has joined #zuul19:27
*** myoung has joined #zuul19:32
*** elyezer has joined #zuul19:41
pabelangercorvus: tristanC: decided to try zuul tenant-conf-check for the invalid rdo.yaml configuration from before, and it seems to pass validation. I'll look more into tomorrow23:24
SpamapSwell we finally got too many parallel jobs for our single-node scheduler+executor to handle23:40
SpamapSPretty sure the governor isn't working for that box.23:41
fungithe load average governor?23:41
SpamapSIt's spiking up to a load of 10, and then tapers off to 3-4 as jobs finish.. but it never says "whoa that's too many" despite subsequently timing out jobs because the load is too high.23:41
SpamapSYeah it's just a cumulative load thing23:42
SpamapSIt's not sustaining enough that it doesn't pull in all the jobs we have (around 10) whenever somebody submits a PR23:42
SpamapSso it starts 10 jobs23:42
SpamapSthen they just battle for the CPU/RAM/Disk until the quick ones finish23:43
SpamapSI'm deploying 3 new executors to combat the problem. :)23:43
fungii dream of a day when zuul can request additional executors from nodepool and then delete them again once it no longer needs as many23:44
openstackgerritPaul Belanger proposed openstack-infra/zuul master: WIP: fix tenant-conf-check for duplicate projects  https://review.openstack.org/59186823:45
pabelangertristanC: corvus: working example of issue, I'd expect validation to fail23:45
pabelangerwill dig more into it in the morning23:45
SpamapSfungi: aye23:46
SpamapSWell I actually think we can also just let out of band auto scalers do that23:46
SpamapSI intend to set up Kubernetes that way.23:46
pabelangerSpamapS: make sure you update to latest hwe kernel, if you can. We seen a large performance boot in openstack-infra23:47
SpamapSpabelanger: dunno what that means. :)23:47
SpamapSI'm on CentOS723:47
SpamapSso .. :(23:47
pabelangerah23:47
SpamapSand not really interested in trying to upkernel23:48
pabelangeryah, not sure what the story on centos is. haven't updated kernels for sf.io either23:48
SpamapSbad enough I have to compile python3.5 ;)23:48
pabelanger++23:48
fungiright, i think it was more that our executors seemed to get a performance boost from kernels that no longer have such a severe performance penalty from spectre/meltdown mitigated microcode23:48
SpamapSah23:49
SpamapSSeems like you'd want spectre/meltdown mitigations.23:49
SpamapSor you're saying they improved the patch?23:49
clarkbSpamapS: yes, basically from 4.13 to 4.15 they made the performance pain go away largely23:50
clarkbSpamapS: there was a blog post by gkh iirc that basically said so many performance improvements in 4.15 too bad they they basically get us back to where we were pre meltdown23:50
funginewer kernels are able to take advantage of some of the options the microcode is able to expose to avoid taking as much of a performance hit23:51
fungi<citation needed>23:51
SpamapSOh yeah I remember that23:51
fungibasically we grew enough that we needed to add an additional executor, then we did a kernel upgrade on our executors and overall load average dropped enough that we could delete the extra executor again23:53
fungiso not a huge boost (given our scale) but still certainly noticeable23:54
fungii guess that suggests we saw at least a 10% improvement anyway23:54
clarkbbasically 4.14 was the released version at time of meltdown so it got the fix it fast patches23:55
clarkb4.15 and beyond are more complete aiui23:55
fungicentos 7 kernels... who knows23:55
SpamapSyeah23:55
SpamapSpatchypatchy23:55
clarkbI actually need to reboot to pick up 4.17  Ithink23:57
fungifor some reason 4.17 seems to have broken my ability to negotiate voltages higher than 5v over usb type-c on the gpd pocket23:59
fungiand 4.18 rc4 isn't fixing it23:59
fungiso i've had to stick to 4.16 on those until i can dig deeper23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!