Thursday, 2018-11-01

*** shanemcd has quit IRC00:02
*** shanemcd has joined #zuul00:04
*** ssbarnea has quit IRC00:29
*** rlandy|bbl is now known as rlandy01:59
*** swest has quit IRC02:33
*** swest has joined #zuul02:47
*** bhavikdbavishi has joined #zuul03:10
*** threestrands has joined #zuul03:46
*** rlandy has quit IRC05:30
*** swest has quit IRC06:11
*** swest has joined #zuul06:12
*** chkumar|off is now known as chandankumar06:21
*** threestrands has quit IRC06:52
*** bhavikdbavishi has quit IRC07:02
*** pcaruana|elisa| has joined #zuul07:40
*** pcaruana|elisa| has quit IRC07:59
*** bhavikdbavishi has joined #zuul08:01
*** pcaruana has joined #zuul08:05
*** bhavikdbavishi has quit IRC08:09
*** bhavikdbavishi has joined #zuul08:09
*** ssbarnea has joined #zuul08:55
*** rcarrillocruz has quit IRC09:05
*** rcarrillocruz has joined #zuul09:11
*** nilashishc has joined #zuul09:16
*** electrofelix has joined #zuul09:32
*** panda|off is now known as panda09:56
*** chandankumar has quit IRC10:09
*** chandankumar has joined #zuul10:31
*** pcaruana has quit IRC11:05
*** bhavikdbavishi has quit IRC11:16
*** nilashishc has quit IRC11:45
*** pcaruana has joined #zuul11:52
*** bhavikdbavishi has joined #zuul11:54
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool master: Add tox functional testing for drivers  https://review.openstack.org/60951512:25
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool master: Add tox functional testing for drivers  https://review.openstack.org/60951512:26
*** goern has joined #zuul12:40
*** rlandy has joined #zuul12:41
*** zul has joined #zuul12:58
*** hashar has joined #zuul13:02
*** zul has quit IRC13:04
*** zul has joined #zuul13:06
Shrewstobiash: if you have a moment, my moving nodepool unit tests to a subdir in 609515 has caused failures in test_launcher.py for some reason. I can't quite see why, and I have to run to an eye appointment right now. Can you take a quick look and give me a hint? I've already fixed the TestDrivers failure, just not pushed it up.13:36
Shrewsbbiab13:36
*** hashar has quit IRC13:49
*** goern_ has joined #zuul13:51
tobiashlooking13:54
*** goern has quit IRC13:54
*** goern_ is now known as goern13:54
*** bhavikdbavishi has quit IRC13:58
tobiashweird, running the failing tests individually works14:09
mordredtobiash, Shrews: while you're looking at weird test failures - any clue why things are failing here: https://review.openstack.org/#/c/612169/ ?14:16
mordredthey both seem random14:16
tobiashmordred: that looks like a zk connection problem we still see occationally14:18
tobiashShrews: looks like the drivers list is empty sometimes when I enhance the error message: RuntimeError: fake: unknown driver, available: {}14:19
mordredtobiash: :(14:23
mordredtobiash, Shrews: maybe a global variable that's getting stepped on?14:25
tobiashShrews, mordred: in the failing cases Drivers.load tells me it's already loaded (which is not true14:27
mordredtobiash: well - Drivers.load does use a singleton list14:28
mordredso if two tests are running and the file is loaded in each of them - that state is going to stick around14:28
tobiashand at some point it initializes it to an empty list14:28
tobiashthat's why it says it;s already loaded14:28
tobiashbut I have no clue why this happens only in this change14:29
mordredyah - me either14:30
mordredunless moving the tests around just happened to change the ordering such that 2 tests that each want to load drivers wind up in the same process memory14:30
tobiashmordred: I fixed that emtpy list and now I get: FileNotFoundError: [Errno 2] No such file or directory: '/home/cloud/nodepool-test/nodepool/tests/driver'14:30
tobiashah, I think I have a clue...14:31
openstackgerritTobias Henkel proposed openstack-infra/nodepool master: Add tox functional testing for drivers  https://review.openstack.org/60951514:33
tobiashShrews, mordred: fixes these driver paths14:34
tobiashShrews, mordred: specifically https://review.openstack.org/#/c/609515/14/nodepool/tests/unit/test_drivers.py14:34
mordredtobiash: ah - because of the extra directory14:36
tobiashthat's why I don't like global variables ;)14:36
Shrewstobiash: yeah, that was the TestDrivers fix i mentioned. That doesn't fix the TestLauncher failures15:29
tobiashShrews: it did fix it locally15:29
Shrewshow?15:30
tobiashIt had a side effect of breaking driver loading15:30
Shrewsoh, i see15:31
Shrewstobiash: thx!15:31
tobiashNo problem15:32
Shrewsmordred: for your failing thing, is it possible you may be seeing another task manager problem that clarkb put up a fix for (https://review.openstack.org/#/c/612835/1/nodepool/tests/test_sdk_integration.py)?15:36
Shrewsmordred: the zk stuff was misleading in that case15:37
Shrewsmordred: i ask only because i saw the words "task manager" in your review, not that I've investigated15:37
fungicorvus: tobiash: we have some renewed interest in https://review.openstack.org/57855715:40
fungidoes it need anything else besides the todos in the commit message (documentation and release note?)15:40
tobiashWe run it at least since a while in production15:41
tobiashfungi: I think only docs and reno is missing15:41
corvustobiash, fungi: agreed15:43
pandadoes zuul compile the list of playbooks to run for a job dynamically or statically ? I mean when does it decide which pre-run/post-run to run ?16:03
corvuspanda: yes -- right before it starts the job.  it takes into account all job variants which apply to the change.16:04
corvus(job variants may or may not apply based on what branch the change is on)16:05
pandacorvus: is there a way to see this list in the job itself ?16:05
corvuspanda: yes, there's currently some debugging information emitted under the "zuul._inheritance_path" ansible variable.  it's not a stable api and is only meant for human consumption, but it should tell you what's going on.16:06
corvusfor example, in http://logs.openstack.org/85/605585/5/check/system-config-run-docker/3bb43f1/zuul-info/inventory.yaml you can see http://paste.openstack.org/show/733779/ which shows that it built up the final job it ran from 4 sources16:08
corvus(the numbers are line numbers, so you can track down which jobs / variants were used)16:08
pandacorvus: I know, that shows how many configurations you traverse, but doesn't show the list of playbooks are actually going to be run16:13
pandaalso to parse this corretly I would need to know the exact version of the file was used16:14
corvuspanda: well, the list of playbooks run is encoded in either the ara report, or the json output.  eg: http://logs.openstack.org/85/605585/5/check/system-config-run-docker/3bb43f1/job-output.json.gz16:15
corvuspanda: however, you used the future tense, and i'm using the past tense... so we may not be on the same page -- what is it that you want to do?16:15
pandacorvus: ooooohhh right! the json console16:16
pandacorvus: is ther a way to get this json in the run itself, not as an artifact at the end of it ?16:18
corvuspanda: what are you trying to do?16:19
pandacorvus: we are tracking down the requirements to reproduce locally the jobs. THere are essential part of the two nodes job for example (the multinode-bridge) that are done in pre-run playbooks. We need to run them too before the reproducer can kick in, or the initial environment will not be genuine.16:22
corvuspanda: ah, so you want to make a post-playbook to run at the end of the job which processes the job-output.json of the currently running job to generate a reproducer script which gets uploaded along with the logs?16:23
pandacorvus: yup16:24
*** pall is now known as pabelanger16:24
pandaand then it will probably create a black hole of recursion16:24
corvuspanda: https://zuul-ci.org/docs/zuul/user/jobs.html#var-zuul.executor.log_root is the directory containing job-output.json; the job should be able to read it, but it will only be partially written at that point.  i don't remember if there will be anything about the currently running playbook in it, or if it will only be written up to the previous playbook if you read it mid-stream.16:26
corvuspanda: so that's worth a try :)16:26
pandacorvus: that can be taken into account, the only problme is if mid-stream the json is not parsable16:27
corvuspanda: i think it should always be valid json16:27
corvus(i'm pretty sure we take care to make sure that's the case)16:27
pandacorvus: wow, perfect, I think we have a lot more than a shot, thanks!16:27
corvuspanda: we control the generation of that file via a custom ansible callback plugin in zuul, so if you need more information, we can probably add it16:28
corvuspanda: see http://git.zuul-ci.org/cgit/zuul/tree/zuul/ansible/callback/zuul_json.py#n7316:29
corvus(that's how all the pre/post stuff gets into the json)16:30
pandacorvus: I see a dump only for v2_playbook, is it updated at every step ?16:33
panda(not really familiar with the zuul code unfortunately)16:34
corvuspanda: that's more ansible than zuul -- but i *think* that means that we dump to the file after the completion of each playbook, so the file should have everything up to the end of the previous playbook, and nothing about the current playbook when you read it.16:36
SpamapShrm.. having pbrx issues16:36
SpamapShttp://paste.openstack.org/show/733781/16:36
Shrewsmordred: i'm becoming more convinced that we need to account for task manager threads in our test thread whitelist16:37
pandacorvus: perfect, will check, thanks.16:37
mordredShrews: nod. so - hopefully we're not too far off from getting rid of the task manager threads16:37
mordredSpamapS: does your project have an entry_points section in its setup.cfg?16:38
mordredSpamapS: (we should make that section of code more resilient and skip the per-console-script image creation if there are no console-script defined)16:39
clarkbShrews: we already do16:39
clarkbShrews: well sort of, we explicitly stop the taskmanager threads in nodepool16:40
clarkbShrews: the reason the openstacksdk taskmanager didn't work is those don't get registered with the driver so we don't call stop on them iirc16:40
clarkbwhich is why my fix to use the nodepool taskmanager made the problem go away16:40
SpamapSmordred: my project is zuul.. so.. yes16:41
SpamapSoh16:41
SpamapSlooks like the bash script has a bug16:42
SpamapSbig surprise16:42
SpamapSstand down16:42
Shrewsclarkb: but something weird is up with mordred's change to the task manager. it "feels" similar, but i can't quite yet put a finger on the issue16:42
*** pabelanger is now known as pall16:43
*** sshnaidm|ruck is now known as sshnaidm|afk16:48
manjeetscorvus, Hi https://github.com/openstack-infra/zuul/blob/master/doc/source/admin/examples/etc_zuul/main.yaml#L10 this maps to https://github.com/openstack-infra/zuul/blob/master/doc/source/admin/examples/etc_zuul/zuul.conf#L23 ?16:51
manjeetsand in https://github.com/openstack-infra/zuul/blob/master/doc/source/admin/examples/etc_zuul/zuul.conf#L23 I can use local file path in base url ?16:51
*** rfolco|rover is now known as rfolco|ruck16:58
corvusmanjeets: exactly17:01
manjeetscorvus, but what driver should be used ?17:03
corvusmanjeets: 'git', just line on line 2517:04
mordredtobiash: my brain just clicked with something I think you were talking about yesterday or the day before ... regarding camel cased statsd keys17:40
mordredtobiash: it's not actually openstacksdk that creates the camel cased keys - it's nodepool.task_manager._transform_task_name17:40
tobiashmordred: maybe we want to change that?17:41
mordredmaybe - I'm actually working on what to do about the task manager statsd logging when there is no longer a task manager17:42
*** goern has quit IRC17:44
mrhillsmanjust want to say that the notifications on the dashboard are awesome17:46
mrhillsmanbig high five, thumbs up, etc for whoever implemented that17:46
mrhillsmannot sure if this is the place but a timestamp would be very useful; i say that because i just noticed notifications exist and not sure if i already corrected the issue i got notification for or not17:48
AJaegermrhillsman: notifications? I haven't found those yet - tell me more, please17:50
fungiwhat notifications? i completely missed that, whatever it is17:50
funginow i'm intrigued17:51
AJaegergood, I'm not the only one ;)17:51
mrhillsmani do not see any zuul.o.o17:51
mrhillsmanbut on a new install i just put up via the docker containers17:51
mrhillsmanin the top-right next to "Tenant tenant-name"17:52
mrhillsmanthere is a bell icon17:52
fungiwhat sort of notifications is it displaying?17:52
mrhillsmanthat lets me know i have some config errors17:52
fungiaha!17:52
mrhillsmannot sure if they are old or new, pretty sure they are new17:52
mrhillsmani mean old lol17:52
mrhillsmanbut they are there :)17:53
mrhillsmanand i like it17:53
fungiyeah, config errors info page is relatively new17:53
mrhillsmanhehe17:53
AJaegerhttp://zuul.openstack.org/api/config-errors shows we have no errors ;)17:53
fungii guess i missed that you'd get a little bell icon on the main status if there are config errors17:53
fungithat's neat17:53
mrhillsmanbeing able to configure the notifications and having a timestamp would be extra lovely17:54
mrhillsmani was trying to find out more to see if i could add a patch17:55
mrhillsmanooo, i think i found it :)17:55
*** sshnaidm|afk is now known as sshnaidm|off18:00
corvusmrhillsman: ah, the notifications represent *current* config errors -- so they're never out of date.  so if you see an error there, it's current.  if you fix an error, it will go away.18:09
mrhillsmangood to know, thx corvus18:10
manjeetscorvus, for some reason zuul web server always shows loading nothing even though patches are now getting streamed18:24
manjeetshttps://pasteboard.co/HLbAMDA.png18:24
corvusmanjeets: try doing a shift-reload18:27
manjeetscorvus, no difference18:34
corvusmanjeets: hrm.  maybe one of the folks more familiar with the javascript stuff can suggest a debugging strategy.18:35
corvusmanjeets: oh, one thing to make sure of is whether zuul-web is running, and if there are any errors in its logs18:36
corvusmanjeets: if you're using containers, it should be the "web" container18:37
manjeetsok corvus and using that zuul-config I'm trying to post a comment on a patch in ci-sandbox using my own account18:38
manjeetshttp://paste.openstack.org/show/733859/18:38
manjeetsbut it returns host key verification failed ? I didn't get that same key is used for streaming events which works fine18:38
corvusmanjeets: until we merge https://review.openstack.org/608453, you'll need to put the host key into ~/.ssh/known_hosts in the executor container18:39
clarkbcorvus: thoughts on cutting a zuul release nowish since ifnra has been running nodepool and zuul for about a week with no major issues (openstacksdk being the only one but that is fixed indepdnently)18:39
corvusclarkb: yes that seems like a very good idea18:41
corvusclarkb: hrm, we did add "Filter file comments for existing files" after the current sha18:44
manjeetsoh thanks corvus another issue I see is may be because of image used for container some container I am able to get in but for executor it returns18:44
manjeetsrpc error: code = 2 desc = oci runtime error: exec failed: container_linux.go:247: starting container process caused "exec: \"bash\": executable file not found in $PATH"18:44
corvusmanjeets: you need to run '/bin/sh' instead of bash18:44
manjeetslol dumb me18:45
corvusclarkb: ideally, i'd like to land the pep8 job changes and test that out before release, but that's going to take a while.  we could release 3.3.0 with openstack's sha now and count on releasing 3.3.1 later.  or we could jump ahead and release master as 3.3.0, assuming our test coverage is good there.  :)18:45
corvusthat's the only substantial code change between openstack's sha and master18:46
clarkbI'd be fine with 3.3.1 after 3.3.0 to pick that up. Simply because oepnstack is running without that fix and things seem mostly happy18:46
clarkbfwiw I'm working on restarting nodepool services today to pick up the sdk changes18:47
corvusyeah, it's not a bug, per-se.  it's a safety belt in case someone has a bug in an in-line comment return (like we did).18:47
corvusthough, if we do release 3.3.0, it means we can't land the pep8 job change until we are certain it's working correctly (since if we break it with 3.3.0 out, we'll break other installs)18:48
SpamapSoh yay, cherrypy actually serves w/o nginx now18:48
SpamapS(in the pbrx built images)18:49
corvusclarkb: so should we wait just a bit longer to tag nodepool?18:52
clarkbcorvus: ya, also nodepool has a very recent tag anyway so is less urngent18:52
corvusclarkb, tobiash, Shrews, mordred: i think we're looking at zuul 3.3.0 @ d4a142b97b5ce4b0c15702f502e55965e89095eb  and nodepool 3.3.1 @ 7e1b8a7261d31208d853ecfa255da648ba54b656 [pending the ok after clarkb restarts openstack's nodepool as a sanity check].  look right?18:54
*** rfolco has joined #zuul18:54
clarkbcorvus: that is the sha1 that is being installed on the infra nodepool builders right now.18:55
*** rfolco|ruck has quit IRC18:56
tobiashcorvus: looks good to me18:58
Shrewscorvus: lgtm19:04
clarkbcorvus: re zuul release that appears to be what openstack ifnra is running now. I'm happy with that and I think gets a lot of improvements out to users before the summit which will be nice (worry that with summit fast approaching we won't be able to do 3.3.0 with inlien commenting fix before hand)19:04
*** pcaruana has quit IRC19:05
corvusyes, lots of improvements, and one dangerous pitfall :)19:09
manjeetscorvus, I added the host key to ~./ssh/known_hosts to executor container still see this http://paste.openstack.org/show/733859/19:14
*** panda is now known as panda|off19:20
*** caphrim007 has joined #zuul19:26
caphrim007corvus: am i understanding this correctly that, if a nodeset to a job is not provided, that the job will run on the zuul-executor node itself?19:26
*** electrofelix has quit IRC19:28
*** leifmadsen has left #zuul19:43
*** j^2 has joined #zuul19:48
corvuscaphrim007: ansible is always run from the zuul-executor (that's the ansible control host).  it's accessible as the implied localhost in the inventory (so to use it, you have to name it explicitly, eg, "hosts: localhost").  any job can perform limited actions on it, and jobs defined in config-repos can perform any action on it.20:10
corvuscaphrim007: (this is all true whether or not there are any actual hosts in the inventory)20:10
caphrim007thanks corvus!20:11
corvusmanjeets: hrm, i don't know why that wouldn't work.  maybe the wrong key?20:11
manjeetscorvus, copied same key from one I had in etc_zuul/ which works for gerrit event streaming20:12
corvusmanjeets: the gerrit event streaming doesn't have the same problem (it doesn't require an entry in known_hosts), so that may not be doing what you think.20:13
*** rlandy is now known as rlandy|brb20:26
*** rlandy|brb is now known as rlandy21:21
*** rcarrillocruz has quit IRC21:56
*** threestrands has joined #zuul22:17
*** j^2 has quit IRC22:39
*** caphrim007 has quit IRC23:01
mrhillsmancan someone help me understand bare vs contained role? https://zuul-ci.org/docs/zuul/user/config.html?highlight=galaxy#attr-job.roles.name23:47
mrhillsmanhttps://zuul-ci.org/docs/zuul/user/config.html?#attr-job.roles.name23:48
clarkbmrhillsman: bare role is a repo with a single role in it. Think openstack-ansible-nova. Contained is a repo with a bunch of roles like zuul-jobs23:50
clarkbat least that is my understanding of it23:50
mrhillsmanty sir23:50

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!