Wednesday, 2017-04-05

mordredjlk: that's quite yuck00:55
*** jamielennox is now known as jamielennox|away02:07
*** jamielennox|away is now known as jamielennox02:21
*** isaacb has joined #zuul06:25
*** hashar has joined #zuul07:37
openstackgerritDirk Mueller proposed openstack-infra/nodepool master: Add default logging file argument  https://review.openstack.org/44632407:49
*** bhavik1 has joined #zuul08:22
openstackgerritJavier Peña proposed openstack-infra/zuul master: Find fallback branch in zuul-cloner  https://review.openstack.org/44237008:37
openstackgerritJavier Peña proposed openstack-infra/zuul master: Find fallback branch in zuul-cloner  https://review.openstack.org/44237009:15
*** bhavik1 has quit IRC10:47
*** hashar has quit IRC11:06
*** hashar has joined #zuul11:07
*** isaacb_ has joined #zuul12:07
*** isaacb has quit IRC12:08
openstackgerritJavier Peña proposed openstack-infra/zuul master: Find fallback branch in zuul-cloner  https://review.openstack.org/44237012:33
*** pabelanger has quit IRC12:49
*** pabelanger has joined #zuul12:49
*** isaacb_ has quit IRC13:04
*** isaacb_ has joined #zuul13:05
*** isaacb_ has quit IRC13:13
pabelangermorning14:56
pabelangerShrews: jeblair: going to start the process to restart nodepool14:56
pabelangerthen move onto zuulv3-dev14:56
pabelangernl01.o.o == nodepool in this context14:56
Shrewsack14:58
pabelangerokay, restarted using nodepool-launcher binary now14:59
openstackgerritDavid Shrewsbury proposed openstack-infra/zuul feature/zuulv3: Add trigger developer doc  https://review.openstack.org/45369515:01
pabelangerokay, starting upgrade of zuulv3-dev15:10
pabelangerzuul restarted15:15
pabelangerbut going to stop because of an exception in the logs15:15
pabelangerhttp://paste.openstack.org/show/605520/15:16
pabelangerjeblair: you might be interested ^15:16
jeblairpabelanger: it looks like the "NoneType" it refers to is the project name, since the repr method of TriggerEvent includes the project name: <TriggerEvent ref-replicated None>15:19
jeblair(in other words, that was a ref-replicated event for project "None")15:19
jeblairpabelanger: btw, those are not fatal errors, and it's not in a loop.  you could actually keep it running.15:19
pabelangerokay, I can start it back up then15:20
pabelangerI'll see what changed to cause that15:20
pabelangerI also noticed we generated keys for secrets!15:21
pabelangerhowever, they are world readable :(15:21
jeblairmaybe the change to "support" ref-replicated...15:21
pabelangerI'll get a patch to fix that too15:21
jeblairpabelanger: cool.  they should be in a non-world readable directory, but yes, we should also fix that.  :)15:21
pabelangerjeblair: thanks, I'll look there first15:21
jeblairpabelanger: it looks like we just don't set the project attribute because the event is neither a change nor a ref-update.  i think we just need better handling of that, though, to be honest, i think we should ignore it and not bother even equeueing it.15:24
jeblairreplication-complete is a more useful event i think.15:24
jeblair(we don't have any support for actually doing anything with it anyway15:25
openstackgerritDavid Shrewsbury proposed openstack-infra/zuul feature/zuulv3: Clarify TriggerInterface documentation  https://review.openstack.org/45371315:31
*** hashar has quit IRC15:31
pabelangerso, it looks like we pulled down ansible 2.2.2 on the upgrade15:33
pabelangernow seeing some SSH errors15:33
*** hashar has joined #zuul15:33
pabelangernot sure if ansible side or nodepool15:33
*** hashar has quit IRC15:34
pabelangerssh works on images15:35
pabelangerlooking into ansible15:35
pabelangersudo error on zuulv3-dev will be me15:38
pabelangerk, ~/.ssh/known_hosts for the zuul users was populated with old host keys15:50
pabelangernot sure why, but I have purged them and going to see what could have added them15:51
clarkbdon't we tell the zuul user to not populate keys at all though?15:51
clarkbsince they are single use nodes with host keys that get deleted?15:51
pabelangerya, we have ansible use job specific known_hosts file15:51
openstackgerritJavier Peña proposed openstack-infra/zuul master: Find fallback branch in zuul-cloner  https://review.openstack.org/44237015:52
pabelangerHmm, that might not have been it15:56
pabelangerHmm16:01
pabelangerit looks like we might not be generating the known_host file properly16:02
pabelangerhttp://paste.openstack.org/show/605525/ is the current value16:02
pabelangerI don't believe that is valid, I think we need to also include the IP address16:03
pabelangerhttp://paste.openstack.org/show/605526/16:03
pabelangerfrom ssh-keyscan16:03
jeblairyeah, that looks like it's missing a bit :)16:10
openstackgerritPaul Belanger proposed openstack-infra/nodepool feature/zuulv3: Include IP address in SSH host key  https://review.openstack.org/45373416:14
pabelangerhttp://paste.openstack.org/show/605530/ ^ what is generated now16:14
jeblairpabelanger: we already have the ip address in zuul, why don't we assemble it there?16:16
pabelangerjeblair: I was going to do that but what if incorrectly mapped ipv4 to an ipv6 hostkey?16:17
pabelangeror, does hostkey work regardless of ipv4 / ipv616:17
jeblairpabelanger: i believe they are typically the some (though, theoretically, a sysadmin could run two different ssh deamons on the two different network stacks).16:18
pabelangerokay, I don't have a preference. I thought it would be safer to build it on the nodepool side, against the IP we scanned, since we had the indo16:19
pabelangerinfo*16:19
clarkbyou'll sometimes find that if you ssh via v6 and accept a key then ssh via v4 ssh will warn you but say the host keys match so its probably ok and carry on16:19
jeblairpabelanger: i think we have to weigh it against the likelihood of a nodepool consumer accessing the node over a different address than the one nodepool used.  i believe that is more likely than the case of a node having two different host keys.16:19
pabelangergood point16:21
*** robcresswell has joined #zuul16:23
robcresswellHello hello. I've been tasked with some research into creating a CI system that mirrors openstack-infra's; Is Zuul v3 considered usable enough / documented enough for people to get started with yet, or is it still very much in a science project stage?16:25
pabelangerjeblair: Hmm, how would I map the known_hosts data for multiple hosts?16:25
mordredhi robcresswell! I think it's somewhere in the middle of those two - it's not science project (we're running it on zuul changes) but it's also not production ready and still needs some features and may still make breaking file format changes before it is ready for production ...16:27
pabelangerrobcresswell: we are not recommending zuulv3 for production just yet. We are still working towards an stable API16:27
mordredrobcresswell: so the questions would be what is your timeframe and what is your interest in / tolerance for looking at something in an early-alpha kind of place16:27
robcresswellmordred, pabelanger: It's just a research task yet; we're a way off putting it in prod. I'm comfortable working off unstable APIs, just try to gauge progress16:28
robcresswellmordred: Is there much in the way of docs for the current progress?16:29
robcresswellcurrent state, even16:29
mordredawesome. if it's a research task, I'd definitely not spend much effort on zuul v2. also, we haven't spent much time documenting v3 extensively yet16:29
pabelangerOur zuulv3 status ML posts would be a good summary I think: http://lists.openstack.org/pipermail/openstack-dev/2017-March/113148.html16:30
mordred(you can see jobs reported from "zuul" on changes to zuul: https://review.openstack.org/#/c/453347/)16:30
jeblairyes, those periodic status emails are good16:30
mordredalso, https://storyboard.openstack.org/#!/board/41 is the workboard tracking work16:31
jeblairpabelanger: the host key should be an attribute of the node16:34
robcresswellpabelanger, mordred: This is promising, I'll spend some time reading. Thanks for the links. Due to the nature of the work, I'll probably look to v3.16:34
* robcresswell apologises in advance for stupid questions16:34
pabelangerjeblair: yes, that make sense.16:34
pabelangerneed more coffee16:34
mordredrobcresswell: we love stupid questions!16:35
jeblairrobcresswell: no worries.  oh, there's two more things that may help you put the pieces together in lieu of our actually having updated the documentation yet.  let me get links16:36
jeblairrobcresswell: the zuul v3 spec which mostly accurately describes what it's all about and how it's different from v2: http://specs.openstack.org/openstack-infra/infra-specs/specs/zuulv3.html16:37
jeblairrobcresswell: and this email i sent to the team before our hackathon at the PTG, which focuses on how we set up our bootstrap production instance: http://lists.openstack.org/pipermail/openstack-infra/2017-February/005131.html16:38
openstackgerritPaul Belanger proposed openstack-infra/zuul feature/zuulv3: Add host IP address to host_keys  https://review.openstack.org/45376516:45
robcresswellThanks for all the help jeblair :)16:46
clarkbShrews: I'm starting to write up nodepool operations docs. And noticing that we still have dib-image-list and dib-image-delete commands but no image-upload. Does it make sense to keep the dib-* commands if all you can do with them is list and delete?16:55
robcresswellTo elaborate on use case; I'm at Cisco, we have a bunch of plugins as well as other internal projects. We're looking to improve our CI situation, and the logical thinking was to mirror upstream setups; makes debugging/documentation/other help easier. So the overall investigation task is to look at current tooling, see where it's headed, and what the best way16:56
robcresswellto utilise it is. That might be configuring components ourselves, or operating at a higher level and using something like puppet-openstackci (?) to set things up.16:56
robcresswellI'm woefully ignorant on how it all works right now, so it'll be a few days of poking things with sticks and reading code.16:56
pabelangerclarkb: FWIW: I find image-delete complicate to use, too many inputs. So, I just end up using dib-image-delete, which gets all the things16:56
openstackgerritTobias Henkel proposed openstack-infra/zuul feature/zuulv3: Use ssh for git-upload-pack  https://review.openstack.org/43680216:56
pabelangercomplicated*16:57
clarkbpabelanger: so if you dib-image-delete it doesn't just remove it from disk?16:57
clarkb(thats what its supposed to do iirc)16:57
pabelangerright, it will first delete the DIB from clouds, then disk16:58
clarkbugh16:58
clarkb(I don't think thats the right behavior)16:58
pabelangeryou cannot delete a DIB if the image is uploaded16:58
pabelangercurrently16:58
clarkbright but thats why that command exists in the first place :)16:58
clarkbyou don't necessarily need ti local on disk if its happy in a cloud somewhere16:58
pabelangerya, if you have limited local HDD space, will be an issues17:00
pabelangerbut, image-delete likely needs a global UUID because the amount of flags needed today, is complex IMO17:01
*** jamielennox has quit IRC17:04
jeblairrobcresswell: cool, thanks -- yeah, puppet-openstackci is the best way to get a v2 zuul set up, and it's what we recommend for openstack third-party ci operators.17:04
jeblairclarkb: i don't think i've ever heard the use case for dib-image-delete to be to save hd space.17:05
clarkbjeblair: in old nodepool all it did was remove the image from disk, potentially allowing you to rebuild then upload a new image if disk constrained17:06
clarkbbut all the dib-* commands only affected "local" resources, nothing in clouds17:07
*** jamielennox has joined #zuul17:07
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Store project name in ref-replicated events  https://review.openstack.org/45379617:17
openstackgerritClark Boylan proposed openstack-infra/nodepool feature/zuulv3: Start adding operational docs to zuulv3  https://review.openstack.org/45379717:18
jeblairclarkb: the hierarchical structure we used for images in zk makes that more difficult.  it also makes things like 'image-delete' more difficult for the same reason.  i'm willing to chalk that up to a learning experience (we did say we wanted to take baby steps with zk and learn from it -- and indeed, everything else we've done in zk is much more flat) and am willing to change it to split it back out into separate tables.  that's not going to be ...17:21
jeblair... trivial, especially backwards compat.  but it's doable.  maybe as a lower priority.17:21
openstackgerritClark Boylan proposed openstack-infra/nodepool feature/zuulv3: Start adding operational docs to zuulv3  https://review.openstack.org/45379717:22
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Remove source from pipelines  https://review.openstack.org/45336217:23
clarkbjeblair: I think my only real concern with it now is it feels very redundant since we already have image-delete and image-list if we aren't going to treat the disk images as separate artifacts17:24
clarkbif we think the dib image details should be abstracted away thats fine but we shouldn't have dib-* specific commands inthat case17:24
jeblairclarkb: the naming there is awkward;  dib-image-* is "built image file on disk" and image-* is "uploaded image in a cloud".  so they are different.17:25
clarkbyup thats why I find the current behavior of dib-image-delete to not be what we want17:25
clarkbwe've sort of mashes the two sets together in a weird way for users17:25
jeblairclarkb: i accept that's not what you want.  but i don't understand how we've mashed them together.  they still seem very separate to me.  there is just now a one-way dependency (you can not have an image uploaded that you don't also have a built copy of)17:26
Shrewsclarkb: iirc, we cannot just delete the local dib build b/c we can't delete the zookeeper znode w/o first knowing we are the "owner" of the dib (find it on disk). and we can't delete the znode w/o first deleting the uploads. so it's a weird symbiotic relationship that we have now, that may not be ideal17:27
clarkbjeblair: previously dib-* meant only managing resources that are local17:28
clarkbjeblair: but now it means both local and cloud remote17:28
clarkbthats what I mean about mashing together17:28
jeblairclarkb: that is correct.  i do still think they are different commands that serve distinct purposes.17:28
clarkbbasically it was safe to dib-image-delete without affecting anything in the cloud17:28
jeblairclarkb: yes17:28
jeblairthat has changed.17:28
clarkbI'm not sure why we need a separate command from image-delete then17:29
pabelangeris that a result of using ZK, or because we wanted to not leak images in clouds?17:29
mordredso - image-delete would delete from the cloud but not disk, so the image on disk would get re-uploaded right?17:29
jeblairclarkb: image-delete removes a single uploaded image from one cloud.  dib-image-delete removes all of them.17:29
mordredah17:29
jeblairmordred: correct (unless you paused that cloud/image)17:29
clarkbjeblair: wouldn't a --all-clouds ro similar flag be a better way to express that?17:29
clarkbI just don't read `dib-image-delete` as "really go delete everything everywhere`17:30
jeblairclarkb: well, it also deletes the image file.17:30
clarkband I don't think our existing users will either17:30
clarkb(I've updated the docs to express taht in my change fwiw)17:30
clarkb(so at least the docs are correct now)17:30
jeblairclarkb: this is where documentation helps.  we can also change the command name if there's a better one.17:30
mordredyah - I agree the command name does not imply to me (without reading the docs) that the command will delete the local and remote images, but I grok the need for the two commands17:31
mordredI'd vote for figuring out a new command name, if we can17:31
* mordred has no better name suggestions yet though17:32
clarkbperhaps image-purge17:33
Shrewsclarkb: also, rbergeron and myself had a discussion about new doc things, so pinging her so that she is aware of your new operation doc17:33
clarkbrbergeron: ping ^ I pushed https://review.openstack.org/45379717:33
mordredimage-purge would imply to me that a bunch of things are going to get deleted17:33
clarkbmordred: aiui thats exactly what happens17:33
clarkbits completely removed from local disk and all the clouds17:33
pabelangerand, unless paused, will trigger a rebuild and reupload17:34
jeblairi think the key thing is that there are two concepts here: a dib-image-file, and an uploaded-image-in-a-cloud.  inside the nodepool source code, we now call those 'image' and 'upload', because of exactly this confusion.  i think good command names would help communicate that distinction to the user as well so they understand it.17:35
mordredwhat if we broke a BUNCH of backwards compat and called it "image-delete" and "upload-delete"17:36
jeblairso image-purge is a pretty good descriptor, however, it confuses the two concepts rather than elucidating them to the user.  if we call dib-image-delete image-purge, we back ourselves into a corner with dib-image-list.17:36
mordredor image-delete and image-upload-delete17:36
jeblairmordred: i like image-upload-delete.  it's more typing, but it's less awkward than 'upload-delete' which can be read as two verbs.  :)17:36
mordredyah17:37
jeblairmordred: two contradictory verbs, at that :)17:37
clarkband then have image-upload-list and image-list17:37
clarkbthat would work17:37
mordredwhereas image-upload-delete is "delete an image-upload"17:37
rbergeronclarkb: oh excellentay awesome, thanks for the heads up (i will take a peek. it surely is more riveting than, say, the all day phone call i am in)17:37
rbergeron(sigh)17:37
jeblairor we can even completely englishify it as 'delete-image-upload'17:37
mordredjeblair: now you're making crazy noises17:38
jeblair(i have no idea why we make all our command names backwards in openstackland)17:38
mordred(but I'm fan of verb-object myself)17:38
openstackgerritClark Boylan proposed openstack-infra/nodepool feature/zuulv3: Fix internal doc refs to renamed section  https://review.openstack.org/45380217:38
clarkbrbergeron: also ^17:38
clarkbjeblair: its so that you have to scroll aroundin your shell more to modify other objects17:39
mordredjeblair: I believe it's due to how we create comamnds via nested namespaces17:39
clarkbmordred: yes thats the real reason17:39
mordredso "openstack server create" can look up the server plugin to find the actoins it has and then find the create action on that17:39
clarkbits an easy way to allow server and network to register their own "delete" commands17:39
mordredwhereas openstack create server would have to parse and undersatnd the whole thing17:39
mordredyah17:39
jeblairmordred: ah, that's helpful.  but i think we still had 'nova image-list' a long time ago?17:39
mordredwell, that was written by termie or something, so all bets are off17:40
jeblairpoint17:40
jeblairanyway, sounds like 'delete-image-upload' may be a winner?17:40
clarkbrbergeron: I also have https://review.openstack.org/452964 for cleaning up zuuls doc warnings17:40
clarkbrbergeron: those two cleanup changes are probably good to get in before a ton of docs work happens as it will help us catch silly bugs (like the images refs in nodepool not existing)17:41
mordredjeblair: ++17:41
jeblair(i guess the agreement might change if you do that; 'list-images' would probably be preferable to 'list-image'.  maybe we can make both work.  :)17:41
jeblair(whereas 'image-list' is less impacted by grammar conventions)17:42
* jeblair registers for a grammar convention17:42
jeblairclarkb: thanks, made some corrections/suggestions17:51
jeblairclarkb, mordred: can one of you +3 https://review.openstack.org/453765 ?17:58
jeblairpabelanger, clarkb, mordred: can you review https://review.openstack.org/453796 ?17:59
jeblairboth of those are things we should get in asap to restart zuul and make it less spammy.17:59
pabelanger+218:00
pabelangerI need to run an errand to the bank18:00
pabelangerback shortly18:00
openstackgerritClark Boylan proposed openstack-infra/nodepool feature/zuulv3: Fix internal doc refs to renamed section  https://review.openstack.org/45380218:07
openstackgerritClark Boylan proposed openstack-infra/nodepool feature/zuulv3: Start adding operational docs to zuulv3  https://review.openstack.org/45379718:07
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Add host IP address to host_keys  https://review.openstack.org/45376518:10
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Remove source from pipelines  https://review.openstack.org/45336218:24
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Remove source from pipelines (1/2)  https://review.openstack.org/45336218:30
openstackgerritJames E. Blair proposed openstack-infra/zuul feature/zuulv3: Remove source from pipelines (2/2)  https://review.openstack.org/45382118:30
jeblairi split up that commit so my eyes bleed less18:30
SpamapSyeah wow18:32
*** harlowja has quit IRC18:33
*** hashar has joined #zuul18:37
openstackgerritMerged openstack-infra/zuul feature/zuulv3: Store project name in ref-replicated events  https://review.openstack.org/45379618:45
SpamapSjeblair: my my, you've been busy. I really am enjoying reviewing the multi-source stack.19:00
* Shrews currently attempting to grok that stack19:14
jlkoh that's lovely19:19
jlkRun full tox, two tests fail. Run just the failing test, it passes.19:19
jlk¯\_(ツ)_/¯19:19
jeblairjlk: are they the "dequeue" tests?  if so, they're probably just running long (they are ridiculously large tests)19:20
jlkno, it was a test_playbook I think.19:20
jlktests.unit.test_v3.TestAnsible.test_playbook19:20
jeblairjlk: oh.  that is also long (it waits 10 seconds for a job to time out), but not quite as busy.19:21
jlkah. Is there a way to extend the timeouts?19:21
jlkI think there was, I just forget what it was19:21
jeblairjlk: yeah, theres an env var... OS_TEST_TIMEOUT i think?19:21
jeblairyep19:22
jlkthanks19:23
SpamapSjlk: I've been running with OS_TEST_TIMEOUT=999919:31
SpamapSstill get random fails though19:31
SpamapSso dunno if there's more to it.19:31
jlkyeah I'm getting disappointingly inconsistent results.19:31
SpamapSI wonder if somewhere deep in zuul something else is setting up alarms19:31
jlkI narrowed all my tests down to just the github ones, one test fails. Just run that one test, it passes.19:32
jlkmakes me wonder if there is something weird with parallelization19:32
Shrewsjeblair: perhaps this will be obvious deeper in the stack, but is there a reason to have both the Source and Connection classes store canonical_hostname (https://review.openstack.org/451110)? There is already a connection between those two classes...19:32
SpamapSjlk: I've taken to running just the ones I care about locally, and (ab)using zuul to run the full suite.19:32
SpamapSjlk: testr does run with as many CPUs as you have by default.. but I've had equally bad results by running with concurrency of 219:33
jlkI'm about to do the same.19:33
jeblairShrews: yes -- though it may not be a good one.  :)  basically, it's an API decision -- a connection may not necessarily have a canonical_hostname (connections include gerrit and github, but also smtp servers, (future: irc servers, mqtt servers), etc).  but a source (as a thing where git repos live) must have one.  so i put it in the source API.  now, the gerrit connection does have one because it supports the source api, and that's where the ...19:37
jeblair... actual configuration happens, but that's essentially private to that connection class.19:37
Shrewsok19:41
jlkApologies for the incoming.19:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: support github pull reqeust labels  https://review.openstack.org/44451119:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Allow github trigger to match on branches/refs  https://review.openstack.org/44562519:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Set filter according to PR/Change in URL  https://review.openstack.org/44678219:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Add support for requiring github pr head status  https://review.openstack.org/44939019:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Support for dependent pipelines with github  https://review.openstack.org/44529219:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Add basic Github Zuul Reporter.  https://review.openstack.org/44332319:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Configurable SSH access to GitHub  https://review.openstack.org/44403419:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Add 'push' and 'tag' github webhook events.  https://review.openstack.org/44394719:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Add 'pr-comment' github webhook event  https://review.openstack.org/44395919:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Support for github commit status  https://review.openstack.org/44406019:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Better merge message for GitHub pull reqeusts  https://review.openstack.org/44564419:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Query changes only via relevant sources  https://review.openstack.org/44825719:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Encapsulate determining the event purpose  https://review.openstack.org/44524219:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Allow using webapp from connections  https://review.openstack.org/43983119:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Merge pull requests from github reporter  https://review.openstack.org/44446319:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Add support for github enterprise  https://review.openstack.org/44925819:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Support GitHub PR webhooks  https://review.openstack.org/43983419:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: GitHub file matching support  https://review.openstack.org/44611319:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Add trigger capability on github pr review  https://review.openstack.org/44936519:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Log GitHub API rate limit  https://review.openstack.org/44615019:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Adds github triggering from status updates  https://review.openstack.org/45384419:49
openstackgerritJesse Keating proposed openstack-infra/zuul feature/zuulv3: Implement pipeline requirement on github reviews  https://review.openstack.org/45384519:49
Shrewswow19:50
Shrewsthat's going to exercise nodepool-launcher quite nicely19:50
jlkit's a rebase!19:50
SpamapSall zig for patches19:54
dkranzpabelanger: I have zookeeper running and was going to try to run nodepool with your playbooks. But I was not sure how I specify my particular config options. Any hints?19:55
*** harlowja has joined #zuul19:58
pabelangerdkranz: http://git.openstack.org/cgit/openstack-infra/nodepool/tree/devstack/plugin.sh?h=feature/zuulv3#n195 is the nodepool.yaml file we are testing in the gate19:58
pabelangerthat will have zookeeper settings to use19:59
dkranzpabelanger: ok, cool. How do I tell the playbook to use my file?19:59
pabelangerdkranz: did you create a playbook?19:59
dkranzpabelanger: no, I was using your ansible playbook. does it just do the install?20:00
pabelangerdkranz: if you are using ansible-role-nodepool, you can set the nodepool_file_nodepool_yaml_src variable to a copy of a local nodepool.yaml file. Then when we rerun ansible, the role will use your local copy20:01
pabelangerand place it on the remote node20:01
dkranzpabelanger: thanks, that is what I was looking for20:01
openstackgerritClint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Add ability to set an insecure prefix/suffix  https://review.openstack.org/45385120:05
openstackgerritClint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Create wrapper script for executor  https://review.openstack.org/45385220:05
* mordred hands dkranz a pie20:05
* SpamapS has begun playing a bit20:05
mordredSpamapS: neat!20:09
*** jamielennox is now known as jamielennox|away20:12
SpamapShttps://bugs.launchpad.net/bugs/1680227 <-- backport request for bubblewrap to Xenial20:14
openstackLaunchpad bug 1680227 in Yakkety Backports "Please backport bubblewrap 0.1.7-1 (universe) from zesty" [Undecided,New]20:14
SpamapSdoh20:16
SpamapSsilly script20:17
Shrewspabelanger: if we bring more projects into the zuulv3 voting realm, maybe we should up the max-servers & min-ready counts in the nodepool config at that time. pretty happy with how it's been performing lately20:18
Shrewsat least min-ready, rather. max-servers is probably fine20:19
pabelangerShrews: ya, I think we can get nodepool next. min-ready should be okay, but max I think we have 10 servers in each region ATM we can use20:19
pabelangerbut ya, we can bump if needed20:19
SpamapShrm20:32
SpamapSmordred: so .. when I run faillocal in bwrap.. I get this:20:44
SpamapS/home/clint/tmp/foo.txt:2017-04-05 13:41:41,505 zuul.AnsibleJob                  DEBUG    Ansible output: ERROR! Invalid callback for stdout specified: zuul_stream20:44
SpamapSmordred: but when I run it without brwap, I get this:20:45
SpamapS/home/clint/tmp/foo_unwrapped.txt:2017-04-05 13:42:56,934 zuul.AnsibleJob                  DEBUG    Ansible output: ERROR! Unexpected Exception: No module named 'zuul'20:45
SpamapSso now I'm really confused20:45
mordredSpamapS: SO ...20:47
mordredSpamapS: we copy zuul's ansible modules to a location (I think in /var) on zuul startup20:47
mordredSpamapS: so you may need to also bind-mount that locatoin20:47
SpamapSoh that's definitely it20:48
mordredI'm not sure about the second thing20:48
SpamapSI have a nice fresh empty /var20:48
mordredSpamapS: /var/lib/zuul/ansible/20:48
SpamapSstate_dir is the variable tho20:49
mordredSpamapS: we do that, incidentally, because pip install -U zuul deletes the files before writing new ones out20:49
SpamapSso need to pass that in as a thing to ro-bind20:49
mordred++20:49
SpamapSreally enjoying working with bubblewrap20:49
SpamapSif we can layer seccomp and SELinux/AppArmor on it, I'm going to be able to sleep I think.20:50
mordredwoot!20:50
jlkNice! only 5 of that series failed tests.20:53
mordredjlk: woot!20:57
jlkat least two are transient and should clear up with recheck.20:57
jlkso only a couple to focus on.20:57
SpamapSwe need to look closer into the transient fails we're all seeing20:58
SpamapSit's going to eat us alive20:58
Shrewsjeblair: +0 with comments on https://review.openstack.org/45159721:01
Shrewsfixing in a later review would be fine rather than rebase the entire stack for that21:05
Shrewsespecially since we don't even generate docs with those docstrings, atm21:11
openstackgerritClint 'SpamapS' Byrum proposed openstack-infra/zuul feature/zuulv3: Add support for bwrap  https://review.openstack.org/45385121:11
* SpamapS jumps the spec gun a little21:11
SpamapS^^ this works btw.21:11
mordredSpamapS: WOOT21:11
jeblairShrews: ah thanks; i guess we have to add the tenant class21:11
jeblairSpamapS: \o/21:11
SpamapSYou just need bubblewrap backported.21:11
SpamapSWhich is why it's likely to fail its tests since I added bubblewrap to the bindep21:12
Shrewsjeblair: it's in datamodel.rst, but we don't use ":members:" with it. not sure if that's intentional21:12
SpamapSBut what you get is a super limited system that shoudl at least provide a nice onion layer for an attacker to escape through.21:12
jeblairShrews: probably that was just the edge of the api at that time; probably makes sense to add it, since i think these methods will be useful to drivers.21:13
SpamapSIt would be nice if we could figure out how to make ssh and ansible not require /dev/null21:13
jeblairspeaking of spec -- we should all give the spec a once over and make sure it's added to next tuesday's infra team agenda.21:13
Shrewsjeblair: also, i *think* the class itself must be documented before member docs will appear21:13
jeblairShrews: i think you're right21:13
Shrewsjeblair: i can clean that up (and my nit comments) with a review. gives my fingers something to do21:14
jeblairShrews: ++21:14
jeblairSpamapS: the spec says "security pants".  that tells me it's going to be good.21:15
mordredShrews: don't you have the backporting super powers?21:18
Shrewstab fail21:18
mordredgah21:18
mordredSpamapS: don't you have the backporting super powers?21:18
jeblairSpamapS: jhesketh left a q on the spec about pros/cons of seccomp21:19
Shrewspabelanger: hrm, RETRY_LIMIT failure still showing up for me: https://review.openstack.org/45371321:20
jeblairShrews, pabelanger: did we restart zuul since the fixes landed?21:21
Shrewsoh, maybe we need a zuul restart21:21
Shrewsyeah, that21:21
jlkoh boy21:21
jeblairShrews: want to go ahead and do that?  should be gtg now.21:21
jlkSo I just did all those rebases, but if this change goes in to add canonical host names, I'm going to have to rebase it all again.21:21
jlkoh well.21:21
Shrewsjeblair: sure. is it zl01 ?21:22
jeblairShrews: zuulv3-dev21:22
Shrewsheh. glad i asked21:22
jeblair:)21:22
jeblairjlk: yes, when i was writing it, i was thinking that it will have some impact on the github stack.  but i tried to keep it minimal.  but also, from our chat last week, i think it's essential for progress on it.  so hopefully worth it.21:23
Shrewsjeblair: do both scheduler and executor need restarting?21:23
jlkyeah it's necessary work. I am not upset21:23
Shrewsand is there an order?21:23
jeblairShrews: both (one bug in each)21:24
Shrewsok. restarted21:25
jeblairShrews: and order shouldn't generally matter.21:25
Shrewsjlk: lemme test my change before you try your stack  :)21:25
jlkI'm limited to one rebase a day, it's fine :)21:25
jeblairhaha21:26
jlkjeblair: question, something came up while we were thinking about limiting github API calls. With this new model you've done, is there a way that when an event comes in, and Zuul constructs the event object, some details of that event are carried through? For example, when an event is built up, we have to construct a PullRequest object, which takes API calls. That is so we can get data and feed Event attributes. Then the event goes off and is scheduled, then21:28
jlk picked up later by the Source I think. At that point, we're constructing a new PullRequest object from the event details, repeating many of the same API calls.21:28
jeblairSpamapS: security spec looks great to me.  jhesketh has 2 comments.21:28
Shrewshrm, zuul still no happy21:28
jlkWe're hoping at some point to be able to persist a data object from event throughout, so that we don't call the same APIs a bunch.21:28
jlknot sure if this change has anything to do with that.21:29
jlk( jamielennox|away can explain this better )21:29
Shrewsjeblair: pabelanger: hrm, /opt/zuul is out of date by days. is it not puppeted on zuulv3-dev?21:29
jeblairShrews: oh, right, i think pupet is disabled.21:30
jeblairShrews: so you'll want to do a manual pull there, then 'sudo pip install .' then restart again.21:30
Shrewsk21:30
mordredjlk: maybe some sort of caching layer perhaps?21:30
jeblairjlk: the gerrit driver maintains a cache of change objects, so that when we later need to getChange, it returns a cached object.  basically for the same reason (it performs many expensive api calls at event time)21:30
jlkmordred: we're doing a caching layer, yes.21:30
* mordred thinks jeblair said better words21:31
jeblairjlk: the sequence is: connection gets event; connection updates internally cached Change; connection creates TriggerEvent (which does not directly reference Change but describes it); connection submits TriggerEvent to scheduler; scheduler pops TriggerEvent; scheduler asks source for Change; source asks connection for Change; connection returns cached Change object.21:33
jlk]I see21:33
jeblair(we could consider attaching the Change object to the TriggerEvent directly, however, it turns out to be really useful to have the cache once you start doing dependencies between changes, so i think we need it anyway.)21:35
jeblairfungi, clarkb, mordred: i think it's worth giving the zuulv3 security spec at https://review.openstack.org/444495 a once-over this week so that we can put it on the infra agenda next week.21:40
Shrewszuul is happy again21:41
pabelangerya, we have puppet disabled still21:41
pabelangerwoot21:41
*** hashar has quit IRC21:43
fungiawesome. i already had it starred, will make sure to go through now21:49
*** harlowja has quit IRC22:05
openstackgerritMatthew Treinish proposed openstack-infra/nodepool master: Add mqtt module to enable pushing MQTT messages  https://review.openstack.org/45391522:12
*** jamielennox|away is now known as jamielennox22:13
SpamapSmordred: no I don't have backports access, and I relinquished my SRU privs too. Can still upload to dev.22:30
jlkugh this is frustrating. I can't get a full suite of tests to pass.22:51
jheskethMorning23:01
openstackgerritMatthew Treinish proposed openstack-infra/nodepool master: Add mqtt module to enable pushing MQTT messages  https://review.openstack.org/45391523:12
openstackgerritMatthew Treinish proposed openstack-infra/nodepool master: Enabled sending MQTT messages for node events  https://review.openstack.org/45392223:12
*** harlowja has joined #zuul23:25
SpamapSHrm, I wonder if we'll have to do some tap-dancing to get a backports package into bindep.23:28
jeblairSpamapS: i think the -backports repos are standard on our nodes23:31
SpamapSjeblair: right, I'm just not sure if packages that don't exist in the regular repo are pulled without a backports/ prefix (which is how you get newer packages that have been updated via backports)23:33
SpamapSmy apt brain is a bit fuzzy23:33
jeblairhuh, i thought they just showed up as candidates with later versions.23:34
SpamapSthey don't, the repo metadata has a setting to prevent that23:34
SpamapShttp://us.archive.ubuntu.com/ubuntu/dists/xenial-backports/Release   has "NotAutomatic: yes"  (amazing work there wordsmith apt engineers)23:35
jeblairSpamapS: oh.  then i now understand the problem you describe.  i'm now also curious as to the answer.  :)23:36
SpamapSI'm looking for candidate23:36
jeblairDisableInhibitNonAutomaticInstallation: true23:37
jeblairwould be so much clearer23:38
SpamapSoh hooray, it works23:38
SpamapSjeblair: so it looks like NotAutomatic just forces the apt source to 100 priority as if you apt pinned it that way.23:39
SpamapSso if the package does not exist in the main archive, you'll get the backported one.23:39
jeblairnice23:40
SpamapSbackports team looks a little overtaxed these days23:40
SpamapShttps://bugs.launchpad.net/xenial-backports23:41
SpamapS43 waiting23:41
SpamapSwell, 40.. a few are support reqs23:41
*** jamielennox is now known as jamielennox|away23:42
*** jamielennox|away is now known as jamielennox23:45

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!