Monday, 2018-08-27

*** jesusaur has joined #zuul00:21
SpamapStristanC: dmsimard any advice on an IAM policy to attach to nodepool's creds?00:27
SpamapSbotocore.exceptions.ClientError: An error occurred (UnauthorizedOperation) when calling the DescribeImages operation: You are not authorized to perform this operation.00:30
SpamapSdespite having ec2:* allowed00:30
tristanCSpamapS: I don't know, maybe look for the aws glance service name and add it?01:28
*** bhavikdbavishi has joined #zuul02:55
SpamapStristanC: aws's glance is ec2:*03:55
SpamapStristanC: I wonder how you get your boto creds in. I'm mounting mine in as ~/.aws/credentials03:55
SpamapStristanC: n/m.. somehow my pods are picking up the ARN for the worker node role, and not the one I made for nodepool04:10
tristanCSpamapS: iirc i run sudo -u nodepool aws configure04:11
tristanCwhat is ARN?04:12
SpamapSamazon resource n-something04:23
SpamapS??04:23
SpamapSdunno04:23
* SpamapS id detaching for a while04:23
SpamapSI think I got it figured out04:23
*** bhavikdbavishi has quit IRC06:16
*** pcaruana has joined #zuul06:49
*** dmellado has quit IRC07:00
*** dmellado has joined #zuul07:02
*** eumel8 has joined #zuul07:16
*** hashar has joined #zuul07:20
*** jpena|off is now known as jpena07:45
*** electrofelix has joined #zuul08:22
*** lennyb has joined #zuul09:46
*** hashar has quit IRC10:15
*** hashar has joined #zuul10:22
*** pcaruana has quit IRC10:32
*** pcaruana has joined #zuul10:32
*** threestrands has quit IRC11:05
*** hashar has quit IRC11:11
*** swest has joined #zuul11:16
*** jpena is now known as jpena|lunch11:22
*** hashar has joined #zuul11:24
*** snapiri has joined #zuul11:39
*** jpena|lunch is now known as jpena12:24
*** rfolco has joined #zuul12:29
*** rlandy has joined #zuul12:30
*** samccann has joined #zuul12:36
*** _ari_|pto has quit IRC12:48
*** myoung|training is now known as myou7ng12:57
*** myou7ng is now known as myoung12:57
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: rewrite interface in react  https://review.openstack.org/59160413:22
tristanCmordred: corvus: 591604 should now be feature complete with the angular/jquery code, and i've added a couple of jest tests for good measure.13:25
ShrewsSpamapS: speaking of containers, you may find https://review.openstack.org/593125 interesting13:29
pabelangerwould love to see a +3 this week to start testing them ^13:59
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: rewrite interface in react  https://review.openstack.org/59160413:59
dmsimardSpamapS: for the time being, I'm running with ~/.aws/credentials (from the perspective of nodepool) however the name of the credentials must be "default" right now14:16
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: rewrite interface in react  https://review.openstack.org/59160414:16
dmsimardhoping to be able to change that so it's possible to use multiple credentials (not unlike clouds.yaml)14:17
Shrewspabelanger: i cannot think of a better way to test the infinite launch, other than just waiting for manager.createServer_fails to reach < 114:17
Shrewspabelanger: i tried (unsuccessfully) a few things this morning, but that's the best i came up with14:19
corvustristanC: cool!  mordred is mostly afk until ~ the PTG; would you mind if we wait until he's back to merge it?14:20
pabelangerShrews: okay, I'll refresh the patch this morning and see if others can review. Thanks for helping14:20
Shrewscorvus: the iptables fail logs are really weird14:20
corvusShrews: neat!  let's go over to infra and look at those14:21
tristanCcorvus: sure, well if you are ok with this change, then i'll rebase the other angular page on top of it14:27
corvustristanC: yeah, i like the change, and i think mordred does too, and i haven't heard anyone argue to keep angular, so i think we're going to do it.  it's just a matter of having all hands on deck when it goes in.  :)14:29
tristanCcorvus: that's great to hear :-)14:30
*** ianychoi has quit IRC14:36
*** ianychoi has joined #zuul14:42
*** hashar is now known as hasharAway14:49
*** pcaruana has quit IRC15:29
SpamapSdmsimard: yeah for some reason nodepool isn't finding them in ~/.aws so I'm setting them as env vars instead15:32
SpamapSwhich actually kinda makes more sense anyway15:32
dmsimardtristanC: I was looking at the build ( http://logs.openstack.org/04/591604/22/check/zuul-build-dashboard/6e880c6/npm/html/status ) and it looks really cool. Filtering is instantaneous and the page seems faster to load (in firefox). I can't comment on the code but the result is nice, good job :D15:33
dmsimardSpamapS: in my world that means /var/lib/nodepool/.aws (since that's the homedir of nodepool)15:33
Shrewscorvus: can you reapply your +W on https://review.openstack.org/593125 ? looks like a random test failure15:38
corvusShrews: i just left a recheck comment (it has to go through check again anyway)15:39
Shrewsah15:39
dmsimardSpamapS: how are you setting those env vars ?15:39
tristanCdmsimard: thanks :) React is designed to only render element that needs update, thus status loading and filtering is indeed much faster15:46
tristanCit could be even faster if the create_tree method was rewritten to be per pipelines, it's still a copy of the legacy code that walk through all change_queues to figure out box indentation.15:47
SpamapSdmsimard: in the deployment container spec15:53
SpamapSdmsimard: in mine I just run nodepool as "root" since it's not real root inside the container. :-P15:53
dmsimardSpamapS: sure, I guess that reading env variables is provided by boto and not nodepool itself though.. I think the credentials file approach mirrors well what we have for openstack's clouds.yaml though so that would probably be what is the cleanest approach (from a driver standpoint)15:57
SpamapSdmsimard: Right, I want the credentials approach to work.. just not sure what's making it fail15:58
dmsimardSpamapS: are they under a [default] block ?15:58
dmsimardSpamapS: like http://paste.openstack.org/raw/728840/15:59
SpamapSdmsimard: it's failing the isfile check16:04
SpamapSbecause secrets mounted in are symlinks I guess16:04
dmsimardSpamapS: have a stack trace ? I can look into it16:04
*** jpena is now known as jpena|off16:09
SpamapSbah no, it's dumber than that16:09
SpamapSHelps if you name the fields right16:09
SpamapSdmsimard: this is my new stack trace16:13
SpamapShttp://paste.openstack.org/show/728842/16:13
SpamapSlooks like maybe that's just my AMI16:15
dmsimardSo that looks like an error in the code I wrote on that last patchset16:16
SpamapSyeah some ami's have 0 block device mappings16:17
SpamapSthey're instance-store-only16:17
dmsimardSpamapS: I guess line 136 at https://review.openstack.org/#/c/535558/10/nodepool/driver/aws/provider.py@136 was a bit naive16:17
SpamapSI'm trying an EBS AMI now16:18
dmsimardSpamapS: yeah, I was basically trying to address that some AMIs ship with an EBS by default and we definitely need to make sure they are set to delete on termination16:18
dmsimardLet me try and check what an AMI without EBS looks like16:18
SpamapSlol, and the EBS ones fail16:19
SpamapSException: The AMI %s has more than one block device16:19
SpamapSso ubuntu's images either have 0 b_d_m, or > 1.16:20
dmsimardSpamapS: are those AMIs on the marketplace ?16:20
SpamapSdmsimard: I have a quick patch for this, which is just to wrap it in an if block_device_mapping16:20
SpamapSdmsimard: https://cloud-images.ubuntu.com/locator/16:21
SpamapSthat's how I find 'em16:21
dmsimardSpamapS: ok, let me just have a look at what they look like16:21
SpamapSdmsimard: I'm testing a patch that just wraps the mapping check in a boolean check of block_device_mapping16:22
SpamapSin case you want to run with that on your next patchset16:22
* SpamapS has to task switch to real work now ;)16:25
SpamapSI'm hoping that over the next couple weeks I will evolve my k8s deployment into helm charts16:25
SpamapSand maybe we'll at least have a "got a k8s cluster? Want some zuul in it? -> chart"16:25
dmsimardSpamapS: looks like this from boto's perspective http://paste.openstack.org/raw/728844/16:26
*** pcaruana has joined #zuul16:26
dmsimardSpamapS: happy to try your patch out and submit a new patchset16:27
SpamapSdmsimard: right, try an hvm-instance16:27
SpamapSdmsimard: that will have 0 mappings16:27
dmsimardSpamapS: there's two AMIs in that paste, the first is hvm-instance and yeah it has nothing16:27
SpamapSoh right16:27
SpamapSyeah so that's what I'd want to use for test nodes16:28
SpamapSNo reason to spend $ on gp2 EBS16:28
dmsimardsure16:28
dmsimardSpamapS: if your patch was just to check if block_device_mappings was empty, I'll take care of it16:28
SpamapSand for those with Ebs, probably quite common to have > 116:28
SpamapSdmsimard: yeah that's the patch I'm going to test now16:28
SpamapSthough I'm switching to other stuff for a few hours16:28
dmsimardSpamapS: okay, the bdm > 1 part I'm not entirely sure how to handle it.. like that image has one ebs but three devices ? or is that actually three volumes ? the first has delete on termination but the other two do not16:29
* dmsimard prefers OpenStack16:30
*** electrofelix has quit IRC16:38
*** jlviva-viva is now known as jlvillal16:40
*** openstackgerrit has quit IRC17:04
*** openstackgerrit has joined #zuul17:06
openstackgerritMerged openstack-infra/zuul-jobs master: add-fileserver: fileserver_leading_path isn't from the secret  https://review.openstack.org/58776817:06
openstackgerritMerged openstack-infra/zuul master: Add container push job  https://review.openstack.org/59312517:18
Shrewsooh, neat. somebody merge something into zuul17:44
Shrews:)17:44
*** jimi|ansible has quit IRC18:28
SpamapSdmsimard: the non EBS ones are ephemeral storage18:59
dmsimardSpamapS: yeah I understood that part19:06
*** pcaruana has quit IRC19:13
SpamapSShrews: does that mean we'll get a dockerhub that we can FROM zuul:latest on?19:16
ShrewsSpamapS: yes19:17
pabelangerI was thinking about nodepool, how did we solve the diskimage-builder dependencies for nodepool-builder? Will pbrx be smart enough to pull those in from bindep in diskimage-builder?19:18
pabelangeror do you first build diskimage-builder container, then FROM it when doing nodepool-builder images19:20
Shrewspabelanger: yes, it uses bindep19:22
SpamapSShrews: neat19:23
pabelangerShrews: cool, so we don't need to add DIB dependencies directly into nodepool bindep.txt?19:25
pabelangereg: kpartx19:25
SpamapSsomething we might want to put in zuul's bindep: ssh client19:26
SpamapSFileNotFoundError: [Errno 2] No such file or directory: 'ssh-agent': 'ssh-agent'19:26
SpamapSthat's my current speed bump to running jobs in my AWS+K8S+Zuul monster19:26
Shrewspabelanger: shouldn't. it builds a "base" container first that contains all dependencies from bindep installed. then creates the other containers based on that one19:27
pabelangerokay cool19:28
Shrewspabelanger: i'm making the assumption that these things actually work. i haven't tested them, personally   :)19:29
pabelangeryah, we'll likely learn soon enough :)19:31
openstackgerritClint 'SpamapS' Byrum proposed openstack-infra/zuul master: Need SSH client for Ansible  https://review.openstack.org/59687519:37
SpamapS^^19:38
SpamapSneed that so the docker images get ssh client19:38
dmsimardcorvus: I'm able to find where the counters for SUCCESS/FAILURE are incremented for the different phases by the executor with "self.executor_server.statsd" but I'm unable to find where we might be reporting durations or timers.. is it done elsewhere ? The only duration/timer calculations I've seen were inferred from the database (i.e, for display in the web ui)20:09
corvusdmsimard: https://git.openstack.org/cgit/openstack-infra/zuul/tree/zuul/scheduler.py#n45220:10
dmsimardah, ty20:11
*** samccann has quit IRC20:18
*** samccann has joined #zuul20:19
*** rlandy is now known as rlandy|brb20:31
dmsimardcorvus: I'm not able to figure it out... :( I took for example the Zuul tox-py36 job and on graphite.o.o, neither "stats.zuul.tenant.openstack.pipeline.check.project.git_openstack_org.openstack-infra_zuul.master.job.tox-py36.SUCCESS" or "stats_counts.zuul.tenant.openstack.pipeline.check.project.git_openstack_org.openstack-infra_zuul.master.job.tox-py36.SUCCESS" seem to hold any kind of duration data20:32
dmsimardI did confirm that the timer is reporter properly and on the right key in my environment, the value was reported was "103275" which seems appropriate for a job that ran for 1m43s (103275/1000 = 103 seconds)20:35
corvusdmsimard: the equivalent timer key for that is stats.timers.zuul.tenant.openstack.pipeline.check.project.git_openstack_org.openstack-infra_zuul.master.job.tox-py35.SUCCESS.mean20:41
corvusdmsimard: well, the last part ("mean") is just one item of data recorded by timers, there's a slew of them; you'll see them if you expand under "SUCCESS"20:41
dmsimardOH20:42
dmsimardit's under timers20:42
corvusright, statsd puts different kinds of metrics under different roots20:42
dmsimardokay, I found it now -- awesome, thanks :D20:43
dmsimardcorvus: I'm not sure I understand why we're multiplying by 1000 -- the raw value appears to be in seconds already ?20:49
corvusdmsimard: right, that puts it in milliseconds.  statsd only deals with integers, so if we want subsecond, we multiply.  time in milliseconds is also semi-standard in javascript, so it's convenient in some cases.20:51
dmsimardoh, for precision -- got it20:51
*** rlandy|brb is now known as rlandy20:53
*** samccann has quit IRC21:08
*** hasharAway has quit IRC21:56
*** eventingmonkey has quit IRC23:26
*** eventingmonkey has joined #zuul23:27
openstackgerritJames E. Blair proposed openstack-infra/zuul-jobs master: write-inventory: add ansible_python_interpreter variable  https://review.openstack.org/59696123:30

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!