Thursday, 2017-06-15

*** rfolco has joined #openstack-dib00:49
*** rfolco has quit IRC00:49
*** rfolco has joined #openstack-dib01:03
*** rfolco has quit IRC01:03
*** isaacb has joined #openstack-dib03:22
*** isaacb has quit IRC03:24
*** brault has joined #openstack-dib05:18
*** brault has quit IRC05:19
*** andreas-f has joined #openstack-dib05:22
*** makowals has joined #openstack-dib05:59
*** makowals has quit IRC06:03
*** hashar has joined #openstack-dib06:39
*** makowals has joined #openstack-dib06:40
*** andreas-f has quit IRC07:07
*** aparnav has joined #openstack-dib08:44
*** yolanda_ has joined #openstack-dib08:59
lennybHi Team, what could be a reason for not passing 'compat=10.0' in nodepool-builder ?  I am bringing up new CI and I see that qemu-img fails due to a missing compat flag10:26
lennyb#link http://paste.openstack.org/show/612668/10:26
ianwlennyb: did you set it in nodepool.yaml?11:02
ianwsee https://git.openstack.org/cgit/openstack-infra/project-config/tree/nodepool/nodepool.yaml11:03
*** yolanda__ has joined #openstack-dib11:30
*** yolanda_ has quit IRC11:30
*** yolanda__ is now known as yolanda12:05
lennybthanks, ianw. I've missed this var12:12
*** appu103 has joined #openstack-dib12:54
*** aparnav has quit IRC12:57
*** appu103 has quit IRC12:58
*** hashar has quit IRC13:47
*** hashar has joined #openstack-dib13:54
*** hashar has quit IRC14:04
*** hashar has joined #openstack-dib14:12
*** makowals has quit IRC14:47
*** makowals has joined #openstack-dib14:53
*** hashar has quit IRC15:31
*** andreas-f has joined #openstack-dib16:12
*** greghaynes is now known as greghayn117:18
*** greghaynes has joined #openstack-dib17:18
*** greghaynes has quit IRC17:22
*** greghaynes has joined #openstack-dib17:23
*** greghayn1 has quit IRC17:25
johnsomHmm, seeing a few:17:46
johnsomsubprocess.CalledProcessError: Command 'sudo mkfs -t ext4 -i 4096 -J size=64 -L cloudimg-rootfs -U f9dc428c-394e-427a-a6d6-18c49a822fa0 -q /dev/mapper/loop0p1' returned non-zero exit status 117:46
johnsomhttp://logs.openstack.org/74/474674/1/check/gate-octavia-v1-dsvm-py3x-scenario-multinode/6c43279/logs/devstacklog.txt.gz#_2017-06-15_16_59_05_41417:46
johnsomNot sure what is causing DIB to fail there.  Two out of six gate test failed with that17:47
johnsomAll run DIB17:47
greghaynesI wonder if out of disk space?18:31
*** Sukhdev has joined #openstack-dib19:26
*** yolanda has quit IRC21:45
ianwjohnsom: hmm, 2.6.0 did move to using kpartx, it's using /dev/mapper ... but for only 2 out of several to fail?21:50
johnsomYeah, it is really strange, same patch, six jobs that build images, just two failed with this error.21:50
johnsomOn a global requirements update no less21:51
johnsomThough we have seen this recently on a few other random patches21:52
ianwhmm, are we tracing the output?  i don't think so22:06
ianwoh yeah, we are ... that's right22:07
ianw2017-06-15 16:59:05.349 | 2017-06-15 16:59:05.349 INFO diskimage_builder.block_device.utils [-] Calling [sudo kpartx -avs /dev/loop0]22:08
ianw2017-06-15 16:59:05.358 | 2017-06-15 16:59:05.357 INFO diskimage_builder.block_device.utils [-] Calling [sudo mkfs -t ext4 -i 4096 -J size=64 -L cloudimg-rootfs -U f9dc428c-394e-427a-a6d6-18c49a822fa0 -q /dev/mapper/loop0p1]22:08
ianwthat's not very long between the kpartx call and the mkfs call22:08
ianwmaybe we should put a sleep in22:08
ianwhttps://bugs.launchpad.net/nova/+bug/1024586 -> reports that kpartx is not synchronous22:10
openstackLaunchpad bug 1024586 in OpenStack Compute (nova) "avoid the use of kpartx in file injection" [Medium,Invalid] - Assigned to Pádraig Brady (p-draigbrady)22:10
ianwfirst hit for "kpartx" and synchronous :)22:10
ianwsecond comment > I just noticed a mail to say in kernel Loop device partition scanning is unreliable22:11
ianwhaha, everything is terrible :)22:11
johnsomHa, yeah, that actually brings up some memories for a bug we had early in DIB22:15
ianwi wonder if "udevadm settle" is actually useful here, or it's just a placebo and a sleep(2) would be just as effective?22:17
johnsomMaybe a partprobe or kpartx -u?22:20
johnsomUgh, it's been a while since I dealt with this sort of thing, it is fuzzy22:21
ianwwe deliberately removed the partprobe so we didn't have to cleanup two paths22:22
ianwbut, we did used to have a udevadm settle after it, looking at old logs22:22
ianwhttp://logs.openstack.org/91/470791/3/check/gate-dib-dsvm-functests-python2-ubuntu-trusty/9a08409/console.html#_2017-06-05_07_26_32_78966022:22
ianwand we are running in "sync" mode (-s) which should not return until the partitions are made22:23
johnsomI'm not super familiar with udevadm settle, but it sounds reasonable from the man page.22:25
johnsomSleeps bother me as someone always invents a slower/busier host22:29
*** rm_work has joined #openstack-dib22:29
johnsomianw Should I open an actual bug and throw up a patch?22:30
ianwthe other thing is, if it wasn't there, why didn't mkfs output anything22:30
ianwThe file /dev/mapper/loop0p3 does not exist and no size was specified.22:30
ianw^ mkfs output on a non-existent device for me using that command line22:30
johnsomYeah, I was wondering what happened to the output too.  I just thought DIB was eating it or since we don't have -x tracing on it wasn't printed.22:32
ianwohhhh ... no "-x", yeah it's debug level22:32
ianwis there a reason not run with "-x"?  i've wanted to make that default level22:32
johnsomYeah, too chatty for us in day-to-day dev22:32
johnsomYeah, we tried it for a bit, but it is super chatty22:33
ianwit has got a *lot* better ...22:33
rm_worki LIKE -x22:33
rm_workfor the record22:33
rm_worki keep trying to add it22:33
ianwi've done stuff like turn off things like logging loop internals22:33
rm_workbut can't find the right spot (and my care level was admittedly low)22:34
johnsomFine, fine, we can make that 300k log file 600k...  grin.  I won't block it.22:34
ianwwe could turn the tracing of commands run under exec_sudo up to info level22:34
ianwit would be really great to see why that mkfs fails ... if it is a missing device, well clearly there's a bug in the "-s" mode of kpartx22:35
ianwi'm happy to work around it, but would like to understand why we are22:35
rm_workI will +2 adding "-x"22:35
rm_work:)22:35
ianwi do not believe we've seen this in the dib gate.  but maybe our hosts aren't as busy22:35
ianwif there are parts of "-x" spewing out useless boilerplate, i'm happy to turn that down22:36
johnsomWe probably turn a lot more patchsets too22:36
ianwtrue22:36
ianwso my first preference would be to catch one of these failures so we know what we're dealing with22:37
ianweither up the log level in the exec_sudo, or add "-x" to the call22:37
johnsomrm_work It's OCTAVIA_DIB_TRACING in gate_hook22:39
johnsomhttps://github.com/openstack/octavia/blob/master/devstack/plugin.sh#L6222:39
rm_workk22:40
rm_workOCTAVIA_DIB_TRACING=True ?22:40
rm_workOCTAVIA_DIB_TRACING=122:40
rm_workOCTAVIA_DIB_TRACING=-x22:40
*** openstackgerrit has joined #openstack-dib22:40
openstackgerritOpenStack Proposal Bot proposed openstack/diskimage-builder master: Updated from global requirements  https://review.openstack.org/47479822:40
johnsom122:40
rm_workk i'll set that in ... one of these22:41
rm_workerr whatever i'll make it a thing22:41
ianwok, hopefully (?) it's a missing partition ...23:40
ianwnow we're driving the build from python, i wonder if it would be more practical to capture logs to a file23:41
ianwif we dup()'d stdout/stderr before exec() maybe...23:44

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!