Wednesday, 2019-10-30

*** rlandy has quit IRC00:02
*** panda has quit IRC00:02
*** panda has joined #openstack-infra00:05
*** jamesmcarthur has quit IRC00:15
*** tkajinam has joined #openstack-infra00:16
*** weifan has joined #openstack-infra00:26
*** jamesmcarthur has joined #openstack-infra00:29
openstackgerritMonty Taylor proposed opendev/system-config master: Clean up review comments  https://review.opendev.org/69200300:29
*** weifan has quit IRC00:30
*** goldyfruit___ has quit IRC00:42
*** michael-beaver has quit IRC00:43
*** jamesmcarthur has quit IRC00:44
*** ricolin has quit IRC00:45
openstackgerritMerged openstack/diskimage-builder master: Revert "Drop vhdutil dependency, use qemu-img"  https://review.opendev.org/69199800:52
*** pots has quit IRC00:54
*** pots has joined #openstack-infra00:55
ianwi'll release dib with ^00:55
ianw2.29.1 out ... will check in on deployment00:57
*** ricolin has joined #openstack-infra00:57
*** jamesmcarthur has joined #openstack-infra01:01
mordred:(01:02
ianwi'm trying having qemu-img set the tag to "tap\0"01:05
*** weifan has joined #openstack-infra01:06
ianwnot that carrying a patched qemu-img is really any easier ... well maybe a little01:06
mordredianw: if only rackspace would accept uploads in a realistic format01:07
*** jamesmcarthur has quit IRC01:13
*** weifan has quit IRC01:14
openstackgerritMonty Taylor proposed opendev/system-config master: Plumb through secure.config contents  https://review.opendev.org/69180001:53
*** yamamoto has joined #openstack-infra01:55
*** yamamoto has quit IRC01:56
*** yamamoto has joined #openstack-infra01:56
*** slaweq has joined #openstack-infra02:13
*** slaweq has quit IRC02:17
*** slaweq has joined #openstack-infra02:22
*** diablo_rojo has quit IRC02:24
*** slaweq has quit IRC02:27
*** uberjay has quit IRC02:27
*** uberjay has joined #openstack-infra02:29
ianwwell with a patched qemu-img i can upload an image that becomes active ...02:32
mordredianw: that's cool. it's just the tag change?02:32
ianwyou also have to fiddle the version to fool https://fossies.org/dox/xen-4.12.1/libvhd_8c_source.html#l0119302:33
mordredianw: but it *didn't* work with editing the file, right? was there maybe just an additional byte that needed changing or something perhaps?02:33
mordredah02:33
mordredso it's conceivable that we could just edit the header afterwards perhaps?02:33
ianwnot really as it fiddles the checksum02:36
ianwbut, possibly maintaining a patch for qemu which is alive, rather than removed bits of xen, might be easier at least02:36
*** slaweq has joined #openstack-infra02:37
mordredyeah02:45
mordredwe *might* even be able to convince someone in ubuntu to accept the patch02:46
mordredor even upstream02:46
*** slaweq has quit IRC02:52
ianwi think i can make a compelling case for including a flag upstream ... if this thing boots ...02:53
*** slaweq has joined #openstack-infra02:57
ianw23.253.245.130 ... it has booted02:57
*** yamamoto has quit IRC02:58
*** slaweq has quit IRC03:12
*** slaweq has joined #openstack-infra03:13
mordred\o/03:17
mordredthat's SO EXCITING03:17
mordredand it resized appropriately?03:17
mordredlooks like it: /dev/xvda1       15G  9.6G  4.4G  69% /03:18
* mordred hands ianw a box of celebration chickens03:19
ianwno, i don't think it has, i think that should be 40gb03:20
ianwthe images are close ... http://paste.openstack.org/show/785645/03:22
ianwbut the one using vhd-util has this batmap ... my test one was giving an error about missing batmaps prior to the version change03:22
*** slaweq has quit IRC03:22
*** slaweq_ has joined #openstack-infra03:23
ianwit appears to be a xen feature of vhd-util, it's not described in the offiical spec03:25
*** slaweq_ has quit IRC03:27
*** yamamoto has joined #openstack-infra03:29
*** slaweq_ has joined #openstack-infra03:32
mordredoh. well - piddle. I guess those were premature celebration chickens03:33
paladox:D03:34
mordredianw: so really supporting it in qemu-img would be like making another output format, like "xen" or something, that implements the xen extensions to vhd03:34
ianwmordred: upon further investigation, someone has sort of tried to implement the batmap with https://patchwork.ozlabs.org/project/qemu-devel/list/?submitter=6475003:35
ianw... patchwork ... my opinions best left unsaid03:36
*** slaweq_ has quit IRC03:37
*** psachin has joined #openstack-infra03:37
mordredyeah. that's *definitely* better than gerrit. I can *totally* follow what's going on there03:37
ianwi used vhd-utils to resize the test disk, made with (patched) qemu and it seemed to work -> Current disk size   : 33554432 MB (35184372088832 Bytes)03:38
*** rh-jelabarre has quit IRC03:38
ianwi feel like that is what xen/nova/something is doing in the background03:38
*** yamamoto has quit IRC03:39
mordredianw: when you say you used vhd-utils - you mean you did that on the image file? or on the booted vm?03:39
ianwmordred: on the image file ... vhd-util resize --debug  -n ./test.vhd -s $((32 * 1024 * 1024 )) -j resize.log03:39
mordredianw: if only we could upload in raw format and have the cloud transform that into vhd and do the resize for us ...03:40
ianwi wonder though, if i reupload ^^ if that will then boot with a 32gb root disk03:40
mordred(or really, if only we could upload in qcow2 everywhere, and have the clouds auto-reformat to their desired format as part of the uopload)03:41
ianw-rw-r--r-- 1 ianw ianw  16G Oct 30 03:33 test.vhd03:41
mordredianw: I'd be willing to bet it would03:41
mordredoh. it still takes 16G?03:41
mordredon disk?03:41
mordredis that unpatched vhd-util you used for the resize?03:41
ianwyes still 16gb, but whatever vhd-util is on the nb's03:42
*** slaweq_ has joined #openstack-infra03:42
ianwi will re-upload the image and see if we get  a bigger host03:43
*** slaweq_ has quit IRC03:47
mordredianw: if it works - we should try again with the vhd-util that's not from the ppa03:49
mordredI'm pretty sure nb has the ppa version03:50
*** slaweq_ has joined #openstack-infra03:52
*** slaweq_ has quit IRC03:57
mordredianw: actually - it seems like there is no version of vhd-util in ubuntu bionic without our ppa03:58
mordredjust to make this as complex as possible03:59
*** ykarel|pto has joined #openstack-infra04:01
*** ykarel|pto is now known as ykarel04:01
*** dave-mccowan has quit IRC04:02
*** jerryz has quit IRC04:07
*** todun has joined #openstack-infra04:08
ianw"message": "Image cannot be imported. Error code: '413'",04:17
*** udesale has joined #openstack-infra04:35
*** soniya29 has joined #openstack-infra04:59
*** ykarel is now known as ykarel|afk05:02
*** ykarel|afk has quit IRC05:02
*** janki has joined #openstack-infra05:17
*** sgw has quit IRC05:17
*** todun has quit IRC05:23
*** ykarel has joined #openstack-infra05:25
*** todun has joined #openstack-infra05:31
*** todun has quit IRC05:34
*** larainema has joined #openstack-infra05:39
*** yamamoto has joined #openstack-infra05:41
*** yamamoto has quit IRC05:55
*** kjackal has joined #openstack-infra05:56
*** kjackal has quit IRC06:08
AJaegerconfig-core, please review https://review.opendev.org/691986 to remove obsolete jobs06:21
*** lmiccini has joined #openstack-infra06:22
*** yamamoto has joined #openstack-infra06:23
*** igordc has quit IRC06:37
*** kopecmartin|off is now known as kopecmartin06:40
*** georgk has quit IRC06:40
*** fdegir has quit IRC06:40
*** georgk has joined #openstack-infra06:41
*** fdegir has joined #openstack-infra06:41
*** dpawlik has joined #openstack-infra06:58
*** ykarel is now known as ykarel|afk06:59
*** dpawlik has quit IRC07:03
*** pcaruana has joined #openstack-infra07:09
*** yamamoto has quit IRC07:09
*** dpawlik has joined #openstack-infra07:18
*** pgaxatte has joined #openstack-infra07:23
*** dciabrin has quit IRC07:24
*** rascasoft has quit IRC07:29
*** lpetrut has joined #openstack-infra07:35
*** kjackal has joined #openstack-infra07:36
*** yamamoto has joined #openstack-infra07:47
*** aedc has quit IRC07:49
*** pkopec has joined #openstack-infra07:55
*** florianf has joined #openstack-infra08:02
*** tkajinam has quit IRC08:03
*** ykarel|afk is now known as ykarel08:03
*** dchen has quit IRC08:07
*** yamamoto has quit IRC08:17
*** florianf has left #openstack-infra08:17
*** yamamoto has joined #openstack-infra08:18
*** chandankumar has quit IRC08:19
*** chandankumar has joined #openstack-infra08:20
openstackgerritMerged openstack/openstack-zuul-jobs master: Remove openstack-tox-py27-with-oslo-master  https://review.opendev.org/69198608:22
*** hashar has joined #openstack-infra08:27
*** slaweq_ has joined #openstack-infra08:30
*** gfidente has joined #openstack-infra08:34
*** yamamoto has quit IRC08:34
*** prometheanfire has quit IRC08:35
*** jpena|off is now known as jpena08:37
*** prometheanfire has joined #openstack-infra08:37
*** amoralej|off is now known as amoralej08:46
*** janki has quit IRC08:46
*** jchhatbar has joined #openstack-infra08:47
*** yamamoto has joined #openstack-infra08:47
*** lucasagomes has joined #openstack-infra08:51
*** lmiccini has quit IRC08:57
*** rpittau|afk is now known as rpittau08:58
*** ralonsoh has joined #openstack-infra08:58
*** dtantsur|afk is now known as dtantsur09:01
*** ykarel is now known as ykarel|lunch09:03
*** lmiccini has joined #openstack-infra09:06
*** vesper has joined #openstack-infra09:08
*** vesper11 has quit IRC09:09
*** iurygregory has joined #openstack-infra09:10
*** slaweq_ is now known as slaweq09:18
*** sshnaidm|afk is now known as sshnaidm09:21
*** yamamoto has quit IRC09:25
*** trident has quit IRC09:26
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - add support for git.tag.creation event  https://review.opendev.org/67993809:30
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - Support for branch creation/deletion  https://review.opendev.org/68511609:30
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - add support for git.tag.creation event  https://review.opendev.org/67993809:32
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - Support for branch creation/deletion  https://review.opendev.org/68511609:32
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - add the enqueue_ref unit test  https://review.opendev.org/68735109:32
*** dpawlik has quit IRC09:33
*** trident has joined #openstack-infra09:34
*** yamamoto has joined #openstack-infra09:34
*** elod has quit IRC09:37
*** tesseract has joined #openstack-infra09:38
*** dciabrin has joined #openstack-infra09:42
*** aedc has joined #openstack-infra09:46
*** dpawlik has joined #openstack-infra09:48
*** cgoncalves has quit IRC09:49
*** yamamoto has quit IRC09:50
*** elod has joined #openstack-infra09:57
*** cgoncalves has joined #openstack-infra09:58
*** ykarel|lunch is now known as ykarel10:13
AJaegercorvus, regarding the /opt/git changes on our images: We have lots of "/usr/zuul-env/bin/zuul-cloner --cache-dir /opt/git" - did we just break that?10:17
*** Tengu has quit IRC10:18
*** kjackal has quit IRC10:18
*** Tengu has joined #openstack-infra10:19
*** kjackal has joined #openstack-infra10:22
*** yamamoto has joined #openstack-infra10:25
*** rcernin has quit IRC10:27
*** pcaruana has quit IRC10:35
*** yamamoto has quit IRC10:37
openstackgerritFabien Boucher proposed zuul/nodepool master: Remove uneeded shebang and exec bit on some files  https://review.opendev.org/69210010:39
*** jtomasek has quit IRC10:44
*** openstackstatus has quit IRC10:44
*** mgoddard has quit IRC10:46
*** mgoddard has joined #openstack-infra10:47
*** jtomasek has joined #openstack-infra10:47
*** rfolco|off has joined #openstack-infra10:54
*** sulo has joined #openstack-infra10:54
*** panda is now known as panda|pto11:00
*** arxcruz is now known as arxcruz|lunch11:10
*** jchhatbar has quit IRC11:14
*** rh-jelabarre has joined #openstack-infra11:16
*** aedc has quit IRC11:17
*** sshnaidm has quit IRC11:23
*** goldyfruit___ has joined #openstack-infra11:29
*** rfolco|off has quit IRC11:35
openstackgerritHarald Jensås proposed openstack/diskimage-builder master: WIP: Add IPv6 support in dhcp-all-interfaces  https://review.opendev.org/69211011:40
*** dpawlik has quit IRC11:41
openstackgerritHarald Jensås proposed openstack/diskimage-builder master: WIP: Add IPv6 support in dhcp-all-interfaces  https://review.opendev.org/69211011:43
*** sshnaidm has joined #openstack-infra11:43
*** ramishra has quit IRC11:46
*** ramishra has joined #openstack-infra11:47
*** goldyfruit___ has quit IRC11:54
*** dciabrin has quit IRC11:55
*** jpena is now known as jpena|lunch11:59
*** pcaruana has joined #openstack-infra12:00
*** ykarel is now known as ykarel|afk12:02
dtantsurhi folks! what's the unbound service installed in the CI? it conflicts with dnsmasq from bifrost, apparently..12:02
*** dpawlik has joined #openstack-infra12:08
*** larainema has quit IRC12:09
*** dpawlik has quit IRC12:12
*** rlandy has joined #openstack-infra12:13
openstackgerritHarald Jensås proposed openstack/diskimage-builder master: WIP: Add IPv6 support in dhcp-all-interfaces  https://review.opendev.org/69211012:17
fricklerdtantsur: iiuc we install a local resolver mainly in order to be able to direct requests via ipv6 when that is natively available, to avoid drops in cloud providers' overloaded ipv4 nat devices12:18
*** arxcruz|lunch is now known as arxcruz12:18
dtantsurack12:18
dtantsurI think I've figured why the CI is failing, it's not related12:19
*** markvoelker has quit IRC12:25
*** hashar is now known as hasharAway12:25
*** markvoelker has joined #openstack-infra12:25
*** udesale has quit IRC12:26
*** hasharAway has quit IRC12:32
*** hashar has joined #openstack-infra12:34
*** hashar is now known as hasharAway12:35
*** aedc has joined #openstack-infra12:39
*** sulo has quit IRC12:41
*** Goneri has joined #openstack-infra12:49
fungifrickler: the main reason we install unbound is to have a local dns cache on the node so that it doesn't query the same records from an external resolver over and over, as the latter increases chances of failure from intermittent network issues/packet loss12:56
*** eharney has quit IRC12:57
fungiif we didn't already want that, there would have been other ways to set preferred resolver addresses on different providers12:57
*** rfolco has joined #openstack-infra13:01
*** jpena|lunch is now known as jpena13:02
*** psachin has quit IRC13:02
*** sgw has joined #openstack-infra13:03
*** goldyfruit___ has joined #openstack-infra13:04
*** ramishra has quit IRC13:07
*** yamamoto has joined #openstack-infra13:08
*** soniya29 has quit IRC13:11
*** amoralej is now known as amoralej|lunch13:12
*** hasharAway has quit IRC13:13
*** xek has joined #openstack-infra13:13
*** yamamoto has quit IRC13:13
*** hashar has joined #openstack-infra13:14
*** nicholas has joined #openstack-infra13:14
*** hashar_ has joined #openstack-infra13:15
*** pkopec has quit IRC13:19
*** pkopec has joined #openstack-infra13:19
*** mriedem has joined #openstack-infra13:22
*** yamamoto has joined #openstack-infra13:36
*** hashar_ has quit IRC13:37
*** hashar has quit IRC13:38
*** hashar has joined #openstack-infra13:38
*** yamamoto has quit IRC13:41
*** ramishra has joined #openstack-infra13:41
*** yamamoto has joined #openstack-infra13:41
*** ykarel|afk is now known as ykarel13:45
*** kopecmartin is now known as kopecmartin|scho13:48
*** dciabrin has joined #openstack-infra13:50
*** dpawlik has joined #openstack-infra13:54
*** amoralej|lunch is now known as amoralej13:58
*** dave-mccowan has joined #openstack-infra14:00
*** eharney has joined #openstack-infra14:03
Shrewsinfra-root: Most of mordred's topic:gerrit-images changes have 3 +2's now. Do we want to begin landing those?14:03
*** ykarel is now known as ykarel|meeting14:04
*** mattw4 has joined #openstack-infra14:04
clarkbShrews: I was planning on lettibg mordred land what he had tested on -dev but wasnt sure what those were so left off +As14:04
Shrewsclarkb: k. probably best to let mordred control the landing14:04
AJaegerclarkb: good morning. Good progress on openSUSE 150 changes but stein is causing problems. osa is fixing their branches right now.14:05
AJaegerclarkb: I wonder whether we should force-merge https://review.opendev.org/677181 for bifrost?14:06
*** mattw4 has quit IRC14:16
*** diablo_rojo has joined #openstack-infra14:16
*** jaosorior has joined #openstack-infra14:21
*** jamesmcarthur has joined #openstack-infra14:24
*** goldyfruit_ has joined #openstack-infra14:28
*** goldyfruit___ has quit IRC14:31
*** jamesmcarthur has quit IRC14:31
*** anteaya has joined #openstack-infra14:37
openstackgerritHarald Jensås proposed openstack/diskimage-builder master: WIP: Add IPv6 support in dhcp-all-interfaces  https://review.opendev.org/69211014:43
*** rkukura has quit IRC14:47
*** Goneri has quit IRC14:49
*** ykarel|meeting is now known as ykarel15:01
*** xek_ has joined #openstack-infra15:03
*** amoralej is now known as amoralej|off15:03
*** jpena is now known as jpena|off15:04
*** yamamoto has quit IRC15:04
*** xek has quit IRC15:05
*** eharney has quit IRC15:05
*** jaosorior has quit IRC15:06
*** efried1 has joined #openstack-infra15:07
*** yamamoto has joined #openstack-infra15:08
*** weshay is now known as weshay|rover15:08
*** efried has quit IRC15:08
*** efried1 is now known as efried15:08
*** rfolco is now known as rfolco|ruck15:08
*** yamamoto has quit IRC15:12
*** rfolco|ruck is now known as rfolco|rucker15:13
openstackgerritDavid Shrewsbury proposed opendev/base-jobs master: Remove buildset_proxy reference  https://review.opendev.org/69216715:16
*** dpawlik has quit IRC15:16
Shrewsclarkb: ^^ should fix us15:16
fungigonna go grab lunch and run some pre-travel errands but will return in a couple hours15:17
*** eharney has joined #openstack-infra15:19
*** rkukura has joined #openstack-infra15:19
*** elod has quit IRC15:19
*** elod has joined #openstack-infra15:19
*** lpetrut has quit IRC15:22
AJaegerinfra-root, anything wrong with nodepool? Looking at http://grafana.openstack.org/d/T6vSHcSik/zuul-status?orgId=1 I see a max of 1064 nodes but only 500 in use. and there are requests for more.15:23
AJaegerinfra-root, looks like Rackspace is down, see http://grafana.openstack.org/d/8wFIHcSiz/nodepool-rackspace?orgId=115:24
*** gyee has joined #openstack-infra15:24
*** rfolco|rucker has quit IRC15:26
corvus Detailed error for node 936cf8e3-3dfc-4dab-8725-dfdf94d9371d: Exceeded maximum number of retries. Exceeded max scheduling attempts 3 for instance 936cf8e3-3dfc-4dab-8725-dfdf94d9371d. Last exception: ['SR_BACKEND_FAILURE_110', '', "VDI resize failed [opterr=Command ['/usr/bin/vhd-util', 'revert', '--debug', '-n', '/15:26
corvusclarkb, Shrews, AJaeger, fungi: did i see people talking about images yesterday?15:27
AJaegercorvus: yes, saw something in backlog, let's check that...15:27
Shrewsnot sure15:28
AJaegermordred, ianw ^15:28
AJaegerhttp://eavesdrop.openstack.org/irclogs/%23openstack-infra/latest.log.html#t2019-10-30T00:55:3615:28
AJaegerianw wanted to release a new diskimage-builder with a fix15:28
*** jamesmcarthur has joined #openstack-infra15:29
AJaegerwhich he did - https://review.opendev.org/691998 is in 2.29.115:29
AJaegerwhere those images build with older dib and we need to rebuild with current?15:30
Shrewscorvus: is https://review.opendev.org/692167 the right way to handle that, or should we have a check for a running buildset_proxy before attempting to gather the logs? The separation between starting containers and gathering logs for them made that difficult to catch.15:32
corvusShrews: yep that's right, thanks.  it's gone and not coming back.15:32
Shrewscool15:33
corvusShrews, AJaeger: there are a lot of old instances in rax-dfw15:35
corvuslike this: http://paste.openstack.org/show/785659/15:36
Shrewsempty properties? shouldn't that have nodepool specific info?15:37
corvusyeah, maybe they leaked because of that?15:37
Shrewsnothing in nodepool changed around that.15:37
Shrewsthat would definitely cause a leak though15:37
*** dciabrin has quit IRC15:37
corvusanyone have any idea how i can ask for the quota via openstack cli?15:39
Shrewsi can't find that server id in launcher logs15:39
corvusit's ancient look at the dates15:39
nicholasin novaclient it used to be quota-show , not sure if it's same command in openstack cli15:39
*** michael-beaver has joined #openstack-infra15:39
corvusnicholas: Service 'network' is disabled because its configuration could not be loaded.15:40
Shrewscorvus: oh, i was looking at updated date15:40
Shrewswhich is still older than our logs15:40
*** jamesmcarthur has quit IRC15:41
openstackgerritMerged opendev/base-jobs master: Remove buildset_proxy reference  https://review.opendev.org/69216715:42
corvusAJaeger, Shrews: i have to go catch a plane.  i'm sorry i couldn't help more.  i still have no idea what our quota is currently set to, so i can't even investigate whether it has changed or we are exceeding it.  i think that's still something to pursue.15:43
corvusif anyone figures out how to check quota in openstack, please update https://docs.openstack.org/infra/system-config/nodepool.html15:43
AJaegercorvus: safe travels!15:47
nicholastalking about rax quota ?15:47
*** armax has joined #openstack-infra15:48
AJaegeryes15:48
*** Goneri has joined #openstack-infra15:49
nicholassomeone set it to 015:49
*** goldyfruit___ has joined #openstack-infra15:49
*** jaosorior has joined #openstack-infra15:51
*** goldyfruit_ has quit IRC15:52
AJaegerany idea why? are you from rackspace?15:52
nicholasi'm not sure why, but i am from rackspace15:53
nicholassomething about using a bad base image15:54
AJaegerthat should have been solved with new diskimage-builder. Seems we need an infra-root to check that we have working images up again.15:55
AJaeger(and some are just on the way to Shanghai for the summmit)15:55
*** ykarel is now known as ykarel|away15:55
AJaegernicholas: thanks!15:55
* AJaeger cannot do anything.15:55
nicholasi see there's a ticket as well15:55
AJaeger#status log Rackspace quota is at 0 apparently due to broken base images.15:56
*** armax has quit IRC15:56
clarkbthe issue is related to qemu-img built images from dib15:56
clarkbdib was updated to revert that change15:56
clarkbnew images should be built correctly15:56
AJaegerclarkb: do we have new images now?15:57
clarkbI dont know. Slow start today. But I can check soon15:57
AJaegerinfra-root, we lost our boot - #status is not working15:57
*** xarses_ has joined #openstack-infra15:57
AJaegerit's one of those days ;/15:57
corvusinfra-root: i see the ticket in the openstackjenkins account; we should probably verify that we only have correct images uploaded, delete all instances, then ask on the ticket for quota to be reinstated.  there's more info with problematic image ids on the ticket.15:57
* corvus really afks again15:58
clarkbnicholas: the issur is vhd utils checks if it made the image and if it didnt it fails even though the image qemu-img created is fine (tested by updating qemu-img to claim it is vhd-utils)15:59
clarkbnicholas: I believe that ianw found all that tap codehadbeen removed from xen evebtually too15:59
*** xarses has quit IRC16:00
clarkbthis is a case where nodepool's behavior is actually really annoying16:04
clarkbI want to delete all the centos-8 images from rax16:04
clarkbbut as soon as I do nodepool will reupload the bad image iirc16:05
clarkbbut I don't want to delete the bad image because only the vhd is bad and not the other formats16:05
*** kjackal has quit IRC16:05
clarkbya its uploading the bad centos-8 image now :(16:07
Shrewsclarkb: i believe if you pause the image, it will neither be rebuilt or uploaded16:07
clarkbShrews: we want it to be built16:08
clarkbwhich is the problem16:08
*** xarses_ has quit IRC16:08
*** armax has joined #openstack-infra16:09
Shrewsclarkb: in dfw, i see several active instances without anything in 'properties'. from the april/may time period, looks like. should i begin manually deleting those?16:11
*** kjackal has joined #openstack-infra16:11
Shrews(no idea how they end up with no metadata)16:11
clarkbShrews: what are their instance names? I would expect them to be not booted by nodepool if they don't have properties16:12
clarkbbut if the instance names match nodepool instance names then I would double check for holds and if not held then delete16:12
Shrewsubuntu-xenial-rax-dfw-*, centos-7-rax-dfw-*, ubuntu-bionic-rax-dfw-*16:12
clarkbprobably fine then16:13
Shrewswe don't have any holds that old16:13
*** mattw4 has joined #openstack-infra16:14
*** jamesmcarthur has joined #openstack-infra16:18
clarkbok this is actually really awkward. I keep thinking I can delete images that are bad and then they start getting reuploaded16:19
clarkbI'm going to stop deleting images and focus on queuing up image builds16:19
Shrewsstarting to delete old instances, from oldest to newest16:21
*** igordc has joined #openstack-infra16:21
*** jamesmcarthur has quit IRC16:22
clarkbonly nb02 is building an image according to dib-image-list but nb01 has a dib process running16:22
*** jamesmcarthur has joined #openstack-infra16:22
*** kopecmartin|scho is now known as kopecmartin|off16:24
clarkbI'm going to clean up nb01 and reboot and bring it back up again so that it communicates to zk hopefully16:24
clarkbShrews: I think we need a way to represent a state where we want remove an image from a particular provider (and not all providers) while allowing rebuilds16:25
clarkba different kind of pause basically16:25
*** pgaxatte has quit IRC16:26
*** pgaxatte has joined #openstack-infra16:27
clarkbbased on image build times I expect it will be towards the end of the day before we can turn rax back on16:34
clarkbif anyone has better ideas on how to convince nodepool to use older images after deleting newer images while still building replacement images that may speed it up16:34
*** ricolin has quit IRC16:41
*** hashar has quit IRC16:49
*** cjohnston has joined #openstack-infra16:53
Shrewsinfra-root: ok, rax ORD and DFW are both cleaned up of nodepool created instances that had an empty properties value. I have no idea how that could have happened, but most seemed to be from last year, and one as recently as Jan 2019. Probably something we may want to keep an eye on. I left a few that had some values in properties. Will have to look at those later after lunching.16:54
*** lucasagomes has quit IRC16:54
clarkbok both builders are building images now16:55
clarkbwe have ~8 images to build I think16:56
clarkbI have issued explicit built requests for all of the images that I believe need updating16:56
*** diablo_rojo has quit IRC16:57
*** rpittau is now known as rpittau|afk16:59
clarkbAJaeger: nicholas ^ fyi at this point we are just waiting for all the new images to be built using vhd-util so that rax xen doesn't complain16:59
clarkbwhen that is done we can update the ticket and request a revert on the quota change16:59
*** hashar has joined #openstack-infra16:59
AJaegerclarkb: thanks!17:01
*** openstackstatus has joined #openstack-infra17:03
*** ChanServ sets mode: +v openstackstatus17:03
clarkbAJaeger: statusbot is back now17:03
clarkb(I restarted it17:03
clarkb#status log Rax quota set to 0 while we rebuild images with vhd-util instead of qemu-img so that they can be resized in rax. Once images are updated we should request quote be reset back to normal17:04
openstackstatusclarkb: finished logging17:04
*** rfolco has joined #openstack-infra17:04
AJaegerwelcome back, openstackstatus ;)17:05
*** dayou has quit IRC17:12
clarkbthese are the images I think we need updates on after the current point in time. debian-stretch, debian-buster, fedora-29, gentoo-17, opensuse-150, opensuse-tumbleweed, ubuntu-bionic17:14
clarkbthe other images have been updated already17:14
clarkbgentoo and fedora-29 are building now and that should take just over an hour. figure ~4.5 hours or so probably for all of them?17:15
*** dayou has joined #openstack-infra17:15
clarkbI'm going to pop out for breakfast since we are waiting anyway, nodepool should build all of those images as it finishes others17:16
*** dpawlik has joined #openstack-infra17:23
*** Garyx_ has quit IRC17:25
fungiokay, back from lunch/errands and seeing lots of pings. will try to follow up in order17:26
*** pgaxatte has quit IRC17:28
*** weifan has joined #openstack-infra17:29
*** xek_ has quit IRC17:29
*** weifan has quit IRC17:31
fungiahh, looks like things have settled down and we're just watching for images to rebuild now17:31
*** weifan has joined #openstack-infra17:31
clarkblooks like nb02 disconnected from zk too?17:37
clarkbit is still building the gentoo iamge but dib-image-list has no record of it17:37
clarkbwe may want to clean up that server too17:37
*** jamesmcarthur has quit IRC17:38
*** jamesmcarthur has joined #openstack-infra17:39
clarkbya I'm going to get that started17:39
*** jamesmcarthur has quit IRC17:44
*** hashar has quit IRC17:44
*** dtantsur is now known as dtantsur|afk17:49
*** jamesmcarthur has joined #openstack-infra17:51
*** Garyx has joined #openstack-infra17:51
clarkbfedora-29 is uploading now17:52
clarkbbionic is building17:52
*** jamesmcarthur has quit IRC17:58
clarkbnb02 is back in the rotation18:00
*** jamesmcarthur has joined #openstack-infra18:00
*** hashar has joined #openstack-infra18:01
*** jamesmcarthur has quit IRC18:05
*** jerryz has joined #openstack-infra18:06
*** pcaruana has quit IRC18:10
*** jamesmcarthur has joined #openstack-infra18:13
openstackgerritMatt Riedemann proposed opendev/elastic-recheck master: Add query for nova functional test fail bug 1850682  https://review.opendev.org/69220318:13
openstackbug 1850682 in OpenStack Compute (nova) "functional tests in rocky randomly fail with "Build of instance was re-scheduled: Cannot modify readonly field uuid"" [Undecided,Confirmed] https://launchpad.net/bugs/185068218:13
*** eharney has quit IRC18:14
*** jamesmcarthur has quit IRC18:15
*** jamesmcarthur has joined #openstack-infra18:16
*** chandankumar is now known as raukadah18:17
*** dciabrin has joined #openstack-infra18:18
*** hwoarang has quit IRC18:21
*** hwoarang has joined #openstack-infra18:22
*** jamesmcarthur has quit IRC18:23
*** jamesmcarthur has joined #openstack-infra18:24
*** dpawlik has quit IRC18:26
*** ykarel|away has quit IRC18:31
*** aedc has quit IRC18:36
*** dpawlik has joined #openstack-infra18:37
nicholasclarkb: let me know if there's anything i can do to help out18:40
*** dciabrin has quit IRC18:42
*** Goneri has quit IRC18:46
*** jamesmcarthur has quit IRC18:58
*** jamesmcarthur has joined #openstack-infra18:58
*** tesseract has quit IRC19:03
*** jamesmcarthur has quit IRC19:03
*** eharney has joined #openstack-infra19:03
*** pcaruana has joined #openstack-infra19:05
*** dpawlik has quit IRC19:16
*** jamesmcarthur has joined #openstack-infra19:16
*** jaosorior has quit IRC19:21
*** prometheanfire has quit IRC19:21
*** prometheanfire has joined #openstack-infra19:21
*** hashar has quit IRC19:24
*** gfidente is now known as gfidente|afk19:28
*** pcaruana has quit IRC19:29
*** jamesmcarthur has quit IRC19:34
*** Goneri has joined #openstack-infra19:36
*** kjackal has quit IRC19:39
*** pcaruana has joined #openstack-infra19:39
*** kjackal_v2 has joined #openstack-infra19:39
efriedo/ infra19:40
efriedYesterday mordred helpfully held the node for one of the failing jobs here https://review.opendev.org/#/c/691980/ (not sure which one) and set up my ssh key.19:40
efriedBut I'm having trouble connecting, and could use some help...19:40
efriedThe problem is possibly that I can't IPv6 from here, but I'm not even sure how to figure that out (never IPv6ed in my life).19:41
clarkbefried: if on linux you want to run `ip addr` or `ifconfig` in a terminal to get interface and address listings you are looking for an ipv6 address that is marked global19:42
clarkbif you don't have one of those then no ipv6 global routing from your host19:42
clarkbif you do and it isn't working then we'll have to dig in more19:42
clarkbif on windows I think you can run ipconfig19:42
clarkbnot sure on osx19:43
efriedI'm on bionic19:43
*** aedc has joined #openstack-infra19:43
efriedip addr | grep global only shows a v4 addr19:43
clarkbthere is a good chance that you don't have ipv6 wherever you currently are then. I have this problem too and bounce through a cheap ovh instance. You can also set up a hurricane electric tunnel19:44
clarkbalso most cellphone networks are ipv6 enabled hwoever when you tether on them they tend to be ipv4 only :/19:44
efriedegads, all of those things sound complicated. Chances of me doing a lot of this are pretty slim; is it worth it for a one-off?19:44
clarkbif you want access to an instance in an ipv6 only cloud its your only option19:45
efriedI'm on home network (wifi to my cable modem)19:45
clarkbwe can't add ipv4 to those instances19:45
* clarkb double checks the hold list19:45
efriedokay, which thing is easiest?19:45
clarkbyes that is a fortnebula instance which is an ipv6 only cloud19:46
efriedoh, donnyd has been able to set up an ipv4 jump station for me in the past...19:46
efriedseemed to be able to do it by snapping his fingers19:46
donnydefried: what do you need19:46
efriedbut it's all magic from where I sit.19:46
efrieddonnyd: a way to ipv4 to...19:47
efried[pm'd]19:47
efriedclarkb: how much longer is that hold good for? Like another 2.5h or so?19:48
clarkbI don't think they expire by default19:48
clarkbshould be good until you tell us it isn't19:48
efriedokay, mordred said something about 24h19:49
clarkboh hrm let me double check on the zuul side then19:49
efriedFYI I'm unbreaking python-openstackclient builds, which went pear-shaped when something or other switched them to py3 (oops, they hadn't been testing under py3 prior to that)19:49
efriedand need the node because I can't run that thing locally19:50
clarkbautohold list doesn't show any timeout19:50
clarkbShrews: ^ how do we check that?19:50
clarkbinfra-root we are down to opensuse-tumbleweed, gentoo-17, debian-buster and debian-stretch images needing to go to rax19:51
clarkbI've been trying to delete the older images as the new ones update too to make it clear which are good19:51
efriedclarkb, Shrews: it may have just been speculation on mordred's part http://eavesdrop.openstack.org/irclogs/%23openstack-sdks/%23openstack-sdks.2019-10-29.log.html#t2019-10-29T22:22:58 -- this could be just a wild goose chase.19:51
donnydefried: finger snap complete19:58
efriedkablam19:58
sshnaidmcores, please review fix for os_image module in ansible: https://github.com/ansible/ansible/pull/6395920:05
ianwclarkb / everyone : sorry for the mess on that, i thought they'd clear faster20:07
clarkbianw: no worries. I think this is only weird/painful due to how nodepool works20:08
donnydlmk if there is anything else I can do to help :)20:08
clarkbmaybe we can brainstorm how to do that better at the summit/forum/ptg20:08
clarkbbasically we can't force nodepool to use an older image now and also build new images20:08
clarkbwhich would've soled the problem quickly20:08
ianwmordred: re our conversation, i have tried every permutation i can come up with creating vhd's with patched qemu and while i can get them to boot, they don't grow20:08
*** pcaruana has quit IRC20:09
ianwthe "only" difference is the vhd-utils based images have the "batmap" table (the xen extension not mentioned in the microsoft vpc spec)20:09
ianwalthough it looks like xen's resize utils should handle "legacy" images without it, it seems like they don't20:09
ianwi'm going to write this all up in a note so that we don't have to debug this far again20:10
* fungi imagines a note saying something to the effect of "this is here while we wait for xen to finally die"20:10
*** gfidente|afk has quit IRC20:11
fungiit's unfortunate that they would extend a specification in undocumented ways and not work to get support for that extension into mainstream tooling20:11
*** rfolco has quit IRC20:12
fungibut given the dwindling interest in xen, i have doubts anyone cares enough to solve that properly now20:12
*** jamesmcarthur has joined #openstack-infra20:13
*** aedc has quit IRC20:14
*** Goneri has quit IRC20:22
ianwfungi: yeah, the problem is that qemu-img creates a tempting close replica.  there's patches out there that add support for the batmap *and* overlay disks20:23
ianwit's all stuffed in random mailing list posts as is the development  model20:24
ianwI also found blktap sort of exists ... but has this rather comforting INSTALL note from 7+ years ago ... https://github.com/xapi-project/blktap/blob/master/INSTALL20:26
*** hashar has joined #openstack-infra20:28
*** jamesmcarthur has quit IRC20:29
*** adriant has quit IRC20:31
*** iokiwi has quit IRC20:31
fungiindeed, that really makes me want to use it20:34
*** diablo_rojo has joined #openstack-infra20:35
ianwnicholas: so can you see logs of instances booting on the rax side?20:39
*** goldyfruit___ has quit IRC20:42
*** ralonsoh has quit IRC20:46
*** iokiwi has joined #openstack-infra20:49
fungioh, also, mordred suggested there was no vhd-util in ubuntu, looks like it was dropped from debian/unstable in january: https://bugs.debian.org/91790720:49
openstackDebian bug 917907 in ftp.debian.org "RM: blktap -- RoQA; obsolete, FTBFS" [Normal,Open]20:49
*** jamesmcarthur has joined #openstack-infra20:50
ianwyeah, fair enough as upstream xen rf -rf'd the whole thing20:50
fungias a result the last ubuntu lts release to include a blktap-utils package was xenial: https://packages.ubuntu.com/blktap-utils20:51
*** dciabrin has joined #openstack-infra20:54
*** weifan has quit IRC20:54
*** markvoelker has quit IRC20:55
*** markvoelker has joined #openstack-infra20:55
*** hashar has quit IRC20:55
*** dciabrin_ has joined #openstack-infra20:58
ianwyeah, this whole thing is a yak shave from me thinking i want to have a bionic builder (and pabelanger note on the same)20:58
*** markvoelker has quit IRC20:59
*** jamesmcarthur has quit IRC21:00
*** weifan has joined #openstack-infra21:00
*** dciabrin has quit IRC21:02
*** hashar has joined #openstack-infra21:03
*** ociuhandu has joined #openstack-infra21:05
*** jamesmcarthur has joined #openstack-infra21:07
nicholasianw: yes21:08
*** jamesmcarthur_ has joined #openstack-infra21:09
ianwnicholas: so i'd be interested to see why the instance i've created with a patched qemu-img hasn't managed to grow it's disk, or if it tried21:09
ianwit's running @ 104.239.143.79 now21:10
ianwid is 1f535e76-3e8f-4d6b-80b4-06313f1a150c21:11
*** jamesmcarthur has quit IRC21:12
*** panda|pto has quit IRC21:14
nicholaswhich region is that21:14
clarkbwe are now waiting on buster and stretch. buster is uploading now and stretch is building21:14
clarkbianw: ^ fyi21:14
clarkbwe should be good to go in about 1.5 hours I think21:15
ianwnicholas: DFW21:15
clarkbI've tried to delete the bad images for all the other images so far as well21:15
*** panda has joined #openstack-infra21:18
*** jamesmcarthur_ has quit IRC21:24
nicholasianw: it looks like there are a few vbd's attached... grepping the UUIDs generates a ton of logs..21:24
nicholaswhich device? might help me target better21:24
ianwnicholas: hrm, that should just have one main disk21:25
ianwi has booted from the image 95e9b40b-4161-45e8-bbdb-02f0dc29bb9a21:25
ianw"ubuntu-bionic-qemu" ... it's one i generated with a patched qemu-img to set the creator to 'tap\0' with a low version number21:26
*** jamesmcarthur has joined #openstack-infra21:27
nicholasyeah i see the instance, it's showing 3 vbds: hda, xvde, hdd21:27
*** ociuhandu has quit IRC21:27
nicholasi'm not a xen guru fwiw :) i used to be on the ops/eng team back in the day but i've been on a different team for a few years now21:27
ianwumm, so xvde would be the ephemeral storage ... not that one.  hda i guess?21:27
ianw /dev/xvdd: UUID="2019-10-30-08-05-30-00" LABEL="config-2" TYPE="iso9660"21:28
ianwwould be configdrive21:28
*** jamesmcarthur has quit IRC21:29
*** ociuhandu has joined #openstack-infra21:29
jrosserperhaps some console streams may have gone awol again21:30
jrosseri believe this is running right now http://zuul.openstack.org/stream/8f6bfb46f4234d0ea5c02bb32207bcfb?logfile=console.log21:30
*** jamesmcarthur has joined #openstack-infra21:32
nicholasianw: my xen fu is rusty, but it appears to have successfully attempted it: [1739] ['/usr/bin/vhd-util', 'resize', '--debug', '-s', '40960', '-n', '/var/run/sr-mount/da64c4c1-e5b8-2ac7-b24a-ccad9e4cfdf7/600e1f41-d11c-4e45-b5f5-6acbdd24cb16.vhd', '-j', '.journal-600e1f41-d11c-4e45-b5f5-6acbdd24cb16']21:34
ianwxvda    202:0    0 16.3G  0 disk21:35
ianw└─xvda1 202:1    0 16.3G  0 part /21:35
ianwyeah, unfortunately the host doesn't seem to see that :/21:35
openstackgerritIan Wienand proposed openstack/diskimage-builder master: vhd-util : note on Xen/RAX images  https://review.opendev.org/69223421:36
ianwnicholas: ^ that's pretty much the summary at this point21:37
ianwclearly other customers are not banging down rax's door to create custom vhd images from raw files ...21:38
ianwi'm guessing this all works better from a windows environment21:38
*** iurygregory has quit IRC21:40
*** ociuhandu has quit IRC21:43
*** jamesmcarthur has quit IRC21:44
*** jamesmcarthur has joined #openstack-infra21:44
*** jmccrory has joined #openstack-infra21:46
*** jamesmcarthur has quit IRC21:49
*** jamesmcarthur has joined #openstack-infra21:50
nicholasianw: i need to head afk for the evening, but if you need any logs or anything, you can update the ticket and the ops folks can pull it for you21:51
ianwnicholas: np, thanks for the help21:53
clarkbjrosser: I don't knwo that we restarted any of the executors after we caught hte last failed set21:55
clarkbfungi: ^21:55
*** ociuhandu has joined #openstack-infra21:55
*** jamesmcarthur has quit IRC21:55
jrosserclarkb: ah i guess thats quite invasive?21:55
*** jamesmcarthur has joined #openstack-infra21:56
clarkbjrosser: it stops all running jobs and restarts them21:56
fungii thought they got restarted again after that for a new zuul version21:57
fungibut checking21:57
openstackgerritHarald Jensås proposed openstack/diskimage-builder master: WIP: Add IPv6 support in dhcp-all-interfaces  https://review.opendev.org/69211021:59
fungiyeah, they did, but since the last restart an oom incident has killed the finger listener process on ze09 and ze1222:00
*** jamesmcarthur has quit IRC22:00
fungithey were all restarted around 16:00z on 2019-10-2422:01
fungiso a little less than a week ago22:01
clarkbnow down to just stretch22:03
clarkbthe build for stretch should complete soon then we wait for uploads and I think we are good22:03
*** ociuhandu has quit IRC22:04
clarkband now uploading stretch22:09
*** ociuhandu has joined #openstack-infra22:10
*** hashar has quit IRC22:15
*** ociuhandu has quit IRC22:15
*** kjackal_v2 has quit IRC22:16
*** hashar has joined #openstack-infra22:17
*** ociuhandu has joined #openstack-infra22:20
efriedmordred, clarkb, donnyd: I'm done with that node and the v4 jumper. Thank you for the help!22:23
donnydAny time efried22:23
*** igordc has quit IRC22:25
*** weifan has quit IRC22:26
*** goldyfruit___ has joined #openstack-infra22:27
*** ociuhandu has quit IRC22:31
*** ociuhandu has joined #openstack-infra22:33
*** pkopec has quit IRC22:33
*** ociuhandu has quit IRC22:37
*** ociuhandu has joined #openstack-infra22:38
*** mriedem has quit IRC22:40
*** hashar has quit IRC22:42
*** ociuhandu has quit IRC22:43
*** jklare has quit IRC22:45
*** jklare has joined #openstack-infra22:48
*** dchen has joined #openstack-infra22:48
clarkbianw: nicholas fungi AJaeger I think we are good to go in rax now22:48
clarkbstretch miages uploaded and older ones deleted22:49
clarkbI'm not in a great spot to go update the ticket, maybe ianw can do that? but I think all our images should boot now22:49
ianwclarkb: will do22:50
*** adriant has joined #openstack-infra22:51
clarkbianw: note the fedora-30 images are ~20 days old so are unaffected (I'm guessing that is pre dnf stopped working)22:51
*** goldyfruit___ has quit IRC22:51
clarkbother than that all the images should be from within the last 12 hours or so22:51
ianwclarkb: yep, we can turn that back on now, the dnf fixes have made it to mainline22:52
*** pkopec has joined #openstack-infra22:52
*** markvoelker has joined #openstack-infra22:56
*** fresta has quit IRC22:59
*** fresta has joined #openstack-infra22:59
*** rlandy has quit IRC23:00
*** markvoelker has quit IRC23:00
*** tkajinam has joined #openstack-infra23:01
*** tkajinam has quit IRC23:01
ianwi updated the ticket, with links to https://review.opendev.org/692234 ... maybe rax have an interest in fixing this, maybe not ... happy to work with them if they do, we might be able to get qemu-img making compatible images23:03
*** tkajinam has joined #openstack-infra23:04
*** slaweq has quit IRC23:05
*** slaweq has joined #openstack-infra23:10
*** weifan has joined #openstack-infra23:11
*** mattw4 has quit IRC23:14
*** slaweq has quit IRC23:16
mordredclarkb: if you have a sec, feel like reviewing https://review.opendev.org/#/c/692003/ - which is the patch containing the fixes from your review of the first patch?23:18
clarkbdone23:19
mordredclarkb: woot. thanis23:21
mordredtahnks23:21
mordredzomg23:21
mordredT H A N K S23:21
clarkbdon't worry I can't type either :)23:21
clarkbianw: nicholas If I understand correctly we are waiting for rax to revert the quotas back to what they were now?23:32
fungithat sounds right based on my reading of scrollback from earlier23:36
*** ociuhandu has joined #openstack-infra23:43
openstackgerritClint 'SpamapS' Byrum proposed zuul/zuul-jobs master: Remove argument to ssh-keygen for key size  https://review.opendev.org/69224423:44
*** EmilienM has quit IRC23:47
*** ociuhandu has quit IRC23:48
*** EmilienM has joined #openstack-infra23:48
*** weifan has quit IRC23:59
*** jklare has quit IRC23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!