Thursday, 2019-10-10

*** slaweq has quit IRC00:01
*** jamesmcarthur has joined #openstack-infra00:14
*** markvoelker has joined #openstack-infra00:19
*** jamesmcarthur has quit IRC00:21
*** jamesmcarthur has joined #openstack-infra00:26
*** markvoelker has quit IRC00:30
*** diablo_rojo has joined #openstack-infra00:41
*** goldyfruit has joined #openstack-infra00:43
*** hongbin has joined #openstack-infra00:52
*** jamesmcarthur has quit IRC00:53
*** jamesmcarthur has joined #openstack-infra00:54
*** dychen has quit IRC00:58
*** jamesmcarthur has quit IRC01:01
*** hwoarang has quit IRC01:03
*** slaweq has joined #openstack-infra01:11
*** yamamoto has quit IRC01:12
*** hwoarang has joined #openstack-infra01:12
*** jamesmcarthur has joined #openstack-infra01:15
*** slaweq has quit IRC01:15
*** yamamoto has joined #openstack-infra01:17
*** jamesmcarthur has quit IRC01:28
*** markvoelker has joined #openstack-infra01:33
*** michael-beaver has quit IRC01:46
*** jamesmcarthur has joined #openstack-infra02:01
*** apetrich has quit IRC02:09
*** threestrands has joined #openstack-infra02:13
*** jamesmcarthur has quit IRC02:30
*** jamesmcarthur_ has joined #openstack-infra02:30
*** roman_g has quit IRC02:33
*** markvoelker has quit IRC02:40
*** jamesmcarthur_ has quit IRC02:45
*** ricolin has joined #openstack-infra02:52
*** ramishra has joined #openstack-infra02:53
*** larainema has joined #openstack-infra02:53
*** jamesmcarthur has joined #openstack-infra02:53
*** xinranwang has joined #openstack-infra02:57
*** slaweq has joined #openstack-infra03:11
*** rh-jelabarre has joined #openstack-infra03:11
*** yamamoto has quit IRC03:14
*** yamamoto has joined #openstack-infra03:15
*** slaweq has quit IRC03:15
*** diablo_rojo has quit IRC03:18
*** diablo_rojo has joined #openstack-infra03:19
*** yamamoto has quit IRC03:20
*** hongbin has quit IRC03:23
*** dchen has quit IRC03:24
*** dchen has joined #openstack-infra03:24
*** psachin has joined #openstack-infra03:34
*** goldyfruit has quit IRC03:48
*** hongbin has joined #openstack-infra03:49
*** udesale has joined #openstack-infra04:00
*** ramishra has quit IRC04:00
*** whoami-rajat has joined #openstack-infra04:02
*** jamesmcarthur has quit IRC04:04
*** slaweq has joined #openstack-infra04:11
*** ykarel has joined #openstack-infra04:13
*** rh-jelabarre has quit IRC04:14
*** slaweq has quit IRC04:16
*** ramishra has joined #openstack-infra04:17
*** yamamoto has joined #openstack-infra04:21
*** ociuhandu has joined #openstack-infra04:30
*** ociuhandu has quit IRC04:35
*** kjackal has joined #openstack-infra04:37
*** hwoarang has quit IRC04:37
*** hwoarang has joined #openstack-infra04:38
*** hongbin has quit IRC04:41
*** markvoelker has joined #openstack-infra04:41
*** markvoelker has quit IRC04:46
*** xenos76 has joined #openstack-infra04:48
*** dave-mccowan has quit IRC04:51
*** rakhmerov has joined #openstack-infra04:52
*** xenos76 has quit IRC05:00
*** pcaruana has joined #openstack-infra05:07
*** xenos76 has joined #openstack-infra05:09
*** slaweq has joined #openstack-infra05:11
*** slaweq has quit IRC05:15
*** yamamoto has quit IRC05:16
*** odicha has joined #openstack-infra05:16
ianwtristanC / clarkb / donnyd : dropped a comment in https://review.opendev.org/#/c/686749/11 but after testing today, I'm fairly convinced this is a NM issue where it times out waiting for a permanent link-local address and then gives up trying to configure ipv605:16
ianwI have filed : https://bugzilla.redhat.com/show_bug.cgi?id=176017905:16
openstackbugzilla.redhat.com bug 1760179 in NetworkManager "IPv6 address never assigned, possibly "linklocal6: waiting for link-local addresses failed due to timeout"" [Unspecified,New] - Assigned to lkundrak05:16
ianwi'm out for today, but I think that we might have luck making glean just wait a bit to make sure DAD has happened and the link-local address is permanent before starting networkmanager05:17
*** yamamoto has joined #openstack-infra05:17
openstackgerritSimon Westphahl proposed zuul/nodepool master: Sort waiting static nodes by creation time  https://review.opendev.org/68727105:19
openstackgerritMerged zuul/zuul master: Include session expired reason in API fetch error message.  https://review.opendev.org/68697605:24
openstackgerritMerged zuul/zuul master: Ensure tenant web_root url has a trailing slash  https://review.opendev.org/67682605:28
*** pcaruana has quit IRC05:35
*** kjackal has quit IRC05:36
*** jtomasek has quit IRC05:50
*** jaosorior has joined #openstack-infra06:04
*** roman_g has joined #openstack-infra06:10
*** pgaxatte has joined #openstack-infra06:20
*** surpatil has joined #openstack-infra06:21
*** yamamoto has quit IRC06:22
*** yamamoto has joined #openstack-infra06:25
*** kjackal has joined #openstack-infra06:34
*** threestrands has quit IRC06:36
*** threestrands has joined #openstack-infra06:36
*** iurygregory has joined #openstack-infra06:38
*** threestrands has quit IRC06:41
*** hwoarang has quit IRC06:49
*** pcaruana has joined #openstack-infra06:51
*** yamamoto has quit IRC06:53
openstackgerritSimon Westphahl proposed zuul/zuul master: Spec for allowing circular dependencies  https://review.opendev.org/64330906:54
openstackgerritSimon Westphahl proposed zuul/zuul master: Add optional support for circular dependencies  https://review.opendev.org/68535406:55
*** jaosorior has quit IRC06:57
*** yamamoto has joined #openstack-infra06:58
*** zhangfei has joined #openstack-infra06:58
*** tesseract has joined #openstack-infra06:59
*** hwoarang has joined #openstack-infra07:01
*** kopecmartin|off is now known as kopecmartin07:02
*** slaweq has joined #openstack-infra07:02
*** gfidente has joined #openstack-infra07:02
*** ykarel is now known as ykarel|lunch07:04
*** xinranwang has quit IRC07:07
*** rcernin has quit IRC07:07
*** ccamacho has joined #openstack-infra07:08
*** ccamacho has quit IRC07:09
*** ccamacho has joined #openstack-infra07:09
*** ricolin has quit IRC07:10
*** tosky has joined #openstack-infra07:12
*** jpena|off is now known as jpena07:13
openstackgerritSimon Westphahl proposed zuul/nodepool master: Sort waiting static nodes by creation time  https://review.opendev.org/68727107:19
openstackgerritSimon Westphahl proposed zuul/nodepool master: Don't touch static nodes that are allocated  https://review.opendev.org/68780607:19
*** pkopec has joined #openstack-infra07:23
*** FlorianFa has quit IRC07:24
*** Florian has joined #openstack-infra07:25
*** yamamoto has quit IRC07:25
*** eernst has joined #openstack-infra07:28
*** yamamoto has joined #openstack-infra07:33
*** zbr has joined #openstack-infra07:33
*** elod has quit IRC07:36
*** apetrich has joined #openstack-infra07:43
*** elod has joined #openstack-infra07:44
*** zbr has quit IRC07:44
*** eernst has quit IRC07:56
*** trident has quit IRC07:58
*** trident has joined #openstack-infra08:01
*** zbr has joined #openstack-infra08:02
*** ralonsoh has joined #openstack-infra08:02
*** ociuhandu has joined #openstack-infra08:03
openstackgerritSimon Westphahl proposed zuul/nodepool master: Don't touch static nodes that are allocated  https://review.opendev.org/68780608:03
openstackgerritSimon Westphahl proposed zuul/nodepool master: Sort waiting static nodes by creation time  https://review.opendev.org/68727108:03
*** lucasagomes has joined #openstack-infra08:04
*** ociuhandu has quit IRC08:08
*** tkajinam has quit IRC08:10
*** kjackal_v2 has joined #openstack-infra08:10
*** kjackal has quit IRC08:11
*** arxcruz|rover is now known as arxcruz08:11
*** rpittau|afk is now known as rpittau08:13
fricklerprometheanfire: can you take a look at https://review.opendev.org/682635 please? gentoo dib tests are failing for some time now08:25
fricklersee also https://review.opendev.org/68263908:26
*** yamamoto has quit IRC08:29
*** derekh has joined #openstack-infra08:31
*** yamamoto has joined #openstack-infra08:31
*** jtomasek has joined #openstack-infra08:33
*** dchen has quit IRC08:36
*** markvoelker has joined #openstack-infra08:44
*** yamamoto has quit IRC08:48
*** diablo_rojo has quit IRC08:49
*** markvoelker has quit IRC08:50
*** lennyb has quit IRC08:57
*** lennyb has joined #openstack-infra08:58
*** dtantsur|afk is now known as dtantsur09:01
*** diablo_rojo has joined #openstack-infra09:05
*** e0ne has joined #openstack-infra09:06
*** Florian has quit IRC09:19
*** FlorianFa has joined #openstack-infra09:19
openstackgerritMatthieu Huin proposed zuul/zuul master: Zuul Web: add /api/user/authorizations endpoint  https://review.opendev.org/64109909:23
openstackgerritMatthieu Huin proposed zuul/zuul master: Reduce sleep to avoid race conditions  https://review.opendev.org/68472609:24
*** ykarel|lunch is now known as ykarel09:26
*** yamamoto has joined #openstack-infra09:29
*** ricolin has joined #openstack-infra09:30
*** gfidente has quit IRC09:33
*** gfidente has joined #openstack-infra09:42
*** diablo_rojo has quit IRC09:47
*** udesale has quit IRC09:52
*** udesale has joined #openstack-infra09:53
*** ykarel is now known as ykarel|afk09:54
*** udesale has quit IRC10:00
*** udesale has joined #openstack-infra10:01
*** ociuhandu has joined #openstack-infra10:03
*** ociuhandu has quit IRC10:19
*** ociuhandu has joined #openstack-infra10:28
openstackgerritThierry Carrez proposed opendev/puppet-ptgbot master: Deploy the etherpads.html file  https://review.opendev.org/68785010:31
openstackgerritThierry Carrez proposed opendev/puppet-ptgbot master: Deploy glyphicon font files  https://review.opendev.org/68785110:31
*** xek_ has joined #openstack-infra10:34
openstackgerritMerged opendev/irc-meetings master: Update StoryBoard meeting day/time  https://review.opendev.org/68764410:43
*** zhangfei has quit IRC10:51
*** slaweq_ has joined #openstack-infra10:57
*** ociuhandu has quit IRC10:59
*** slaweq has quit IRC10:59
openstackgerritMerged zuul/nodepool master: Add port-cleanup-interval config option  https://review.opendev.org/68702411:00
*** jpena is now known as jpena|lunch11:04
*** udesale has quit IRC11:09
*** yamamoto has quit IRC11:24
*** jhesketh has quit IRC11:29
*** ykarel|afk is now known as ykarel11:32
*** larainema has quit IRC11:34
*** ociuhandu has joined #openstack-infra11:37
*** ociuhandu has quit IRC11:42
*** jhesketh has joined #openstack-infra11:43
*** weshay|ruck is now known as weshay11:48
*** ociuhandu has joined #openstack-infra11:48
*** yamamoto has joined #openstack-infra11:50
*** jpena|lunch is now known as jpena11:58
*** rh-jelabarre has joined #openstack-infra12:02
*** goldyfruit has joined #openstack-infra12:03
*** rfolco has joined #openstack-infra12:05
*** rfolco is now known as rfolco|ruck12:07
openstackgerritSimon Westphahl proposed zuul/nodepool master: Don't touch static nodes that are allocated  https://review.opendev.org/68780612:10
openstackgerritSimon Westphahl proposed zuul/nodepool master: Sort waiting static nodes by creation time  https://review.opendev.org/68727112:10
*** AJaeger has quit IRC12:12
*** goldyfruit has quit IRC12:14
*** rlandy has joined #openstack-infra12:15
*** AJaeger has joined #openstack-infra12:17
*** markvoelker has joined #openstack-infra12:21
*** Goneri has joined #openstack-infra12:24
*** derekh has quit IRC12:24
*** tmorin has joined #openstack-infra12:28
tmorinhi folks (infra-root)12:32
tmorinI have a change that's +1+W but not being merged (stuck in "ready to submit" state, the one change it depends on has been merged months ago)12:32
tmorineven after rechecks (quite a few) and trying to go again through +1+W from the initial state...12:32
tmorinI'm hoping someone could perhaps check what is happening  ?  https://review.opendev.org/#/c/63642212:33
tmorin^^ slaweq_12:34
*** jaosorior has joined #openstack-infra12:34
*** tmorin has quit IRC12:34
*** tmorin has joined #openstack-infra12:34
fricklertmorin: I think you need to rebase that change, it is on top of https://review.opendev.org/#/c/636962/1 while PS3 of that has merged12:36
slaweq_tmorin: I just rebased https://review.opendev.org/#/c/63642212:45
tmorinthanks frickler12:45
tmorinthanks slaweq_ , I just saw that12:45
tmorinfrickler: aren't there cases (most ?) where gerrit can be smart enough to rebase on its own ?12:46
*** mriedem has joined #openstack-infra12:50
*** markvoelker has quit IRC12:51
openstackgerritJeremy Stanley proposed opendev/puppet-openstack_infra_spec_helper master: Block minitest 5.12.1  https://review.opendev.org/68788412:51
fungitmorin: not when the explicit parent of the change is an outdated patchset12:52
fungibecause that parent will never appear in the git history12:53
fungigerrit doesn't rebase changes, it only merges them12:53
fungiand it can't merge a change which has a parent that isn't in the repository12:53
fungior at least isn't in that branch12:53
*** udesale has joined #openstack-infra12:55
*** aaronsheffield has joined #openstack-infra12:56
*** dtantsur is now known as dtantsur|afk12:57
*** ihti has quit IRC12:58
*** anteaya has quit IRC13:00
*** ihti has joined #openstack-infra13:01
openstackgerritJeremy Stanley proposed opendev/puppet-ptgbot master: Deploy the etherpads.html file  https://review.opendev.org/68785013:05
openstackgerritJeremy Stanley proposed opendev/puppet-ptgbot master: Deploy glyphicon font files  https://review.opendev.org/68785113:06
*** dpawlik has joined #openstack-infra13:13
*** priteau has joined #openstack-infra13:13
*** trident has quit IRC13:14
*** trident has joined #openstack-infra13:15
*** psachin has quit IRC13:15
*** michael-beaver has joined #openstack-infra13:18
*** dpawlik has quit IRC13:18
tmorinthanks for the explanation frickler, fungi!13:22
*** dpawlik has joined #openstack-infra13:22
*** goldyfruit has joined #openstack-infra13:23
openstackgerritMonty Taylor proposed zuul/zuul-registry master: HEAD object after PUT  https://review.opendev.org/68768113:30
fungiinfra-puppet-core: can i get an expedited approval on a gem pin in https://review.opendev.org/687884 to fix our centos-7 puppet jobs?13:31
pabelanger+213:31
fungijob results on latest patchsets of 687850 and 687851 show it's working13:32
*** david-lyle is now known as dklyle13:33
*** goldyfruit has quit IRC13:35
*** goldyfruit has joined #openstack-infra13:37
openstackgerritMonty Taylor proposed zuul/zuul-registry master: HEAD object after PUT  https://review.opendev.org/68768113:40
fungithanks pabelanger!13:43
fungii went ahead and self-approved so we don't block puppet module changes13:43
*** tmorin has left #openstack-infra13:43
*** dave-mccowan has joined #openstack-infra13:43
*** eharney has joined #openstack-infra13:47
*** rkukura_ has joined #openstack-infra13:52
*** tosky has quit IRC13:55
*** rkukura has quit IRC13:55
*** rkukura_ is now known as rkukura13:55
*** yamamoto has quit IRC13:56
*** ccamacho has quit IRC13:56
*** ccamacho has joined #openstack-infra13:56
openstackgerritMerged opendev/puppet-openstack_infra_spec_helper master: Block minitest 5.12.1  https://review.opendev.org/68788413:56
*** spsurya has joined #openstack-infra13:57
*** diablo_rojo has joined #openstack-infra14:00
*** surpatil has quit IRC14:00
*** dklyle has quit IRC14:02
*** dklyle has joined #openstack-infra14:04
*** mriedem has quit IRC14:04
*** mriedem has joined #openstack-infra14:05
*** georgk has quit IRC14:06
*** fdegir has quit IRC14:06
openstackgerritMerged opendev/puppet-ptgbot master: Deploy the etherpads.html file  https://review.opendev.org/68785014:06
openstackgerritMerged opendev/puppet-ptgbot master: Deploy glyphicon font files  https://review.opendev.org/68785114:06
*** georgk has joined #openstack-infra14:07
*** fdegir has joined #openstack-infra14:07
*** odicha has quit IRC14:10
*** sreejithp has joined #openstack-infra14:13
*** ociuhandu has quit IRC14:14
*** markvoelker has joined #openstack-infra14:15
*** adriant has quit IRC14:29
*** iokiwi has quit IRC14:29
*** adriant has joined #openstack-infra14:31
*** iokiwi has joined #openstack-infra14:31
*** yamamoto has joined #openstack-infra14:36
*** jpena is now known as jpena|off14:39
*** pcaruana has quit IRC14:39
*** yamamoto has quit IRC14:41
*** dave-mccowan has quit IRC14:41
*** chandankumar is now known as raukadah14:43
*** pgaxatte has quit IRC14:43
*** xenos76 has quit IRC14:44
*** xenos76 has joined #openstack-infra14:45
*** ociuhandu has joined #openstack-infra14:48
*** jamesmcarthur has joined #openstack-infra14:52
*** ociuhandu has quit IRC14:53
*** yamamoto has joined #openstack-infra14:55
*** ociuhandu has joined #openstack-infra14:58
*** xenos76 has quit IRC15:00
openstackgerritFrode Nordahl proposed openstack/project-config master: Add OVN charms  https://review.opendev.org/68792515:00
*** pcaruana has joined #openstack-infra15:01
*** xenos76 has joined #openstack-infra15:01
AJaegerconfig-core, please review ianw's CentOS 8 stack starting at https://review.opendev.org/#/c/68744515:06
openstackgerritSean McGinnis proposed openstack/project-config master: Add stable notifications to openstack-glance  https://review.opendev.org/68793115:10
*** ykarel is now known as ykarel|afk15:11
*** ociuhandu has quit IRC15:12
openstackgerritFrode Nordahl proposed openstack/project-config master: Add OVN charms  https://review.opendev.org/68792515:17
*** ociuhandu has joined #openstack-infra15:18
AJaegerthanks, mnaser !15:21
mnaserinfra-root: i think it would be good if someone +2'd this and watched it -- https://review.opendev.org/#/c/687453/215:22
mnasernp AJaeger15:22
*** gyee has joined #openstack-infra15:22
*** pcaruana has quit IRC15:26
fungiapproved, i'll set a reminder to check the image build log once that's deployed15:27
AJaegerthanks, fungi! I'm sure ianw will check as well once he's awake ;)15:29
*** eernst has joined #openstack-infra15:29
*** zbr has quit IRC15:29
*** ociuhandu has quit IRC15:30
*** zbr has joined #openstack-infra15:31
openstackgerritMerged openstack/project-config master: infra-pkg-needs: Update pkg-maps for CentOS 8, select chronyd  https://review.opendev.org/68744515:32
openstackgerritMerged openstack/project-config master: zuul-worker: no selinux python2 libs on CentOS 8  https://review.opendev.org/68744615:32
openstackgerritMerged openstack/project-config master: infra-package-needs: fix haveged install for all CentOS releases  https://review.opendev.org/68744715:32
openstackgerritMerged openstack/project-config master: nodepool/elements : use abstracted commands  https://review.opendev.org/68652415:33
openstackgerritMerged openstack/project-config master: Remove explicit set of DIB_SIMPLE_INIT_NETWORKMANAGER  https://review.opendev.org/68745215:33
*** yamamoto has quit IRC15:34
*** slaweq_ is now known as slaweq15:34
*** ociuhandu has joined #openstack-infra15:35
prometheanfirefungi: yep, looks good15:37
openstackgerritMerged openstack/project-config master: CentOS 8 initial deployment  https://review.opendev.org/68745315:40
corvusfungi, mordred, clarkb: the gerrit maintainers would like us to take a lok at https://review.opendev.org/68553315:41
*** jtomasek has quit IRC15:41
*** ykarel|afk is now known as ykarel15:41
*** dave-mccowan has joined #openstack-infra15:42
clarkbcorvus: it is avalid feature in many gerrit installs, wouldnt itbe better to accept the flag and fail if the gerrit cant support it rather than remove it entirely?15:43
*** lucasagomes has quit IRC15:45
*** ociuhandu has quit IRC15:45
fungiseems like it's already basically deprecated in gerrit 2.15, so suggesting that folks who need to use that feature on an older gerrit deployment should avoid upgrading git-review could make sense15:46
*** rpittau is now known as rpittau|afk15:47
*** ociuhandu has joined #openstack-infra15:48
mordredI'm torn - I like supporting older things - but even in older gerrits it's a feature that doesn't exactly do what people think it does15:48
clarkbya, but we disable it in our gerrit and return an error to git review15:49
clarkbwe didnt rm it from git review15:49
mordredya15:49
*** Goneri has quit IRC15:51
*** kmalloc has left #openstack-infra15:52
corvusother options: keeping it around until 3.1 is the oldest supported release?  emitting a warning that it's deprecated and will be removed?15:53
fungiwe don't seem to test it, so not even sure if that feature is actually working15:53
fungiat a minimum it deserves a release note, but sure a deprecation warning, and then removing at the following release would be gentler15:54
*** roman_g has quit IRC15:54
*** jaosorior has quit IRC15:55
*** ociuhandu has quit IRC15:56
*** roman_g has joined #openstack-infra16:01
*** eernst has quit IRC16:05
*** vkmc has joined #openstack-infra16:06
*** yamamoto has joined #openstack-infra16:08
*** mriedem is now known as mriedem_lunch16:13
*** yamamoto has quit IRC16:15
*** udesale has quit IRC16:19
*** jpena|off is now known as jpena16:19
*** igordc has joined #openstack-infra16:20
*** Goneri has joined #openstack-infra16:22
*** dklyle has quit IRC16:30
*** david-lyle has joined #openstack-infra16:30
*** kopecmartin is now known as kopecmartin|off16:31
*** david-lyle is now known as dklyle16:31
*** ociuhandu has joined #openstack-infra16:33
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add docker buildset test  https://review.opendev.org/68795316:34
*** Goneri has quit IRC16:34
*** ociuhandu has quit IRC16:38
*** ccamacho has quit IRC16:39
*** pcaruana has joined #openstack-infra16:40
*** dpawlik has quit IRC16:48
fungihttps://nb01.openstack.org/centos-8-0000000001.log16:53
fungiianw: we have centos-8 images, it looks like16:53
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Run docker and podman push/pull tests  https://review.opendev.org/68769216:54
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add docker buildset test  https://review.opendev.org/68795316:55
pabelangerfungi: ianw: Ooooh, nice!16:55
fungistill uploading in all providers, but i'll see if we get any nodes building once they populate16:55
pabelangerYah, that would be cool. If works out of box, we'll totally at it to zuul.a.c to test too16:56
*** e0ne has quit IRC16:56
*** jamesmcarthur has quit IRC17:03
*** jamesmcarthur_ has joined #openstack-infra17:04
*** gfidente has quit IRC17:07
*** rlandy is now known as rlandy|brb17:08
*** ociuhandu has joined #openstack-infra17:10
*** ykarel is now known as ykarel|away17:11
corvusfungi, mordred, pabelanger: the example ansible facts in the documentation looks familiar: https://docs.ansible.com/ansible/latest/user_guide/playbooks_variables.html#variables-discovered-from-systems-facts17:15
pabelangerindeed17:15
corvusthat's a great idea -- and they could have redacted way less info :)17:16
corvusgo ahead and throw in those ssh host keys17:16
clarkbha17:17
corvusanyway, i was going to go look up how to get the uid of the user zuul was running as, and i end up getting the actual value in the docs!  that's some spot-on documentation17:17
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Run docker and podman push/pull tests  https://review.opendev.org/68769217:21
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add docker buildset test  https://review.opendev.org/68795317:21
openstackgerritAdam Coldrick proposed opendev/storyboard-webclient master: Adds Migration Docs to Dashboard  https://review.opendev.org/68023517:22
*** priteau has quit IRC17:23
*** dpawlik has joined #openstack-infra17:25
*** Goneri has joined #openstack-infra17:27
fungicentos-8 images have gone to a ready state in rax-dfw and rax-ord so far17:29
*** dpawlik has quit IRC17:29
clarkbdo we expect them to have the same NM problems on FN and limestone?17:29
clarkbalso any idea if further debugging was done there?17:29
fungithe first min-ready node is building in ord now17:30
clarkbI'm about to page all that back in and look at booting some upstream images for comparison17:30
fungiclarkb: not sure17:30
fungiclarkb: see overnight scrollback from ianw though, he opened an upstream bug i think17:30
clarkbooh this is excellent reading17:31
*** mriedem_lunch is now known as mriedem17:34
*** ykarel|away has quit IRC17:35
fungii think the centos-8 image isn't booting successfully in rax-ord17:37
clarkbI wonder if the solicitation delay affects things with that timeout in ianw's bug17:37
clarkbI'm going to build an image without that delay being updated17:37
fungifalse alarm. may have been an nova cache update delay. this time it went ready! 104.130.211.12 2001:4801:7827:102:be76:4eff:fe10:6c9017:38
*** ociuhandu has quit IRC17:38
*** ociuhandu has joined #openstack-infra17:39
*** ricolin has quit IRC17:39
fungii'm timing out ssh'ing into it via ipv6 though17:39
fungiand ipv4 for that matter17:39
fungican't establish a socket on 22/tcp17:40
fungiand no replies to icmp echo request17:41
clarkbthat could be the NM issue17:41
clarkbbecause current glean can't configure ipv6 on rax on centos17:41
fungioh, it got deleted17:41
clarkband ipv4 is what breaks with current glean17:41
*** Goneri has quit IRC17:41
fungiand now building again17:41
fungiwhy would it have gone ready?17:41
clarkbthat I do not knlow17:42
fungiseems like the launcher shouldn't have listed it as ready if it was just going to delete it17:42
fungithe next build went straight to deleting17:43
fungi| 0012251618 | rax-iad              | centos-8             | 54abb4e5-c42e-41c8-a3aa-3174392c8a84 | 104.130.4.224   | 2001:4802:7802:104:be76:4eff:fe20:e39  | deleting | 00:00:00:04  | unlocked |17:43
clarkbis it timing out against ssh?17:43
fungiso we probably need to get a console log17:43
clarkbya probably boot one by hand and check the console17:43
*** jamesmcarthur_ has quit IRC17:44
fungino, strangely the launcher log just says it's deleting an unused node, after doing the full dance to collect the host key17:44
fungiso nodepool thinks it should boot the node, but also thinks it should delete it17:44
fungi?!?17:44
*** rlandy|brb is now known as rlandy17:45
fungihttp://paste.openstack.org/show/782735/17:45
pabelangeryou can have nodepool collection console log, via api, if cloud supports it17:45
clarkbpabelanger: rax does not support it17:46
fungiyeah, i actually suspect there's nothing wrong with the node it booted though17:46
pabelangerclarkb: ah, only rax is failing?17:46
clarkbfungi: nodepool only boots based on min ready and demand17:46
fungipabelanger: only rax has tried to boot it so far17:46
clarkbfungi: maybe min ready weirdness?17:46
clarkbfungi: ya I think rax is where we satisfy min ready by default so that makes sense17:46
fungimin-ready is set to 1, or it presumably wouldn't be booting any at all17:46
*** ramishra has quit IRC17:51
clarkbok removing the RA delay sysctl setting 5/5 instances get working ipv6 on centos on fn17:53
clarkbnow checking to see if they all got ipv4 configured too17:53
clarkbyup they all have ipv4. I'm going to test fedora next17:55
clarkbI think this may be what causes us to tickle the bug that ianw filed17:55
clarkband I guess having explicit config for ipv6 in NM causes it to not ignore interfaces as we had hoped17:55
clarkbif that is the case I think we update glean and dib and tag them together at roughly the same time17:55
*** jpena is now known as jpena|off17:57
clarkbI'll need to test these centos and fedora images on all the clouds too probably17:57
clarkbsince they all use a slightly different varient of glean behavior :(17:57
openstackgerritDavid Shrewsbury proposed zuul/nodepool master: WIP: experimenting with using ZK for fake driver  https://review.opendev.org/68715017:59
*** ociuhandu has quit IRC17:59
*** jamesmcarthur has joined #openstack-infra18:00
*** ccamacho has joined #openstack-infra18:02
clarkbok 5/5 ipv6 setups work on fedora29 too without solicitation delay. only 3/5 ipv4 setups work18:13
clarkbit feels like we can have ipv6 or ipv4 but if you'd like to have both then you need to look in another castle18:13
clarkbianw: ^ to tl;dr removing the router solicitation delay seems to fix ipv6 configuration, but we go back to having problems with ipv4 in some cases18:15
*** efried is now known as efried_pto18:16
*** igordc has quit IRC18:19
*** ykarel|away has joined #openstack-infra18:23
pabelangerlooking at https://launchpad.net/~openstack-ci-core/+archive/ubuntu/vhd-util we don't have bionic packages, which is needed for DIB / and rackspace.  Could we try to rebuild xenial dpkg for bionic?18:28
fungipabelanger: and there's not one included directly in bionic/universe now?18:30
pabelangerfungi: no, I think we carry an out of tree patch, IIRC18:30
fungiahh18:30
pabelangerwhen I last tried to use vhd-util directly for vhd, I don't believe it worked18:31
clarkbya its an out of tree patch :/18:33
clarkbupstream fedora-29 image takes forever to bring up networking to the point where I thought it had failed18:33
clarkbhowever it does bring up both ipv4 and ipv6 with cloud init (at least on a single attempt I need to boot a bunch more tests since fail rate seems to be ~40%)18:33
clarkbI notice that it does not explicitly configure ipv6 in sysconfig and the only ipv4 option we don't use is the one for persistent dhcp18:34
clarkbit also doesn't set NM_CONTROLLED=yes but nmcli implies it is actually NM controlled18:34
clarkbit is possible that PERSISTENT_DHCLIENT is the behavior change we need for ipv4 so I will be testing that after lunch18:35
*** yamamoto has joined #openstack-infra18:35
*** e0ne has joined #openstack-infra18:37
*** yamamoto has quit IRC18:40
*** e0ne has quit IRC18:41
*** Goneri has joined #openstack-infra18:44
openstackgerritDavid Shrewsbury proposed zuul/nodepool master: Fix builder shutdown race in tests  https://review.opendev.org/68796518:49
openstackgerritMerged opendev/storyboard-webclient master: Adds Migration Docs to Dashboard  https://review.opendev.org/68023518:53
openstackgerritMerged opendev/storyboard master: Link development.rst to contributing.rst  https://review.opendev.org/64596018:56
*** prometheanfire has quit IRC18:57
*** prometheanfire has joined #openstack-infra18:58
openstackgerritFrode Nordahl proposed openstack/project-config master: Add OVN charms  https://review.opendev.org/68792518:59
*** ykarel|away has quit IRC19:00
fungiafter moving logs to swift (i think) the build-javascript-content job result for opendev/storyboard-webclient has stopped being usable for anything involving interactions with the storybaord-dev.o.o api or authenticating with openid: https://99957bd7ffedb79bb17e-02cf1f4ef0de29ab49209009be295d1d.ssl.cf2.rackcdn.com/680235/2/gate/build-javascript-content/4fb6c68/npm/html/19:02
fungiwe did something similar to solve those sorts of problems for the zuul dashboard preview builds, right?19:03
clarkbthere is the zuul proxy thing but that mostly has to do with rooting the uris at /19:04
openstackgerritFrode Nordahl proposed openstack/project-config master: Add OVN charms  https://review.opendev.org/68792519:07
*** yamamoto has joined #openstack-infra19:07
*** kjackal_v2 has quit IRC19:08
openstackgerritFrode Nordahl proposed openstack/project-config master: Add OVN charms  https://review.opendev.org/68792519:09
*** kjackal has joined #openstack-infra19:11
*** whoami-rajat has quit IRC19:12
*** yamamoto has quit IRC19:12
openstackgerritMerged zuul/nodepool master: Don't touch static nodes that are allocated  https://review.opendev.org/68780619:26
*** ociuhandu has joined #openstack-infra19:26
*** bnemec has quit IRC19:29
*** pkopec has quit IRC19:29
*** bnemec has joined #openstack-infra19:30
openstackgerritDavid Shrewsbury proposed zuul/nodepool master: Fix builder shutdown race in tests  https://review.opendev.org/68796519:32
fungioh, yeah hrm...19:34
fungiin this case it's more a problem of cors permission i think19:34
openstackgerritMerged zuul/nodepool master: Sort waiting static nodes by creation time  https://review.opendev.org/68727119:38
*** Goneri has quit IRC19:40
clarkbOk I didn't end up finding lunch and just went ahead and tested adding PERSISTENT_DHCLIENT. It seems to have been more reliable. My first 6 boots of fedora nd centos each worked (12 total boots)19:41
clarkbthen I wrote a script that would boot fedora, ssh in via ipv6 and check ipv4 in a loop and that caught a failure almost immediately19:42
clarkbThe difference between upstream images and ours must be in boot timing/races or some other network manager config19:42
*** yamamoto has joined #openstack-infra19:43
clarkbI think the next step is to enable NM debug logging and then reproduce, but I'm running out of steam on this19:43
clarkbif it is still a valid option I Think we should consider not using NM19:44
fungiyeah, it doesn't seem well-suited to this use case19:45
clarkbif NM is required (I think that was the concern that newer fedora/rhel/centos would require it) then we need to probably have a heart to heart with upstream19:46
clarkbthe docs are really bad ( like really bad ), the behavior is unexpected and not logged (when it decides to ignore an interface you've explcitly told it to not ignore via NM_MANAGED=yes and similar config)19:47
*** yamamoto has quit IRC19:48
clarkbcuriously I've recently started having similar problems on my local desktop19:48
fungiand also it seems to be just plain unreliable due to timing races19:48
clarkbremember all those reboots I did for apparomor?19:49
clarkbwell now NM comes up and doesn't configure any interfaces until I restart it19:49
clarkbthankfully (heh not really) I'd already run into this behavior with glean and know that restarting it likely fixes it19:49
clarkbI expect my problems on the desktop are also timing races19:50
clarkbif anyone is wondering where to find teh docs for RH's sysconfig + NM configuration it is in gnome19:51
clarkbnot in the RH docs as far as I can tell19:51
clarkbhttps://developer.gnome.org/NetworkManager/stable/nm-settings-ifcfg-rh.html I Guess because the rh nm settings plugin is actually an upstraem NM plugin19:52
*** ociuhandu has quit IRC19:54
*** igordc has joined #openstack-infra20:03
EmilienMhey folks20:03
EmilienMERROR Ansible plugin dir /var/lib/zuul/builds/228ffd2f4c70427bb4cb895178dd67a7/ansible/pre_playbook_1/role_0/tripleo-ansible/roles/tripleo-container-manage/filter_plugins found adjacent to playbook /var/lib/zuul/builds/228ffd2f4c70427bb4cb895178dd67a7/ansible/pre_playbook_1/role_0/tripleo-ansible/roles/tripleo-container-manage in non-trusted repo.20:03
EmilienMit seems like it doesn't like my customer filter plugin20:04
pabelangerEmilienM: yah, zuul won't load top-level plugins for security reasons20:04
pabelangersince they would run on executor side20:05
EmilienMwhat should I do?20:05
pabelangerto work in untrusted, you'd need to move them20:05
paladoxfyi if you use gerrit.wikimedia.org we have upcoming maintenance https://lists.wikimedia.org/pipermail/wikitech-l/2019-October/092664.html :)20:05
pabelangerEmilienM: or see how to move them to trusted context20:05
pabelangerEmilienM: in this case, you likely can used them with nested ansible20:06
EmilienMpabelanger: I need to go afk a little, if you can comment on https://review.opendev.org/#/c/686196/ please20:07
pabelangersure, can look in a bit20:07
EmilienMthx20:07
*** michael-beaver has quit IRC20:08
*** jamesmcarthur has quit IRC20:08
*** jamesmcarthur has joined #openstack-infra20:09
*** eharney_ has joined #openstack-infra20:12
*** eharney has quit IRC20:12
*** eharney_ is now known as eharney20:13
*** jamesmcarthur has quit IRC20:14
*** yamamoto has joined #openstack-infra20:14
*** pcaruana has quit IRC20:15
*** Goneri has joined #openstack-infra20:19
*** yamamoto has quit IRC20:19
*** jamesmcarthur has joined #openstack-infra20:25
*** jamesmcarthur has quit IRC20:27
*** e0ne has joined #openstack-infra20:28
*** spsurya has quit IRC20:28
*** jamesmcarthur has joined #openstack-infra20:30
ianwpabelanger: i'll look at the bionic packages20:32
clarkbianw: I wrote a lot above doing further NM fiddling . Ithink our RA solicit delay may be the cause of the delay that cause NM to timeout20:32
clarkbremoving the solicit delay fixes that problem but brings back the "ipv4 doesn't work becase NM won't configure the interface now"20:33
*** kjackal has quit IRC20:35
ianwclarkb: yeah, just looking ... how long was the ra delay?20:36
clarkbI then tried the one difference I Found on an upstream image compard with ours and it is the PERSISTENT_DHCLIENT=yes setting. Setting that doesn't fix the NM refuses to configure interface beacuse something else got there first20:36
clarkbianw: I think we are at 30 second now20:36
clarkbwe started at 1020:36
clarkbI undid the changeto update it and kernel seems to default to 1 second20:36
ianwhrm, i guess that would explain hitting that timeout20:38
clarkbit is really odd to me that the NM_MANAGED flag seems to be ignored20:38
ianwi never saw the ipv4 failures ... do you now have an image that replicates that?20:38
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add docker buildset test  https://review.opendev.org/68795320:39
clarkbianw: ya clarkb-test-glean-fedora3 and clarkb-test-glean-centos3 do it on fn20:39
clarkbit isn't 100% failure though20:40
clarkbits the same problem that we put the solicit delay in place to fix20:40
clarkb`nmcl c show` shows two ens3/eth0's20:40
clarkbone that is a stand in because kernel managed the itnerface and the other for the interface we asked NM to configure that it refuses to configure20:40
*** factor has quit IRC20:42
clarkbclarkb-test-glean-fedora exhibits this issue in fn20:42
clarkbI think we've got two different bugs playing off of each other. Effectively forcing us to have working ipv4 or working ipv6 but not both consistently20:43
ianwclarkb: interesting, because i was booting clarkb-test-glean-fedora yesterday to debug that timeout, and didn't see this20:43
clarkbianw: ya those older images had the solicit delay and would get ipv4 reliably but not ipv620:43
ianwoh, ok, so the image is updated?20:43
clarkbI built new images without the solicit delay thinking it might explain the bug you filed20:43
clarkbianw: so that is an instance name20:43
*** jamesmcarthur has quit IRC20:43
clarkbclarkb-test-glean-fedora3 is the image that instance is built on20:44
clarkb(name collisions across resource types)20:44
*** jamesmcarthur has joined #openstack-infra20:44
*** eharney has quit IRC20:45
ianwok trying fedora3 image now20:45
clarkbyou can ssh into clarkb-test-glean-fedora via ipv6 then ifconfig and nmcli c show and nmcli d show to see what is going on there20:45
clarkbits basically the same problem as before we added the solicit delay.20:46
clarkbianw: note it isn't a 100% failure so you may have to loop a few times to catch one20:46
*** roman_g has quit IRC20:48
*** e0ne has quit IRC20:48
ianwok, trying now20:49
*** factor has joined #openstack-infra20:49
*** FlorianFa has quit IRC20:49
clarkboh wait that host may use a different ssh key beause it was part of my test loop on bridge. If you su to me on bridge you'll be able to ssh from the key I generated there for this task20:49
*** jamesmcarthur has quit IRC20:50
clarkbsorry I had been using the infra root keys previosuly but wanted to do an automated loop check to see if we were still catching this problem and set up a new ssh key forthat on bridge20:50
*** jamesmcarthur has joined #openstack-infra20:50
*** jamesmcarthur has quit IRC20:52
ianwwell i am able to log into a host iwth that image20:53
*** tesseract has quit IRC20:53
ianwi've rebooted 15 times now and not seen an issue :/20:53
clarkbianw: ipv6 works20:53
clarkbonly ipv4 fails and only sometimes (it was my 13th boot that it failed)20:54
clarkb13th new instance boot, not reboots of the same host20:54
ianwahh, ok ... i am clearing the glean file.  i wonder if there's other persistent state20:55
ianwclarkb: we really need to capture it with debugging on.  is the .qcow2 somewhere i can guestfish into it and update the config file?20:57
clarkbianw: yes nb01.openstack.org:~clarkb/something-fedora.qcow220:58
clarkbit should be the file with the latest timestmap20:58
clarkb(sorry I killed my ssh agent and haven't dug out the physical media to reload keys yet)20:58
ianwok, will play now20:59
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add podman buildset test  https://review.opendev.org/68798620:59
*** xenos76 has quit IRC20:59
clarkbianw: that image includes the PERSISTENT_DHCLIENT change to the ifcfg files and the removal of changing the RA solicit delay21:00
clarkbotherwise it should be the same as the images I had built previously21:00
*** slaweq has quit IRC21:01
*** FlorianFa has joined #openstack-infra21:01
ianwtest-glean-nm-updates-fedora.qcow221:01
ianwat least the centos-8 build went ok21:04
*** jamesmcarthur has joined #openstack-infra21:07
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add podman buildset test  https://review.opendev.org/68798621:08
*** FlorianFa has quit IRC21:08
*** markvoelker has quit IRC21:09
*** sreejithp has quit IRC21:11
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add podman buildset test  https://review.opendev.org/68798621:11
*** slaweq has joined #openstack-infra21:11
*** rfolco|ruck has quit IRC21:13
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add podman buildset test  https://review.opendev.org/68798621:14
*** slaweq has quit IRC21:17
*** FlorianFa has joined #openstack-infra21:21
*** xek_ has quit IRC21:22
*** benj- has quit IRC21:28
*** benj has joined #openstack-infra21:31
*** benj is now known as Guest6942321:31
*** e0ne has joined #openstack-infra21:34
*** jamesmcarthur has quit IRC21:46
*** ociuhandu has joined #openstack-infra21:55
*** ociuhandu has quit IRC21:59
*** jbadiapa has quit IRC22:02
*** yamamoto has joined #openstack-infra22:03
*** trident has quit IRC22:03
*** mriedem has quit IRC22:04
*** trident has joined #openstack-infra22:05
*** yamamoto has quit IRC22:08
*** ralonsoh has quit IRC22:12
ianwclarkb: replicated, with debug logs22:18
clarkbprogress22:19
*** rlandy is now known as rlandy|bbl22:23
*** e0ne has quit IRC22:24
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Remove unused file from functional test  https://review.opendev.org/68799822:28
*** yamamoto has joined #openstack-infra22:30
*** igordc has quit IRC22:36
clarkbianw: are you able to share the logs (I'm mostly curious to see what they look like with debugging enabled)22:40
ianwclarkb: yep, one tick22:40
ianwclarkb: have you ever maanged to boot & get into a upstream fedora image?22:40
clarkbianw: yes, I did that using the fedora29 image on FN with ssh keys metadata set and config drive enabled22:41
ianwhrm, i'm trying that and no joy, but with my own image modified with nm22:42
ianwhttps://people.redhat.com/~iwienand/bad.txt22:42
clarkbianw: the image I used is uploaded in FN and called upstream-fedora29 or similar22:42
ianwno, i tell a lie, it's up now ... it just took a while22:42
ianwplatform: signal: link   added: 2: eth0 <DOWN;broadcast,multicast> mtu 1450 arp 1 ethernet? not-init addrgenmode eui64 addr FA:16:3E:03:BE:62 driver virtio_net rx:0,0 tx:0,022:44
ianwon the upstream image, eth0 is starting in "DOWN" state ...22:44
clarkboh ya it does take a long time22:45
clarkbI thought it had failed too then tried again a few minutes later and it worked22:45
clarkbOct 10 21:51:57 ianw-test-glean-debug NetworkManager[854]: <debug> [1570744317.8035] Connection 'ens3' differs from candidate 'System ens3' in ipv4.method22:45
clarkbI think ^ is sort of the first clue as to why this is happening on the failed case22:45
ianwyep, that's the key message i think, where it decides "can't touch this"22:45
*** rcernin has joined #openstack-infra22:46
ianwi wonder if cloud-init is clearing an RA addresses and downing then interface?22:46
clarkbSystem ens3 has ipv4.method set to auto (this comes from our config) and ens3 has it set to disabled22:46
clarkb(thinking out loud here, messages like that should be well above debug level imo)22:47
ianwonce again, "eth0 <DOWN;broadcast,multicast>" on upstream image22:48
clarkbdowning the interface so that NM sees it as fresh and new? That could be22:49
clarkbcould probably make an image with modified glean that does that without too much trouble22:50
clarkbthough that may still race?22:50
clarkbsince the interface could be UP'd between glean.sh running and NM starting22:50
ianwwhat would do that though?22:51
clarkbI think it would have to be cloud init or a udev rule?22:52
ianwmaybe that's it ... something in udev?22:53
ianwwe set our own udev rules right?22:54
clarkbya I think we use udev rules to trigger glean.sh against specific interfaces?22:55
* clarkb looks22:55
clarkbya glean/init/glean-udev.rules22:56
clarkbhas a one liner that appears to be systemd specific saying "when you udev add a network interface add a systemd wants rule for glean.sh to run against taht interface"22:56
clarkbianw: we could potentially add a udev rule that down's the interface on add from the start22:57
clarkbthen NM would bring it up22:57
clarkb(and that should avoid the stray RAs?22:58
ianwit seems to be a pretty big difference here ... i wonder what brings it up22:59
ianwi wonder if it's the predicable ntework naming ... cloud-init is still using eth023:00
clarkbwe get a different set of udev rules when changing name schemes right?23:01
clarkbI suppose that could be it23:02
*** aaronsheffield has quit IRC23:05
*** tkajinam has joined #openstack-infra23:06
ianwOct 10 21:51:55 localhost kernel: virtio_net virtio0 ens3: renamed from eth023:06
ianwperhaps that brings it up?23:06
clarkbthat could be23:07
clarkbSUBSYSTEM=="net", ACTION=="add", ATTR{addr_assign_type}=="0", RUN+="ip link set $name down"23:07
clarkba rule like ^ might do what we want?23:07
*** ccamacho has quit IRC23:10
*** markvoelker has joined #openstack-infra23:10
clarkbthe fedora default is to devbiosname23:10
clarkbthe upstream image might be setting biosdevname=0 on the kernel command line?23:11
*** slaweq has joined #openstack-infra23:11
clarkbI know we've set kernel parameters with dib in the past, we should be able to set biosdevname=0 and test with that23:11
*** diablo_rojo has quit IRC23:11
clarkbianw: actually centos doesn't biosdevname23:12
ianwnet.ifnames=023:13
clarkbclarkb-test-glean-centos3 is the equivalent image but for centos7 (and it is the centos image on nb01 in my homedir if you want to modify it)23:13
clarkbI did 6 boots of centos on that image without problems23:14
clarkbmaybe we see if centos 7 has the problem at all and if so that should rule this out?23:14
*** markvoelker has quit IRC23:15
*** slaweq has quit IRC23:16
ianwjust seeing if net.ifnames makes any difference to initial interface state23:16
ianw2001:470:e045:8000:f816:3eff:fe03:be62 port 22: Connection refused ... seems to have locked me out :/23:17
clarkboops23:18
ianwclarkb: do you have a host can try sshing to 192.168.48.151 ?23:19
ianwwait, i'm in how23:20
ianwnow23:20
clarkbfwiw I should have a test node I booted previously I could bounce through23:20
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add podman buildset test  https://review.opendev.org/68798623:25
donnydclarkb: so this is a little strange http://grafana.openstack.org/d/3Bwpi5SZk/nodepool-fortnebula?orgId=1&from=now-24h&to=now23:27
donnydI have been watching this issue for the last week or so23:27
donnydthere are huge chunks of time where nodepool is reporting the vm23:27
donnydas deleting23:27
donnydbut they are deleted in a few seconds from FN23:28
*** diablo_rojo has joined #openstack-infra23:28
donnydso I am not quite sure what to do from my end...23:29
donnydbut its missing a lot of CI cycle time23:29
clarkbdonnyd: nodepool will actually poll nova to check that the delete succeeded23:29
clarkbis it possible that nova isn't actually reporting those deletes as completed via the api?23:30
*** vesper11 has quit IRC23:30
donnydwhen I do an openstack server list during one of these events I don't see any not reporting as ACTIVE23:30
*** vesper11 has joined #openstack-infra23:31
*** goldyfruit has quit IRC23:31
clarkbdonnyd: ya so nodepool may have asked nova to delete them and the state didn't change23:31
clarkb(nodepool will retry)23:32
donnydbut it seems to take quite a while, and it just started doing it like a week or so ago23:32
donnydI will keep an eye on it23:32
donnydnothing is busted.. just want the community to get the most FN can give23:32
clarkbdonnyd: in those cases it might be helpful to check the api logs for incoming delete requests and see if nova failed to handle them23:33
ianwclarkb: still seems to rename them, even with out the cmdline option23:33
donnydkk23:33
*** dchen has joined #openstack-infra23:34
ianw"KVM guests exclusively using virtio-net type interfaces can safely set net.ifnames=0"23:35
ianwmaybe we should be setting it anyway23:35
ianw    subprocess.check_call(['ip', 'link', 'set', 'dev', iface, 'up'])23:39
ianwhttps://opendev.org/opendev/glean/src/branch/master/glean/cmd.py#L1134 might be our smoking gun here23:40
clarkbhrm it does seem odd that we would do that when we write the config after the fact and then rely on the init system to up the network with the correct config23:44
clarkbI think this is an optimization to only configure interfaces with an active carrier link23:44
clarkbbut maybe that isn't worth doing23:44
clarkbalso maybe we can check that without UPing the interfaces23:45
ianwyeah, this fits almost exactly ... interface comes up ... sometimes gets the RA ... networkmanager doesn't touch it by design because it thinks it's configured by something else23:45
clarkbianw: I would argue this is also a bug in NM because we've explicitly told NM you manage this interface23:46
clarkbvia the NM_MANAGED flag23:46
clarkbif we weren't setting that then ok fine the behavior kind of makes sense23:46
clarkbip link show foo gives me a NO-CARRIER attribute on an unplugged rj45 jack23:47
clarkbthe interface is up though23:47
* clarkb downs it to see if that changes23:47
ianwyeah, i mean according to -> https://bugs.debian.org/cgi-bin/bugreport.cgi?att=1;bug=755202;filename=irc-log.txt;msg=156 that's basically, as they say "intended but sub-optimal behaviour"23:47
clarkbhrm maybe this interface was down23:49
clarkbI need a more interesting network setup on this machine to be able to compare between interfaces23:49
openstackgerritJames E. Blair proposed zuul/zuul-registry master: Add podman buildset test  https://review.opendev.org/68798623:49
clarkbianw: the sysfs carrier attribute check should actually be sufficient23:52
clarkbianw: there is a good chance we can just delete that ip link set foo up command23:52
clarkbI bet if I git blame this we'll actually get some commit about how this was added to fix baremetal use cases23:53
*** gagehugo has quit IRC23:54
clarkbwow that code actually comes from disk image builder says the commit that mordred wrote23:55
ianwyeah, was just looking at cloud-init which uses "sys/net/devname/carrier is 1"23:55
clarkbianw: also since we are predominantly systemd and udev driven now we should only be touching interfaces that exist and are expected to do a thing23:55
clarkbbut even when we aren't I don't know what that wait is supposed to accomplish. I guess it gives time for "hardware" to establish that l1 connection23:56
clarkb(wouldn't your pre linux boot stuff do that though?)23:57
clarkbI think my vote is to remove the ip link up then we can do some exhaustive boot tests across all the things (ugh) and if they work just roll with it23:57
ianwyeah, just playing with that on my test host now23:58
clarkbTheJulia actually updated that exec call to be compat with older python at some point. Makes me think that we probably only hit that code path on baremetal23:59
clarkb(we would've seen issues with it on our VMs otherwise)23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!