Friday, 2018-09-28

*** strigazi has quit IRC00:01
*** vabada has quit IRC00:01
*** strigazi has joined #openstack-infra00:02
*** vabada has joined #openstack-infra00:02
ianwclarkb: do you have thoughts on https://review.openstack.org/#/c/605583/ .  we could do that, or i could put in a simpler "if fedora { write only ipv4 nameservers }" with a reference to the bug00:03
*** bobh has quit IRC00:04
*** eernst has quit IRC00:06
clarkbmy only concern with scoping it to fedora is other newer distros likely have the issue too? and we may not remember to exclude them as well00:06
openstackgerritGoutham Pacha Ravi proposed openstack-infra/project-config master: [manila-ui] Don't run python35 tests until Rocky  https://review.openstack.org/60589300:07
openstackgerritMerged openstack-infra/zuul master: Don't report non-live items in stats  https://review.openstack.org/60554000:09
ianwclarkb: yeah, maybe the larger change is best, i'll unwip it for comments.  i have built with it, but my attempt to upload it to rax failed with an OverLimit Retry... (HTTP 413) whatever that means00:09
*** eernst has joined #openstack-infra00:10
openstackgerritGoutham Pacha Ravi proposed openstack-infra/project-config master: [manila-ui] Don't run python35 tests until Rocky  https://review.openstack.org/60589300:13
*** eernst has quit IRC00:14
*** eernst has joined #openstack-infra00:17
*** sthussey has quit IRC00:18
*** yamamoto has joined #openstack-infra00:21
*** eernst has quit IRC00:23
*** eernst has joined #openstack-infra00:24
*** hamzy has joined #openstack-infra00:25
*** eernst has joined #openstack-infra00:27
*** jamesmcarthur has quit IRC00:28
*** jamesmcarthur has joined #openstack-infra00:29
*** eernst has quit IRC00:31
*** bobh has joined #openstack-infra00:33
*** jamesmcarthur has quit IRC00:34
*** anteaya has quit IRC00:34
*** longkb has joined #openstack-infra00:34
*** rlandy has quit IRC00:36
*** gyee has quit IRC00:51
*** jamesmcarthur has joined #openstack-infra00:55
*** jamesmcarthur has quit IRC00:59
*** smarcet has joined #openstack-infra01:03
*** diablo_rojo has quit IRC01:05
*** bobh has quit IRC01:10
*** shardy has quit IRC01:10
openstackgerritIan Wienand proposed openstack-infra/nodepool master: Normalise more of the API stats calls  https://review.openstack.org/60589801:12
*** felipemonteiro has joined #openstack-infra01:15
*** shardy has joined #openstack-infra01:18
*** dpawlik has joined #openstack-infra01:21
*** harlowja has quit IRC01:24
*** dpawlik has quit IRC01:26
*** jamesmcarthur has joined #openstack-infra01:28
*** bobh has joined #openstack-infra01:31
*** smarcet has quit IRC01:39
*** bobh has quit IRC01:42
*** smarcet has joined #openstack-infra01:45
*** zzzeek has quit IRC01:48
*** zzzeek has joined #openstack-infra01:49
*** bobh has joined #openstack-infra01:49
*** bobh has quit IRC01:53
*** mrsoul has quit IRC01:55
*** jamesdenton has joined #openstack-infra02:07
*** yamamoto has quit IRC02:16
*** ykarel has joined #openstack-infra02:18
*** hongbin has joined #openstack-infra02:20
*** smarcet has quit IRC02:24
*** stakeda has joined #openstack-infra02:26
*** ykarel has quit IRC02:32
*** smarcet has joined #openstack-infra02:34
*** smarcet has quit IRC02:43
*** smarcet has joined #openstack-infra02:45
*** imacdonn has quit IRC02:51
*** imacdonn has joined #openstack-infra02:51
*** felipemonteiro has quit IRC03:00
*** roman_g has quit IRC03:04
*** ykarel has joined #openstack-infra03:13
*** dpawlik has joined #openstack-infra03:22
*** yamamoto has joined #openstack-infra03:24
*** eernst has joined #openstack-infra03:26
*** dpawlik has quit IRC03:27
*** psachin has joined #openstack-infra03:29
openstackgerritIan Wienand proposed openstack-infra/system-config master: Initial port of install-docker role  https://review.openstack.org/60558503:33
*** hongbin has quit IRC03:38
openstackgerritIan Wienand proposed openstack-infra/project-config master: Grafana: set zuul node requests yaxis min  https://review.openstack.org/60588603:45
*** graphene has quit IRC03:50
*** yamamoto has quit IRC03:50
*** graphene has joined #openstack-infra03:51
*** graphene has quit IRC03:55
*** graphene has joined #openstack-infra03:56
*** njohnston has quit IRC04:02
*** graphene has quit IRC04:20
*** graphene has joined #openstack-infra04:21
*** graphene has quit IRC04:28
*** graphene has joined #openstack-infra04:29
*** yamamoto has joined #openstack-infra04:41
*** toabctl has quit IRC04:48
AJaegerconfig-core, please put https://review.openstack.org/598323 and https://review.openstack.org/605893 on your review queue04:54
*** jamesmcarthur has quit IRC04:57
*** jamesmcarthur has joined #openstack-infra05:01
*** jamesmcarthur has quit IRC05:06
*** graphene has quit IRC05:08
*** ramishra has joined #openstack-infra05:09
*** graphene has joined #openstack-infra05:09
*** udesale has joined #openstack-infra05:09
*** ykarel_ has joined #openstack-infra05:11
*** ykarel has quit IRC05:14
*** ykarel__ has joined #openstack-infra05:15
*** ykarel_ has quit IRC05:18
*** ykarel_ has joined #openstack-infra05:20
*** yamamoto has quit IRC05:21
*** ykarel__ has quit IRC05:23
*** dpawlik has joined #openstack-infra05:23
*** ykarel__ has joined #openstack-infra05:24
*** ykarel_ has quit IRC05:27
*** dpawlik has quit IRC05:29
*** ykarel_ has joined #openstack-infra05:29
*** yamamoto has joined #openstack-infra05:29
*** ykarel__ has quit IRC05:32
*** quiquell|off is now known as quiquell05:41
*** chkumar|off is now known as chandankumar05:46
*** e0ne has joined #openstack-infra05:52
openstackgerritMerged openstack-infra/project-config master: Grafana: set zuul node requests yaxis min  https://review.openstack.org/60588605:57
*** gfidente has joined #openstack-infra06:07
quiquellGood morning all06:07
quiquellAJaeger: I think we have a review at wrong queue06:09
quiquellAJaeger: https://review.openstack.org/#/c/594511/06:09
*** pcaruana has joined #openstack-infra06:11
*** smarcet has quit IRC06:21
*** verdurin has quit IRC06:23
*** e0ne has quit IRC06:24
*** yamamoto has quit IRC06:27
*** mtreinish has joined #openstack-infra06:27
*** dpawlik has joined #openstack-infra06:27
*** eernst has quit IRC06:28
*** jamesmcarthur has joined #openstack-infra06:29
chandankumarianw: Hello06:31
chandankumarianw: it appears that Zuul queue is very long 92 hours in post anything we can do to minimize it or is it expected?06:31
*** dpawlik has quit IRC06:32
*** dpawlik has joined #openstack-infra06:32
*** jamesmcarthur has quit IRC06:34
*** mrsoul has joined #openstack-infra06:38
*** bhavikdbavishi has joined #openstack-infra06:40
AJaegerquiquell: what do you mean?06:43
AJaegerchandankumar: http://lists.openstack.org/pipermail/openstack-dev/2018-September/134867.html06:44
*** jtomasek has joined #openstack-infra06:44
jaosoriorchandankumar: well, we (tripleo) are taking most of the resources. And our timeout issues are still present. So... fixing our timeout issues (which are better than last week), will help in this.06:46
AJaegerquiquell: you need to rebase 594511, it's not current, see the orange dot beside parent06:47
AJaegerjaosorior: 594511 is tripleo, quiquell asked about it, see above as FYI ^06:48
*** e0ne has joined #openstack-infra06:51
*** icey has quit IRC06:54
quiquellAJaeger: thanks06:59
chandankumarAJaeger: Thanks !07:00
*** florianf|afk has quit IRC07:01
*** shardy has quit IRC07:01
*** shardy has joined #openstack-infra07:02
*** quiquell is now known as quiquell|brb07:04
*** yamamoto has joined #openstack-infra07:04
egonzalezhi, ask.openstack.org is down07:06
*** jamesmcarthur has joined #openstack-infra07:07
*** icey has joined #openstack-infra07:07
dpawlikegonzalez: ask here :D07:09
xinliangianw: ping07:09
*** jamesmcarthur has quit IRC07:11
*** ginopc has joined #openstack-infra07:11
*** florianf has joined #openstack-infra07:12
*** rcernin has quit IRC07:12
AJaegerxinliang: best leave a message so that he can read it once he comes back - or somebody else might be able to help.07:14
xinliangianw: The kolla-debian-building-arm job can build, but it get stuck because of no disk space07:15
xinlianghttp://logs.openstack.org/59/557659/24/experimental/kolla-build-debian-source-arm64/0c15810/job-output.txt.gz#_2018-09-27_15_15_53_15944407:15
AJaegerinfra-root, ask.openstack.org is down ;(07:15
*** ssbarnea|bkp has quit IRC07:16
xinliangAJaeger: thanks, leave a message:)07:17
AJaegerxinliang: we have 80 GB, see https://docs.openstack.org/infra/manual/testing.html - if you hit that, you need to rework your job07:18
*** aojea has joined #openstack-infra07:18
AJaegerxinliang, I wonder what partition is used, you might want to add some strategic "df" commands for debugging...07:19
xinliangAJaeger: Yes, flavor is enough. we found there is a issue of resize root before. Not sure if it has been fixed yet07:22
xinliangwill check with "df"07:22
*** jamesmcarthur has joined #openstack-infra07:29
*** dpawlik has quit IRC07:32
*** dpawlik has joined #openstack-infra07:34
*** jamesmcarthur has quit IRC07:34
*** dpawlik has quit IRC07:34
*** dpawlik has joined #openstack-infra07:34
*** quiquell|brb is now known as quiquell07:35
*** dpawlik has quit IRC07:36
*** shu-mutow has joined #openstack-infra07:36
*** markvoelker has quit IRC07:36
*** markvoelker has joined #openstack-infra07:37
*** markvoelker has quit IRC07:42
*** hashar has joined #openstack-infra07:43
*** jpena|off is now known as jpena07:43
*** longkb has quit IRC07:44
*** tosky has joined #openstack-infra07:50
*** quiquell is now known as quiquell|brb07:55
*** longkb has joined #openstack-infra07:55
*** jamesmcarthur has joined #openstack-infra07:56
*** alexchadin has joined #openstack-infra07:57
*** jpich has joined #openstack-infra07:58
*** jamesmcarthur has quit IRC08:01
*** alexchadin has quit IRC08:01
*** rossella_s has joined #openstack-infra08:01
*** bauzas is now known as PapaOurs08:03
*** quiquell|brb is now known as quiquell08:05
*** rossella_s has quit IRC08:09
*** rossella_s has joined #openstack-infra08:10
*** alexchadin has joined #openstack-infra08:11
*** dpawlik has joined #openstack-infra08:12
*** alexchadin has quit IRC08:16
*** alexchadin has joined #openstack-infra08:18
xinlianggrowroot still not working for arm64 node, ianw, AJaeger08:24
xinlianghttp://logs.openstack.org/59/557659/25/experimental/kolla-build-debian-source-arm64/ff00a12/job-output.txt.gz#_2018-09-28_07_57_15_53952808:24
AJaegerxinliang: thanks for investigating - I cannot help, hope others can08:25
*** alexchadin has quit IRC08:25
xinliangAJaeger: that's fine.08:26
*** stephenfin is now known as finucannot08:27
xinliangThis patch: https://review.openstack.org/#/c/578265/  merged and should be fix this issue.08:27
xinliangposted by ianw08:28
*** alexchadin has joined #openstack-infra08:28
*** ykarel__ has joined #openstack-infra08:28
AJaegerok, then let's wait for ianw - might need to wait until Monday...08:28
xinliangok08:28
*** ykarel_ has quit IRC08:31
*** derekh has joined #openstack-infra08:37
*** markvoelker has joined #openstack-infra08:37
*** olivierb has joined #openstack-infra08:38
fricklerask.o.o seems fine for me now, maybe the usual morning outage took a bit longer?08:49
openstackgerritIan Wienand proposed openstack-infra/nodepool master: Normalise more of the API stats calls  https://review.openstack.org/60589808:49
AJaegerfrickler, ianw , please put https://review.openstack.org/598323 and https://review.openstack.org/605893 on your review queue.08:50
ianwxinliang: weird, i can't ssh into the debian arm64 node we have.  the others are like 100 days old :/  i'm going to kill them all and cycle them, see what happens08:51
AJaegeryeah, number of periodic jobs and post jobs is slowly going down - still LARGE backlog08:51
xinliangianw: i see no ready arm64 nodes on london cloud. they are building08:54
*** roman_g has joined #openstack-infra08:54
xinliangianw: i notice that when job post to run there is no ready node for it. node building just in time08:55
ianwright, i just killed the old ones so min-nodes is kicking in for them08:55
*** vivsoni_ has quit IRC08:56
*** ykarel__ is now known as ykarel08:59
*** markvoelker has quit IRC08:59
openstackgerritMerged openstack-infra/project-config master: [manila-ui] Don't run python35 tests until Rocky  https://review.openstack.org/60589309:01
*** tosky has quit IRC09:02
ianwxinliang: i can't ssh into the debian one ... which suggests to me the same thing we were seeing before with the config drive not being mounted and the ssh keys not being rolled out09:02
ianwfor a xenial host "/dev/sda3        75G  8.8G   63G  13% /"09:03
ianwwhich looks right09:03
*** tosky has joined #openstack-infra09:03
xinliangianw: but i can see sr0 device from the log http://logs.openstack.org/59/557659/25/experimental/kolla-build-debian-source-arm64/ff00a12/zuul-info/host-info.primary.yaml09:04
*** ykarel is now known as ykarel|lunch09:04
openstackgerritbrandon zhao proposed openstack/ansible-role-cloud-launcher master: use include_tasks instead of include  https://review.openstack.org/60601109:07
xinliangianw: there is the node booting log: https://uk.linaro.cloud/project/instances/6d62f976-3025-4569-9243-6b35d2651ef4/console09:08
xinliangnot sure if it helps09:08
ianwxinliang: see anything from glean in there?  i don't have the login details handy09:09
AJaegerianw, then  https://docs.openstack.org/infra/manual/testing.html - is wrong, it states 80 GB. I didn't realize first taht this is ARM - do we need to update the doc?09:10
xinliangianw: paste one: http://paste.openstack.org/show/731075/09:10
*** alexchadin has quit IRC09:10
*** rossella_s has quit IRC09:11
xinliangianw: sorry just one log is ubuntu node's.09:11
xinliangianw: this one is debian's , it has glean things: http://paste.openstack.org/show/731076/09:13
ianwyeah the debian one in in cn-109:13
*** rossella_s has joined #openstack-infra09:13
xinliangianw: so currently, we can't specific london node to run job?09:15
openstackgerritMerged openstack-infra/irc-meetings master: update api-sig meeting times  https://review.openstack.org/60580809:15
xinliangspecify09:15
ianwno, it will balance between them09:16
ianwlinaro has all the right keys in the cloud09:16
ianwxinliang: whatever debian this image is, it's not the most recent one09:20
ianw 3      17.8MB  14.8GB  14.8GB  ext4         "root"09:20
ianwit's got the quotes09:20
xinliangif so growpart will not work, right?09:23
ianwdib on nb03 is 2.16.009:23
ianwlooks like puppet is failing on it09:24
*** pbourke has quit IRC09:24
*** alexchadin has joined #openstack-infra09:25
*** pbourke has joined #openstack-infra09:25
xinliangianw: there might be a problem. I mean using node on cn cloud. nodes on the cloud are not working due to networking issue09:25
ianwno, it seems like puppet is not running on nb03, so it hasn't been updated to the lastest dib.  so the images it has built are out of date i guess09:26
ianwi'm trying a manual puppet run to see what's up there09:26
ianwCould not get latest version: undefined method `[]' for nil:NilClass ?09:26
cmurphyianw: hi do you want debugging help?09:28
ianwcmurphy: maybe :)  i'll see if i can get some sort of sensible error with kick.sh on nb0309:29
cmurphyo709:30
ianwone of hte problems with the dib puppet was that pip installing in on arm took a long time due to it building everything under the sun, and it was timing out.  i'm pretty sure i merged a fix for that though09:31
*** armax has quit IRC09:31
ianwok, so this is the problem http://paste.openstack.org/show/731083/09:34
ianwcomes from http://git.openstack.org/cgit/openstack-infra/puppet-diskimage_builder/tree/manifests/init.pp#n7909:35
ianwwhich looks pretty straight forward to me :/09:35
cmurphymust be a bug in the openstack_pip provider09:35
ianwyeah, that's what i'm thinking, has there been updates to that lately?09:36
cmurphynot since last year09:36
ianwroot@nb03:~# pip --version09:37
ianwpip 18.009:37
ianwwas that recently released or something?09:37
ianwnot really, july09:37
cmurphyit's coming from either http://git.openstack.org/cgit/openstack-infra/puppet-pip/tree/lib/puppet/provider/package/openstack_pip.rb#n23 or http://git.openstack.org/cgit/openstack-infra/puppet-pip/tree/lib/puppet/provider/package/openstack_pip.rb#n28 so either the output of `pip list --outdated` or `pip show diskimage-builder` is unexpected09:39
ianwhttp://paste.openstack.org/show/731088/ is the output, looks about right to me for show09:40
ianwhttp://paste.openstack.org/show/731089/ looks about right too09:41
*** toabctl has joined #openstack-infra09:42
cmurphyit's looking for 'Latest: ' but the header is 'Latest'09:43
ianwahhh, yes, the output is quite different09:45
ianwit's expecting something that looks like "cryptography (1.2.3) - Latest: 2.3.1 [wheel]"09:46
cmurphythat's annoying09:47
*** ykarel|lunch is now known as ykarel09:48
ianwcmurphy: it's pip 18 i guess :/09:48
ianwmaybe we're just pinned on other servers and haven't noticed09:49
ianwfor the immediate issue, i've install pip 9.0.1 on nb03 and that should get it going ...09:50
ianwcmurphy: is it ok if i throw handling later pip in openstack_pip on your plate?  i'm on pto for a bit09:51
cmurphyianw: on it09:51
openstackgerritColleen Murphy proposed openstack-infra/puppet-pip master: Fix openstack_pip provider for pip 18  https://review.openstack.org/60602109:52
cmurphyianw: something like that maybe ^09:52
*** jtomasek has quit IRC09:56
*** markvoelker has joined #openstack-infra09:57
ianwcmurphy: ++ !09:57
openstackgerritMerged openstack-infra/zuul master: replace dict.update by a dict merge in zuul_return  https://review.openstack.org/60205409:57
ianwxinliang: so .. nb03 now has dib 2.17.0 ... so i'll trigger some fresh arm64 builds09:58
ianwtalk about yak shaving!09:58
*** longkb has quit IRC09:58
cmurphyheh09:59
*** scroll is now known as hfjvjffju09:59
*** alexchadin has quit IRC10:05
xinliangianw: thanks, will try the new nodes10:11
*** jamesmcarthur has joined #openstack-infra10:12
ianwxinliang: cool, you can see the status @ http://nl01.openstack.org/dib-image-list and http://nl01.openstack.org/image-list10:14
*** xinliang has quit IRC10:16
*** e0ne has quit IRC10:16
*** jamesmcarthur has quit IRC10:16
*** markvoelker has quit IRC10:18
openstackgerritIan Wienand proposed openstack-infra/zuul-sphinx master: Add attr-overview directive  https://review.openstack.org/60498010:18
*** alexchadin has joined #openstack-infra10:22
openstackgerritIan Wienand proposed openstack-infra/nodepool master: Use zuul-sphinx for configuration layout  https://review.openstack.org/60427410:23
openstackgerritIan Wienand proposed openstack-infra/nodepool master: Add overview of config options  https://review.openstack.org/60498410:23
*** e0ne has joined #openstack-infra10:25
openstackgerritIan Wienand proposed openstack-infra/system-config master: [WIP] Provision graphite01.o.o via docker container  https://review.openstack.org/60602810:27
*** alexchadin has quit IRC10:29
*** yamamoto has quit IRC10:31
*** yamamoto has joined #openstack-infra10:32
*** yamamoto has quit IRC10:38
*** bhavikdbavishi has quit IRC10:38
*** alexchadin has joined #openstack-infra10:39
openstackgerritAndreas Jaeger proposed openstack-infra/openstack-zuul-jobs master: Remove release-openstack-server and publish-xstatic templates  https://review.openstack.org/53183010:39
*** stakeda has quit IRC10:43
*** olivierb has quit IRC10:47
*** AJaeger has quit IRC10:56
*** njohnston has joined #openstack-infra10:56
*** dtantsur|afk is now known as dtantsur10:57
*** njohnston has left #openstack-infra10:58
*** rfolco has quit IRC11:07
*** jpena is now known as jpena|lunch11:07
*** AJaeger has joined #openstack-infra11:07
*** ssbarnea|bkp has joined #openstack-infra11:08
*** alexchadin has quit IRC11:08
*** xinliang has joined #openstack-infra11:09
xinliangianw: root resize still not working : http://logs.openstack.org/59/557659/25/experimental/kolla-build-debian-source-arm64/f1088a3/job-output.txt.gz#_2018-09-28_10_53_05_99182711:09
*** rossella_s has quit IRC11:21
*** rossella_s has joined #openstack-infra11:22
*** rfolco has joined #openstack-infra11:26
*** yamamoto has joined #openstack-infra11:26
*** rossella_s has quit IRC11:26
*** rossella_s has joined #openstack-infra11:27
ianwxinliang: the new images haven't finished uploading11:31
ianwhttp://nl01.openstack.org/image-list11:31
*** udesale has quit IRC11:32
*** rossella_s has quit IRC11:33
openstackgerritColleen Murphy proposed openstack-infra/puppet-pip master: Fix openstack_pip provider for pip 18  https://review.openstack.org/60602111:34
*** rossella_s has joined #openstack-infra11:37
*** dpawlik has quit IRC11:37
*** agopi|brb is now known as agopi11:40
*** ssbarnea|bkp has quit IRC11:40
*** panda|off is now known as panda11:42
*** shu-mutow has quit IRC11:43
*** yolanda has joined #openstack-infra11:44
*** rossella_s has quit IRC11:53
*** mrsoul has quit IRC11:54
*** rossella_s has joined #openstack-infra11:55
*** Bhujay has joined #openstack-infra11:58
*** EmilienM is now known as EvilienM12:00
*** jpena|lunch is now known as jpena12:05
*** rossella_s has quit IRC12:09
*** rossella_s has joined #openstack-infra12:10
openstackgerritMatthieu Huin proposed openstack-infra/zuul master: web: add tenant and project scoped, JWT-protected actions  https://review.openstack.org/57690712:10
openstackgerritMatthieu Huin proposed openstack-infra/zuul master: CLI: add create-web-token command  https://review.openstack.org/60538612:10
*** rossella_s has quit IRC12:14
*** rossella_s has joined #openstack-infra12:15
*** rh-jelabarre has joined #openstack-infra12:15
*** yamamoto has quit IRC12:16
*** rfolco has quit IRC12:18
*** weshay is now known as weshay_ruck12:23
*** rlandy has joined #openstack-infra12:24
openstackgerritneilsun proposed openstack-infra/zuul master: Add type check for zuul conf  https://review.openstack.org/59191712:25
*** ykarel_ has joined #openstack-infra12:28
*** boden has joined #openstack-infra12:29
*** mriedem has joined #openstack-infra12:29
*** ykarel has quit IRC12:31
*** ykarel_ is now known as ykarel12:31
*** nicolasbock_ has joined #openstack-infra12:32
*** agopi is now known as agopi|brb12:34
*** jcoufal has joined #openstack-infra12:34
*** agopi|brb has quit IRC12:39
*** tpsilva has joined #openstack-infra12:40
*** graphene has quit IRC12:42
*** trown|outtypewww is now known as trown12:43
*** graphene has joined #openstack-infra12:44
openstackgerritMohammed Naser proposed openstack-infra/project-config master: Temporarily bump up capacity by 50 VMs  https://review.openstack.org/60605812:44
*** rossella_s has quit IRC12:45
*** jamesmcarthur has joined #openstack-infra12:45
openstackgerritMohammed Naser proposed openstack-infra/project-config master: Revert "Temporarily bump up capacity by 50 VMs"  https://review.openstack.org/60605912:45
mnaserinfra-root: ^ can someone promote this to top of the check queue and gate afterwards?12:45
*** rossella_s has joined #openstack-infra12:47
AJaegermnaser: the first one I hope ;)12:47
mnaserAJaeger: aha, yes12:47
AJaegerthanks a lot, mnaser !12:47
AJaegerfungi, frickler, are you around to help ? ^12:48
mnaseror maybe we can drag dmsimard back out :P12:48
mnaserconsidering east coast12:48
cmurphyawesome mnaser12:48
mnaser:)12:49
AJaegerconfig-core, a long but mechanic change to update release jobs, please review https://review.openstack.org/598323 . Could we give dhellmann a +2A, please?12:49
AJaegerpabelanger: are you around?12:49
dmsimardI'm here12:49
AJaegerdmsimard: know how to promote a change?12:49
AJaegerhttps://review.openstack.org/#/c/60605812:50
openstackgerritneilsun proposed openstack-infra/zuul master: Add type check for zuul conf  https://review.openstack.org/59191712:51
dmsimardAJaeger: there's docs for it so I suppose https://zuul-ci.org/docs/zuul/admin/client.html#promote12:51
*** hashar is now known as hasharAway12:51
mnaserseems straight forward12:52
mnaserzuul promote --tenant openstack --pipeline check --changes 606058,112:52
AJaegermnaser: currently situation is getting slowly under control, the backlog on nodes is not really growing so far http://grafana.openstack.org/d/T6vSHcSik/zuul-status12:52
*** rossella_s has quit IRC12:52
AJaegerZuul even got through 40 periodic jobs and some post jobs...12:53
dmsimardmnaser: we need to enqueue in gate first12:53
mnaseroh yes12:54
mnaserdmsimard: or rather check i think you mean there12:54
*** rossella_s has joined #openstack-infra12:54
dmsimardit's in gate and promoted12:54
mnaseralso12:54
mnaseri really really really think we should look into smaller instance sizes for smaller jobs12:55
mnaserit'll help use our resources much more efficently12:55
AJaegerdmsimard: thanks!12:55
dmsimard#status log (dmsimard) enqueued https://review.openstack.org/606058 to gate and promoted it to increase nodepool capacity12:55
mnaserdmsimard: thank you so much :>12:55
*** kgiusti has joined #openstack-infra12:55
openstackstatusdmsimard: finished logging12:55
*** bhavikdbavishi has joined #openstack-infra12:55
dmsimardmnaser: yes, in fact we can do that now12:55
openstackgerritGabriele Cerami proposed openstack-infra/zuul-sphinx master: Raise an error if a file in zuul.d is empty  https://review.openstack.org/60606212:56
dmsimardmnaser: nodepool is quota aware so instead of using max-servers we can use max-ram/mac-cores and use different flavors for different jobs12:56
mnaseri think that'd be really beneficial.. for example running doc jobs on a 1 core / 1g instance12:56
mnaserthat way.. 8 doc jobs can run at once instead for example12:56
openstackgerritGabriele Cerami proposed openstack-infra/zuul-sphinx master: Raise an error if a file in zuul.d is empty  https://review.openstack.org/60606212:58
fungimnaser: we would just enqueue directly to the gate and then no need to promote as the project-config queue is basically empty anyway12:59
mnaserfungi: gotcha12:59
fungidmsimard: were you handling that, or should i?12:59
AJaegermnaser, fungi , could either of you review https://review.openstack.org/598323 , please? I know it's long - but really mechanical...13:00
mnaserfungi: dmsimard already did :)13:00
AJaegerfungi: it's done13:00
fungithanks dmsimard and mnaser!13:00
dmsimardfungi: I did it and I did a status log13:00
*** jaosorior has quit IRC13:00
mnaserAJaeger: i was confused why `release-openstack-server` jobs were replaced by `publish-to-pypi-python313:00
dmsimardfungi: I am wondering if we should really keep that backlog of >300 periodic jobs13:00
mnaserits a change in behaviour.. but the commit message didnt explain why or what13:00
fungimnaser: the release team is working on publishing server projects to pypi now13:01
mnaserok, the commit message seemed to imply that it was just moving to a python3 release job, but let me see13:01
fungirelease-openstack-server was basically like publish-to-pypi-python3 except it skipped the actual pypi upload13:01
fungidhellmann: ^ can you confirm?13:01
mnaseryeah i figured, but dont we need to make sure all these projects have pre-configured stuff in pypi?13:01
mnaserso the job doesnt fail?13:01
mnaserthe acls and allowing openstackci to upload to them13:02
fungipretty sure they're going around registering the missing ones13:02
AJaegermnaser: we discussed on IRC yesterday, the release team will take care of that13:02
fungiwith a few exceptions (keystone, magnum, congress) which need coordination with previous registrants on pypi to possibly allow us to take over those names13:02
mnasercool, in that case, its ok wtih me13:02
AJaegermnaser: there was an email as well ot openstack-dev (I agree, the commit message could be more verbose)13:03
AJaegerthanks, mnaser13:03
mnaseryeah let's get it rolling, release team is accessible enough to talk to should there be any issues13:04
toskyuhm, couldn't you have make release-openstack-server derive from publish-to-pypi-python3, instead of replacing all jobs?13:04
toskymade*13:04
*** dpawlik has joined #openstack-infra13:05
AJaegertosky: sure, we could have just changed the template - but then have two templates that do the same...13:05
toskyok13:05
fungideduplication of jobs13:06
fungior rather of templates in this case13:06
*** rfolco has joined #openstack-infra13:06
openstackgerritMerged openstack-infra/project-config master: Temporarily bump up capacity by 50 VMs  https://review.openstack.org/60605813:06
mnaserum13:07
mnaserdo the current doc job builds for other languages?13:08
fungium?13:08
*** dpawlik has quit IRC13:08
mnaseri'm seeing the OSA docs getting translated to german (yay!!!) but i dont see a link in our docs to show those translations13:08
fungiif the pofiles are in the repo they should...13:08
AJaegermnaser: we started wit hfirst repos - eumel8 has started. I think OSA is one of the three guinea pigs ;)13:08
mnaserwell thats why i was wondering what we have to tweak13:08
*** dpawlik has joined #openstack-infra13:08
mnaserhttps://review.openstack.org/#/c/605990/113:08
AJaegermnaser: best talk with dhellmann and eumel813:09
mordredyeah - we discussed translated docs at the ptg (exciting)13:09
AJaegermnaser: I think the building is not done - just pushing to translation server and back.13:09
mnaserAJaeger: ahhh okay13:09
mnaseroh13:09
mnaserthere's a `build-tox-manuals-checklang` job13:09
mnaserwhich we dont run13:09
mnasereumel8: whenever you're around, let me know what it takes to make it possible for the docs to be seen in HTML :)13:10
AJaegermnaser: that is only for openstack-manuals and friends, don't use it13:11
mnaserfine i'll make my own then if i cant use yours >:(13:11
mnaser:p13:11
AJaegermnaser: idea is to enhance docs tox environment - so you use openstack-tox-docs ;)13:12
AJaegermnaser: I'm happy to converge in the end to a common job for this if that's the way forward...13:12
AJaegermnaser: right now checklang has some "strange" requirements13:13
*** psachin has quit IRC13:13
eumel8mnaser: dunno. There were some discussions during PTG which I didn't attend. Best to ask dhellmann or ianychoi. My first shot was wrong: https://review.openstack.org/#/c/604568/ Now I don't know how to proceed.13:13
*** agopi|brb has joined #openstack-infra13:14
*** agopi|brb is now known as agopi|afk13:14
AJaegermnaser: http://lists.openstack.org/pipermail/openstack-dev/2018-September/134609.html13:14
openstackgerritMerged openstack-infra/project-config master: switch all official python projects to python3 publishing job  https://review.openstack.org/59832313:16
*** felipemonteiro has joined #openstack-infra13:18
AJaegermnaser: btw. feel free to get back your 50 nodes anytime - and self approve https://review.openstack.org/#/c/606059/ ...13:18
*** zul has joined #openstack-infra13:18
mnaserAJaeger: yep, that's the plan13:19
mnaserthank you13:19
mnaseri guess we'll have to wait till bridge kicks a nodepool run13:20
*** ramishra has quit IRC13:22
openstackgerritAndreas Jaeger proposed openstack-infra/openstack-zuul-jobs master: Remove release-openstack-server and publish-xstatic templates  https://review.openstack.org/53183013:22
AJaegermnaser: followup cleanup ^13:24
AJaegermnaser: ignore, needs more work ;(13:24
mnaserfungi: sorry to bother you but could you get puppet running on nodepool? i jus want to make sure everything starts cleanly so i get back to other stuff13:24
dhellmanneumel8 : mordred was going to with with you and ianychoi on updating the underlying job. If you work on a script that does the translation build for an existing HTML build created by "tox -e docs" I think that would be a good next step.13:24
*** efried is now known as fried_rice13:25
*** derekh has quit IRC13:26
*** derekh has joined #openstack-infra13:26
fungimnaser: i can kick the launchers manually, sure13:27
fungijust a sec13:27
eumel8dhellmann: ok, then I have to catch mordred and ianychoi, thx.13:28
mnaserfungi: thank you :)13:33
*** brokencycle has joined #openstack-infra13:33
brokencycleHi! I am unhappy with the website, www.openstack.org13:33
mnaserbrokencycle: whats up?13:33
dhellmanneumel8 : your existing script is probably a good start, it just needs to take into account the right starting assumptions13:33
mnaserif there's something in specific i can help point to the right folks to help address any issues13:34
*** david-lyle has joined #openstack-infra13:34
brokencyclemnaser: In particular, it seems to be impossible to open some parts of the website with a right click in a new tab.13:34
mnaserbrokencycle: are you talking about the summit schedule pages?13:34
AJaegerfungi, didn't clarkb disable manually OVH? I see the graph growing far too much for nodepool...13:35
brokencycleEg. when I look at the project list, https://www.openstack.org/software/project-navigator/deployment-tools, if I right-click on any of them, the new tab just opens as 'www.openstack.org'. If I left-click on the same link, I get to the actual project.13:35
brokencycleBut I want to right click and open the project's page in a new tab, not the existing one.13:36
mnaserbrokencycle: you are right, looking at hte html code, i see <a href> but without a link there13:36
mnaserbrokencycle: could you pm me your email and i can start an email thread with someone that can help you?13:37
*** dklyle has quit IRC13:37
*** bhavikdbavishi has quit IRC13:39
mnaserbrokencycle: voila, fired off an email, foundation staff are awesome when it comes to this so i expect things to clear up soon13:39
mnaserthanks for letting us know13:39
AJaegerfungi, did you see my comment above?13:41
eumel8dhellmann: from my understanding you want everything to build into the repo. But that requires to have the similar script in each repo with translation. I tried to centralize it, in the wrong way. Second solution would be to bring this script into the repo like in openstack-manuals, so tox -edocs builds the whole documentation13:43
*** zzzeek has quit IRC13:43
mnaserdoes anyone know how we can make pbr generate version for project via cli?13:44
mnaseropenstack ansible maintains a hard coded variable for the version we're at right now13:44
mnaserwe'd like to use pbr instead, we can run a local lookup somehow13:44
mnaserpbr info or pbr sha hasnt helped too much13:44
*** zzzeek has joined #openstack-infra13:45
dmsimardmnaser: https://github.com/openstack/ara/blob/master/ara/__init__.py13:45
mnaserah so we might need to run a python script13:45
dmsimardmnaser: it probably wouldn't be too hard to do a one liner ?13:45
mnaseri mean probably cleaner to get a small python script probably.. i think13:45
mnaserthat way we run it with our own python virtualenv13:45
mnaserah damn13:47
mnaserbut we don't actually install the package locally13:47
*** zzzeek has quit IRC13:48
mnaseroh we do nevermind13:48
dmsimardmandre: /opt/venv/bin/python -c 'import pbr.version; print(pbr.version.VersionInfo("foo").version_string())' ?13:48
dmsimarder, mnaser ^13:49
mnaserlemme try that13:49
*** zzzeek has joined #openstack-infra13:49
mnaser /opt/ansible-runtime/bin/python -c 'import pbr.version; print(pbr.version.VersionInfo("openstack-ansible").release_string())'13:49
mnaserworks perfectly13:50
dmsimard\o/13:50
fungiAJaeger: clarkb increased max-servers in bhs1 to something like 20 yesterday and it seemed to be mostly holding but going above that ended up with more port leak/pileup13:51
*** agopi|afk is now known as agopi13:53
dhellmanneumel8 : the job can check out the doc tools repo so we don't have to have a copy of the script everywhere13:54
dhellmanneumel8 : or we can put the script in the repo where the job is defined13:54
dhellmannmnaser : "python setup.py --version"13:54
mnaserdhellmann: wow that was an obviosu one13:55
mnaserlol13:55
*** yamamoto has joined #openstack-infra13:55
AJaegerfungi: looking at grafana: All is fine again13:55
dhellmannmnaser :-)13:55
*** eernst has joined #openstack-infra13:59
*** felipemonteiro has quit IRC14:03
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: remove job settings for ironic repositories  https://review.openstack.org/59247214:06
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Simplify vexxhost nodepool configuration  https://review.openstack.org/60546914:08
pabelangerAJaeger: mnaser: dmsimard: fungi: ^ should reduce some copypasta for vexxhost for nodepool14:09
*** rossella_s has quit IRC14:10
AJaegermnaser: I'd like you to +2 first ;) Happy to +2 afterwards...14:12
mnaserpabelanger: AJaeger added a comment.. i'd like us to bring ca-ymq-1 to queens before we bring bfv back14:13
eumel8dhellmann: That confused me because I was in the docs tool repo and AJaeger mentioned it's the wrong place ;)14:13
pabelangermnaser: Oh, I didn't see that was a different14:13
pabelangeryah, that won't work then14:13
*** rossella_s has joined #openstack-infra14:14
AJaegereumel8, dhellmann, we discussed putting this in a role in ansible. But yes, script can life anywhere...14:14
mnaserinfra-root: i suspect some form of quota is being hit at sjc1 .. any nodepool logs to help me bump up the right thing?14:16
AJaegereumel8: maybe I read too much in dhellmann's email...14:16
*** jamesmcarthur has quit IRC14:18
*** jamesmcarthur has joined #openstack-infra14:18
*** bnemec is now known as beekneemech14:20
eumel8AJaeger, dhellmann: my focus was also to build docs locally. With an Ansible Role you need to download first and setup Ansible for testing the docs. Thats looks complicated to me14:21
AJaegerdhellmann: ironic is done, isn't it?14:22
corvusmnaser: openstack.exceptions.SDKException: Error in creating the server: Build of instance ef7fbe70-d2da-433c-8368-234de8a20db3 aborted: VolumeSizeExceedsAvailableQuota: Requested volume or snapshot exceeds allowed gigabytes quota. Requested 80G, quota is 5120G and 5120G has been consumed.14:22
dhellmannAJaeger , eumel8 : we want the job to run the script so we don't have to update tox.ini. I don't really mind where we put the script, as long as it is in a place where we can update it if we have to.14:24
dhellmannAJaeger : I'm still catching up; dealing with TC election stuff this morning14:24
dhellmanneumel8 : if you write the script so the ansible role can call it, that should give us the best of both worlds.14:25
AJaegerdhellmann: and I would put the script in openstack-zuul-jobs then14:25
dhellmannAJaeger : sounds good to me14:25
AJaegerdhellmann: just run your goal tools script after pushing ironic for some +2As;)14:26
*** edmondsw_ has joined #openstack-infra14:26
eumel8dhellmann: okay, will think about it, thx14:27
*** edmondsw has quit IRC14:29
*** edmondsw_ is now known as edmondsw14:29
*** agopi is now known as agopi|afk14:29
*** roman_g has quit IRC14:30
*** quiquell is now known as quiquell|off14:31
*** bobh has joined #openstack-infra14:32
*** electrofelix has quit IRC14:34
mnasercorvus: thanks, let me do some math14:36
mnasercorvus: disk quota bumped to 6144 which should put us in a good place14:36
*** jamesmcarthur has quit IRC14:38
*** jamesmcarthur has joined #openstack-infra14:38
*** jamesmcarthur has quit IRC14:41
*** jamesmcarthur has joined #openstack-infra14:42
mnasercool i think we're good14:44
mnaseri see ~142 in use14:44
*** felipemonteiro has joined #openstack-infra14:44
*** armax has joined #openstack-infra14:45
*** gfidente is now known as gfidenteN00b14:46
*** jamesmcarthur has quit IRC14:46
openstackgerritMerged openstack-infra/system-config master: Only replicate openstack namespaces to github  https://review.openstack.org/60548614:52
AJaegerzuul experts, I'm confused https://review.openstack.org/#/c/593884/ runs openstack-tox-py35 but we disabled that with change https://review.openstack.org/605893 . Is that change not rolled out? Or anything wrong with it?14:53
*** jamesmcarthur has joined #openstack-infra14:55
AJaegercorvus, mordred, any ideas? ^14:56
*** e0ne has quit IRC14:58
*** Bhujay has quit IRC14:58
*** jamesmcarthur has quit IRC14:59
*** jamesmcarthur has joined #openstack-infra15:00
AJaegertbarron just asked the same question in #zuul - we can discuss there as well15:01
tbarronAJaeger: ty :)15:01
*** e0ne has joined #openstack-infra15:02
AJaegertbarron: I'm still puzzled ;)15:03
corvustbarron, AJaeger: 2018-09-28 14:30:30,576 DEBUG zuul.layout: Pipeline variant <Job openstack-tox-py35 branches: None source: openstack-infra/openstack-zuul-jobs/zuul.d/project-templates.yaml@master#515> matched <Change 0x7f183b99f15:03
corvus6a0 openstack/manila-ui 593884,2>15:03
*** smarcet has joined #openstack-infra15:03
corvustbarron, AJaeger: that points to this: http://git.openstack.org/cgit/openstack-infra/openstack-zuul-jobs/tree/zuul.d/project-templates.yaml#n51515:04
AJaegercorvus: so, the "branches: ^(?!stable/(ocata|pike|queens)).*$" get ignored?15:05
corvusthat project does use that project-template15:05
corvusAJaeger: for that invocation, not for the other15:05
AJaegercorvus: so, we need to remove the template as well here?15:05
corvusAJaeger: yes15:05
AJaegercorvus: ah, did that semantic change in the last months? I might have missed that...15:05
*** sthussey has joined #openstack-infra15:05
corvusAJaeger, tbarron: you can add the project-template in-repo to just the branches it should apply to15:06
AJaegertbarron: know what to do? I'm happy to +2 a change to remove the template compltely15:06
AJaegercorvus: that's what they will do as part of python3-first migration.15:06
AJaegertbarron: I think corvus is right, you converted master etc already15:06
AJaegertbarron: so, just remove from project-config the rest and you get what you want...15:06
AJaegertbarron: that allows you to finish conversion15:07
*** dave-mccowan has joined #openstack-infra15:07
AJaegertbarron: still with us?15:07
tbarronAJaeger: just trying to follow :)15:08
corvusAJaeger: (also, btw, you can find the same debug information in the "_inheritance_path" variable here: http://logs.openstack.org/84/593884/2/check/openstack-tox-py35/d7efc50/zuul-info/inventory.yaml)15:08
AJaegercorvus: ah, thanks!15:09
AJaegertbarron: ok, waiting for a change by manila team and will guide you through it...15:10
AJaegertbarron: change for project-config15:10
* tbarron is fetching fresh project-config15:12
*** dave-mccowan has quit IRC15:13
tbarronAJaeger: should I be looking at zuul.d/projects.yaml?  openstack-manila-ui project?15:16
AJaegertbarron: yes, similar to https://review.openstack.org/#/c/605893/15:16
AJaegertbarron: just remove everything py35 related from manila-ui ;)15:16
*** rossella_s has quit IRC15:16
AJaegertbarron: the delete key is your key to success for that change ;)15:17
*** rossella_s has joined #openstack-infra15:19
openstackgerritTom Barron proposed openstack-infra/project-config master: Remove py3 jobs for manila-ui project  https://review.openstack.org/60611415:20
tbarronAJaeger: ^^15:20
*** bhavikdbavishi has joined #openstack-infra15:23
*** ginopc has quit IRC15:24
*** yamamoto has quit IRC15:25
AJaegertbarron: one line too much - otherwise ok15:25
openstackgerritTom Barron proposed openstack-infra/project-config master: Remove py3 jobs for manila-ui project  https://review.openstack.org/60611415:28
AJaegertbarron: LGTM, +2 - any other config-core to +2A ^, please? That allows manila team to finish python3-first imports...15:28
tbarronAJaeger: ty!15:29
AJaegertbarron: due to backlog, this will take at least two hours until we have it tested...15:29
tbarronAJaeger: kk, gouthamr will be getting up in Seattle by then :)15:29
AJaeger;)15:30
tbarronthough it looks like he was working quite late last night15:30
AJaegerthen he deserves his rest..15:32
*** ykarel is now known as ykarel|away15:35
*** lbragstad is now known as elbragstad15:38
*** zul has quit IRC15:41
fungiclarkb: did you see the reply from amorin? looks like we should be okay to crank bhs1 back up to max again15:42
fungii'll prep the change15:42
clarkbfungi: I havent yet, still booting my day ++ to getting things rolling15:45
mnaserfungi: i think it happened indirectly15:45
mnaserwhen you kicked off nodepool15:45
fungioh, maybe15:46
*** adriancz has quit IRC15:46
fungialso my internet at the house here is out, so i may not be pushing any changes for a bit anyway15:46
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Move openstackdocstheme api-ref job in-tree  https://review.openstack.org/60461015:47
clarkbfungi: maybe manually set it to 80 again on nl04?15:51
clarkbthat seemed to trigger thr behavior quickly last time15:51
*** yamamoto has joined #openstack-infra15:51
mrhillsmanhow does one trigger periodic pipeline? i tried but maybe i am doing it wrong15:52
mrhillsmanalso using github15:53
mrhillsmanzuul enqueue-ref --tenant openlab --trigger github --pipeline cloud-provider-openstack-acceptance-test-e2e-conformance-stable-branch-v1.12 --project kubernetes/cloud-provider-openstack --ref refs/heads/master15:53
mrhillsmani think the --ref is wrong maybe?15:53
clarkbthe trigger is periodic not github I think15:54
fungiclarkb: i ran kick.sh against nl* a little while ago, so as mnaser noted i may have inadvertently cranked it back up to max anyway15:54
clarkbfungi: ah ok does that ignore the emergency file?15:54
mrhillsmanah ok15:57
mrhillsmantrigger is timer15:57
fungiclarkb: http://grafana.openstack.org/d/BhcSH5Iiz/nodepool-ovh?orgId=1&var-region=ovh-bhs1 would suggest so15:58
clarkbok we'll have to watch ti then. I will remove nl04 from the emrgency file now16:01
fungithanks16:02
fungii also will need to perform some local surgery to tether the machine from which i ssh to openstack servers if this internet outage persists, so logging into things in general is a bit of a pain at the moment16:02
dhellmannI see the post queue is continuing to grow. Where did we come down on the decision about queue priorities yesterday?16:12
*** graphene has quit IRC16:12
clarkbdhellmann: I think you mostly convinced corvus that it could be changed at this point (since we don't run coverage jobs (or at least many coverage jobs) there anymore)16:12
clarkbdhellmann: are you itnerested in pushing the change up to update the priority on post or do you want one of us to do it?16:13
dhellmannif I propose a patch, does landing it trigger the change or does zuul need to restart?16:13
dhellmannI'm happy to do it if we think it's a good idea16:13
*** graphene has joined #openstack-infra16:13
clarkbdhellmann: I want to say landing it is sufficient since that is part of the reloadable config. What I don't know is if existing node requests will have their priorities updated16:13
clarkbShrews: ^ may know off the top of his head16:14
dhellmannwhat are valid values for "precedent"? high and low? or high, medium, and low?16:14
*** fried_rice is now known as fried_rolls16:15
dhellmannI see a promote pipeline in there now; is anything using that?16:15
AJaegerclarkb: we run coverage only on stable branches - but I tried to update master everywhere and will continue so...16:15
clarkbhttps://zuul-ci.org/docs/zuul/user/config.html#attr-pipeline.precedence high normal low16:15
dhellmannclarkb : thanks16:15
clarkbdhellmann: oddly only the infra CD jobs use promote that I know of. Its sort of a WIP16:15
clarkbAJaeger: yup I think it was your work refactoring that that helped16:15
dhellmannok, I'll leave promote alone for now16:16
AJaegerlet me reprhase: clarkb: we run coverage *in post* only on stable branches - but I tried to update master everywhere to move cover to check and will continue so...16:16
AJaegerclarkb: ianw and my work...16:16
clarkbAJaeger: gotcha, still much fewer coverage jobs running in post then16:16
AJaegerclarkb: yes, much fewer. I considered it not worth updating stable branches for this.16:17
*** aojea has quit IRC16:17
*** dpawlik has quit IRC16:18
*** manjeets has joined #openstack-infra16:19
clarkbthank you AJaeger ianw and mnaser for those project-config reviews16:20
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: make pipeline precedence progressively higher  https://review.openstack.org/60612916:20
dhellmannclarkb : let me know what you think of that ^16:21
Shrewsclarkb: priorities of existing requests will not change16:21
dhellmannah, well, that's a shame16:21
*** mriedem is now known as mriedem_lunch16:21
AJaegerdhellmann, clarkb that makes periodic and check the same priority - right now periodic is low and check is normal. Is that kind of change ok?16:23
dhellmannhmm16:24
dhellmannit seems we have only 3 available levels but 4 desired levels16:24
*** gyee has joined #openstack-infra16:24
AJaegeryep16:24
clarkbya that is an unfortunate gearman protocol exposure16:25
dhellmannah16:25
dhellmanntoo bad it's not just an integer I guess16:25
clarkbwe may be able to workaround it now that we use zk for the node requests, but would require changes16:25
clarkb(but the three values are a holdover from gearman for sure)16:25
dhellmannI think having periodic and check both set to low is probably ok16:26
dhellmannthe point is to stop later parts of the process from being hung up if jobs keep entering the earlier part16:26
Shrewsyeah, for zk requests, it's just a number. we can have as many as we want in reality16:26
clarkbinfra-root https://review.openstack.org/#/c/605583/1 is a fix for the dns problems we've had on fedora during infra jobs (other jobs use the normal base job and should be fine)16:28
clarkbianw found a bug in unbound in the process too16:28
*** zul has joined #openstack-infra16:28
*** ykarel_ has joined #openstack-infra16:29
*** ykarel|away has quit IRC16:30
*** ykarel__ has joined #openstack-infra16:31
*** yamamoto has quit IRC16:31
*** ykarel_ has quit IRC16:31
clarkbdhellmann: I think we should set the precedence on the third party check pipeline at the end of that file too. Otherwise lgtm16:33
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: make pipeline precedence progressively higher  https://review.openstack.org/60612916:33
clarkb(I left commentson the change)16:33
dhellmannclarkb : I just saw your comment and did update the precedence16:33
clarkbperfect16:33
* dhellmann goes for food16:35
clarkbfungi: reading the ovh bhs1 graph I don't think the cloud is entirely happy, but at the same time it isn't spiralling out of control16:36
clarkbfungi: likely other growing pains on the new version that happen to be less fatal to nodepool16:36
fungiyeah, looks like boot and delete are taking a while maybe16:36
*** roman_g has joined #openstack-infra16:37
clarkbhopefully nodepool's usage of that cloud region is constructive feedback for ovh :)16:38
*** ianychoi has quit IRC16:39
clarkbjungleboyj: is http://logs.openstack.org/22/600722/1/gate/legacy-tempest-dsvm-neutron-full/fe7320a/job-output.txt.gz#_2018-09-28_15_52_01_683952 a known issue with cinder/tempest?16:39
AJaegerdhellmann: ok to approve https://review.openstack.org/592472 now? Ironic should be ready...16:41
clarkbmriedem_lunch: melwitt in http://logs.openstack.org/06/604906/5/gate/openstack-tox-py35/c633c4e/ nova uses MoxStubout which is apparently deprecated and creates a very large log file. Might be something worth cleaning up to make it easier to read the logs when things fail16:43
AJaegerclarkb: could you +2A https://review.openstack.org/606114  - to help manila to finish python3-first, please? I expect it takes some more time to pass tests...16:43
*** ianychoi has joined #openstack-infra16:44
clarkbAJaeger: done16:44
*** Dobroslaw has quit IRC16:45
*** dtantsur is now known as dtantsur|afk16:45
clarkbmriedem_lunch: melwitt as for why that job failed it appears that the nova os profiler test never returned control back to the calling process after completing its test run16:45
clarkbthen the job timed out16:46
*** smarcet has quit IRC16:46
*** rossella_s has quit IRC16:51
*** jpich has quit IRC16:52
*** mdbooth has joined #openstack-infra16:53
mdboothHello, looking at https://docs.openstack.org/infra/elastic-recheck/readme.html#adding-bug-signatures but I don't see which repo that stuff is in16:53
clarkbmdbooth: it is in the elastic-recheck repo itself16:54
clarkbopenstack-infra/elastic-recheck16:54
*** priteau has joined #openstack-infra16:54
mdboothclarkb: I don't know? I'll check it out and look, thanks.16:54
clarkbmdbooth: there is a link to them in that document16:54
clarkbthe git.openstack.org link16:54
*** rossella_s has joined #openstack-infra16:54
mdboothclarkb: I'm aware from PTG trivia night that infra has more repos than anybody else :P16:54
mordredmdbooth: the question is - does infra have more repos than the rest of openstack combined?16:55
*** derekh has quit IRC16:55
clarkbmriedem_lunch: following up on the yum install bug, the vast majority of the hits on that are from tripleo and in those tripleo cases it fails due to dns resolution failures to the infra mirror. Checking all but one of our mirrors have a dns ttl of an hour  (I will fix the one with a 5 minute ttl), but this affects all the cloud regions so don't expect that is the cause16:55
mdboothmordred: I wouldn't be surprised :)16:55
clarkbmriedem_lunch: I expect that something in the jobs themselves is causing problems for dns16:56
clarkbmriedem_lunch: those jobs don't fail 100% of the time because yum will try other mirrors if the first one doesn't resolve16:56
jungleboyjclarkb:  That one doesn't look familar to me.16:57
jungleboyjsmcginnis:  ^^16:57
clarkbjungleboyj: smcginnis ok I think http://status.openstack.org/elastic-recheck/gate.html#1794143 is the bug for that, we are just behind on indexing so the more recent occurences haven't shown up there16:57
*** jpena is now known as jpena|off16:57
clarkb(took me a while to dig that up)16:58
*** ykarel__ has quit IRC17:00
clarkbhttp://status.openstack.org/elastic-recheck/gate.html#1793370 causes job failures, but if zuul is doing its job properly all of those failures should be retried (because network connectivity losses like that should trigger a retry)17:00
clarkbI wonder if we can easily tell if zuul is retrying those jobs17:00
mordredclarkb: zuul isnt' going to retry those, as those are job-content failures in post jobs17:02
clarkbmordred: not all of them are post, some of them are copying ssh keys in pre17:04
clarkb(actually it seemd a lot of them were because if networking to the instance is flaky we hit it early rather than late)17:04
clarkbmordred: the title is too specific17:04
clarkbmordred: http://logs.openstack.org/98/603498/3/gate/openstack-tox-pep8/84b0c29/job-output.txt#_2018-09-27_21_44_42_572812 is an example. It actually fails early because netowrking doesn't work. Then we also fail trying to collect the logs in post17:05
clarkbthat job should be restarted right?17:06
clarkbhttp://logs.openstack.org/50/603050/3/gate/openstack-tox-py36/f29117b/job-output.txt#_2018-09-26_08_47_23_846898 same with that one17:06
*** bobh has quit IRC17:07
clarkbhttp://logs.openstack.org/12/602112/3/gate/openstack-tox-py36/9612f0c/job-output.txt#_2018-09-26_08_47_22_578970 and so on17:07
jungleboyjclarkb:  That elastic recheck bug looks different as that is on a retype, not an extend.17:08
mordredclarkb: yah - if pre failed we shoudl totally re-try it17:08
clarkbjungleboyj: ah ok17:08
*** psachin has joined #openstack-infra17:08
mordredclarkb: I wonder if we can detect that we're in a post job that's associated with a job that failed in pre and send an additional *something* to elasticsearch?17:09
clarkbmordred: ok thanks for confirming my understanding of that. I had deprioritzed debugging that problem because it looks like its across all the providers and we should retry in many of the cases17:09
clarkbmordred: maybe a zuul indiciation of whether or not the failure is fatal?17:09
*** gfidenteN00b has quit IRC17:10
mordred++17:10
mordredclarkb: because collecting info on things that failed in pre is still useful - but also being able to filter out those failures since zuul handles them as 'expected' types of failures we can retry on17:10
jungleboyjThere are issues around actions like volume extension that can be teased out depending on the load on the system.  I am guessing this is one of this edge cases where it takes longer than expected.17:10
melwittclarkb: thanks for the heads up. will look at that17:11
*** trown is now known as trown|lunch17:11
clarkbmordred: yup exactly.17:12
clarkbmordred: checking the two py36 job failures against the changes they ran against there are no reported py36 failures to those changes. I think that maens the job retries are working as expected17:13
clarkbmordred: I expect too that in the old system the vast majority of these failures were weeded out by our ready script17:13
clarkbmordred: but now we've shifted that into zuul jobs themselves17:13
*** bobh has joined #openstack-infra17:13
mordredyah17:13
*** mdbooth has quit IRC17:15
clarkbmwhahaha: looking at logstash for http://status.openstack.org/elastic-recheck/gate.html#1708704 it seems that the yum install of dstat in the overcloud has this issue quite a bit. I think that points at something in the job around installing dstat that makes dns flaky? It is weird that that one package install (when all the other package installs are happening) is a problem17:17
mwhahahaclarkb: maybe we're pulling it from a different repo? not sure. i don't think that's the cause but rather a sympton17:18
clarkbmwhahaha: also we don't seem to collect the dstat logs in the overcloud, the logs are collected from the undercloud though17:18
*** bobh has quit IRC17:21
mwhahahaclarkb: so i pulled up the log stash and a job where that happened was actually a successfull job, we didn't fail on dstat17:22
mwhahahahttp://logs.openstack.org/20/603220/1/gate/tripleo-ci-centos-7-scenario003-multinode-oooq-container/43093ac/job-output.txt17:22
clarkbjungleboyj: http://logs.openstack.org/44/598244/2/gate/openstack-tox-lower-constraints/b21ebac/job-output.txt.gz#_2018-09-28_17_03_18_365816 that is the bug I linked?17:22
clarkbmwhahaha: correct, it is failing over to some other mirror17:23
mrhillsmanin the gate pipeline i see "Queue: integrated", how do you set the value there; i.e. i want "Queue: arbitrary"17:24
*** jtomasek has joined #openstack-infra17:24
clarkbmwhahaha: mostly pointing it out because the behavior is odd and it is happenign a lot17:24
mwhahahaclarkb: so it's actually failing post deployment, that's really weird17:24
mwhahahaweshay_ruck: -^ fyi17:24
jungleboyjclarkb:  No, that is a different one but we were made aware of that one yesterday.17:25
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Use bionic for openstack-manuals publishing  https://review.openstack.org/60614717:25
*** anteaya has joined #openstack-infra17:25
clarkbmrhillsman: https://zuul-ci.org/docs/zuul/user/config.html#attr-project.%3Cpipeline%3E.queue17:25
jungleboyjThat one is being looked into.17:25
mrhillsmanty sir17:25
mwhahahai wonder if unbound is getting stomped on by a dnsmasq or something17:25
AJaegerconfig-core, could I get a +2A on  https://review.openstack.org/606147  to fix openstakc-manuals publishing, please?17:25
*** yamamoto has joined #openstack-infra17:26
*** rossella_s has quit IRC17:27
*** harlowja has joined #openstack-infra17:27
mwhahahaclarkb: i figured it out, it's designate17:28
clarkbmwhahaha: neat17:28
mwhahahaclarkb: it's conflicting with unbound on scenario00317:28
mwhahahaso post-deployment, dns no longer works17:28
mwhahahabeekneemech, weshay_ruck -^17:28
mwhahahalet me file a bug17:28
*** rossella_s has joined #openstack-infra17:30
*** roman_g has quit IRC17:31
*** eernst has quit IRC17:32
beekneemechmwhahaha: Isn't unbound running on the undercloud though?17:34
mwhahahabeekneemech: multinode, it's run on both. and the default resolve.conf points to 127.0.0.117:34
beekneemechOr does it run on both?17:34
beekneemechAh. :-/17:34
mwhahahabeekneemech: https://bugs.launchpad.net/tripleo/+bug/179504317:34
openstackLaunchpad bug 1795043 in tripleo "designate's named is conflicting with unbound in CI scenario003" [High,Triaged]17:34
mwhahahanot completely sure why, but it's pretty consistent on scenario003 according to logstash17:34
clarkbmriedem_lunch: for the pip no packages found issue, the two most recent occurrences of that were ara trying to install a package that didn't support the local version of python. Previous to that we had the broken mirrors in limestone and gra1 both of which should be fixed17:34
clarkbmriedem_lunch: we should keep tracking that but I think its a non issue for the last ~4 days17:35
mwhahahaspeaking of pip, did the version of ansible recently get updated on the images?17:35
mwhahahawe've noticed something is pip installing ansible 2.6.417:35
openstackgerritMerged openstack-infra/zuul master: Fix node leak on job removal  https://review.openstack.org/60552717:35
mwhahahawhich has broken some of our stable jobs17:35
* weshay_ruck reading through it17:36
clarkbmwhahaha: devstack-gate installs its own ansible in a virtualenv which was updated. I don't think other things are expected to use that ansible install17:37
clarkbit is a devstack gate implementation detail and not a contract with everyone else17:37
mwhahahaclarkb: yea we're seeing it on the actual host itself17:37
mwhahahastarting as of 2 days ago17:37
clarkbmwhahaha: I don't think we install ansible on the test nodes themselves out side of the jobs17:37
mwhahahasomethign is, not sure what though because we use packages or a venv17:38
*** smarcet has joined #openstack-infra17:38
mwhahahahttp://logs.openstack.org/24/567224/109/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/8b476ee/logs/undercloud/var/log/extra/pip.txt.gz vs http://logs.openstack.org/24/567224/111/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/448effe/logs/undercloud/var/log/extra/pip.txt.gz17:38
*** eernst has joined #openstack-infra17:39
clarkbI don't think it is infra17:39
mwhahahabut it should be 2.4.4.0 from the package http://logs.openstack.org/24/567224/111/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/448effe/logs/undercloud/var/log/extra/rpm-list.txt.gz17:39
mwhahahak i'll continue to poke at it17:39
fungicould it be the old devstack-gate logic which uses ansible to orchestrate the overlay networking for multi-node configurations (or something oooq lifted from d-g a while back)?17:40
clarkbfungi: that is in a dedicated d-g virtualenv in /tmp17:40
fungiahh, nm17:40
clarkbmwhahaha: could it possibly be ara?17:40
clarkbI don't think ara deps on asnible but maybe that changed?17:40
clarkbfungi: we intentionally did it that way for this reason. People run ansible in the jobs and ansible is not really consistent over minor version releases for compat17:41
mwhahahayea not sure, yet. our logs point to older versions being pip installed17:41
clarkbansible does not show up in our dib elements so doubt it is coming from that17:41
logan-yes newer ara versions have an ansible version requirement, if you dont pin ara it will pull in newer ansible. broke a few jobs where i had it unpinned last week17:42
mwhahahak i'll keep trying to disect the logs then17:42
fungithanks logan-!17:43
clarkbdmsimard: ^ you are probably interested in this17:43
dmsimardI am dmsimard and I might be interested in this17:43
*** e0ne has quit IRC17:44
dmsimardlogan-: newer ansible ? the pin is currently >=2.4.5 which is the lowest version not currently EOL :p17:45
dmsimardhttps://docs.ansible.com/ansible/latest/reference_appendices/release_and_maintenance.html17:45
dmsimardmwhahaha: ^17:46
dmsimardclarkb: ara 0.x does depend on ansible because it leverages ansible to configure itself (ansible.cfg etc)17:47
mwhahahaK that's probably the issue17:47
clarkbdepending on how you use pip >=2.4.5 can pull in 2.6 say if you already have 2.5 installed17:48
*** jamesmcarthur has quit IRC17:48
mwhahahaweshay_ruck: we probably need to pin ara in quickstart17:50
*** agopi|afk is now known as agopi17:51
*** harlowja has quit IRC17:51
*** mriedem_lunch has quit IRC17:52
*** diablo_rojo has joined #openstack-infra17:52
*** harlowja has joined #openstack-infra17:52
*** auristor has quit IRC17:54
*** mriedem has joined #openstack-infra17:54
weshay_ruckmwhahaha, ara==0.15.017:55
dmsimard0.15.0 is kind of old17:56
*** manjeets has quit IRC17:56
*** david-lyle has quit IRC17:56
dmsimardMay 3rd17:56
dmsimard0.16.1 was released 24 days ago17:56
dmsimardthis is the pin for 0.15.0: https://github.com/openstack/ara/blob/41427039de3b9ed1859bb3afdc1f8629e6c72a7a/requirements.txt#L417:57
*** e0ne has joined #openstack-infra18:02
*** TheJulia is now known as needssleep18:02
*** auristor has joined #openstack-infra18:02
*** e0ne has quit IRC18:05
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: ensure the twine check command runs in the correct directory  https://review.openstack.org/60615218:05
dhellmannconfig-core: I think ^^ fixes an issue with the new packaging check job, as exhibited in the failure on http://logs.openstack.org/24/591624/2/gate/test-release-openstack-python3/3cf3d56/ara-report/result/3e1b200d-0ec8-4b96-9882-ece83edbe0e8/18:06
clarkblooking18:06
*** jcoufal has quit IRC18:08
clarkbdhellmann: I'm not sure if zuul_work_dir is valid in that context, but gave a path that is based on zuul inventory vars that should work18:11
clarkb(we use this alternate path in the post.yaml playbook)18:11
dhellmannclarkb : hmm, ok. I copied that out of the other playbook but maybe it was in a role or something18:12
dhellmannI see lots of other uses of that variable in18:13
dhellmannhttp://codesearch.openstack.org/?q=zuul_work_dir&i=nope&files=&repos=18:13
*** smarcet has quit IRC18:13
openstackgerritMerged openstack-infra/project-config master: Remove py3 jobs for manila-ui project  https://review.openstack.org/60611418:13
clarkbI think they define it in their defaults file let me check18:13
dhellmannsince I'm still learning, when you say "valid in that context" what is different about that context than any of the others?18:13
dhellmannah18:13
clarkbhttp://git.openstack.org/cgit/openstack-infra/zuul-jobs/tree/roles/bindep/defaults/main.yaml like that18:13
clarkblooks like that value might be better than the one I gave though as it is probably rooted18:14
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: ensure the twine check command runs in the correct directory  https://review.openstack.org/60615218:14
clarkbdhellmann: the ansible vars are global so if one of the roles in the check.yaml playbook defines zuul_work_dir it will be valid, but if they don't it won't be valid18:14
dhellmannyay for namespaces18:14
clarkbdhellmann: but I think it is bad to rely on side effects like that18:14
dhellmannok, I've updated it to use zuul.project.src_dir18:14
dhellmannyeah, I don't want to have it suddenly fail if something else is changed18:15
*** anteaya has quit IRC18:15
dhellmannI saw zuul in the name and assumed it was being defined by zuul. TIL18:15
clarkbdhellmann: the zuul.foo vars should be defined by zuul in the inventory and are safe to use anywhere in the job18:16
dhellmannyeah, but not zuul_foo18:17
clarkbthinking about this removing aliases like http://git.openstack.org/cgit/openstack-infra/zuul-jobs/tree/roles/bindep/defaults/main.yaml#n5 might be a good idea18:17
clarkbit really isn't any shorter to type or harder to understand but will be more consistent18:17
dhellmannyeah18:17
clarkb(I think some of the motivation there is to make these roles useable outside of zuul, but not sure how realistic that is)18:18
*** smarcet has joined #openstack-infra18:18
*** hasharAway is now known as hasharRlyAwy18:22
*** manjeets has joined #openstack-infra18:25
*** e0ne has joined #openstack-infra18:25
*** bobh has joined #openstack-infra18:26
*** yamamoto has quit IRC18:28
*** bobh has quit IRC18:31
AJaegerdhellmann: I'll approve the ironic python3-first change now...18:33
dhellmannAJaeger : ack18:33
dhellmannand thank you18:33
clarkbdhellmann: AJaeger I see you have debugged the stackviz issues with python3 in the past. Seems like we are still hitting that, is that a known issue?18:34
dhellmannstackviz doesn't ring any bells18:34
clarkbok thanks I'm working on a reproducer locally18:35
AJaegerclarkb: I have? Sorry, forgotten ;(18:36
clarkbbased on git logs it looks like you all added python3.6 testing? its ok I think I have a minimal ish reproduction18:36
*** trown|lunch is now known as trown18:37
AJaegerclarkb: yes, but debugging was only fixing bindep.txt - and then it worked by magic ;)18:37
*** bobh has joined #openstack-infra18:38
clarkbhttp://logs.openstack.org/71/605271/1/check/tempest-full-py3/cb623b6/job-output.txt#_2018-09-28_04_00_03_179985 is what I am looking at and appears to be some interaction between shutil.copyfileobj and input that isn't utf818:38
clarkbya sys.stdin has an ecoding that is platform dependentm utf8 in this case, but we are trying to copy the data into another buffer and that triggers a fault because the input isn't utf818:40
clarkbI can trigger the bug by reading from sys.stdin as well18:40
*** rossella_s has quit IRC18:40
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Add api-ref job to ironic-inspector  https://review.openstack.org/59989018:41
*** david-lyle has joined #openstack-infra18:41
*** rossella_s has joined #openstack-infra18:42
*** david-lyle is now known as dklyle18:42
*** bobh has quit IRC18:42
*** smarcet has quit IRC18:43
*** mriedem has quit IRC18:44
openstackgerritMerged openstack-infra/project-config master: remove job settings for ironic repositories  https://review.openstack.org/59247218:44
*** felipemonteiro has quit IRC18:47
*** e0ne has quit IRC18:50
*** rossella_s has quit IRC18:50
*** smarcet has joined #openstack-infra18:55
*** rossella_s has joined #openstack-infra18:57
*** anteaya has joined #openstack-infra18:59
mnaseris logstash having issues19:02
mnaseror is it just really behind19:02
*** rossella_s has quit IRC19:02
*** rossella_s has joined #openstack-infra19:03
AJaegermnaser: 86k jobs behind according to http://grafana.openstack.org/d/T6vSHcSik/zuul-status?orgId=119:03
mnaserAJaeger: is that..normal? :p19:03
AJaegermnaser: this week nothing is normal ;(19:03
AJaegermnaser: I don't think it's normal but don't check this often...19:04
*** smarcet has quit IRC19:04
clarkbit isn't normal, its due to the large number of jobs we are runnign19:07
clarkbthe gate resets cause that because we run a bunch of jobs then start them all over again19:07
*** bobh has joined #openstack-infra19:09
mwhahahadmsimard: is ara packaged in rpm form yet?19:09
*** elbragstad has quit IRC19:10
*** elbragstad has joined #openstack-infra19:10
AJaegercorvus: do you want to review dhellmann's change of priorities for our pipelines? https://review.openstack.org/#/c/606129/19:13
*** florianf is now known as florianf|afk19:14
*** bobh has quit IRC19:23
*** bobh has joined #openstack-infra19:23
clarkbdhellmann: AJaeger fwiw there is no python side test suite for stackviz19:24
*** graphene has quit IRC19:24
clarkbI'm adding a couple tests and in the process have found other bugs with python3, but this fix should fix our usage of it I think19:24
AJaegerok19:24
dhellmannclarkb : those patches are part of the goal work this cycle.19:25
pabelangerdhellmann: AJaeger: clarkb: not related to current topic, but with pipelines, anything in the release / pre-release / release-post pipelines is managed releases team, right?19:26
dhellmannpabelanger : those jobs are triggered by tags, and some tags are added after we review release things, but some are also pushed directly by unofficial teams or teams that aren't part of "The OpenStack Release" (tm)19:27
AJaegerpabelanger: no - unofficial repos can self-tag19:27
AJaegerpabelanger: why?19:28
pabelangerdhellmann: AJaeger: okay, let me ask differently, is the release team using the tags pipeline?19:28
dhellmannpabelanger : I think the release notes build jobs run there19:29
pabelangerAJaeger: nothing specific to openstack, trying to work out pipelines for github based zuul.19:29
*** e0ne has joined #openstack-infra19:29
pabelangerand couldn't remember what went into tags vs release pipelines19:29
fungipabelanger: you can check the regexes. all three of tag, pre-release and release are triggered from tags19:30
dhellmannrelease and pre-release used to run separate jobs. I think now we have them configured to run the same job for python but I don't know about other languages19:30
clarkbpabelanger: tags is any tag. release is semver pbr appropriate tags19:30
pabelangerYah, that's what I was seeing. Guess releases is mostly semver things19:31
mnaserhey uh19:31
mnaserim not late to the party but19:31
dhellmannpre-release is alpha, beta, rc19:31
mnaserpackethost is borked right19:31
fungiactually we never did trim release down to just pbr-appropriate version patterns because it's used for non-python projects and things like xstatic packages which may have additional version components19:31
mnaser0 in use, 0 ready, 95 deleting? 28 building19:31
dhellmannrelease is a version number without alpha, beta, or rc19:31
pabelangermnaser: yah, for a while. clarkb they tried working on it at PTG19:31
fungimnaser: packethost has been basically broken since before the ptg19:32
mnaseroh since the PTG19:32
mnaserouch19:32
mnaserum19:32
mnaserwhat if i ask nicely for root access19:32
mnaseron the machines/platform19:32
pabelangerdhellmann: okay, cool. That helps19:32
fungistudarus has root access to them, i think?19:32
dmsimardmwhahaha: It's packaged in fedora, for CentOS the only packages there are were made by tristanC for software factory19:32
dmsimardmwhahaha: it probably wouldn't be too hard to pull his package in RDO if you wanted19:33
dmsimardI don't have the bandwidth to do the legwork though19:33
mnaserfungi: i mean if you/infra-core agrees to it, i'd like to have my hand at fixing whats wrong..19:33
mnaserso maybe if you want to email him about it19:33
mwhahahadmsimard: k i'll try and round up someone19:33
mnaser100 nodes would be nice19:33
dhellmannpabelanger : as I said, the jobs that ran for those types of version numbers used to be different. That change we just merged to update to the new python packaging job may make that less important now, but I haven't reviewed the jobs for other types of artifacts lately19:34
*** e0ne has quit IRC19:34
pabelangerdhellmann: understood19:34
fungimnaser: i don't think we have access to the machines19:35
fungistudarus might19:35
mnaserfungi: he does.  right, but i'm guessing a request from infra-root to let me rather than me emailing asking for it might be more reasonable :)19:35
fungiahh, i see19:35
pabelangermnaser: I mean, I'd +2 a patch to disable it, to help reduce launcher errors in grafana for nodepool. But that is just me :)19:36
clarkbdhellmann: AJaeger mtreinish https://review.openstack.org/60618419:36
mnaseri'd love to fix it and i think i could get it done with the right access19:36
fungimnaser: he might go for that, worth asking i suppose. let's see what clarkb thinks when he has a moment19:36
clarkboh looks like there is already a change for that19:37
clarkbthis will teahc me to check for open changes before writing a change, but now I feel like I can review the other change19:37
clarkbhttps://review.openstack.org/#/c/555388/3 is the other change and it doesn't pass tests, I have rechecked it to get logs to figure out why19:39
clarkbmy fix also fixes the file in put case19:39
clarkbmnaser: fungi: I'm fine with it, but I am not sure how much access studarus has either19:39
mnaserclarkb: ill write up an email19:39
clarkbthat was one of the things I found out at the PTG, he is an openstack admin but not necesarily root on the control plane? something like that19:39
clarkbanyway the qa team should really get on top of that or we should consider removing stackviz from our jobs19:40
*** hasharRlyAwy is now known as hasharAway19:42
AJaegerconfig-core, could you put the following changes on your review queue, please? https://review.openstack.org/605583  https://review.openstack.org/604610 https://review.openstack.org/606147 https://review.openstack.org/605128 https://review.openstack.org/60488919:43
timothyb89clarkb: I think your patch is better, though removing stackviz would be a good option if nobody is using it19:49
clarkbtimothyb89: mostly I mention that option beacuse I realized how old your chagne is without it getting much attention from the team that should be responsible19:49
*** mriedem has joined #openstack-infra19:50
clarkbtimothyb89: I think fixing it is a fine option too if the QA team gets a fix in (I'm fine with either patch, if yours goes in first I will rebase mine to add the file provider fix too)19:50
clarkblooks like core membership is actually pretty minimal there, should we add the rest of the qa team to it?19:50
* AJaeger thanks clarkb for reviewing and waves good night19:51
timothyb89clarkb: yours seems to supercede it in all ways that matters so I'm happy to abandon19:51
timothyb89clarkb: but stackviz has been essentially unmaintained for > 1 year, if nobody's benefitting from it removal would be pragmatic19:52
clarkbtimothyb89: I would've updated your change if I had noticed it but wasn't until I saw the conflicts with that I noticed :( sorry19:52
*** Emine has quit IRC19:52
timothyb89clarkb: it's all good, mainly I don't want my old intern project to be a continual time sink for you guys :)19:52
clarkbeh I think it is useful when it works (which is the python2 jobs currently)19:53
clarkbthese are python3 transition pains19:53
clarkbto be expected19:53
clarkbthe time based graph that shows test overlap and resource usage is actually quite useful imo19:54
*** mdbooth has joined #openstack-infra19:54
clarkbtimothyb89: just earlier today jungleboyj mentioned that a bug with cinder seemed to be resource contention related and stackviz shows us the info to figure that out19:54
timothyb89clarkb: huh, well, glad to hear it's still in use :)19:55
*** Emine has joined #openstack-infra19:55
openstackgerritMerged openstack-infra/project-config master: Adding openstack/octavia-lib project  https://review.openstack.org/60488920:01
mriedemclarkb: have you seen this one yet? http://logs.openstack.org/28/605828/1/check/neutron-grenade-multinode/40ddb0f/logs/grenade.sh.txt.gz#_2018-09-28_00_17_54_19820:03
*** smarcet has joined #openstack-infra20:04
clarkbmriedem: I have not20:06
*** openstackgerrit has quit IRC20:07
*** rossella_s has quit IRC20:08
*** rossella_s has joined #openstack-infra20:09
*** mdbooth has quit IRC20:10
*** psachin has quit IRC20:10
mriedemguh logs/undercloud/var/log/extra/logstash.txt20:10
mriedemthere is that giant single indexed file20:10
mriedemclarkb: is ^ killing e-s?20:11
clarkbmriedem: probably, I'd have to go grep logs though20:11
clarkbmriedem: I can do that after lunch20:11
*** bhavikdbavishi has quit IRC20:12
mriedemapparently this shows up a lot but it's mostly not causing failures20:12
mriedemhttp://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22Exception%20in%20thread%5C%22%20AND%20message%3A%5C%22(most%20likely%20raised%20during%20interpreter%20shutdown)%5C%22%20AND%20NOT%20tags%3A%5C%22logs%2Fundercloud%2Fvar%2Flog%2Fextra%2Flogstash.txt%5C%22&from=7d20:12
clarkbmriedem: just sent a followup on the zuul queue backlog thread too20:12
mriedemclarkb: thanks20:12
*** smarcet has quit IRC20:14
mriedemprometheanfire: have any paramiko releases / upper-constraints been approved lately?20:14
mriedemactually nvm,20:15
mriedemi think this is just the job getting killed from being slow20:15
mriedemhttp://logs.openstack.org/28/605828/1/check/neutron-grenade-multinode/40ddb0f/job-output.txt.gz#_2018-09-28_02_21_33_47574020:16
clarkbmriedem: http://logs.openstack.org/70/605070/1/check/kuryr-kubernetes-tempest-daemon-containerized-octavia-py36/d6c9fbd/controller/logs/screen-kubelet.txt.gz has killed at least one log worker. It is 32MB large after compression20:16
mriedemyup http://status.openstack.org/elastic-recheck/#168654220:17
mriedemcripes almighty20:17
clarkbwe should get kuryr to clean that up20:17
clarkbmriedem: http://logs.openstack.org/66/582466/34/check/tripleo-ci-centos-7-containers-multinode-queens/b42e065/job-output.txt.gz?level=INFO killed another worker20:18
clarkb(they OOM)20:18
clarkbas did http://logs.openstack.org/59/603059/6/check/monasca-tempest-python-cassandra/5b6697c/logs/screen-monasca-persister.txt.gz 18MB large after compression20:19
clarkbshould have monsasca clean that up too20:19
*** smarcet has joined #openstack-infra20:19
mwhahahawhat killed the worker? the job output?20:19
clarkbmwhahaha: yes, basically the job output being too large and we OOM20:20
mriedemdmellado: see re http://logs.openstack.org/70/605070/1/check/kuryr-kubernetes-tempest-daemon-containerized-octavia-py36/d6c9fbd/controller/logs/screen-kubelet.txt.gz20:20
mwhahahaalso that's a log from 9 weeks ago?20:20
clarkbmwhahaha: ya that is when the worker died, I'm just going through worker by worker and finding what crashed them20:20
mwhahahaclarkb: you sure that's the cause or a side effect?20:21
clarkbmwhahaha: pretty sure it is teh cause. We get these multiple hundred of megabyte log files that cause the prpcessors to OOM and the worker crashes20:21
clarkbOOMkiller gets invoked on them20:21
mwhahahaclarkb: yea but job-output.txt from tripleo is not that big20:21
clarkbmwhahaha: ya in this case with it being an old log that isn't on the log servers anymore we probably can't know for sure20:22
mwhahahathe tripleo job-output is typically ~100k20:22
clarkbmwhahaha: we can likely ignore that one for now since its in ahard to debug state20:22
mriedemi pinged witek over in -monasca20:23
mwhahahak do let me know if you wander across any larger ones. we do try and keep those down20:23
mriedemso these logs aren't using the oslo log format so rather than just index INFO logs we're indexing everything?20:23
mriedemis the registration to index these logs at all still in system-config?20:24
clarkbmwhahaha: will do20:24
clarkbmriedem: yes we are indexing everything in that case20:24
prometheanfiremriedem: :D20:24
clarkbmriedem: no it moved with zuulv3 let me find it20:24
clarkbmriedem: project-config/playbooks/base/post-logs.yaml is what calls the submit-logstash-jobs role in the roles dir of that repo20:25
*** yamamoto has joined #openstack-infra20:26
*** rossella_s has quit IRC20:26
*** rlandy is now known as rlandy|brb20:27
*** rossella_s has joined #openstack-infra20:29
clarkbhttp://logs.openstack.org/36/528336/12/check/neutron-tempest-dvr-ha-multinode-full/f897fea/logs/screen-q-svc.txt.gz some really largel neutron files20:29
mriedemi don't know how to turn this off for these jobs20:30
mriedemthat neutron HA one is a 3-node job20:30
clarkbI think the config is a regex we could negative lookahead on them?20:31
mriedemi couldn't find the definition of submit-logstash-jobs via codesearch.o.o20:31
clarkbmriedem: project-config/roles/20:32
clarkblooks like the config is in defaults/main.yaml20:32
mriedemok i see it20:33
mriedemhttp://git.openstack.org/cgit/openstack-infra/project-config/tree/roles/submit-logstash-jobs/defaults/main.yaml#n79 yeah?20:34
mriedemso kubelet and monasca-persister20:34
mriedemon it20:34
clarkbya that file20:34
*** openstackgerrit has joined #openstack-infra20:35
openstackgerritMerged openstack-infra/project-config master: ensure the twine check command runs in the correct directory  https://review.openstack.org/60615220:35
mriedemhttps://storyboard.openstack.org/#!/story/200391120:36
clarkbhttp://logs.openstack.org/76/597876/1/check/networking-ovn-tempest-dsvm-ovs-release/976ea81/logs/screen-ovn-northd.txt.gz is 16MB large and caused a crash20:37
*** anteaya has quit IRC20:37
*** agopi is now known as agopi|brb20:37
clarkbfound another monasca persister crash too20:37
clarkbhttp://logs.openstack.org/99/602599/1/gate/mistral-rally-task/9c9a906/controller/logs/screen-mistral-engine.txt.gz 33MB20:38
mriedemhttps://bugs.launchpad.net/kuryr-kubernetes/+bug/179506720:39
openstackLaunchpad bug 1795067 in kuryr-kubernetes "screen-kubelet.txt is causing logstash index OOM errors" [Undecided,New]20:39
mriedemthat's mistral20:39
clarkbyup and ovn20:39
clarkb(sorry I'm just sort of throwing them out as I got through and investigate20:39
mriedemhttps://bugs.launchpad.net/mistral/+bug/179506820:41
openstackLaunchpad bug 1795068 in Mistral "screen-mistral-engine.txt size is causing logstash index OOM" [Undecided,New]20:41
*** agopi|brb has quit IRC20:41
openstackgerritMatt Riedemann proposed openstack-infra/project-config master: Blacklist logstash indexing of some very large screen logs  https://review.openstack.org/60619720:45
clarkbthat ovn one and the mistral one are pretty common20:46
clarkbas is the monasca persister20:46
mriedemoh i'll update with the ovn one20:46
mriedemit used to be that you had to opt into logstash indexing....20:47
*** kgiusti has left #openstack-infra20:47
mriedemnow everyone gets it for free?20:47
mriedemi mean, by default?20:47
mriedemthat seems pretty reckless when there are projects that don't know how their CI is setup20:48
*** rlandy|brb is now known as rlandy20:49
clarkbmriedem: we alweays indexed all jobs20:51
clarkbwhat has changed is the ruleset for finding logfiles20:51
clarkbso its a bit more greedy now particularly with screen-* log files20:51
openstackgerritMatt Riedemann proposed openstack-infra/project-config master: Blacklist logstash indexing of some very large screen logs  https://review.openstack.org/60619720:52
mriedemwell hopefully ^ helps20:53
clarkbmriedem: +2 thanks. any other config-core willing to review ^20:53
mnaserclarkb: mriedem +W20:54
*** bobh has quit IRC20:54
bodenis there a kosher way to run another projects python UTs in the gate? for example: https://review.openstack.org/#/c/60586120:54
clarkbboden: look at the requirements repo, they run unittests of a variety of projects as part of checking new dependencies20:55
clarkbboden: it does so by invoking the tox in the target repo not the tox in the test with repo if that makes sense20:56
bodenclarkb: will it require playbooks?20:56
clarkbboden: https://git.openstack.org/cgit/openstack/requirements/tree/.zuul.d/cross-jobs.yaml20:56
*** panda has quit IRC20:57
*** panda has joined #openstack-infra20:58
bodenclarkb: ack thanks20:58
*** shardy has quit IRC21:00
*** PapaOurs is now known as bauzas21:05
openstackgerritClark Boylan proposed openstack-infra/system-config master: Add zuul user to bridge.openstack.org  https://review.openstack.org/60492521:11
openstackgerritClark Boylan proposed openstack-infra/system-config master: Manage user ssh keys from urls  https://review.openstack.org/60493221:11
clarkbhaving CI of the infra things is really handy21:12
*** yamamoto has quit IRC21:15
*** rossella_s has quit IRC21:17
*** rfolco has quit IRC21:18
*** rossella_s has joined #openstack-infra21:20
*** hasharAway has quit IRC21:22
*** agopi|brb has joined #openstack-infra21:28
*** slaweq has quit IRC21:28
*** bobh has joined #openstack-infra21:30
*** rossella_s has quit IRC21:31
*** rossella_s has joined #openstack-infra21:31
clarkbze07 seems to have leaked build dirs (we aren't cleaning those up on start I guess?) due to its having crashed and needing a reboot21:35
clarkbI am going to clean out the older directories by hand to avoid running out of disk there21:35
*** mriedem has quit IRC21:37
*** boden has quit IRC21:37
*** priteau has quit IRC21:42
*** rossella_s has quit IRC21:49
clarkbmnaser: mriedem the logstash queue is trending in the downward direction at the rate of ~3k jobs per hour21:51
clarkbwe should catch up over the weekend21:51
clarkbinfra-root I offered to restart the apache server on the etherpad server to clear out any stale connections prior to ansiblefest as they will be using our etherpad server I guess.21:51
clarkbI am going to do that now21:52
fungithanks clarkb21:52
clarkband done21:52
clarkbnow to fix mirror.dfw.rax.openstack.org dns21:53
fungiwe've had recent-ish trouble tickets about two or three zuul executors whose host hypervisor servers underwent emergency reboots, so ze07 may not be the only one21:54
clarkbmirror.dfw.rax.openstack.org should have a CNAME record with ttl of 3600 now instead of 30021:56
clarkbthe A and AAAA records were fine21:57
clarkbfungi: the zuul status grafana graphs don't show others as having the same issue (ze02 does too but for other reasons, it is an old one with bigger git repos iirc)21:57
fungiahh21:58
*** smarcet has joined #openstack-infra22:00
*** bobh has quit IRC22:03
clarkbfinding a day that works for this opendev discussion is difficult. Maybe I will try to draft an email instead22:17
clarkbsilly ansiblefest travel22:17
*** EvilienM is now known as EmilienM22:20
*** fried_rolls is now known as efried22:20
*** panda is now known as panda|off22:26
*** rlandy has quit IRC22:27
*** jamesmcarthur has joined #openstack-infra22:36
*** smarcet has quit IRC22:37
*** eernst has quit IRC22:40
*** smarcet has joined #openstack-infra22:40
*** jamesmcarthur has quit IRC22:40
*** tosky has quit IRC22:42
*** ijw has joined #openstack-infra22:48
*** elbragstad has quit IRC22:48
*** tpsilva has quit IRC22:52
*** pbourke has quit IRC22:56
*** pbourke has joined #openstack-infra22:56
*** pbourke has quit IRC22:59
*** pbourke has joined #openstack-infra23:00
*** felipemonteiro has joined #openstack-infra23:00
*** elbragstad has joined #openstack-infra23:01
melwittclarkb: opened https://bugs.launchpad.net/nova/+bug/1795086 FYI. didn't find how it could be happening yet23:03
openstackLaunchpad bug 1795086 in OpenStack Compute (nova) "nova.tests.unit.test_profiler.TestProfiler.test_all_public_methods_are_traced sometimes does not return" [Low,Confirmed]23:03
openstackgerritClark Boylan proposed openstack-infra/system-config master: Manage user ssh keys from urls  https://review.openstack.org/60493223:05
clarkbmelwitt: thanks23:05
*** brokencycle has quit IRC23:07
*** elbragstad has quit IRC23:11
johnsomHi there, can someone bootstrap the octavia-lib-core group in gerrit with the octavia-core group?  https://review.openstack.org/#/admin/groups/1951,members  Thank you!23:12
clarkbjohnsom: done23:14
johnsomThanks!23:14
*** yamamoto has joined #openstack-infra23:15
fungiclarkb: 604932 worries me for reasons i can't quite put my finger on23:19
fungihow often are those public keys likely to change?23:20
clarkbfungi: I don't think we've decided at this point, but corvus was quite interested in having that behavior. I can be convinced either way23:23
clarkbthere are definitely upsides and drawbacks to both approaches23:23
clarkbin particular not needing to rotate those keys ourselves is nice but could inadverdently add keys we don't want23:23
clarkbfungi: that change is a result of feedback corvus had on the parent change23:24
fungiit's probably no less secure than proposing and reviewing the retrieved keys to the config repository, i'm just trying to find good arguments to help convince myself that's the case23:25
clarkbfungi: https://review.openstack.org/#/c/604925/5/playbooks/bridge.yaml note the current setup dynamically adds the keys we just can't ssh as that user from anywhere23:26
*** felipemonteiro has quit IRC23:26
clarkbfungi: in my change to get CD working with a new user I dropped that in the first change for simplicit,y then followed up with the second change which does the dynamic thing23:26
*** sthussey has quit IRC23:27
fungiahh, yes. so it's not a regression, however the interim state might have been a more secure (if less convenient) choice23:29
clarkbI'm happy to only merge the first change and manage that a bit more directly if we aren't comfortable with the second change23:29
clarkbshould get corvus' input though23:29
fungiwe're basically trusting the https cert on zuul either way, but putting the certs in git means more people an attacker might theoretically need to mitm. also a much shorter window of opportunity23:31
*** gyee has quit IRC23:33
*** smarcet has quit IRC23:33
fungis/putting the certs/putting the public keys/23:34
*** jamesmcarthur has joined #openstack-infra23:34
*** mdbooth has joined #openstack-infra23:36
*** jamesmcarthur has quit IRC23:39
*** mdbooth has quit IRC23:42
johnsomZuul Ansible question. If you set host-vars and/or group-vars does that mean the "vars:" block is totally ignored? This appears to be what I am seeing.23:44
johnsomThe way I read the docs (which very well could be wrong) is that the "vars:" block from the parent would still be honored, but applied to both, and could be overridden by "host-vars" and "group-vars".23:45
johnsomboth being both nodes in a two node nodeset23:46
*** mdbooth has joined #openstack-infra23:46
clarkbI think the ansible variable precendence takes effect23:47
clarkbIm not sure what that is for host and group and zuul vars23:47
johnsomI am trying to do a "native" two node tempest gate with devstack-tempest as a parent, but all of the "vars:" from the parent, DATABASE_PASSWORD etc. are not showing up in the local_conf.txt.23:49
fungiby "the docs" you mean https://zuul-ci.org/docs/zuul/user/config.html#attr-job.vars i guess?23:49
johnsomCorrect23:50
johnsomI was hoping I was doing something wrong and I don't have to duplicate all of these settings.23:50
fungii thought they were all merged with precedence simply being used to determine which value wins when there are conflicts, but i could be misremembering23:51
johnsomhttps://review.openstack.org/#/c/605163/ if you want to have a look. (though sorry for asking late on a Friday)23:51
clarkbjohnsom: there is no controller group but you set group vars for that group23:53
johnsomMaybe the variable override only goes one layer deep, so the host-vars with "devstack_localrc" completely replaces the "var:" devstack_localrc23:53
clarkbI also dont see a subnode grouo23:54
clarkb*group23:54
johnsomsubnode is line 19, but yes, I see that controller is wrong.23:54
johnsomI don't think that is this issue however, since the localrc stuff is all in the host-vars.23:55
clarkbwhat does the resulting inventory look like?23:59
clarkbthat is usually a good place to start when figuring this stuff out23:59
*** yamamoto has quit IRC23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!