Thursday, 2022-09-01

*** dviroel is now known as dviroel|out00:09
*** rlandy|bbl is now known as rlandy01:44
rlandyjm1[m]: added https://bugs.launchpad.net/tripleo/+bug/1988347 - fs064 bug  - master and wallaby01:48
*** rlandy is now known as rlandy|out02:20
*** ysandeep|out is now known as ysandeep05:36
soniya29chandankumar, hello06:29
soniya29around?06:29
soniya29can you help me out with this patch:- https://code.engineering.redhat.com/gerrit/c/testproject/+/39726406:29
soniya29c8 sc10 kvm internal standalone is failing on tempest06:31
jm1hell#oooq06:42
jm1hello#ooq06:42
*** jm1|ruck is now known as jm1|rover06:42
soniya29jm1, hello06:43
chandankumarsoniya29:  checking06:44
soniya29chandankumar, thanks06:46
*** jpena|off is now known as jpena07:38
soniya29pojadhav tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset039 is failing with post_failure issue07:44
soniya29is this issue known/reported?07:44
*** pojadhav is now known as pojadhav|ruck07:44
pojadhav|rucksoniya29, jm1 having more idea about this but I can see some fs39 failures on integration lines some notes on RR hackmd as well.07:51
pojadhav|ruckjm1, do we reported any bug against fs39 ?07:52
soniya29i dont see any cix against it07:52
pojadhav|ruckneed to check first whether it is consistent one or not07:53
soniya29https://review.rdoproject.org/zuul/builds?job_name=tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset039&skip=007:55
pojadhav|rucksoniya29, issue is not consistent one, everytime It fails with different error07:55
soniya29pojadhav|ruck, ack08:00
jm1pojadhav|ruck, soniya29: we are having trouble with a lot of intermittent failures on c9 wallaby and c9 master. see rr notes in section "Intermittent Failures" for some ideas of what will be solved by a simple rerun08:03
soniya29jm1, well i have rechecked for 2 times :)08:04
jm1soniya29: 2 times is nothing, you can get worried after 5 times ๐Ÿ˜‹ no kidding, fs64 on c9 master had to be rekicked 5 times yesterday until it passed08:05
soniya29jm1, i have rechecked once more, let's see08:05
jm1soniya29: this one is known intermittent https://logserver.rdoproject.org/44/852844/2/openstack-check/tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset039/3163945/job-output.txt08:06
soniya29jm1: ack08:06
jm1soniya29: did you rebase your patches on top of master? because this looks somehow familiar https://logserver.rdoproject.org/26/855126/3/openstack-check/tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset039/94c69f0/job-output.txt08:07
soniya29jm1, i have rebased the patch now, waiting for the new results08:10
jm1soniya29: it should pass eventually. we had a couple of passes of fs39 today08:11
soniya29jm1, that would be great08:11
jm1pojadhav|ruck, rlandy|out: we had a promotion of c9 master yesterday ๐Ÿฅณ fs64 finally passed. trying to do the same for c9 wallaby fs64 again.. it MUST pass at some point ^^08:15
pojadhav|ruckjm1, great08:15
jm1ysandeep, chandankumar: did you see this error before? any hint on what to do? https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-centos-9-push-master/ea8ed37/logs/report.html08:20
ysandeepjm1, looking08:20
ysandeephttps://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-centos-9-push-master/ea8ed37/job-output.txt08:21
ysandeep2022-09-01 01:20:54.644089 | primary |   "msg": "Depsolve Error occured: \n Problem: problem with installed package catatonit-3:0.1.7-7.el9.x86_64\n  - package podman-2:4.2.0-3.el9.x86_64 conflicts with catatonit provided by catatonit-3:0.1.7-7.el9.x86_64\n  - cannot install the best candidate for the job",08:21
jm1ysandeep: omg08:21
jm1ysandeep: thanks!08:21
jm1ysandeep, chandankumar: lets see if a rerun solves that08:22
ysandeepfyi.. there was a bug related to catatonit few days back: https://bugs.launchpad.net/tripleo/+bug/1985981 08:22
ysandeephttps://review.opendev.org/c/openstack/tripleo-quickstart/+/853142 was temp workaround 08:23
ykarelhi is issue with undercloud upgrade known?08:24
ykarellast 2 runs fails due to same reason08:24
ykarelhttps://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ci-centos-9-undercloud-upgrade08:24
ykareljm1|rover, pojadhav|ruck ^08:25
ysandeephttps://32f730b73845cbdfd0fc-4cf4bb096d4774f141f32d30757edf40.ssl.cf1.rackcdn.com/853411/2/check/tripleo-ci-centos-9-undercloud-upgrade/b6a6d75/logs/undercloud/home/zuul/undercloud_upgrade.log08:27
jm1ysandeep: thanks!08:27
ysandeephmm, another deps issue: Depsolve Error occurred: \n Problem: package os-net-config-15.2.1-0.20220629114404.6505f24.el9.noarch requires NetworkManager-ovs, but none of the providers can be installed\n  - package NetworkManager-ovs-1:1.39.10-1.el9.x86_64 requires NetworkManager(x86-64) = 1:1.39.10-1.el9, but none of the providers can be installed\n  - package NetworkManager-ovs-1:1.39.12-1.el9.x86_64 requires NetworkManager(x86-64) = 1:1.39.12-1.el9, but 08:27
ysandeepnone of the providers can be installed\n  - package NetworkManager-ovs-1:1.39.6-1.el9.x86_64 requires NetworkManager(x86-64) = 1:1.39.6-1.el9, but none of the providers can be installed\n08:27
ykarelyes both runs failed it ^08:28
pojadhav|ruckykarel, this is bot known issue yet, new bug...08:28
ykarelpojadhav|ruck, ack ok please check then08:29
pojadhav|ruckreporting new bug against ^^08:29
ykarelmay be due to mirror sync08:29
ysandeepykarel: yes, we hit similiar issue last month when there was mirros sync issue: https://trello.com/c/b5yeyBus/2667-cixlp1984175tripleociproa-tripleo-ci-centos-9-undercloud-upgrade-cannot-install-both-networkmanager-11395-1el9x8664-and-networkm08:29
ykarelysandeep, ok then likely it's same08:31
ykarelmight reopen previous bug itself with fresh details08:31
jm1ysandeep, ysandeep, pojadhav|ruck: it passed 2 hours earlier, so i would simply... wait? 08:32
jm1ykarel: ^08:32
ykareljm1, i think it's still good to check what's still missing08:33
jm1ysandeep, pojadhav|ruck, ykarel: looks like waiting and rerunning is the most sucessful issue handling strategy we have for a while now... ๐Ÿ˜ฌ08:34
ykarelif mirrors are up to date then no action needed08:34
ysandeeplast 2 run failed that means we now have issues (it the latest job passed than okay) otherwise I would open a bug, most probably the content-provider issue you highlighted above is also because of same cause(mirror sync)08:35
pojadhav|ruckysandeep, ykarel, jm1 : I have opened a bug https://bugs.launchpad.net/tripleo/+bug/1988397 if we have green runs then will close out.08:36
toskyarxcruz: in fact that python-tempestconf change passed after a recheck, but I left a comment about... a removed comment08:39
ysandeepykarel, jm1 pojadhav|ruck I think we still have mirror issues:-08:42
ysandeepjob have .40 version installed08:42
ysandeephttps://32f730b73845cbdfd0fc-4cf4bb096d4774f141f32d30757edf40.ssl.cf1.rackcdn.com/853411/2/check/tripleo-ci-centos-9-undercloud-upgrade/b6a6d75/logs/undercloud/var/log/extra/package-list-installed.txt08:42
ysandeep~~~08:42
ysandeepNetworkManager.x86_64                            1:1.40.0-1.el9                        @baseos                       08:42
ysandeepNetworkManager-libnm.x86_64                      1:1.40.0-1.el9                        @baseos 08:42
ysandeep~~~08:42
ysandeepI don't see .40 rpms for NetworkManager on rackspace mirror: http://dfw.mirror.rackspace.com/centos-stream/9-stream/AppStream/x86_64/os/Packages/08:42
ykarelysandeep, Thanks for checking08:44
ykarelfacebook mirror looks good08:46
ykarelrackspace one not updated since 10 hours08:47
ykarelhttp://dfw.mirror.rackspace.com/centos-stream/timestamp.txt08:47
jm1ysandeep, ykarel: and neither on http://mirror.mtl01.iweb.opendev.org/centos-stream/9-stream/AppStream/x86_64/os/Packages/08:49
jm1ysandeep, ykarel: ^ this has not been updated since 2022-08-23?08:50
ykarelyes ^ syncs from mirror which is rax currently so should be same 08:50
ykarelhttp://mirror.mtl01.iweb.opendev.org/centos-stream/timestamp.txt08:50
ykarel2022-09-01T06:08:45,769043757+00:0008:50
jm1ykarel: ack, thnks!08:51
jm1pojadhav|ruck: added your bug with description to rr notes, thanks for logging it!09:04
pojadhav|ruckjm1, ack09:04
* pojadhav|ruck lunch09:05
jm1ysandeep, pojadhav|ruck, rlandy|out: having issues with rr notes "Sorry, you've reached the max length this note can be. Please reduce the content or divide it to more notes, thank you!" ๐Ÿ˜ฑ09:46
ysandeeptime to archive :D09:46
jm1limit seems to be 100.000 chars09:46
jm1ysandeep: good that we have thursday09:46
jm1ysandeep, rlandy|out, pojadhav|ruck: hackmd forces me to stop working ๐Ÿ˜‹09:47
ysandeepjm1, pojadhav|ruck  Are you writing a book on ruck rover this week :D09:47
pojadhav|ruckysandeep, jm1 : I made very short updates.. this book writtern by jm1  :D09:48
jm1pojadhav|ruck: let me clean up the old days...09:49
pojadhav|ruckjm1, sure  may be today and yesterday's updates are fine about upstream for RR hand off09:50
jm1pojadhav|ruck: left today in rr notes and moved the rest to new rr doc09:55
jm1pojadhav|ruck: rr stuff until yesterday is linked in "Previous RR notes"09:55
pojadhav|ruckjm1, ack09:55
jm1pojadhav|ruck: the doc is much more responsive now ^^09:56
jm1ysandeep: regarding book: i logged the reasons for all intermittent failures in rr notes to track down real issues. a couple of those intermittent failures are happning REALLY often09:58
pojadhav|ruckjm1, thats great !09:58
ysandeepsure, whatever works for rr tag team :)09:59
pojadhav|ruckysandeep, do we need CIX card for https://bugs.launchpad.net/tripleo/+bug/1988397 ??10:04
ysandeepIf its still happening and blocking patches from getting merged then yes10:05
pojadhav|ruckack thank you!10:06
jm1pojadhav|ruck: i would leave it open because we have other c9 master and c9 wallaby jobs which might be failing for the same bug (on different packages though)10:10
* jm1 lunch10:10
ysandeepykarel|lunch, chandankumar In downstream metadata issue is back, Could you please review https://review.rdoproject.org/r/c/config/+/44717 and https://review.rdoproject.org/r/c/config/+/44718 that enable config drive in our jobs for PSI env.10:21
ysandeepreviewbot, please add in review list: https://review.rdoproject.org/r/q/topic:enable_config_downstream10:22
reviewbotI could not add the review to Review List10:22
*** rlandy|out is now known as rlandy10:31
rlandyjm1: lol - I never jnew hackmd would be the limitng factor10:32
rlandyjm1: pojadhav|ruck: want to sync quickly?10:36
rlandywe will need to resync with new rr later10:37
pojadhav|ruckrlandy, sure10:37
rlandyysandeep: can I just merge https://review.rdoproject.org/r/q/topic:enable_config_downstream?10:38
rlandystill testing/ danger?10:39
ysandeeprlandy, I have tested locally it works(I was not able to test using testproject with pre-run, as ovb pre triggers first.), I have requested ykarel for review - let's wait for his review.10:40
rlandyoh I see waiting for ykarel to review10:40
ysandeeprlandy, yes10:41
rlandyysandeep: also https://review.rdoproject.org/r/c/config/+/44717/4/roles/ovb-manage/tasks/find_undercloud_uuid.yml would run everywhere10:41
ysandeeprlandy, yes we need that to run everywhere because our C9 image is virt-customize build currently and we want to mount config-drive there.10:42
rlandypojadhav|ruck: waiting for jm1 10:44
pojadhav|ruckack10:44
rlandyysandeep: chandankumar: let's meet for a few re: travel10:51
ysandeepack10:52
rlandyhttps://meet.google.com/gux-rwxv-wwg?pli=1&authuser=010:52
* soniya29 tea break10:54
ysandeepchandankumar: you around o/10:55
chandankumarysandeep: sorry I was afk11:07
chandankumarysandeep: still on call?11:07
ysandeepchandankumar, yes11:07
ysandeepplease jump on11:07
Tenguhello there! apparently there's once again a mirror issue or something? Problem: package os-net-config-15.2.1-0.20220629114404.6505f24.el9.noarch requires NetworkManager-ovs, but none of the providers can be installed11:11
Tengufull trace: https://6831567f6cd28d1c548e-a47ddc14dbf9c2ea3d1835942b1575b6.ssl.cf1.rackcdn.com/854360/4/check/tripleo-ci-centos-9-undercloud-upgrade/5583fdc/logs/undercloud/home/zuul/undercloud_upgrade.log11:11
ysandeepTengu, yes mirror issues again, its known11:13
Tenguok!11:14
jm1rlandy, pojadhav|ruck: ready for sync11:15
jm1chandankumar, arxcruz: as i was creating rr notes anyway, i have created a new rr doc for you and added links to rr status page https://hackmd.io/0RguTgCAQsiZz_SnF5209A11:17
rlandyjm1: pojadhav|ruck: ok - ... https://meet.google.com/ybb-ddea-xmo?pli=1&authuser=0 - should be quick11:18
arxcruzjm1 oh, great, thanks!!!11:18
ysandeepchandankumar, could you please +w https://review.rdoproject.org/r/c/config/+/4471811:20
ysandeepchandankumar++11:21
ysandeeppojadhav|ruck, jm1 fyi.. please let me know incase any ovb job fails on ovb stack setup.. fyi.. we have merged https://review.rdoproject.org/r/q/topic:enable_config_downstream to fix downstream jobs.11:23
pojadhav|ruckrlandy, fyi https://bugs.launchpad.net/tripleo/+bug/198839711:25
*** dviroel|out is now known as dviroel11:27
jm1pojadhav|ruck: did the one job which required rebooting the bm node pass?11:27
jm1ysandeep: awesome, thanks :)11:28
pojadhav|ruckjm1, it is still running 11:28
pojadhav|ruckjm1, jfyi https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/status#27623011:29
pojadhav|ruckchandankumar, rlandy : allow list discussion 11:30
chandankumarjoining11:30
rlandyone sec11:31
jm1pojadhav|ruck: ah! atm it is deploying the overcloud so it rebooting the node seemed to have "solved" the issue, great ^^11:32
jm1pojadhav|ruck: need help with anything?11:32
pojadhav|ruckjm1, nope.. will let you know.. thanks!11:33
jm1pojadhav|ruck: okidoki :) then i will chase some bugs11:34
ysandeepjm1, run that job again to crosscheck.. I wonder you will hit the same issue.11:34
ysandeepregarding bm11:34
jm1ysandeep: which job? pojadhav|ruck's bm job which is still wip?11:35
ysandeepjm1, yes11:35
jm1pojadhav|ruck: ^ ;)11:36
pojadhav|ruckjm1, ysandeep : ack11:36
* pojadhav|ruck tea break.. brb in few mins11:48
*** ysandeep is now known as ysandeep|afk11:51
sshnaidmI have standalone and multinode Centos9 jobs failing on ansible-podman repo, they run on RDO. Was there any change to centos9 nodesets recently? Like from yesterday. hhttps://review.rdoproject.org/zuul/builds?pipeline=github-check&skip=011:59
sshnaidmOr new image of centos9 built on RDO for standalone/multinode nodeset, because OVB is fine12:01
* sshnaidm is trying to recall what can break12:01
rlandythere were reviews up to rebuild those nodes using dib12:04
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.12:04
sshnaidmrlandy, the error: "Error: invalid policy in \"/etc/containers/policy.json\": Unknown key \"keyPaths\"" 12:07
sshnaidmprobably it has prebaked incompatible /etc/containers/policy.json 12:08
sshnaidmfor podman12:08
sshnaidmrlandy, can someone take a look?12:09
rlandysshnaidm: yeah - sorry - was on another channel12:09
rlandylet me get you those reviews12:09
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.12:09
rlandydasm|off: pls see #rhos-ops when you are in - we're taking down zuul - questioning latest rr tool patches querying  zuul12:10
ykareljm1, ysandeep|afk rax mirror still not refreshed, pushed the revert https://review.opendev.org/c/opendev/system-config/+/85545712:10
rlandysshnaidm: this was the ticket: https://issues.redhat.com/browse/RHOSZUUL-701 - doesn't look completed though12:15
sshnaidmdpawlik, hey, did you do changes to centos9 images according to this ticket? ^^12:16
sshnaidmrlandy, ack, thanks, let's see who can fix it..12:16
dpawliksshnaidm: today I updated centos 9 stream images to be up to date12:17
dpawlikbut for upstream-centos-9-stream image, we are still working on it12:17
sshnaidmdpawlik, now we know whom to blame! :D12:17
dpawliko lol12:17
dpawlikwhats happening?12:17
sshnaidmdpawlik, kidding, can you please see the problem above? 12:17
rlandydpawlik: you're having quite a EoD week :)12:18
sshnaidmdpawlik, I run standalone/multinode on RDO and it started to fail from yesterday: "Error: invalid policy in \"/etc/containers/policy.json\": Unknown key \"keyPaths\""12:18
sshnaidmdpawlik, probably wrong version of /etc/containers/policy.json for podman in image12:18
sshnaidmdpawlik, logs: https://review.rdoproject.org/zuul/builds?pipeline=github-check&skip=012:19
sshnaidmdpawlik, we use centos9 image from RDO nodeset and run upstream job on it12:19
dpawlikbut 12:19
dpawlikimages has been updated today12:19
dpawlikand upstream centos 9 stream image is not available yet12:19
dpawlikdue https://review.opendev.org/c/openstack/diskimage-builder/+/85501412:19
dpawlikso the issue is mostly in podman, not in image :)12:20
dpawlikbut similar issue I see on IBM instance12:20
sshnaidmhmm, I'd double check since this started to happen only on centos9 images in RDO, even OVB images in RDO work fine (it's a different nodeset)12:21
dpawliks/has/have been updated today12:21
sshnaidmdpawlik, the problem is only with standalone/multinode nodeset12:21
dpawliko12:21
dpawlik< we should use upstream DIB base images >12:22
sshnaidmdpawlik, iirc this is what used: https://github.com/rdo-infra/rdo-jobs/blob/master/zuul.d/nodesets.yaml#L552-L56312:24
sshnaidm"faked"12:24
sshnaidmprobably you don't fake it anymore :)12:25
rlandyhmmm ... we're still a no show on OVB stacks in downstream ...  Resource CREATE failed: WaitConditionTimeout: resources.baremetal_env.resources.bmc.resources.bmc_wait_condition: 0 of 1 received 12:28
rlandychecking merge of bmc template12:28
rlandybmc_image: bmc-template 12:29
rlandystill using old bmc template12:29
rlandypojadhav|ruck: ^^ note diff OVB failure12:30
pojadhav|ruckrlandy, ack12:31
bhagyashrisrlandy, hi, let me know when you free want to talk about 17.1 or may be we can talk in scrum 12:33
rlandybhagyashris: hey - any movement on that bug to unblock you?12:33
rlandyjust looking at downstream OVB12:33
rlandyneeds debug12:33
rlandylet's chat at scrum12:34
bhagyashrisbogdando, submitted this patch https://code.engineering.redhat.com/gerrit/c/openstack-tripleo-heat-templates/+/426525 but this is not fixing issue  12:34
bhagyashrisi talked with him on rhos-dev12:35
bhagyashrisso what i got to know that12:35
bhagyashris we will not allow fresh install of 17.1 on rhel 812:36
bhagyashristhat is not somethign we will supprot12:36
bhagyashristhe only way to get to 17.1 on rhel8 is to deploy with 16.2 and upgrade12:36
bhagyashristhere is to be no scale out or frsh isntal or 17.1 on rhel 8.4 hosts12:36
bhagyashrisso i am not sure about the exact plan12:37
bhagyashrisyou can check <sean-k-mooney> messages on rhos-dev12:37
*** ysandeep|afk is now known as ysandeep12:38
rlandybhagyashris: ok - let's talk at scrum about the right way to go here12:40
rlandypojadhav|ruck: this needs debug12:40
rlandyI am looking at the BMC console12:40
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/72/190672/275/check/periodic-tripleo-ci-rhel-9-ovb-3ctlr_1comp-featureset001-internal-rhos-17.1/5aba9c7/logs/bmc_275_90843-console.log12:40
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/72/190672/275/check/periodic-tripleo-ci-rhel-9-ovb-3ctlr_1comp-featureset001-internal-rhos-17.1/5aba9c7/logs/failed_ovb_stack.log12:40
pojadhav|ruckrlandy, ^^ for this logs right ??12:41
rlandypojadhav|ruck: look at failed OVB jobs12:41
rlandysee the times12:41
rlandycross check OVB stack failure12:41
pojadhav|ruckok12:41
dpawlikshould we hold the node and check what's going on? 12:43
rlandysorry - was on something else12:50
rlandylooking at console of passed job12:50
ysandeeprlandy, https://sf.hosted.upshift.rdu2.redhat.com/logs/58/418658/21/check/periodic-tripleo-ci-rhel-9-ovb-3ctlr_1comp-featureset001-internal-rhos-17.1/b9302c9/logs/bmc_21_9836-console.log12:51
* ysandeep debugging failure12:51
rlandyyep - comparing12:52
*** dasm|off is now known as dasm12:54
dasmo/12:54
dasmrlandy: will check it in a few12:54
dasmwrt killikg zuul12:54
ysandeeprlandy, fyi.. disabling ovb cleanup script in downstream to debug bmc failure12:55
rlandydasm: pls chat to dpawlik and see conversation on #rhos-ops13:00
pojadhav|ruckrlandy, akahat, chandankumar, ysandeep : scrum13:01
dasmjm1: where are we hosting telegraf which runs all queries to zuul?13:05
jm1dasm: dashboard-ci.tripleo.org13:11
jm1dasm: and downstream cockpit as well13:11
chandankumarhttps://review.opendev.org/q/topic:tripleo-ansible-ee13:13
dasmjm1: ack, thx. i accessed both locations.13:14
chandankumarhttps://review.opendev.org/c/openstack/tripleo-ci/+/843836/25/zuul.d/standalone-jobs.yaml13:14
bhagyashrisrlandy, here is test project patch https://code.engineering.redhat.com/gerrit/c/testproject/+/424992 logs https://sf.hosted.upshift.rdu2.redhat.com/logs/92/424992/27/check/periodic-tripleo-ci-rhel-8-standalone-rhos-17.1/d96df59/logs/undercloud/home/zuul/standalone_deploy.log here is definition patch https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42429313:15
bhagyashrisfixed is here https://code.engineering.redhat.com/gerrit/c/openstack-tripleo-heat-templates/+/42652513:15
ysandeeprlandy, recent run passed stack create: https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/stream/1c30dad6bbb54e038df97d93d3da9897?logfile=console.log13:15
ysandeeprlandy, may be transient issue earlier13:15
rlandyysandeep: good sign13:16
rlandymaybe it took a while to get eveything in sync13:16
ysandeepre-enabling stack cleanup script13:16
chandankumarrlandy: https://review.opendev.org/c/openstack/tripleo-ci/+/850736 please +w it13:16
ysandeeprlandy, probably13:17
bhagyashrisnew patch fix for upstream : https://review.opendev.org/c/openstack/tripleo-heat-templates/+/85550313:17
jm1dasm: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/43307/28#message-d0baab4ed36f3b27cafda6e76f1cc74261bb019d13:19
dasmjm1: dpawlik mentioned that the issue started before 07/1813:20
dasmmy change probably exposed that issue13:20
jm1dasm: my comment is unrelated to zuul crashes13:21
dasmjm1: answering your question: it's fetching ZUUL_JOBS_LIMIT because i'm querying zuul jobs in batches. if te number would be variable based on jobs number, we would miss responses. 13:21
dasmwe could end up with duplicated jobs, because eg. job1, job1, job1 ran 3x and not any other13:22
dasmit's tricky. i'm gonna look into that one more time tho13:22
jm1dasm: but its running once for all jobs13:22
dasmjm1: what do you mean?13:23
jm1dasm: the call done by this requests.get is e.g. curl 'https://review.rdoproject.org/zuul/api/builds?job_name=periodic-tripleo-ci-centos-9-standalone-baremetal-master&job_name=periodic-tripleo-ci-centos-9-scenario012-standalone-baremetal-master&job_name=periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-baremetal-master&job_name=periodic-tripleo-ci-centos-9-scenario000-multinode-oooq-container-updates-baremetal-master&limit=1000'13:25
rlandychandankumar: question on https://review.opendev.org/c/openstack/tripleo-ci/+/85073613:26
jm1dasm: but maybe you wanted get 1000 results per job?13:27
soniya29https://review.opendev.org/q/topic:tempest_allow_list13:28
dasmjm1: no, i'm actually asking for "all jobs last 1000 results" not "onu job 1000 results".13:28
rlandychandankumar: if that is ok for all jobs - ie: having the stream9 image on 8 jobs etc.13:28
ysandeepclosed config drive epic :D13:28
rlandythen I can merge13:28
pojadhav|ruckhttps://review.opendev.org/q/topic:tempest_allow_list13:28
dasmjm1: in the past we were issuing query per job13:28
jm1dasm: ok, great!13:30
* dviroel coffee13:30
chandankumarrlandy: yes, it will be only used in tripleo-ee container13:31
chandankumarit is not going to affect anything else13:31
toskyarxcruz: sooo, for that python-tempestconf patch :)13:51
arxcruztosky sorry, o forgot to check, let me see it 13:52
arxcruztosky lol, it was me who wrote the patch, let me fix it, i might remove it by accident 13:53
toskyarxcruz: thanks13:55
arxcruztosky done 13:55
akahatMixed OS compute job: https://review.rdoproject.org/r/c/rdo-jobs/+/44415, https://review.rdoproject.org/r/c/testproject/+/44499, https://review.opendev.org/c/openstack/tripleo-quickstart/+/85386013:57
akahatfolks please take a look when you are free. Thanks :)13:57
arxcruztosky i'm assuming that was your only complain ;)14:09
toskyarxcruz: yes, it was :)14:12
*** jm1|ruck is now known as jm1|rover14:13
arxcruztosky great :) 14:17
pojadhav|ruckrlandy, bm job running here https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/status#2098714:19
rlandyty14:20
*** ysandeep is now known as ysandeep|afk14:22
*** pojadhav|ruck is now known as pojadhav|afk14:23
toskychandankumar: if you have some time for https://review.opendev.org/c/openinfra/python-tempestconf/+/849127, it passed in the previous review and the review being tested now just adds a few comments14:30
dasmdviroel: jm1 can you +2+W ? https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4472414:32
dasmlet's make it work again14:32
rlandybhagyashris: mind if I edit your patch?14:35
jm1dasm: did you test it in cockpit?14:38
dasmjm1: yes, dpawlik said no issues with zuul atm14:41
dasmupstream cockpit14:41
dasmwe need to have the same on downstream one14:41
rlandyopenstack-tripleo-heat-templates.noarch       14.3.1-1.20220829144516.f7e97cb.el8osttrunk   @delorean-component-tripleo 14:42
rlandylet's see if this works with promoted content14:43
toskychandankumar: thanks!14:49
rlandybhagyashris: rebuilding containers with new content14:55
rlandyjob is in rerun14:55
bhagyashrisrlandy, np you can edit15:04
*** pojadhav|afk is now known as pojadhav|ruck15:06
dviroeldasm: W+115:10
dasmdviroel: ack, thx. dpawlik we're good to go15:10
*** ysandeep|afk is now known as ysandeep15:21
ysandeepdasm, dviroel should we also tune interval if each run takes ~100 mins... https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44724/1#message-fbd3542bcb0f2dfe9f96ab95374b389a97cbf23c15:22
dasmysandeep: why would it run 100 minutes?15:24
dasmit's a simple API query15:24
dviroeldasm: because of the sleep time15:24
ysandeepdasm, I am reading dviroel comment - If we consider the average time, it will take ~100 minutes to finish. 15:24
dasmit's not one after another. it starts all of jobs at once15:25
dasmthe random here spreads it across few seconds15:25
dasmhtat's enough to not overwhelm zuul15:25
dasmdoes it make sense?15:26
ysandeepdasm, yeah.. looks like exec plugin executes all the commands in parallel15:27
ysandeepI am reading docs atm: https://github.com/influxdata/telegraf/blob/master/plugins/inputs/exec/README.md#exec-input-plugin15:27
dviroeldasm: hum, correct, they run in parallel, so my sentence is not correct there15:27
dasmysandeep: that was the main issue with queries. all of them started at once, causing zuul to sweat15:28
dasm++15:28
rlandyjm1: guess what???15:37
rlandyperiodic-tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset064-wallaby  "success": true,15:38
rlandyrekicked periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset064-wallaby15:39
rlandyoops15:39
rlandyhttps://review.rdoproject.org/r/c/testproject/+/4466215:40
rlandyfor fs001 and fs03515:40
*** ysandeep is now known as ysandeep|out15:40
ysandeep|outsee you all tomorrow o/15:40
dviroelo/15:40
dasmysandeep|out: o/15:41
jm1rlandy: periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset064-wallaby is still wip?15:45
rlandywip?15:53
rlandyin what way?15:53
* dviroel lunch15:59
*** dviroel is now known as dviroel|lunch15:59
jm1rlandy: the job is still running. not sure why you get success true16:02
jm1rlandy: https://review.rdoproject.org/zuul/status/change/44661,816:03
jm1rlandy: last pass of that job was 2 days ago https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset064-wallaby&skip=016:04
rlandyhttps://review.rdoproject.org/zuul/stream/b8fb1de2e9df42128a4024000ca052a2?logfile=console.log16:05
jm1rlandy: oh its uploading logs and has passed tempest ๐Ÿ˜ฑ16:06
rlandyjm1; I know - do the happy dance16:06
jm1rlandy: a promotion of c9 wallaby, our present for chandankumar and arxcruz ;)16:07
rlandyjm1++ yep16:07
arxcruzjm1 viele danke! das gefรคhlt mir sehr gut 16:07
jm1rlandy, arxcruz: it took 10 runs until it passed ๐Ÿ™„16:09
rlandyjm1: as we used to say in south africa ... https://mymemory.translated.net/en/Afrikaans/English/aanhouer-wen16:11
jm1rlandy, arxcruz: it is gathering logs for 50 minutes now. not whether its failing again...16:11
rlandydlrn reported16:12
arxcruzyeah, if dlrn report, doesn't matter the logs16:12
jm1rlandy, arxcruz: oh ok great16:12
rlandyhttp://promoter.rdoproject.org/promoter_logs/container-push/20220901-153339.log16:12
arxcruzalthough i would check with the rdo team about the logs 16:12
rlandypromoting16:12
jm1arxcruz: go ahead if you want ;) i am eod now16:12
jm1rlandy: with c9 wallaby being promoted i will be eod16:13
rlandyjm1: have a good night16:13
rlandyI am just watching bhagyashris's work16:13
rlandyafter that will look into ovb rerun16:13
rlandyfor downstream components16:13
jm1rlandy: ok thanks! it looks like downstream is making more trouble than upstream this week16:14
rlandylike having twins - each one takes a turn to make you crazy16:14
jm1rlandy: ๐Ÿ˜‚16:14
rlandymy sister has twins16:14
jm1rlandy: not sure i could handle that for more than a week ;)16:15
rlandylol -she has 5 other kids16:15
jm1rlandy: oha... we were 5 kids in total and even that was a lot of fun..16:16
* jm1 out for today, have a nice evening!16:16
arxcruzrlandy 7 in total?16:24
arxcruzjesus christ...16:24
rlandyarxcruz: yep16:24
rlandybhagyashris: so I think this is fixed16:32
rlandyerror is later now16:32
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/build/7aaac9ad36f8402682dbe9d1aab9cf4816:38
*** jpena is now known as jpena|off16:50
rlandylunch - brb17:11
*** dviroel|lunch is now known as dviroel17:18
dviroeldasm: i think that you can vote now https://softwarefactory-project.io/r/c/config/+/2597317:33
dviroel:)17:33
dasmchecking17:33
dasm> Forbidden17:34
dasmlemme try relogging17:34
dasmnope, clean browser, the same17:35
dasmdviroel: not so "fun fact" i'm getting emails from softwarefactory about updates.17:36
dasmand i can see it when not logged in. but after logging in -- Forbidden17:36
dviroeldasm: weird, i see that apavec added you to reviewer list - maybe still missing something no your account, you should ping him17:38
dasmi tried that yesterday17:41
dasmthe same outcome17:41
dasmdviroel: i can access it now. However it doesn't change that I'm not an expert in this particular area :D17:46
dviroeldasm: of course you are17:49
dasmdviroel: True. We all are experts. In everything :)17:50
dviroeldasm: yeah, nothing that you can't learn in 5min reading ;)17:51
rlandyis the cockpit stuck again18:29
dviroellooks like it is working18:32
dviroelrlandy: you don't see updates there?18:32
dviroelmaybe its missing new data18:33
rlandydviroel: cockpit is ok - w c9 pushed containers but didn't promote18:34
rlandywill look at that after meetings18:34
dviroelhum18:35
rlandydviroel: http://promoter.rdoproject.org/promoter_logs/centos9_wallaby_2022-09-01T16:19.log18:56
rlandysays 1 hash promoted18:56
rlandyhttp://dashboard-ci.tripleo.org/d/HkOLImOMk/upstream-and-rdo-promotions?orgId=1 - says 3 days ago18:57
rlandydasm: ^^ per our conversation earlier18:57
rlandylooks like that promo was not picked up18:57
rlandyalso w c8 promoted today18:57
rlandyso cockpit is out of date18:57
rlandydviroel: rcastillo|rover: dasm: only one review on review hackmd - needs infar w+18:58
rlandyso I think we can skip review time18:58
dviroelrlandy: yeah19:01
rlandyok19:01
rlandyso ... dasm .. think we have your first cockpit task :)19:01
dasmi see some issue in logs. dpawlik our recent change doesn't work19:03
dasm> Error in plugin: exec: exit status 1 for command 'sleep $((RANDOM % 180)); ruck_rover.py --influx --release wallaby --distro centos-9 --component cloudops': sleep: unrecognized option '--influx'19:03
dasminteresting, because there is no issue when running the same thing manually19:04
dasmi need to see why it's doing that19:04
rlandydpawlik is probably end of day now19:11
dasmhe's definitely eod. i wanted to give him some heads up19:13
jm1[m]@dasm: how did you test locally? did you restart the telegraf container after changing the config?19:53
jm1[m]@dasm: how about changing telegraf config to '/bin/bash -c "sleep $(( โ€ฆ )); ruck_rover.py โ€ฆ cloudops"'?19:57
dviroeldasm: ^ you can test inside the running container to see if works20:01
* dviroel lets test in prod20:02
dasmrlandy: dviroel i ran it inside the container. it didn't complain20:09
dasmi'm gonna give it a one more try.20:09
dviroelack20:11
dasmi'm checking for second best option to run the command20:13
rlandyok20:15
dasmrlandy: your idea works: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44725 cc dviroel 20:29
* rlandy looks20:35
rlandydasm: dviroel: should I merge that?20:36
rlandybroken anyways20:36
dasmrlandy: please go ahead. indeed20:36
dasmi ran the command inside the container. this time i've seen it actually gathering results20:37
rlandydasm: pls keep an eye on the cockpit data20:37
dasmlast time apparently i didn't check long enough to catch the issue20:37
* dviroel bbl20:57
*** dviroel is now known as dviroel|afk20:57
*** dasm is now known as dasm|off21:42
dasm|offo/21:42
*** dviroel|afk is now known as dviroel23:55

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!