Tuesday, 2021-11-23

*** pojadhav is now known as pojadhav|afk05:58
akahat|roversoniya29|ruck, please take a look when you are free: https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-train06:30
*** pojadhav|afk is now known as pojadhav06:36
*** ykarel is now known as ykarel|lunch09:21
soniya29|ruckakahat|rover, sure09:58
*** dviroel|afk is now known as dviroel10:22
arxcruzmarios: https://review.opendev.org/c/openstack/tripleo-ci/+/816991 i think what you told makes sense, i forgot about the lookup('env') 10:25
mariosarxcruz: cool. i was staring at it this mornign and just thinking there *must* be some way.. i didn't test it but i think it should work man. sorry to go on about it it is just one more thing that will be maintenance and then only you and me really know you did it the way you did it etc etc our repos have way too much of that10:29
mariosarxcruz: thanks for having another go 10:29
mariosarxcruz: will you try the test with tht or some other repo ? can be master undercloud upgrade job 10:30
mariosarxcruz: looks ok from quick look i 10:30
mariosarxcruz: will review again in my next round 10:31
arxcruzmarios: but it won't work with a dnm test or am i wrong ?10:35
mariosarxcruz: why what will not work /me thinking ? if you depends-on the tripleo-ci patch10:37
*** jpena|off is now known as jpena10:54
arxcruzmarios: yeah, i don't think it will work, well, it will work for the upgrade content provider, but not for the undercloud-upgrade11:06
arxcruzisn't this the issue we are facing11:06
arxcruzor you want to test the content-provider ?11:06
*** rlandy|out is now known as rlandy|ruck11:19
rlandy|ruckysandeep: looking better downstream :)11:28
rlandy|ruck16.2 promoted11:28
rlandy|ruckdid you rerun fs035 on ovb11:29
ysandeeprlandy|ruck, yeah i am tracking component promotions now 11:29
ysandeeprlandy|ruck, on 16.211:29
ysandeep?11:29
rlandy|ruckysandeep: yep on 16.211:29
rlandy|ruckotherwise that test is out of criteria11:29
ysandeeprlandy|ruck, we already have a green run on that hash for fs035 - yesterday 11:30
rlandy|ruckok - good11:31
rlandy|ruckysandeep: need any help on component promotions11:31
rlandy|ruckgoing to ruck/rover sync now11:31
rlandy|ruckwe can pick up the rest11:31
rlandy|ruckso you can carry on11:31
ysandeeprlandy|ruck, nah all good, left notes on tripleo-ci chat channel on what's the status now11:31
ysandeeprlandy|ruck, fyi.. envc have a hardware failure, i have opened a ticket with lab team11:32
*** pojadhav is now known as pojadhav|afk11:32
rlandy|ruckysandeep: ack - I see thanks11:32
rlandy|ruckakahat|rover: soniya29|ruck: hi- let's ruck/rover sync11:33
rlandy|ruckhttps://meet.google.com/njw-nrxe-gxk11:33
rlandy|ruckakahat|rover: soniya29|ruck: ^^11:34
rlandy|ruckarxcruz: are you on the cockpit box?11:42
arxcruzrlandy|ruck: no 11:42
arxcruzrlandy|ruck: but ananya left a tmux open 11:43
arxcruzdon't touch it :D 11:43
rlandy|ruckarxcruz: ugh11:43
rlandy|ruckwhy?11:43
rlandy|ruckdo you know what happened here?11:43
arxcruz 12:58:07 <frenzy_friday>I will be on PTO from monday till dec 3. ci health rdo and opensearch both running on the same health server. There is a tmux already with the logs (window 0 and 1) - in case people start using come looking for health 11:43
rlandy|ruckarxcruz: can you join https://meet.google.com/njw-nrxe-gxk11:44
rlandy|ruckhttps://bugs.launchpad.net/tripleo/+bug/195091611:50
rlandy|ruckhttps://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/818229/1/roles/standalone/templates/standalone_config.yaml.j2#b4911:52
rlandy|ruckakahat|rover: ^^11:52
*** ykarel|lunch is now known as ykarel12:05
rlandy|ruckAND "result" !='RETRY' AND "result" !='RETRY_LIMIT'12:12
*** pojadhav|afk is now known as pojadhav12:17
ysandeep#Folks retrospective mtg in 2 mins12:28
ysandeeprlandy|ruck,  soniya29|ruck akahat|rover rcastillo arxcruz ^^12:31
ysandeeprlandy|ruck,  soniya29|ruck akahat|rover rcastillo arxcruz retro time12:33
ysandeeprlandy|ruck, do you want us to start without you?12:34
rlandy|ruckysandeep; sorry - joining12:34
pojadhavarxcruz, https://meet.google.com/kkp-bejs-vvo12:37
pojadhavarxcruz, https://miro.com/app/board/o9J_lhP-0mY=/12:37
arxcruzpojadhav: gracias12:37
*** ykarel is now known as ykarel|afk12:41
dviroelmarios: sshnaidm extra doc for libvirt driver https://github.com/bogdando/oooq-warp/blob/master/docs/CI-reproducer.md13:16
sshnaidm|afkdviroel, nice, would be great to have it included in our reproducer..13:17
dviroelyes, for sure13:19
*** sshnaidm|afk is now known as sshnaidm13:24
mariosdviroel: thanks13:26
dviroelwrt 'TODO' in code, it is easier to search for work that you left behind :P13:28
dviroele.g https://codesearch.opendev.org/?q=TODO%5C(dviroel%5C)&i=nope&literal=nope&files=&excludeFiles=&repos=13:28
marios+113:28
ysandeeparxcruz, sshnaidm, rlandy, marios, ysandeep, bhagyashris, svyas, soniya29, pojadhav, akahat, chandankumar, frenzy_friday, anbanerj, dviroel tripleo-ci-community call in 1 min13:29
ysandeepceph squad will join us today, if you have other items please add in agenda: https://hackmd.io/MMg4WDbYSqOQUhU2Kj8zNg?both 13:29
soniya29|ruckrlandy|ruck, i will be joining meeting in 5 mins13:58
*** ykarel|afk is now known as ykarel14:02
soniya29|ruckrlandy|ruck, i am in14:02
rlandy|rucksoniya29|ruck: few minutes14:03
soniya29|ruckrlandy|ruck, okay, no problem14:03
rlandy|ruckjust finishing up on community call14:04
rlandy|rucksoniya29|ruck: ^^14:04
*** amoralej|off is now known as amoralej14:08
jfrancoahey, bringing the question here: what does DISK_FULL mean in https://review.opendev.org/c/openstack/tripleo-heat-templates/+/817718 ? logs were not collected either, so it's hard to know what happened14:10
ysandeepinteresting.. so one the job run gave DISK_FULL - tripleo-ci-centos-8-standalone-upgrade-victoria https://zuul.opendev.org/t/openstack/build/fefdad3293cb40ab9766b696472e07d3 : DISK_FULL in 2h 43m 42s14:14
ysandeepI haven't seen that myself , may be others here have seen that, if not i think worth checking with infra team what does that mean.14:14
ysandeepmarios, ykarel sshnaidm ^^ if you have seen this 14:15
mariosysandeep: not seen it but sounds like infra issue?14:16
ysandeepjfrancoa, is that consistent with this job? 14:16
jfrancoaysandeep: so, this has been the first time I've got this far14:16
jfrancoaI could relaunch14:17
ysandeepjfrancoa: yes please recheck , lets see if this is consistent 14:17
jfrancoaysandeep: if you suggest it, I'll just recheck and see if it happens again14:17
jfrancoaysandeep: ack, thanks14:17
jfrancoaI'll keep you posted with the update14:17
jfrancoaysandeep: thanks!14:17
* ysandeep will try to check documenations and see if i get somethings about DISK_FULL14:17
rlandy|rucksoniya29|ruck:periodic-tripleo-ci-centos-8-standalone-full-tempest-api-master14:18
rlandy|ruckjfrancoa: new zuul status14:18
rlandy|ruckadded since last update14:18
rlandy|ruckwe hit it occasionally14:18
rlandy|ruckarxcruz: ^^ someone is hitting that status 14:18
rlandy|rucksoniya29|ruck: periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-master14:19
rlandy|rucksoniya29|ruck: python3 roles/rrcockpit/files/telegraf_py3/ruck_rover.py --release master14:20
ykarelysandeep, i have seen that long back, and i think fixed that too for the issue at that time14:20
ykarellet me check can find the patch14:20
ykarelwas related to log collectin iirc14:20
ykarelysandeep, https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/662767 was the fix that time14:26
ysandeepykarel: thanks!14:29
ykarelysandeep, upstream i think executor disk limit per job is 5GB, so somehow that job is consuming that much and thus fails14:33
ysandeepykarel, I have requested jfrancoa to recheck, if we get capture of logs(hopefully not cross 5gb on each run).. we can check what's taking space there.14:35
ysandeepmay be we can add a dnm patch to add "du -csh" stats before log collection.14:35
ykarelyes would be good to root cause it14:36
ysandeepjfrancoa: let see how current run goes, otherwise we go that route ^^14:37
ykareli had used https://review.opendev.org/c/openstack/tripleo-ci/+/662481/3/roles/run-test/templates/oooq_common_functions.sh.j2 in past14:37
jfrancoaykarel: cool, I'll add that then14:39
jfrancoaykarel: thanks a lot14:39
ysandeepykarel++ looks good we can restore/ use something similiar.14:40
ykarelhttps://zuul.opendev.org/t/openstack/builds?result=DISK_FULL14:42
ysandeepjfrancoa, you will need to restore that patch first if you want to add as depends on14:43
jfrancoaysandeep: ohh right..I didn't realize it was abandoned14:52
jfrancoarestored and rebased14:52
*** ysandeep is now known as ysandeep|out15:01
*** dviroel is now known as dviroel|lunch15:07
jbadiaparlandy|ruck, I think the https://bugs.launchpad.net/tripleo/+bug/1950383 is not exactly the same as https://bugs.launchpad.net/tripleo/+bug/195126015:48
jbadiapaafter second thoughts it seems to me that they are not related15:48
rlandy|ruckjbadiapa: can remove duplicate15:50
rlandy|ruckdone15:50
jbadiapaI'm revieweing the bug regarding UEFI15:51
jbadiapacos the current issue is the introspection is failing and the VMs on libvirt are set as bios, but configure them as uefi does not work either :/15:52
jbadiaparlandy|ruck, thx15:53
rlandy|ruckjbadiapa: trying setting to bios15:54
*** ykarel is now known as ykarel|away15:55
jbadiaparlandy|ruck: If we use bios the VMs dont start as the overcloud-full image is uefi15:56
*** ysandeep|out is now known as ysandeep16:05
ysandeepCedric also have to made changes in tripleo-labs to move OC nodes to UEFI16:05
ysandeephttps://github.com/cjeanner/tripleo-lab/commit/426b9eff44205d7810439684ceaa219c2d96628616:05
jbadiapaysandeep, I've already tried that  https://review.opendev.org/c/openstack/tripleo-quickstart/+/818766 16:06
jbadiapabut it didnt work on my lab16:06
*** dviroel|lunch is now known as dviroel16:08
Tengujbadiapa: we can discuss a bit once I back, next Monday, if you want16:08
ysandeepjbadiapa, ack, I see you already logged the bugs, rlandy|ruck if you are joining hardprov bug scrub today, May be we can discuss these issues there.16:08
jbadiapaTengu, sure16:08
jbadiapaTengu, thx....16:09
Tengutook me a hell of a time (with Harald) to get it working. secure-boot is a terrible thing.16:09
ysandeeprlandy|ruck, imho.. we need to request sbaker to once try the tripleo-quickstart himself and make sure manual deployment are working expected as per his plans.16:09
ysandeepi have triggered check-ci-minimal on https://review.opendev.org/c/openstack/tripleo-quickstart/+/818766 to check where its failing.16:10
rlandy|ruckack - bringing this all to the scrub16:11
ysandeepjbadiapa, recent hardprov changes around whole_disk_images and some other changes have broke oooq manual deployments, we are trying to better align with them to solve these issues.16:12
ysandeeprlandy|ruck: thanks!16:12
jbadiapaysandeep, I know... I experienced those changes.16:13
jbadiapathe whole_disk_images but the new movement was to set the uefi images16:14
rlandy|ruckyep16:14
rlandy|ruckwe had one pass16:14
rlandy|ruckbut now we have the second issue16:14
rlandy|ruckhttps://bugs.launchpad.net/tripleo/+bug/195175216:15
rlandy|rucktwo patches to try set back to bios16:15
rlandy|rucktesting that now16:15
rlandy|ruckmerged a few days ago16:15
rlandy|ruckit's a never ending joy16:15
rlandy|ruckTengu: what happened to PTO????16:15
dviroelmarios: check your last testproject run, $HOME should be /home/zuul now, the default user is zuul now16:15
mariosdviroel: ah 16:17
ysandeeprlandy|ruck, i think Tengu miss us from time to time, so he comes online to see what we are doing :D 16:17
rlandy|ruckysandeep: of course - who wouldn't miss us???16:18
mariosthanks dviroel :)16:18
dviroel;)16:18
*** ysandeep is now known as ysandeep|out16:27
mariossshnaidm: dviroel: rlandy|ruck: i bumped the zuul-discuss thread there fyi http://lists.zuul-ci.org/pipermail/zuul-discuss/2021-November/001752.html 16:48
rlandy|rucklooking16:49
mariosdviroel: i held the node again today and confirmed that *without* installing anything explicitly (just the galaxy install) or setting any paths i was able to run th emodule on localhost without problems so it really must be a zuul limitation http://pastebin.test.redhat.com/1010415 16:49
sshnaidmmarios, in order to use collections ansible required them to be installed on the controller, installing them on hosts won't help..16:50
sshnaidmmarios, to use collections it should be possible to install them on zuul executor16:50
rlandy|rucklocalhost?16:50
mariosdviroel: if you see the latest debug https://logserver.rdoproject.org/17/36817/3/check/tripleo-stream9-development-buildimage-overcloud-hardened-uefi-full-master/5c1daf9/job-output.txt it can run it after manual install but then it fails on the include-role task 16:50
mariossshnaidm: ok i was going to reach out to you about that tomorrow16:51
mariossshnaidm: you mean, if add a pre which targets the zuul.executor (but in this case, it is only one node anyway?)16:51
mariossshnaidm: and do the ansible-galaxy install there it might work? 16:51
mariossshnaidm: but if you see my latest patch, i actually have an explicit install like https://review.opendev.org/c/openstack/tripleo-ci/+/818222/30/roles/tripleo-build-jobs-repos/tasks/main.yaml 16:52
sshnaidmmarios, I mean the installation should be on zuul executor machine in infra, which triggers and runs all jobs everywhere, it's not under our control16:52
mariossshnaidm: i see ... :/16:52
mariossshnaidm: then the collection will be available to all zuul you mean 16:52
mariossshnaidm: hmm... is that what you are doing with the collection patches i saw something related 16:53
mariossshnaidm: like adding them to infra zuul collections? 16:53
sshnaidmmarios, and it should be "preinstalled" on this zuul executor machine, because ansible can't detect the new collections in the same run, need to respawn it probably16:53
sshnaidmmarios, it works well for "nested" ansible, like in my patches16:53
mariossshnaidm: right but here it isn't nested 16:54
mariossshnaidm: it is native this is the problem no quickstart16:54
sshnaidmthere the "ansible controller" is the host we have16:54
marioscos for nested we are already using that collection16:54
mariossshnaidm: can i catch you tomorrow morning mate16:54
sshnaidmmarios, yep, in native collections it won't work16:54
mariossshnaidm: and we can discuss if you have time16:54
sshnaidmmarios, sure16:54
sshnaidmping me16:54
mariossshnaidm: yeah will do but i am confused ... cos 18:54 < sshnaidm> marios, yep, in native collections it won't work16:55
mariossshnaidm: you mean it wont work even if i have it on the zuul executor? 16:55
mariosof if you have it on the executor then it should work natively 16:55
sshnaidmit will work if you have it on executor, but we won't have it on executor, because no one will give such access :)16:55
mariossshnaidm: k thanks16:55
sshnaidmand think that you need to change this collection16:56
sshnaidmexecutor is just one host, which version of collection to have there16:56
sshnaidmand if you change a collection there, it affects all jobs in this time16:56
sshnaidmso even if we have such access, it won't help much16:57
mariossshnaidm: so there is still something missing on the zuul side right i mean for 'native zuul support for galaxy collections' 16:57
sshnaidmmarios, yes16:57
dviroelmarios: ack, i was taking a look at the latest run16:58
mariossshnaidm: like i'm wondering if my post was not needed but http://lists.zuul-ci.org/pipermail/zuul-discuss/2021-November/001752.html sounds like it is still not lear16:58
mariosk thanks sshnaidm lets talk more tomorrow will ping you whenever you have time16:58
sshnaidmmarios, yeah, just probably need a clarification of what we seek for exactly16:58
mariosdviroel: ack am getting out in a few mins16:58
sshnaidmmarios, k16:58
dviroelrlandy|ruck: hey, when you have some time, I updated compose-pinning hackmd with a list of tasks16:59
dviroelrlandy|ruck: https://hackmd.io/qV5mC2FoQX6N0iFt-f8p4A?view#Tasks17:00
dviroelthis should help us to create tasks in the board17:00
rlandy|ruckthanks17:02
jbadiaparlandy|ruck, ysandeep: regarding the issue on the manual deployments. https://paste.opendev.org/show/811245/17:04
*** marios is now known as marios|out17:06
*** jbadiapa is now known as jbadiapa|out17:07
*** jpena is now known as jpena|off17:33
rlandy|rucklunch brb17:47
* dviroel needs a coffee18:18
*** amoralej is now known as amoralej|off19:06
*** dviroel is now known as dviroel|afk19:56

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!