Thursday, 2022-10-20

*** rlandy is now known as rlandy|out00:53
*** ysandeep|PTO is now known as ysandeep05:07
marioso/05:17
ysandeep\o good morning everyone05:20
mariosbhagyashris|ruck: o/ can you reply to the support ticket sending you pvt now 05:52
mariosbhagyashris|ruck: (better if only one person is replying there)05:52
bhagyashris|ruckmarios, ack05:57
mariosthanks bhagyashris|ruck 06:04
mariosjm1[m]: is the ansible openstack modules session @ ptg today?06:09
mariosjm1[m]: i see os-ansible-modules at 1300 utc https://ptg.opendev.org/ptg.html06:09
*** amoralej|off is now known as amoralej06:19
* bhagyashris|ruck lunch brb06:56
jm1o/07:29
jm1marios: yes, aoc ptg is at 1300 utc today :)07:29
marioso/ 07:34
mariosthx jm1 07:35
mariosbhagyashris|ruck: frenzyfriday|rover: o/ do you want to sync today? if you do ping me/send me invite please? (e.g. in 25 mins as usual is a good time?)07:37
* marios fetch cofffee07:37
frenzyfriday|rovermarios, upstream looks good. Master promoted finally07:37
marios\o/07:37
frenzyfriday|roverI have to check the cix, I'll ping you if I need help on them07:37
frenzyfriday|roverlooks like everything promoted :D thats suspicious 07:38
jm1frenzyfriday|rover: master promoted? oh yeah🥳07:39
frenzyfriday|roverjm1, yep finally! I'll check on your card and add back to criteria if everything works07:39
mariosthanks frenzyfriday|rover 07:45
marios10:38 < frenzyfriday|rover> looks like everything promoted :D thats suspicious 07:45
mariosyup you've been in this team long enough :D ^^^ 07:45
frenzyfriday|roverXD07:46
bhagyashris|ruckmarios, hey nothing from downstream side we are still blocked due to dns issue 07:47
mariosbhagyashris|ruck: ack 07:48
mariosarxcruz: o/ looks good now https://quay.io/organization/tripleozedcentos908:25
mariosarxcruz: after the log fix patch merges we should be OK for the copy script? (those were manual?^^ )08:25
arxcruzmarios yes, it will update the toolbox automatically and keep copying 08:26
mariosthx for help arxcruz 08:27
mariosneed more reviews please when you have time o/ https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/45641 Add zed ovb periodic integration jobs for zed criteria08:43
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.08:43
*** ysandeep is now known as ysandeep|lunch09:08
*** amoralej` is now known as amoralej09:09
jpodivinbhagyashris|ruck: Hi. I'm running into problems with one of our component pipeline jobs. Really odd, we are cleaning a log directory before running our tests. Because we make assertions on the contents. But for some reason, even when the task seems to have passed, the original contents are still there. https://logserver.rdoproject.org/openstack-component-validation/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_09:40
jpodivin1comp-featureset001-component-master-validation/80cc670/job-output.txt 09:40
mariosjpodivin: yes!09:41
mariosjpodivin: i was just digging there https://opendev.org/openstack/validations-common/src/commit/b02d478d513a2b35b969ef96f766923714c4a20a/roles/validations/tasks/list_validation_history.yaml#L2709:41
mariosjpodivin: i thought it was zed specific (i see it in some results @ https://review.rdoproject.org/r/c/testproject/+/45451/5#message-6030179ed278fc32a111e88cf407a385454dc29509:42
mariosjpodivin: is it intermittent? it seems some of those jobs were previously green. perhaps a recent commit (last week or so?)09:42
jpodivinmarios: It may be but we have just merged https://review.opendev.org/c/openstack/validations-common/+/861716 and it's still happening. 09:42
jpodivinwhat I don't understand is: How can something be removed and yet be still there. 09:43
jpodivinI mean, unless the module call is wrong or something.09:43
jpodivinBut it does seem straight forward https://docs.ansible.com/ansible/latest/collections/ansible/builtin/file_module.html09:44
mariosjpodivin: for https://review.opendev.org/c/openstack/validations-common/+/861716 then we'll need promotion to get it into the periodic jobs09:45
mariosjpodivin: promotion of https://trunk.rdoproject.org/centos9-master/component/validation/ 09:45
jpodivinmarios: see I would think that as well but when I look at the most recent failures ... I see that task running 09:45
jpodivinand returning "Changed" 09:45
jpodivinhttps://logserver.rdoproject.org/openstack-component-validation/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-component-master-validation/80cc670/job-output.txt09:46
jpodivinsee here ^ 09:46
jpodivin2022-10-19 21:52:06.61334909:46
jpodivinAt that point there should be two files in the log dir, one for undercloud-disabled-services, other for undercloud-disk-space. 09:47
jpodivinAs you can see here: https://logserver.rdoproject.org/openstack-component-validation/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-component-master-validation/80cc670/logs/undercloud/home/zuul/ansible.log.txt.gz 09:48
jpodivinThese validations are executed some time before the log dir is removed. 09:48
mariosjpodivin: yeah but that is validation component job09:49
jm1frenzyfriday|rover, marios: do we have any pending reviews for zed cockpit stuff?09:49
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.09:49
mariosjpodivin: so it fetches component-ci-testing for the validation bits (validation repo is under test here so ci-testing for that and curren-tripleo for other stuff) 09:49
marioshttps://logserver.rdoproject.org/openstack-component-validation/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-component-master-validation/80cc670/logs/undercloud/etc/yum.repos.d/validation-component.repo.txt.gz09:49
mariosjpodivin: ^^^ so it must be new enough to have the recently merged thing09:50
jpodivinit is yes. That's why I don't understand I don't see any results.09:50
mariosjpodivin: yeah b02d478d513a2b35b969ef96f766923714c4a20a via https://trunk.rdoproject.org/centos9-master/component/validation/component-ci-testing/versions.csv09:51
jpodivinmarios: logs show something is going on ... task is executed and returning "changed" 09:51
mariosjpodivin: k so... this hits all branches i guess? well, wallaby was green when i checked eg https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-network-wallaby09:51
mariosjpodivin: so it hits only master? (and zed in my test jobs)09:51
mariosjm1: looking, i know there was one thing pojadhav was looking at 09:52
mariosjm1: https://issues.redhat.com/browse/TRIPLEOCI-1249 09:52
mariospojadhav: can you add review please ^^ 09:52
mariosfrenzyfriday|rover: fyi discussion with jpodivin here we have new blocker for master (and zed) components09:53
jpodivinmarios: it used to hit only master. But VF is essentially putting the same code in master and wallaby ...09:53
jpodivinmarios: I feel like if we could figure out why my mitigation isn't working we would be able to resolve this. 09:54
mariosfrenzyfriday|rover: do you have time to file that please ? otherwise i can let me know09:54
jm1marios: ack, thanks!09:54
mariosfrenzyfriday|rover: giving you some logs in pvt so easier to track09:54
* frenzyfriday|rover checks09:54
jpodivinI have a bug right here https://bugs.launchpad.net/tripleo/+bug/199326209:54
jpodivinfrenzyfriday|rover: ^09:54
mariosah great frenzyfriday|rover ^^^ 09:54
mariosthanks jpodivin 09:54
mariosthat is cix as it blocks components 09:55
frenzyfriday|roverof awesome! thanks09:55
mariosjpodivin: should we disable until we can work it out? is there some smaller hammer than disabling all of it? https://github.com/rdo-infra/rdo-jobs/blob/a1789557651febb6b386a11bfbaabd1199da19cd/zuul.d/component-jobs-master-centos9.yaml#L638 09:56
mariosjpodivin: s/should we/we should ;)09:56
frenzyfriday|rovermarios, jpodivin there is already a fix for that? I see fix released in the Bug09:56
jpodivinmarios: wait .... I'm having a feeling these are two different things 09:56
mariosfrenzyfriday|rover: i think they were hoping https://review.opendev.org/c/openstack/validations-common/+/861716 would help09:56
mariosjpodivin: can you please update with this info ^^^ in the bug? 09:56
frenzyfriday|roverah, okay09:56
jpodivinmarios: I believe the patch is linked automatically09:57
jpodivinmarios, frenzyfriday|rover: yep, I think I'm right. These are two different issues. 09:57
frenzyfriday|roverI added promotion blocker tag, if we later decide to set the bug back to in progress it will show up on cix09:57
mariosjpodivin: https://bugs.launchpad.net/tripleo/+bug/1993262 looks like the thing i am hitting in zed though. what is the 'other issue' you are referring to? 09:59
jpodivinfrenzyfriday|rover: I think we should file a new bug actually. I've checked logs from jobs marios run, and it doesn't seem to be the same error. 09:59
mariosjpodivin: you mean the proposed/merged fix is unrelated to the bug?09:59
marios               12:10:22.112916 | primary | fatal: [undercloud]: FAILED! => {"changed": false, "msg": "The history output length 5 doesn't match the number of 09:59
marios               expected validations runs 3.\n"} 09:59
marioslooks same jpodivin ? ^ 09:59
frenzyfriday|roverjpodivin, ack, /me files a bug09:59
mariosfrenzyfriday|rover: wait09:59
mariosjpodivin: please clarify? 09:59
frenzyfriday|roverhttps://logserver.rdoproject.org/51/45451/5/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-component-zed-validation/eccce54/logs/undercloud/home/zuul/full_validation_history.log.txt.gz looks the same as  https://logserver.rdoproject.org/91/43591/11/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-component-master-validation/8bf10e0/logs/undercloud/home/zuul/full_validation_his10:01
frenzyfriday|rovertory.log.txt.gz10:01
jpodivinmarios: yes, that is true for some, but periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset039-security-zed fails on deploy10:01
jpodivinhttps://review.rdoproject.org/zuul/build/5c7645e36e4d46ef84b90d6e7ae1084210:01
mariosjpodivin: ? i know only some of those hit the issue.10:01
mariosjpodivin: so it is the same issue then 10:01
mariosfrenzyfriday|rover: no need for another bug10:01
mariosjpodivin: i have 3 there 10:01
marios        * https://logserver.rdoproject.org/51/45451/5/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-compute-zed/b7ce58b/job-output.txt10:01
marios        * https://logserver.rdoproject.org/51/45451/5/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-tripleo-zed/c9f0335/job-output.txt10:01
marios        * https://logserver.rdoproject.org/51/45451/5/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-component-zed-validation/eccce54/job-output.txt10:02
jpodivinok those are the same thing.10:02
marios        * https://logserver.rdoproject.org/51/45451/5/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-network-zed/3d29db4/job-output.txt10:02
mariosk 10:02
mariosfour in fact10:02
jpodivinSo coming back to what we were talking about. Can you think of any reason why would the task be ignored? 10:02
mariosjpodivin: you mean in the validation component? it says 'changed'? maybe it is the wrong fix/soemthing more needed? what do you mean task is ignored10:03
jpodivinmarios: that's what I mean. It shows up, but nothing really happens. 10:03
mariosjpodivin: frenzyfriday|rover: i am going to put in a patch to disable validations 10:04
mariosjpodivin: is there some way to disable just this one instead? ^^ 10:04
jpodivinmarios: what do you mean? 10:04
mariosjpodivin: i mean just this validation instead of all of them 10:05
mariosjpodivin: dont think so just looking at the task 10:05
jpodivinmarios: well, there is no one validation doing this. This is an issue of the tests we run on our framework. 10:06
mariosjpodivin: ack so i guess enable_validation: false is the way for now sounds like 10:06
mariosjpodivin: until we find the actual problem/fix10:07
jpodivinmarios: well we know what the problem is ... directory is not getting removed .10:07
marios:)10:08
mariosk then the fix 10:08
jpodivinand that's the problem. I have no idea how to fix ansible ignoring my orders. :D10:08
mariosjpodivin: k but either way it will be easier to work it out without having the pressure of blocking all component lines 10:09
mariosjpodivin: frenzyfriday|rover: added info @ https://bugs.launchpad.net/tripleo/+bug/1993262/comments/410:11
frenzyfriday|roverjm1, hey, did we have a tempest skiplist patch for https://bugs.launchpad.net/tripleo/+bug/1992668? The sc10 kvm internal is passing now.10:14
frenzyfriday|roverjm1, oh, I see the job itself was out of criteria.10:15
frenzyfriday|rovermove sc10 kvm back to criteria - https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4570710:18
*** rlandy|out is now known as rlandy10:29
rlandybhagyashris|ruck: frenzyfriday|rover: hey10:30
bhagyashris|ruckrlandy, Hi, 10:30
rlandybhagyashris|ruck: frenzyfriday|rover: want to sync?10:30
rlandyany progress with OVB on internal?10:30
bhagyashris|ruckcreated service-now ticket for container build issue  https://redhat.service-now.com/help?id=rh_ticket&is_new_order=true&table=incident&sys_id=e14a38db872e999807c9ed3c8bbb35c810:30
rlandybhagyashris|ruck: when was the last container push failure?10:31
rlandyfrenzyfriday|rover'; W+'ed your patch10:31
bhagyashris|ruckyesterday on rhos17.1 on rhel910:32
bhagyashris|ruckhttps://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-periodic-integration-rhos-17.1-rhel9/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-build-containers-ubi-9-internal-rhel-9-build-push-upload-rhos-17.1/a69a965/job-output.txt10:32
rlandybhagyashris|ruck: so no failures today?10:32
rlandybhagyashris|ruck: I rerun 17.1 on 8 yesterday10:33
rlandywhen things started to come back10:33
rlandyworked ok10:33
bhagyashris|ruckrlandy, today rhos17 on rhel9 ran and container build pass there10:33
bhagyashris|rucklooks like situation is improving 10:34
rlandybhagyashris|ruck: and 16.210:34
rlandyso I suspect it may be better10:34
bhagyashris|ruckrlandy, and jfyi if we see the recent run of rhos17 o rhel9 10:34
bhagyashris|ruckhttps://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/buildset/f87ef31d7a2049b8aad0f6c5c99e4bed10:34
bhagyashris|ruckonly the ovb jobs are failing all the other jobs passed 10:34
bhagyashris|ruckrlandy, yes looks better now10:35
rlandybhagyashris|ruck: I expect baremetal fails as well10:35
rlandywe still have settings to change there10:35
bhagyashris|ruckrlandy, yeah10:35
bhagyashris|ruckand 16-2 is also looking better now https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/buildset/0245f621c18443ae8e735582f49cd9fd10:36
rlandybhagyashris|ruck: ok - so I am pinging dpawlik on https://redhat.service-now.com/help?id=rh_ticket&table=sc_req_item&sys_id=36aeadc78726d9d0d5cc642c8bbb3518&view=ess10:37
rlandyno action there10:37
rlandyin the mean time10:37
rlandythere are two things to investigate/fix10:37
rlandybhagyashris|ruck: the baremetal settings10:37
rlandyand why scenario010 is still failing10:37
rlandyfrenzyfriday|rover: you around?10:38
frenzyfriday|roverrlandy, yeah, reading back10:38
rlandyfrenzyfriday|rover: bhagyashris|ruck: let's sync - quicker10:38
bhagyashris|ruckrlandy, fine. but IMO we should keep that as open for few days just to check the results and if it's consistently passing then we can close the ticket10:38
frenzyfriday|roverrlandy, ack10:38
rlandyhttps://meet.google.com/ngu-joiv-crb?pli=1&authuser=010:39
rlandyfrenzyfriday|rover: bhagyashris|ruck: ^^10:39
mariosjpodivin: frenzyfriday|rover: https://review.rdoproject.org/r/c/rdo-jobs/+/4570810:41
mariosrlandy: fyi ^^ 10:41
*** ysandeep|lunch is now known as ysandeep10:46
pojadhavmarios, ack10:49
jpodivinmarios: I think I've got it. It's wrong dir. 10:51
* jpodivin facepal10:51
* jpodivin m10:51
mariosjpodivin: ok if you have fix happy to abandon that one so.. race ;)10:52
rlandybhagyashris|ruck: remove nodeset: https://code.engineering.redhat.com/gerrit/gitweb?p=openstack/tripleo-ci-internal-jobs.git;a=blob;f=zuul.d/integration-pipeline-rhos-16.2.yaml;h=23faa51783a4f27ca2f4a4e99d8b21e41adda10d;hb=HEAD10:53
rlandybhagyashris|ruck: and here: https://code.engineering.redhat.com/gerrit/gitweb?p=openstack/tripleo-ci-internal-jobs.git;a=blob;f=zuul.d/project-templates-components.yaml;h=d57bd37428caab60202829777425dd5e1f9afca3;hb=HEAD10:59
mariosjpodivin: thinking about it i think we still need the disable. you'll have to merge the fix, then we'll need it promoted through validation component in order to be available to the rest of the components11:04
mariosjpodivin: that will take at least a day so.. 11:04
jpodivinyeah, that makes sense. My fix is up, tests are running here https://review.rdoproject.org/r/c/testproject/+/4359111:06
mariosjpodivin: thanks just saw... can you please ... ah :D was going to ask for test thanks!11:07
marioswell at least i know it is not a zed specific thing so unblocking https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/45611/2#message-fc804672407c139e2288ee7bd71b3757534af006 11:08
jpodivinmarios: thanks for the help. :)11:09
mariosneeds reviews please https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/45611 Adds rest of component jobs to zed component criteria 11:11
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.11:11
mariosthanks for jumping on that jpodivin 11:11
rlandyarxcruz: hello - pls update this card: https://trello.com/c/3p8i2YdZ/2639-cixlp1982874tripleociproa-testcreateobjectwithtransferencoding-is-failing-on-tripleo-jobs11:11
arxcruzrlandy i will, i'm bumping tempest version on master and doing tests to make sure it is working11:12
arxcruzlet me grab the reviews for you11:12
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.11:12
arxcruzonce i have it i'll update the card11:12
arxcruzrlandy testing here https://review.rdoproject.org/r/c/testproject/+/45706 and bumping version here https://review.rdoproject.org/r/c/rdoinfo/+/45711 11:14
rlandyarxcruz: thanks - pls add those comments to  the card11:16
arxcruzok11:16
mariospojadhav: i don't see the review yet though https://issues.redhat.com/browse/TRIPLEOCI-1249 please add? 11:16
pojadhavmarios, I am working on it11:16
mariospojadhav: ah sorry, i thought i already reviewd that ? 11:17
mariospojadhav: ah maybe i confused it with your 'victoria removal' one. 11:17
pojadhavmarios, the issue was when my victoria patch got marged, promotions data were seen, so patch got reverted. so I also waiting to take help from dasm for cockpit changes for both victoria removal and addition of zed release.11:18
mariospojadhav: i thought you already posted... sorry ... so jm1 just fyi there is no patch yet pojadhav working on it11:18
pojadhavpromotions data were not seen when victoria patch got merged.11:18
mariosyeah i recall that i just thought i had also seen the zed one my bad 11:19
pojadhavmarios, still I am preparing patch for the same. will push in some time.11:19
mariosk 11:20
mariosthanks pojadhav i did not intend to pressure you to finish it quickly. i really thought you already had it and was pressing for you to put into the jira task (i couldn't find it in the reviews ;) now i know why)11:20
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.11:20
pojadhavmarios, its fine :)11:21
ysandeepchandankumar, o/ Do we have a green run after updating ctlplane ip? for this patch: https://review.opendev.org/c/openstack/tripleo-quickstart/+/86174811:22
chandankumarysandeep: yes11:22
ysandeepchandankumar, could you please pass me those logs, I want to check nova-compute logs on compute node11:23
chandankumarysandeep: I have forced failed the latest run https://review.rdoproject.org/r/c/testproject/+/45547/20/.zuul.yaml11:23
chandankumarI have hold the node also11:23
chandankumarhttps://logserver.rdoproject.org/47/45547/20/check/tripleo-ci-centos-9-standalone-external-compute-target-host1/5a777a8/job-output.txt11:23
ysandeepahh, I was checking this testproject but only checked last green run - https://review.rdoproject.org/r/c/testproject/+/45547/20#message-7595b526a061716cfadf00082cda7b30944dc20111:24
chandankumarhttps://logserver.rdoproject.org/47/45547/20/check/tripleo-ci-centos-9-standalone-external-compute-target-host1/5a777a8/logs/subnode-1/var/log/containers/nova/nova-compute.log.txt.gz11:24
chandankumarysandeep: everything is working fine11:24
chandankumarexcept controller to compute communication11:24
chandankumarwill fix later on11:24
*** dviroel|out is now known as dviroel11:25
ysandeepchandankumar, controller to compute communication - I thought changing ctlplane IP was going to solve that issue only11:26
chandankumarysandeep: so https://logserver.rdoproject.org/47/45547/19/check/tripleo-ci-centos-9-standalone-external-compute-target-host1/913dc66/logs/subnode-1/var/log/containers/nova/nova-compute.log.txt.gz is gone now11:26
chandankumarwe are seeing new issue https://logserver.rdoproject.org/47/45547/20/check/tripleo-ci-centos-9-standalone-external-compute-target-host1/5a777a8/logs/subnode-1/var/log/containers/nova/nova-compute.log.txt.gz now11:26
chandankumarsomethign wrong on control plane networking side11:27
chandankumarI have asked yatin to take a look11:27
bhagyashris|ruckrlandy, chandankumar marios scenario010 patch https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/431979 Remove single-rhel-8-4-node nodeset for scenario010 jobs11:27
bhagyashris|ruckscenario010 fix^11:27
ysandeepchandankumar, ack11:27
chandankumarysandeep: we need this patch https://review.opendev.org/c/openstack/tripleo-quickstart/+/861748 11:27
ysandeepI would say lets hold https://review.opendev.org/c/openstack/tripleo-quickstart/+/861748 , till we completely fix compute to standalone communication or atleast know what the issue is11:29
mariosbhagyashris|ruck: ack but please add related-bug>? 11:29
rlandybhagyashris|ruck; thanks  merging11:31
mariosrlandy: can we add bug before merge? 11:32
rlandywe can - I merged  as you +2'ed it11:32
mariosmust be th is one https://bugzilla.redhat.com/210540811:32
rlandyyep11:33
rlandythat's the right one11:33
rlandyysandeep: hey11:33
mariosrlandy: ok whatever its done now. next time i should -1 then ;) (I did add a comment in my defense but point taken)11:33
ysandeeprlandy, hello o/11:33
rlandymarios: I fixed the commit message with the added bug number - all done11:36
mariosrlandy: :) thank you 11:36
rlandyysandeep: hey - have a few minutes to  talk about rr tool and alternative jobs?11:36
rlandyI am thinking how to represent that11:37
ysandeeprlandy: sure, let me grab my earphones11:37
rlandychandankumar: IBM cloud scheduling is soooo slow11:37
rlandychandankumar: have you noticed that?11:38
rlandyysandeep: https://meet.google.com/cwg-jdmw-bxe?pli=1&authuser=011:38
mariosrlandy: bhagyashris|ruck: frenzyfriday|rover: jpodivin: going to merge that FYI https://review.rdoproject.org/r/c/rdo-jobs/+/45708 (disable validations for related-bug)11:38
rlandymarios: ok11:39
chandankumarrlandy: yes, so many jobs are queued11:41
chandankumardpawlik: hello11:41
chandankumardpawlik: so many jobs on ibm cloud in queued state, is it expected? job scheduling is slow11:41
mariospojadhav: i think https://issues.redhat.com/browse/TRIPLEOCI-1247 done there? if you agree please move it done when you have a minute thanks11:43
pojadhavmarios, yes its done11:44
pojadhavi will move it to done11:44
pojadhavmarios, moved to done 1247 task11:45
mariosfrenzyfriday|rover: assigned this to you https://issues.redhat.com/browse/TRIPLEOCI-1250 and moved it done ;) 11:46
mariosthanks pojadhav 11:46
bhagyashris|ruckrlandy, dns nameserver update patch https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/431983 Update dns_nameservers ip11:50
* bhagyashris|ruck tea brb11:50
rlandybhagyashris|ruck: thanks - will check11:50
rlandyysandeep: ^^ can you take a look - and then will merge11:51
rlandybhagyashris|ruck: once that patch is merged - pls testproject the 16.2 bm job11:52
rlandyand let's see if we get by11:52
ysandeepI remember jakob corrected these values last week, looking at related bug on what's the reasoning behind change11:54
dpawlikchandankumar: dunno, will check in few min11:54
dpawlikchandankumar: https://softwarefactory-project.io/grafana/d/CJAaWS3nz/provider-ibm-bm2-nodepool?orgId=111:55
dpawlik9 is in use11:55
rlandyysandeep: jm1 corrected some of them11:57
rlandythe change is needed11:57
rlandyold nameservers do not work11:57
mariosoooci please add to your queue please: criteria patches https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/45611 https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42999411:57
ysandeeprlandy, bhagyashris|ruck https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/431983/1#message-5c99cf3ea082bab965e9b60355b3e2e7b36635df comment for discussion12:04
ysandeepbhagyashris|ruck, are bm jobs broken currently, If yes, I can login and crosscheck the dns setting12:05
rlandyssec12:06
ysandeepthere are not clear info in the page i have attached - it is mentioned to use first entry for PSI, LAB's, Enginnering ,and second entry for RDU12:07
chandankumardpawlik: ah ok then it  makes sense12:08
chandankumardpawlik: thank you :-)12:08
chandankumarrlandy: based on this https://softwarefactory-project.io/grafana/d/CJAaWS3nz/provider-ibm-bm2-nodepool?orgId=1 it appears to be slow12:08
frenzyfriday|rovermarios, cool, thanks12:08
ysandeeprlandy, bhagyashris|ruck ^^ my above comment.. As these BM machines are in RDU but these are LAB- I wonder if we should go ahead with first entry or second entry12:09
chandankumarrlandy: on IBM cloud, in one go, at max 10 instance gets spanned12:09
rlandychandankumar: so just loading?12:11
*** amoralej is now known as amoralej|lunch12:13
rlandyysandeep: sorry - need a few 12:13
rlandychandankumar: https://review.opendev.org/c/openstack/tripleo-docs/+/861962 - fallout from PTg?12:14
*** ysandeep is now known as ysandeep|mtg12:15
chandankumarrlandy: yes12:15
rlandychandankumar: yes to which question?12:31
chandankumarrlandy: ovb tag comes from ptg discussion12:31
chandankumarrlandy: regarding ibm, we need to wait instances to available then new nodes to get assigned12:34
rlandyack ok12:35
bhagyashris|ruckysandeep|mtg, yes bm jobs are broken 12:37
rlandyysandeep|mtg: bhagyashris|ruck: looking at dns patch12:40
rlandylet me confirm why they are broken12:40
rlandy2022-10-19 23:21:22.650522 | primary | TASK [fetch-images : Cache image by checksum] **********************************12:41
rlandy2022-10-19 23:21:22.650532 | primary | Wednesday 19 October 2022  23:21:22 -0400 (0:00:00.088)       1:47:24.299 *****12:41
rlandy2022-10-19 23:22:07.639191 | primary | fatal: [undercloud]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh: Connection timed out during banner exchange", "unreachable": true}12:41
rlandy2022-10-19 00:51:30.689350 | primary | TASK [overcloud-ssl : Create overcloud-create-ssl-cert.sh] *********************12:42
rlandy2022-10-19 00:51:30.689358 | primary | Wednesday 19 October 2022  00:51:30 -0400 (0:00:07.088)       0:11:22.300 *****12:42
rlandy2022-10-19 00:52:14.879502 | primary | fatal: [undercloud]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh: Connection timed out during banner exchange", "unreachable": true}12:42
rlandyysandeep|mtg: bhagyashris|ruck: so ysandeep|mtg is correct here12:43
rlandydns is not the issue12:43
rlandybhagyashris|ruck: sorry  for the  work ... there is another problem12:44
bhagyashris|ruckrlandy, np12:49
rlandybhagyashris|ruck: I'm going to try rekick both baremetals12:49
rlandylet's see if we can get a consistent failure here or nor12:49
rlandynot12:49
bhagyashris|ruckrlandy, ack12:50
frenzyfriday|roverhey, pls add to your review lists https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/45747 (master sc10 kvm vexx in criteria)12:57
reviewbotI have added your review to the Review list12:57
*** amoralej|lunch is now known as amoralej13:09
*** dasm|off is now known as dasm13:13
dasmo/13:13
bhagyashris|ruckmarios, dasm hey should we meet for rr handover at 15 UTC (after PTG)13:16
bhagyashris|ruckrlandy, ^13:16
mariosbhagyashris|ruck: fine for me 13:18
mariosbhagyashris|ruck: if we finish in time - otherwise we can do it tomorrow morning 13:18
bhagyashris|ruckmarios, ok13:20
mariosbhagyashris|ruck: there is company call in 1.5 hours ish 13:22
dasmbhagyashris|ruck: ack13:26
dasmmarios: right, company meeting13:26
dasmarxcruz: hey! I'm sorry, I didn't want to offend you. I appologize for my comment. Marios hit the nail on the head though: you're the only one who's actually involved into containers code, hence no one else is able to help you with changes.13:27
arxcruzdas13:28
arxcruzdasm still, it's not the kind of comment i would expect in gerrit, but i would like to move on of this subject so, all good 13:29
dasmack13:29
*** ysandeep|mtg is now known as ysandeep13:44
chandankumarysandeep: ykarel thank you :-)13:45
rlandybhagyashris|ruck: frenzyfriday|rover; marios: by some miracle OVB stacks are starting now on downstream14:00
mariosrlandy: finally thanks14:00
rlandyidk how - the ticket was never updated14:00
*** dviroel is now known as dviroel|ptg14:08
mariosrlandy: well perhaps the dns issue was resolved? but the container thing was also not consistent so maybe we got lucky and have containers now ... lets see how far ovb gets :)14:10
rlandycontainer push failed on 17.114:10
rlandyI am rerunning 16.2 jobs that failed14:11
rlandythen will rekick 17.114:11
rlandytrying to see what shakes out today14:11
rlandybut if we can clean up before rr take over, that would be nice14:11
rlandycontainer push may still be an issue14:12
mariosrlandy: yeah that conatiner issue was on/off not consistent (remember you closed the ticket at one point and then we saw it again)14:12
*** ysandeep is now known as ysandeep|dinner14:29
rlandyfrenzyfriday|rover: https://logserver.rdoproject.org/openstack-periodic-integration-stable1-cs8/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby/2bcd4ce/job-output.txt14:34
rlandysame error showed up on internal14:34
mariosrlandy: frenzyfriday|rover: is that a new thing? looking at the error wonder if this is related (is it only on wallaby?) https://review.opendev.org/c/openstack/tripleo-image-elements/+/855840 14:36
mariosrlandy: i merged that yesterday ^^ 14:36
rlandymarios: let me check14:38
mariosthink it might be 14:38
rlandyI saw it on the downstream jobs14:38
rlandyand then checked upstream14:38
rlandyI think I saw it on wallaby c9 yesterday14:39
rlandyI didn;t check today14:39
frenzyfriday|roverI can see it from 2022-10-19 17:21:5714:40
rlandymarios: it's in c9 ... https://logserver.rdoproject.org/openstack-periodic-integration-stable1/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset035-wallaby/15798ef/job-output.txt14:40
rlandymaster was fine yesterday14:40
rlandycan we revert the merge?14:40
mariosrlandy: yeah the master for that patch merged in august, that is why i thought it was 'safe' and didn't request tests ... :/ silly me like we haven't learnt our lesson with these kinds of patches 14:41
rlandymarios: well - maybe we are missing a companion patch to make it work14:41
mariosfrenzyfriday|rover: want to do it or should I (click revert on the patch and you can try a testproject? )14:41
mariosrlandy: yeah maybe 14:41
rlandycan you link the patch?14:41
mariosrlandy: 17:36 < marios> rlandy: frenzyfriday|rover: is that a new thing? looking at the error wonder if this is related (is it only on wallaby?) https://review.opendev.org/c/openstack/tripleo-image-elements/+/855840 14:41
rlandyoh14:42
mariosmerged  Yesterday at 1:03 PM 14:42
rlandyhttps://review.opendev.org/q/topic:wallaby-bootpart - al merged14:42
mariosso it seems a likely candidate14:42
frenzyfriday|rovermarios, lemme try a tp with the revert14:42
rlandyok - let me try a revert14:42
mariosfrenzyfriday|rover: thanks 14:42
rlandyfrenzyfriday|rover: oh you're already on that?14:43
frenzyfriday|roverrlandy, yeah, revert is created , trying a tp now14:44
rlandyfrenzyfriday|rover++14:44
rlandyfrenzyfriday|rover: are you on the hardware prov chat?14:44
rlandycan you ping steve there14:44
rlandyI'll try catch him in my afternoom14:44
bhagyashris|ruckrlandy, ohh thanks :)14:47
frenzyfriday|roverrlandy, the google spaces thing right? yep14:48
mariosrlandy: maybe you can sync with dasm a bit during your day? then i will just pickup in the morning and sync with bhagyashris|ruck & frenzyfriday|rover as needed sound good? dasm ok for you? 14:49
marioswe don't have time for a sync now i think and after the company call is past working day for APAC/EU14:49
dasmmarios: sounds good.14:49
mariosrlandy: dasm: bhagyashris|ruck: frenzyfriday|rover: sound good? ^^ 14:49
frenzyfriday|rovermarios, yep, good for me14:49
mariosberb14:50
mariosbrb14:50
frenzyfriday|roverrlandy, revert: https://review.opendev.org/c/openstack/tripleo-image-elements/+/86211214:50
frenzyfriday|roveri am fixing the merge conflict, else it will not let me put depends on14:50
rlandyfrenzyfriday|rover: ack14:54
rlandymarios: sure14:54
rlandydasm: let's touch base in the afternoon14:54
rlandymarios: dasm: do you want us to start the new hackmd or you want to sort that in the morning?14:55
*** ysandeep|dinner is now known as ysandeep14:55
mariosrlandy: dasm: up to you if you start it i will just add there14:56
mariosrlandy: i'll check https://hackmd.io/2hB-P772SqyqDs0KKZzZEQ?view 14:56
marios(the index)14:56
rlandyok14:56
rlandymarios: downstream is still not in amazing shape14:57
rlandytrying to clean up a bit14:57
rlandydasm: we'll have to take stock of that and check in with nhicher later14:58
dasmmarios: i'm reviewing your https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/45611 it seems to be pretty reddish over there. Some of jobs failing due to "The history output length"14:58
mariosthanks rlandy its been bad ~ a week now so lets hope we are on the tail end of the infra issues 14:58
rlandyyep - onwards an dupwards14:58
dasmI recall you talked today's morning with someone wrt similar issue. Is it still the case?14:59
mariosdasm: ack thanks for looking. so with my sales hat on: the history output length thing is known/existing (commented @ https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/45611/2#message-fc804672407c139e2288ee7bd71b3757534af006 ) 14:59
dasmchecking14:59
mariosdasm: so we don't have to block on that (not zed specific)14:59
bhagyashris|ruckrlandy, marios dasm frenzyfriday|rover ok for me 14:59
mariosdasm: for all the jobs there is at least one passing example, except fs1/network which is failing on that validations bug14:59
dasmmarios: ack, thx. I'm gonna add the comment15:00
mariosdasm: thx15:00
* bhagyashris|ruck leaving for the day ...15:00
dasmbhagyashris|ruck: o/15:01
rlandybhagyashris|ruck: have a good night15:01
frenzyfriday|roverRun build-images.sh is failing in the components, checking15:01
frenzyfriday|roverrlandy, the tp with image mount's revert: https://review.rdoproject.org/r/c/testproject/+/45405 i will check back in some time15:20
rlandyfrenzyfriday|rover: thanks - will watch that15:20
rlandyfrenzyfriday|rover: marios_: https://review.rdoproject.org/r/c/testproject/+/45405 failed15:49
rlandyrebuild images as well?15:49
rlandyovercloud-hardened-uefi-full.raw15:49
* rlandy tries15:50
*** marios_ is now known as marios15:52
mariosyeah you'll need to build the image for the ovb jobs rlandy frenzyfriday|rover 15:53
rlandyack - adding that15:53
chandankumarsee ya people!16:03
dasmchandankumar: o/16:03
mariosme off too o/ 16:03
frenzyfriday|roverohh, rlandy thanks for updating the tp16:05
*** marios is now known as marios|out16:05
rlandydasm: marios|out: frenzyfriday|rover: nice ... https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4574716:06
*** dviroel|ptg is now known as dviroel|lunch\16:06
*** amoralej is now known as amoralej|off16:07
dasmyup16:15
* ysandeep out, Have a great rest of your day everyone o/16:20
*** ysandeep is now known as ysandeep|out16:20
dasmysandeep|out: o/16:27
*** jpena is now known as jpena|off16:39
*** dviroel|lunch\ is now known as dviroel16:53
* frenzyfriday|rover is leaving for the day17:30
frenzyfriday|roverrlandy, the component jobs are failing Run build-images.sh I will check and file a bug tomorow17:31
rlandyfrenzyfriday|rover: https://review.opendev.org/c/openstack/tripleo-image-elements/+/862112 fails17:58
* rlandy check what else was w+'ed17:58
rlandywe may have to revert the whole series18:01
* rlandy creates bug18:01
rlandydasm: frenzyfriday|rover: ^^18:01
rlandydasm: you around?18:28
dasmrlandy: 'sup?18:28
rlandydasm: let's rr sync18:28
dasmk18:28
rlandydasm: https://meet.google.com/hhb-smxy-mix?pli=1&authuser=018:29
rlandydasm: https://hackmd.io/0Lev7RRlRDCj9hiNUKN1zw18:30
rlandyrcastillo: hey - when you have a moment - pls review dasm's patches on review list19:08
rlandyinfra changes19:08
rcastillorlandy: ack I'll have a look19:20
rlandyrcastillo: thanks - dasm will also talk about these patches a tuesday's community call19:20
dasmthx19:24
* jm1 out for today, have a nice evening folks :)19:55
rlandydasm: making some progress downstream22:23
dasmnice22:23
rlandy16.2 all passed expect baremtal22:23
rlandyrerunning 17.2 ovb jobs22:23
dasmi passed couple components so we're pretty clear22:23
dasmcs8 train network is almost done22:23
dasmcs8 wallaby is fully clear22:24
rlandynice22:24
rlandypls leave notes for marios22:24
dasmi left some already22:24
dasmi'm having issues with cert @downstream with rr script, so that's gonna be my next thing tomorrow22:24
rlandydasm: sending you pct msg22:29
rlandyok - it's been 12 hours22:30
rlandywill bbl to check on OVB22:30
*** rlandy is now known as rlandy|bbl22:30
dasmack22:30
rlandy|bblwill leave notes on hackmd22:31
dasmbhagyashris|ruck: frenzyfriday|rover i created new rr status: https://hackmd.io/wtT4lbOSSeuLcRS2aPTQAQ if you're gonna update anything, it would be nice to do it over there. Thanks22:46
* dasm over and out22:46
*** dasm is now known as dasm|off22:46
*** dviroel is now known as dviroel|out22:47

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!