*** rfolco has quit IRC | 00:32 | |
*** jmasud has quit IRC | 01:15 | |
*** Tengu has quit IRC | 03:07 | |
*** Tengu has joined #oooq | 03:09 | |
*** rlandy|ruck|bbl is now known as rlandy | 03:10 | |
*** Goneri has quit IRC | 03:11 | |
*** sanjayu_ has joined #oooq | 03:14 | |
*** dmellado has quit IRC | 03:59 | |
*** ykarel|away is now known as ykarel | 04:10 | |
*** skramaja has joined #oooq | 04:14 | |
*** aakarsh has quit IRC | 04:25 | |
*** aakarsh has joined #oooq | 04:25 | |
*** jmasud has joined #oooq | 04:33 | |
*** udesale has joined #oooq | 04:41 | |
*** ratailor has joined #oooq | 05:08 | |
*** jmasud has quit IRC | 05:10 | |
*** jmasud has joined #oooq | 05:11 | |
*** dtantsur|afk has quit IRC | 05:14 | |
*** jmasud has quit IRC | 05:17 | |
*** ysandeep|PTO is now known as ysandeep | 05:18 | |
*** jmasud has joined #oooq | 05:19 | |
*** soniya29|afk is now known as soniya29|rover | 05:26 | |
*** marios has joined #oooq | 05:31 | |
*** pojadhav|afk is now known as pojadhav | 05:37 | |
*** soniya29|rover is now known as soniya29|afk | 05:40 | |
*** jtomasek has joined #oooq | 07:01 | |
*** ccamacho has joined #oooq | 07:02 | |
*** jbadiapa has joined #oooq | 07:06 | |
*** jmasud_ has joined #oooq | 07:15 | |
*** jmasud has quit IRC | 07:17 | |
*** jfrancoa has joined #oooq | 07:19 | |
*** soniya29|afk is now known as soniya29|rover | 07:35 | |
*** tosky has joined #oooq | 07:35 | |
*** amoralej|off is now known as amoralej | 07:36 | |
marios | needs reviews please https://review.rdoproject.org/r/#/c/28161 https://review.rdoproject.org/r/#/c/28162 test at https://review.rdoproject.org/r/#/c/28106 | 07:49 |
---|---|---|
*** jpena|off is now known as jpena | 07:54 | |
*** ykarel is now known as ykarel|lunch | 08:23 | |
zbr | morning everyone! | 08:31 |
zbr | panda: ping me when ready to start the joy | 08:31 |
marios | lol zbr | 08:32 |
zbr | marios: fyi, joy = upgrading the promoter | 08:32 |
marios | yeah i imagined it was something related | 08:32 |
marios | does anyone know off-hand if ansible lets you configure itself, like lineinfile to ansible.cfg via an ansible task? :D | 08:33 |
zbr | but i seen the change reverted, due to possible issue (not really convinced). | 08:33 |
marios | i vaguely recall someone trying that and there is some protection against it but i am going to try it | 08:33 |
marios | because i have nothing else left to try | 08:34 |
zbr | marios: you can edit ansible.cfg but it will reload the file when another instance is started, not the same runtime | 08:34 |
zbr | obviously that this does not apply to zuul at all. | 08:34 |
marios | hmm... zbr yeah i was considering this. i wonder though does it apply to 2 different playbook executions... perhaps they still count as the same runtime likely | 08:34 |
marios | zbr: i.e. i am configuring in pre.yaml and want to use it in run-v3.yaml | 08:35 |
zbr | i do not know the details but my guess is that you are using it wrong if you need that | 08:35 |
marios | zbr: i suspect part of our problems stems from no/little collections support before ansible 2.9 | 08:35 |
zbr | and i would be against such practice, mainly because ansible works best if you call it only once (like zuul). | 08:35 |
marios | zbr: e.g. in the default ansible.cfg there is no collections_paths options for ansible 2.8.12 | 08:36 |
zbr | marios: don't try anything collection before 2.9, it does not worth the efforth | 08:36 |
marios | zbr: k thanks | 08:36 |
marios | zbr: i | 08:36 |
zbr | they are broken badly in 2.8, zuul will bump default ansible to 2.9 quite soon, faster than the usual cadence. | 08:36 |
marios | zbr: i'm still gonna try edit ansible.cfg by ansible task in pre, so i can say i tried it and i'm really curious to see what will happen :D | 08:36 |
*** ysandeep is now known as ysandeep|lunch | 08:37 | |
zbr | i would support any move to switch minimal ansible to 2.9 | 08:37 |
*** pojadhav is now known as pojadhav|lunch | 08:40 | |
*** ysandeep|lunch is now known as ysandeep | 08:50 | |
amoralej | hi, i've pushed a minor update of rabbitmq-server, 3.8.2 -> 3.8.3 to centos mirror, i've tested it and it should be fine | 09:04 |
amoralej | let me know if you see anything abnormal | 09:05 |
*** dtantsur has joined #oooq | 09:05 | |
*** owalsh has quit IRC | 09:08 | |
*** owalsh_ has joined #oooq | 09:08 | |
*** jmasud_ has quit IRC | 09:11 | |
*** sshnaidm|afk is now known as sshnaidm|ruck | 09:11 | |
*** jmasud has joined #oooq | 09:14 | |
*** pojadhav|lunch is now known as pojadhav | 09:29 | |
*** derekh has joined #oooq | 09:30 | |
*** udesale_ has joined #oooq | 09:46 | |
*** udesale has quit IRC | 09:49 | |
*** ratailor_ has joined #oooq | 09:51 | |
*** ratailor has quit IRC | 09:54 | |
*** marios has quit IRC | 09:54 | |
*** holser_ has joined #oooq | 10:39 | |
arxcruz | chandankumar: kopecmartin please review https://review.opendev.org/#/c/720434/ | 10:40 |
*** holser has quit IRC | 10:41 | |
*** marios has joined #oooq | 10:46 | |
*** ykarel|lunch is now known as ykarel | 10:58 | |
*** pojadhav is now known as pojadhav|afk | 11:30 | |
*** jpena is now known as jpena|lunch | 11:31 | |
soniya29|rover | sshnaidm|ruck, tripleo-ci-centos-8-standalone job is healthy and the patch https://review.opendev.org/#/c/737001/ has also got merged, still it is shown as failure in cockpit. | 11:49 |
sshnaidm|ruck | soniya29|rover, where is the failure? | 11:50 |
*** amoralej is now known as amoralej|lunch | 11:52 | |
soniya29|rover | sshnaidm|ruck, according to cockpit the above job is failing in gate, but I have looked at patch, the patch got merged by zuul | 11:52 |
*** ysandeep is now known as ysandeep|afk | 12:00 | |
*** rlandy has joined #oooq | 12:00 | |
*** rlandy is now known as rlandy|ruck | 12:01 | |
*** rfolco has joined #oooq | 12:01 | |
rlandy|ruck | sshnaidm|ruck: soniya29|rover: hey ... | 12:03 |
soniya29|rover | rlandy|ruck, hello | 12:03 |
rlandy|ruck | sshnaidm|ruck: soniya29|rover: I think we should promote queens | 12:03 |
soniya29|rover | rlandy|ruck, 7 jobs are failing in upstream gate | 12:04 |
sshnaidm|ruck | rlandy|ruck, how did it finish? | 12:04 |
rlandy|ruck | sshnaidm|ruck: how did what finish? | 12:04 |
sshnaidm|ruck | rlandy|ruck, queens | 12:05 |
rlandy|ruck | sshnaidm|ruck: it's 21 days out ... wes was just looking for the best match results and editing the criteria to promote | 12:06 |
rlandy|ruck | sshnaidm|ruck: you and I have access to the promoter | 12:06 |
rlandy|ruck | promoter server - to do that | 12:06 |
rlandy|ruck | http://38.102.83.109/centos7_queens.log | 12:06 |
rlandy|ruck | 2020-06-23 11:52:27,667 7090 INFO promoter Skipping promotion of centos7-queens from tripleo-ci-testing to current-tripleo, missing successful jobs: [u'periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset020-queens'] | 12:07 |
rlandy|ruck | ^^ only one missing job | 12:07 |
rlandy|ruck | 2020-06-23 11:52:27,666 7090 DEBUG promoter Successful jobs for commit: 73823d6340f694992933b7853274362a21486c91, distro: 1f2be4ddcc2ef43e98f16d5d3d32309050f6a9f2, timestamp=1592028322 | 12:08 |
* rlandy|ruck checks criteria | 12:08 | |
sshnaidm|ruck | rlandy|ruck, it's from 13 Jun | 12:09 |
rlandy|ruck | sshnaidm|ruck: better than 21 days | 12:10 |
sshnaidm|ruck | rlandy|ruck, how about 7e4577fd10d1579d72037343607f9d66b7ed1948 | 12:11 |
sshnaidm|ruck | rlandy|ruck, it's from 17 Jun | 12:11 |
rlandy|ruck | 2020-06-23 11:52:27,025 7090 INFO promoter Checking hash commit: 73823d6340f694992933b7853274362a21486c91, distro: 1f2be4ddcc2ef43e98f16d5d3d32309050f6a9f2, | 12:12 |
rlandy|ruck | that's ok | 12:12 |
rlandy|ruck | sshnaidm|ruck: the promoter should pick the latest hash - we just need to make sure the failed jobs are excluded | 12:13 |
rlandy|ruck | sshnaidm|ruck: ie: in this case fs020? | 12:13 |
* rlandy|ruck gets on promoter and checks | 12:13 | |
sshnaidm|ruck | rlandy|ruck, well, your 73823d6340f694992933b7853274362a21486c91 is better | 12:13 |
sshnaidm|ruck | rlandy|ruck, yep | 12:14 |
sshnaidm|ruck | rlandy|ruck, preparing a patch? | 12:14 |
rlandy|ruck | sshnaidm|ruck: no - checking promotion criteria on promoter - tmating you in | 12:14 |
chandankumar | rlandy|ruck, Hello, good morning | 12:15 |
chandankumar | rlandy|ruck, here I have tried the container build https://review.rdoproject.org/r/#/c/28194/ | 12:15 |
chandankumar | rlandy|ruck, and here is the base job https://review.rdoproject.org/r/28195 please have a look | 12:16 |
rlandy|ruck | chandankumar: hey - one sec - busy with promoter - will ping in a few | 12:16 |
chandankumar | sure | 12:16 |
rlandy|ruck | sshnaidm|ruck: soniya29|rover: wrt ipa - I think we can merge https://review.opendev.org/#/c/737058/ | 12:19 |
rlandy|ruck | at leats try get rid of one of the issues | 12:19 |
marios | needs reviews please https://review.rdoproject.org/r/#/c/28161 https://review.rdoproject.org/r/#/c/28162 test at https://review.rdoproject.org/r/#/c/28106 | 12:21 |
soniya29|rover | rlandy|ruck, yeah, but RDO Third party CI is failing badly, so will it be suitable in such case to get it merged? | 12:21 |
rlandy|ruck | soniya29|rover: it shouldn't impact third party at all | 12:22 |
rlandy|ruck | but you are welcome to recheck it | 12:22 |
soniya29|rover | rlandy|ruck, If its not gonna impact then we can merge since we need it now | 12:23 |
sshnaidm|ruck | soniya29|rover, 3party is failing because escalated card of ovb, that's fine | 12:23 |
rlandy|ruck | sshnaidm|ruck: ovb still failing though | 12:23 |
sshnaidm|ruck | soniya29|rover, I mean it's not fine, but.. | 12:23 |
rlandy|ruck | soniya29|rover: ipa jobs are failing gates | 12:23 |
rlandy|ruck | so I | 12:23 |
rlandy|ruck | 'm trying to get that fixed | 12:24 |
soniya29|rover | sshnaidm|ruck, rlandy|ruck, yeah, lets get that merged | 12:24 |
rlandy|ruck | sshnaidm|ruck: soniya29|rover: taking another shot at the other patch | 12:24 |
sshnaidm|ruck | soniya29|rover, rlandy|ruck, https://bugs.launchpad.net/tripleo/+bug/1884518 - Harald is on this | 12:24 |
openstack | Launchpad bug 1884518 in tripleo "OVB metalsmith deployment fails: Failed to attach VIF ... to bare metal node, Node ... is locked by host undercloud" [Critical,Triaged] | 12:24 |
rlandy|ruck | sshnaidm|ruck: yeah - good | 12:24 |
rlandy|ruck | that's a crazy failure to debug | 12:24 |
rlandy|ruck | sshnaidm|ruck: lastly - I re-uploaded the ipxe-boot image for downstream | 12:25 |
rlandy|ruck | looks the same to me | 12:25 |
sshnaidm|ruck | rlandy|ruck, yeah? | 12:25 |
rlandy|ruck | sshnaidm|ruck: maybe it's not shutting down | 12:25 |
sshnaidm|ruck | rlandy|ruck, what I saw is ironic fails to set pxe boot on node | 12:25 |
rlandy|ruck | but the last job failed on an unrelated undercloud install | 12:25 |
rlandy|ruck | will try again later today | 12:26 |
sshnaidm|ruck | rlandy|ruck, but I was able to set boot pxe manually | 12:26 |
sshnaidm|ruck | rlandy|ruck, and rerun introspection, and then node shut down :D | 12:26 |
rlandy|ruck | sshnaidm|ruck: in time? | 12:26 |
soniya29|rover | rlandy|ruck, we need workflow +1 for https://review.opendev.org/#/c/737058/ | 12:26 |
sshnaidm|ruck | rlandy|ruck, I think it just was removed by nodepool then | 12:26 |
rlandy|ruck | soniya29|rover: done - thanks | 12:26 |
rlandy|ruck | sshnaidm|ruck: let me kick that again | 12:27 |
rlandy|ruck | I really hope this works now | 12:27 |
rlandy|ruck | I am so done with downstream and ovb | 12:27 |
sshnaidm|ruck | rlandy|ruck, btw, that works pretty good: https://review.opendev.org/#/c/737289/ | 12:27 |
sshnaidm|ruck | rlandy|ruck, you can see example of failure in comments | 12:27 |
rlandy|ruck | sshnaidm|ruck: yes - I have that included in the job | 12:27 |
rlandy|ruck | see depends-on | 12:28 |
* rlandy|ruck votes | 12:28 | |
*** ysandeep|afk is now known as ysandeep | 12:28 | |
rlandy|ruck | sshnaidm|ruck: here we go again - hopefully that works | 12:33 |
*** pojadhav|afk is now known as pojadhav | 12:34 | |
*** jpena|lunch is now known as jpena | 12:35 | |
rlandy|ruck | sshnaidm|ruck: panda: pls see http://38.102.83.109/centos7_queens.log | 12:36 |
rlandy|ruck | error in log | 12:37 |
rlandy|ruck | FAIL\nERROR ========== centos-binary-haproxy IS NOT BUILT! FIX THIS ASAP! | 12:37 |
rlandy|ruck | what is that? | 12:37 |
*** derekh has quit IRC | 12:40 | |
*** derekh has joined #oooq | 12:40 | |
sshnaidm|ruck | rlandy|ruck, it didn't find containers | 12:40 |
rlandy|ruck | sshnaidm|ruck: we may have to punt until tomorrow when that pipeline runs | 12:41 |
rlandy|ruck | and then rerun failed jobs | 12:41 |
rlandy|ruck | promote our best bet then | 12:41 |
rlandy|ruck | thoughts? | 12:41 |
rlandy|ruck | soniya29|rover: https://meet.google.com/jng-htpw-avm | 12:42 |
ykarel | it's likely old container images deleted in rdo registry as not protected by a named symlink | 12:43 |
*** ratailor_ has quit IRC | 12:44 | |
rlandy|ruck | ykarel: probably - from June 17? | 12:45 |
ykarel | rlandy|ruck, iirc only 3 days are kept | 12:45 |
rlandy|ruck | well that explains things | 12:45 |
ykarel | if it's not in tripleo-ci-testing, current-tripleo etc | 12:45 |
rlandy|ruck | then we can't promote older hashed | 12:45 |
rlandy|ruck | hashes | 12:45 |
sshnaidm|ruck | rlandy|ruck, damn, again | 12:46 |
rlandy|ruck | sshnaidm|ruck: ^^ well I guess we'll just hope for a good run tomorrow | 12:46 |
rlandy|ruck | or we can edit the pipeline to run today | 12:46 |
sshnaidm|ruck | ykarel, it IS in tripleo-ci-testing though | 12:46 |
ykarel | sshnaidm|ruck, current tripleo-ci-testing | 12:46 |
ykarel | https://trunk.rdoproject.org/centos7-queens/tripleo-ci-testing/delorean.repo | 12:47 |
sshnaidm|ruck | ykarel, we should keep the previous too | 12:47 |
sshnaidm|ruck | ykarel, oh, no, it's previous-current-tripleo | 12:48 |
sshnaidm|ruck | well, that explains | 12:48 |
rlandy|ruck | tomorrow - otherwise we're out of the water here | 12:51 |
ykarel | rlandy|ruck, sshnaidm|ruck yes it's three days https://review.rdoproject.org/r/gitweb?p=rdo-infra/rdo-infra-playbooks.git;a=blob;f=roles/rdo-infra/registry-image-pruning/defaults/main.yml#l31 | 12:51 |
ykarel | if not in whitelist https://review.rdoproject.org/r/gitweb?p=rdo-infra/rdo-infra-playbooks.git;a=blob;f=roles/rdo-infra/registry-image-pruning/defaults/main.yml#l2 | 12:51 |
sshnaidm|ruck | ykarel, ack, asking if it can be depending on release | 12:52 |
sshnaidm|ruck | no need to delete every 3 days if containers are built once in 4 days | 12:52 |
ykarel | sshnaidm|ruck, for now it's 3 days, yes can be changed | 12:53 |
sshnaidm|ruck | ykarel, I think it should be not by days amount | 12:54 |
sshnaidm|ruck | but on tags amount | 12:54 |
marios | chandankumar: thanks for review i replied at https://review.rdoproject.org/r/#/c/28162/5/zuul.d/jobs-promoter-centos-7.yaml when you next have time thank you | 12:56 |
ykarel | sshnaidm|ruck, ack, u can propose and it can be evaluated | 12:57 |
ykarel | iirc earlier it used to be 10 days | 12:57 |
ykarel | but yes 3 days in this case is low | 12:58 |
ykarel | when there are no regular promotions | 12:58 |
ykarel | is there known blocker in queens? | 12:58 |
ykarel | if not a fresh run can be promoted for now to clear | 12:58 |
sshnaidm|ruck | ykarel, no blockers afaik, but it runs twice a week only | 12:59 |
ykarel | thrice | 12:59 |
sshnaidm|ruck | ykarel, I think today is next time | 13:00 |
rlandy|ruck | chandankumar: let's chat after community meeting | 13:00 |
ykarel | tomorrow, wednesday and weekend | 13:00 |
chandankumar | rlandy|ruck, sure | 13:00 |
ykarel | sshnaidm|ruck, last failure should be fixed by your patches | 13:00 |
ykarel | docker_namespace one | 13:01 |
ykarel | so i think good to trigger it and see if all clear | 13:01 |
sshnaidm|ruck | yeah, I see periodic runs fine | 13:02 |
sshnaidm|ruck | so should be ok | 13:02 |
*** amoralej|lunch is now known as amoralej | 13:05 | |
sshnaidm|ruck | rlandy|ruck, introspection fails on ussuri for 1 node of 4 | 13:27 |
marios | arxcruz, rfolco, zbr, panda, sshnaidm, rlandy, marios, ysandeep, bhagyashris, soniya29, pojadhav, akahat, weshay, chandankumar community call in 3 mins add agenda items https://hackmd.io/IhMCTNMBSF6xtqiEd9Z0Kw?both | 13:27 |
sshnaidm|ruck | rlandy|ruck, did you try larger timeout for introspection on psi? | 13:27 |
rlandy|ruck | sshnaidm|ruck: I did | 13:27 |
rlandy|ruck | still never shut down | 13:28 |
rlandy|ruck | but that was before I uploaded the newer image | 13:28 |
rlandy|ruck | let's see what this run does | 13:28 |
sshnaidm|ruck | rlandy|ruck, ok, let's hold a node also | 13:28 |
rlandy|ruck | sshnaidm|ruck: introspection always fails on one node for ussuri? | 13:28 |
sshnaidm|ruck | I'd like to experiment | 13:28 |
rlandy|ruck | sshnaidm|ruck: oh - I can do that now | 13:28 |
rlandy|ruck | I have a token | 13:28 |
sshnaidm|ruck | rlandy|ruck, from some jobs that I saw | 13:28 |
* rlandy|ruck has to try it out | 13:28 | |
rlandy|ruck | sshnaidm|ruck I didn;t see any easy way to up the timeout | 13:29 |
rlandy|ruck | I landed up editing the introspection playbook | 13:29 |
sshnaidm|ruck | rlandy|ruck, isn't it a parameter?? | 13:29 |
sshnaidm|ruck | rlandy|ruck, lemme check. | 13:29 |
marios | arxcruz, rfolco, zbr, panda, sshnaidm, rlandy, marios, ysandeep, bhagyashris, soniya29, pojadhav, akahat, weshay, chandankumar | 13:32 |
marios | community call | 13:32 |
zbr | o/ | 13:32 |
marios | E.g. the test there https://review.rdoproject.org/r/#/c/28190 is trying to use https://review.rdoproject.org/r/#/c/28189/2/ci-scripts/infra-setup/roles/get_hash/tasks/get_hash.yaml but it is ignored - no ‘sanity check’ in logs https://logserver.rdoproject.org/90/28190/2/check/periodic-tripleo-centos-8-train-component-baremetal-promote-consistent-to-component-ci-testing/bfcb93b/job-output.txt | 13:37 |
marios | sshnaidm|ruck: ^^ | 13:38 |
marios | rlandy|ruck: ^ | 13:40 |
marios | Depends-On: https://review.rdoproject.org/r/28189 | 13:41 |
chandankumar | rlandy|ruck, want to discuss container build stuff right now? | 13:49 |
rlandy|ruck | chandankumar: ack - 5 mins - just need to hold a node | 13:51 |
*** sshnaidm|ruck is now known as sshnaidm|afk | 13:51 | |
chandankumar | rlandy|ruck, sure, let me know when ready | 13:51 |
rlandy|ruck | chandankumar: k - let's meet | 14:04 |
chandankumar | rlandy|ruck, https://meet.google.com/ymj-yetp-muv?pli=1&authuser=0 | 14:04 |
rlandy|ruck | joining | 14:05 |
chandankumar | rlandy|ruck, https://review.rdoproject.org/r/#/c/28195/ | 14:09 |
marios | needs reviews please https://review.rdoproject.org/r/#/c/28161 https://review.rdoproject.org/r/#/c/28162 test at https://review.rdoproject.org/r/#/c/28106 | 14:13 |
marios | promoter staging/molecule periodics https://tree.taiga.io/project/tripleo-ci-board/task/1801 | 14:13 |
marios | thank you | 14:13 |
rlandy|ruck | sshnaidm|afk: introspection coming up | 14:14 |
chandankumar | rlandy|ruck, https://logserver.rdoproject.org/94/28194/1/check/periodic-tripleo-build-containers-ubi-8/4f642b7/logs/containers-successfully-built.log | 14:17 |
chandankumar | rlandy|ruck, https://opendev.org/openstack/python-tripleoclient/src/branch/master/tripleoclient/config/standalone.py#L182 | 14:24 |
*** TrevorV has joined #oooq | 14:24 | |
rlandy|ruck | soniya29|rover: let's touch base out failing scenario 001, 00 10 in gates before you leave | 14:29 |
soniya29|rover | rlandy|ruck, okay | 14:32 |
chandankumar | https://opendev.org/openstack/tripleo-ci/src/branch/master/roles/build-containers/templates/build.sh.j2#L61 | 14:33 |
rlandy|ruck | chandankumar: https://docs.openstack.org/project-deploy-guide/tripleo-docs/latest/deployment/3rd_party.html | 14:36 |
*** skramaja has quit IRC | 14:38 | |
*** sshnaidm|afk is now known as sshnaidm|ruck | 14:39 | |
*** ysandeep is now known as ysandeep|afk | 14:39 | |
soniya29|rover | rlandy|ruck, will be back in 1 hr | 14:43 |
*** ysandeep|afk is now known as ysandeep | 14:46 | |
*** Goneri has joined #oooq | 14:50 | |
rlandy|ruck | sshnaidm|ruck: soniya29|rover: gate is a disaster | 14:52 |
marios | please add to your review queue https://review.rdoproject.org/r/#/c/28161 https://review.rdoproject.org/r/#/c/28162 test at https://review.rdoproject.org/r/#/c/28106 promoter staging/molecule periodics https://tree.taiga.io/project/tripleo-ci-board/task/1801 | 14:53 |
sshnaidm|ruck | marios, https://github.com/rdo-infra/review.rdoproject.org-config/blob/master/playbooks/tripleo-ci-base-promote-hash/run.yaml#L7:L7 | 14:54 |
sshnaidm|ruck | marios, so it's called from trusted repo | 14:54 |
sshnaidm|ruck | marios, bad luck | 14:54 |
marios | sshnaidm|ruck: right | 14:55 |
marios | sshnaidm|ruck: wow | 14:55 |
marios | sshnaidm|ruck: ack thanks for that | 14:55 |
*** marios is now known as marios|call | 14:59 | |
rlandy|ruck | mving here | 15:03 |
rlandy|ruck | sshnaidm|ruck: what do you want to do about promotions that are blocked by OVB? | 15:04 |
rlandy|ruck | just leave ussuri train etc until OVB is fixed? | 15:04 |
sshnaidm|ruck | rlandy|ruck, need to ignore ovb for now | 15:04 |
sshnaidm|ruck | rlandy|ruck, nope, let's promote | 15:04 |
rlandy|ruck | sshnaidm|ruck: so we can edit criteria for ussuri | 15:05 |
sshnaidm|ruck | rlandy|ruck, we already had a big delay because of wrong containers config | 15:05 |
rlandy|ruck | but I think that is the old promoter? | 15:05 |
sshnaidm|ruck | rlandy|ruck, all except master in vexx promoter | 15:05 |
sshnaidm|ruck | at least was recently | 15:05 |
sshnaidm|ruck | rlandy|ruck, well, not sure about ussuri | 15:06 |
rlandy|ruck | sshnaidm|ruck: weshay_pto said only one can deal with componentized promotions | 15:06 |
rlandy|ruck | I think that would be rdocloud then | 15:06 |
rlandy|ruck | same as master | 15:06 |
sshnaidm|ruck | k | 15:06 |
rlandy|ruck | want to jump on that one and edit? | 15:06 |
* rlandy|ruck can tmate | 15:06 | |
sshnaidm|ruck | rlandy|ruck, sure | 15:07 |
rlandy|ruck | k - sec | 15:07 |
*** owalsh_ is now known as owalsh | 15:07 | |
rlandy|ruck | sshnaidm|ruck: side note - introspection failed again on internal - node should be held for when you want to look at it | 15:10 |
rlandy|ruck | sshnaidm|ruck: let's do this - on tamte on rdocloud promoter | 15:12 |
rlandy|ruck | tamte | 15:12 |
chandankumar | rlandy|ruck, we got the testingmaster namespace for testing container push | 15:13 |
rlandy|ruck | chandankumar: cool - go wild | 15:14 |
rlandy|ruck | with --authfile | 15:14 |
chandankumar | let me put patches authfile | 15:14 |
chandankumar | patch coming soon | 15:14 |
rlandy|ruck | will be fine if we just don't collect it | 15:14 |
rlandy|ruck | awesome | 15:14 |
rlandy|ruck | anybody's guess what authfile looks like | 15:15 |
*** ykarel is now known as ykarel|afk | 15:33 | |
rlandy|ruck | sshnaidm|ruck: ugh - so my node didn't hold - will kick again -but it look like the nodes shut down when introspection fails | 15:35 |
rlandy|ruck | at the 20 min mark | 15:35 |
rlandy|ruck | when I increased the timeout, it just shutdown later | 15:35 |
*** ysandeep is now known as ysandeep|away | 15:36 | |
sshnaidm|ruck | rlandy|ruck, I think by default we retry to introspect if some node failed it | 15:36 |
rlandy|ruck | req-cc625c1d-4d47-4fdc-921e-f08db1d6907b Stop June 23, 2020, 2:35 p.m. 08e376a648608779c7d381265e42c2f2633c7fba38987122547abc8e582bfb10 - | 15:36 |
rlandy|ruck | req-ede9728d-67cc-48ac-8bcf-732b8182e3aa Start June 23, 2020, 2:15 p.m. 08e376a648608779c7d381265e42c2f2633c7fba38987122547abc8e582bfb10 | 15:36 |
sshnaidm|ruck | rlandy|ruck, I'd like to see if it really retries.. | 15:36 |
rlandy|ruck | ^^ see the shutdown at exactly 20 mins | 15:37 |
rlandy|ruck | ok - let me kick again | 15:38 |
rlandy|ruck | I'll ditch these nodes | 15:38 |
sshnaidm|ruck | rlandy|ruck, do we have logs from the last job? | 15:40 |
rlandy|ruck | sshnaidm|ruck: https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/build/7d77ca91d9674f69a3f2402bb738ae7f | 15:40 |
*** ykarel|afk is now known as ykarel | 15:55 | |
*** marios|call is now known as marios|out | 15:59 | |
sshnaidm|ruck | rlandy|ruck, where you edit timeout, can you set retries to 0? and make timeout like an hour for example | 15:59 |
rlandy|ruck | sshnaidm|ruck: in my previous try of that, the nodes just shutdown later - when introspection dies - but we'll see | 16:00 |
sshnaidm|ruck | rlandy|ruck, I'd blame retries there | 16:03 |
rlandy|ruck | sshnaidm|ruck: how so? | 16:03 |
sshnaidm|ruck | I think it doesn't wait enough before retrying, I saw only 10 seconds | 16:04 |
rlandy|ruck | interesting | 16:04 |
* rlandy|ruck tries to edit playbook | 16:10 | |
rlandy|ruck | sshnaidm|ruck: I edited the playbook ... | 16:13 |
rlandy|ruck | max_retries: 0 | 16:13 |
rlandy|ruck | node_timeout: 1800 | 16:13 |
rlandy|ruck | let's see what happens | 16:13 |
*** marios|out has quit IRC | 16:13 | |
sshnaidm|ruck | great | 16:14 |
rlandy|ruck | if I ste retries to 0, I don;t think retry_delay will work | 16:26 |
*** dtantsur is now known as dtantsur|afk | 16:29 | |
*** amoralej is now known as amoralej|off | 16:30 | |
*** ykarel is now known as ykarel|away | 16:30 | |
sshnaidm|ruck | rlandy|ruck, yeah, let's keep it 0 | 16:32 |
rlandy|ruck | k | 16:32 |
sshnaidm|ruck | rlandy|ruck, anyway need to work on delays, not the optimal now | 16:33 |
rlandy|ruck | yeah ok | 16:35 |
sshnaidm|ruck | rlandy|ruck, interesting, I see ussuri ovb passed here: https://review.opendev.org/#/c/737289/ | 16:37 |
rlandy|ruck | sshnaidm|ruck: ha - nice change :) | 16:38 |
rlandy|ruck | maybe all nodes need some extra time | 16:38 |
sshnaidm|ruck | rlandy|ruck, yeah, and maybe need to set it as a parameter, may be bigger on real bm | 16:42 |
rlandy|ruck | sshnaidm|ruck: funnily enough real bm does ok | 16:42 |
sshnaidm|ruck | ok) | 16:42 |
soniya29|rover | rlandy|ruck, gate is horrible | 17:03 |
rlandy|ruck | soniya29|rover: no worries- taken care o | 17:04 |
rlandy|ruck | of | 17:04 |
rlandy|ruck | infra | 17:04 |
rlandy|ruck | there is one last job with the yellow post failures | 17:05 |
*** udesale_ has quit IRC | 17:05 | |
rlandy|ruck | sshnaidm|ruck: introspection in action zuul@10.0.111.117 | 17:09 |
sshnaidm|ruck | rlandy|ruck, watching | 17:11 |
*** jpena is now known as jpena|off | 17:11 | |
rlandy|ruck | timeout set to 180 | 17:14 |
rlandy|ruck | 1800 | 17:14 |
*** derekh has quit IRC | 17:15 | |
*** sshnaidm|ruck is now known as sshnaidm|afk | 17:24 | |
chandankumar | rlandy|ruck, around? | 17:39 |
chandankumar | rlandy|ruck, when we do podman login -u -p | 17:39 |
rlandy|ruck | chandankumar: yes- what's up? | 17:39 |
chandankumar | then it generates auth.json file | 17:39 |
rlandy|ruck | chandankumar: before we push' | 17:39 |
chandankumar | which lives under cat /run/user/1000/containers/auth.json | 17:39 |
chandankumar | yes before push | 17:39 |
chandankumar | rlandy|ruck, we can reuse that path during push | 17:39 |
rlandy|ruck | chandankumar: so downstream I auth on every push | 17:39 |
rlandy|ruck | I think you can | 17:39 |
chandankumar | let me propose the patches | 17:40 |
chandankumar | and we are done | 17:40 |
chandankumar | ${XDG_RUNTIME_DIR}/containers/auth.json is the default path | 17:40 |
chandankumar | and will take a look tomorrow | 17:40 |
rlandy|ruck | ok | 17:44 |
rlandy|ruck | chandankumar: hey - can you look at a tempest test for a sec? | 17:49 |
rlandy|ruck | https://d23c12b0aad63e0703e8-47c67d6f96b63324a4b586b546351bb6.ssl.cf5.rackcdn.com/729465/43/check/tripleo-ci-centos-8-scenario010-standalone/ae53ac9/logs/undercloud/var/log/tempest/stestr_results.html | 17:49 |
rlandy|ruck | timing? | 17:49 |
*** jtomasek has quit IRC | 20:06 | |
*** jfrancoa has quit IRC | 20:43 | |
*** TrevorV has quit IRC | 21:35 | |
*** jbadiapa has quit IRC | 21:38 | |
*** Goneri has quit IRC | 21:46 | |
*** rlandy|ruck is now known as rlandy|ruck|bbl | 22:32 | |
*** jschlueter has quit IRC | 22:37 | |
*** tosky has quit IRC | 22:42 | |
*** chem has quit IRC | 22:53 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!