*** ysandeep|away is now known as ysandeep | 00:59 | |
slaweq | chandankumar: hi, how are You? | 05:55 |
---|---|---|
chandankumar | slaweq: Hello, I am good | 05:55 |
slaweq | chandankumar: can You help me undestand why ci in https://review.rdoproject.org/r/c/testproject/+/35120 is failing? | 05:55 |
chandankumar | slaweq: thanks for asking, How are you? | 05:55 |
chandankumar | slaweq: let me check | 05:55 |
slaweq | I wanted to test if one of the previously skipped tests will now be fine but I can't :/ | 05:56 |
slaweq | or maybe You have link to that job which is running perodically tests from skiplist? I can maybe check there if that test is passing | 05:57 |
ykarel | slaweq, may be you looking for jobs similar to https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-standalone-scenario001-skipped-tempest-master | 06:02 |
ykarel | for other release can change "master" | 06:02 |
chandankumar | slaweq: above job failed at undercloud install https://logserver.rdoproject.org/20/35120/1/check/periodic-tripleo-ci-centos-8-ovb-1ctlr_2comp-featureset020-ussuri/5dde5b8/logs/undercloud/home/zuul/undercloud_install.log.txt.gz while uploading the image is givjng 404 | 06:02 |
ykarel | i just see your last two messages so missed context | 06:02 |
chandankumar | slaweq: let me take a look at ussuri skipped job | 06:02 |
slaweq | chandankumar: ykarel thx | 06:05 |
chandankumar | https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-standalone-scenario007-skipped-tempest-ussuri&project=openstack/tripleo-ci | 06:06 |
chandankumar | most of the ussuri jobs are broken | 06:06 |
chandankumar | most of the skipped ussuri jobs are failing before tempest run | 06:06 |
*** ykarel is now known as ykarel|away | 06:07 | |
ykarel|away | me out, not feeling weel | 06:07 |
chandankumar | ykarel|away: take care :-) | 06:07 |
chandankumar | slaweq: https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-stable3 so in last periodic run container build job was failing, https://review.rdoproject.org/r/c/testproject/+/35120 uses force_periodic to true, so the job will pull container with tripleo-ci-testing hash (which was suppose to be container build job which got failed in this run) are also failing | 06:11 |
chandankumar | due to missing containers on registry | 06:11 |
chandankumar | slaweq: https://review.opendev.org/c/openstack/tripleo-ci/+/806645 once this merges, will fix the container build then we can retrigger the job | 06:12 |
slaweq | chandankumar++ ykarel++ thx a lot for help | 06:14 |
slaweq | chandankumar: I can run my test job without force_periodic: true - maybe that will work right now | 06:15 |
chandankumar | slaweq: yes, but we need this https://review.rdoproject.org/r/c/rdoinfo/+/35105/1/tags/ussuri.yml fix | 06:16 |
chandankumar | let me send a patch in tq release file to install it on undercloud | 06:16 |
chandankumar | slaweq: neutron-tempest-plugin1.2.0 is available in current-tripleo | 06:19 |
chandankumar | https://trunk.rdoproject.org/centos8-ussuri/component/network/7d/59/7d5988b29b6eb27b2dc954ea287c85bfa2ec67c4_a3633030/ | 06:19 |
chandankumar | so without force_periodic it will work and we donot need to make changes in tq release file | 06:19 |
slaweq | chandankumar: so we should be good to go with https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/805899 if that test job will pass, right? | 06:23 |
chandankumar | slaweq: yes | 06:24 |
slaweq | thx | 06:24 |
*** jpena|off is now known as jpena | 07:03 | |
*** amoralej|off is now known as amoralej | 07:05 | |
bhagyashris | zbr Hi around? | 07:11 |
zbr | hi! in 30min | 07:11 |
bhagyashris | ok np | 07:12 |
zbr | i am back | 08:05 |
bhagyashris | zbr, https://meet.google.com/xnf-tvdh-pmk?authuser=0 | 08:08 |
*** arxcruz is now known as arxcruz|training | 08:16 | |
ysandeep | marios: welcome back! Hope you enjoyed your PTOs | 08:23 |
*** ysandeep is now known as ysandeep|lunch | 08:26 | |
marios | o/ ysandeep|lunch thanks | 08:29 |
soniya29 | chandankumar, kopecmartin, arxcruz|training, ysandeep|lunch, please add/edit today's agenda for the tempest meeting | 08:56 |
*** ysandeep|lunch is now known as ysandeep | 09:00 | |
*** sshnaidm|afk is now known as sshnaidm | 09:08 | |
bhagyashris | zbr, fyi https://review.opendev.org/c/openstack/tripleo-repos/+/806879 | 09:38 |
zbr | bhagyashris: no1 prio is to make sanity voting. | 09:41 |
bhagyashris | yes doing | 09:41 |
bhagyashris | zbr, Done https://review.opendev.org/c/openstack/tripleo-repos/+/806880 | 09:44 |
bhagyashris | thanks :) | 09:44 |
zbr | in fact you only need to remove the voting, default is true. | 09:45 |
bhagyashris | zbr, ok i wll do that | 09:47 |
zbr | i will be out for ~1.5h today, need to take blood test | 10:25 |
*** pojadhav is now known as pojadhav|brb | 10:46 | |
marios | zbr: chandankumar: o/ hey i am working there (checking in case you were doing something) https://review.opendev.org/c/openstack/tripleo-quickstart/+/791486/47#message-f19da2ac2f0c3aa53aee30a9330390367278e47d | 10:46 |
chandankumar | marios: please go ahead | 10:48 |
zbr | marios: chandankumar am I wrong o patch #46 did pass on both upstream/rdo. If that is true lets merge in that form and look to other changes another time. | 10:50 |
zbr | that patch is already 4mo old... | 10:51 |
zbr | marios: please +2 https://review.opendev.org/c/openstack/tripleo-repos/+/805400 asap | 10:53 |
zbr | i just realised that the fixes to sanity did were not merged. | 10:53 |
marios | zbr: ack adding to reviews | 10:54 |
zbr | i really need to ensure sanity passes as otherwise people will fail to read its output. this morning i had a talk with bhagyashris around that. | 10:54 |
*** jpena is now known as jpena|lunch | 11:30 | |
*** rlandy is now known as rlandy|rover | 11:38 | |
rlandy|rover | dviroel|ruck: hey - how are things today? | 11:39 |
dviroel|ruck | rlandy|rover: hi, the containers build fix merged, but not in time for the last train pipeline run | 11:42 |
rlandy|rover | dviroel|ruck: we can rekick that line | 11:42 |
rlandy|rover | sec | 11:42 |
dviroel|ruck | yep, would be a good idea, next run is 18:00 utc | 11:42 |
*** pojadhav|brb is now known as pojadhav | 11:43 | |
rlandy|rover | dviroel|ruck: does ussuri also need a rekick? | 11:45 |
dviroel|ruck | rlandy|rover: it should start in 15 min (12:00 utc) | 11:46 |
rlandy|rover | great | 11:46 |
rlandy|rover | dviroel|ruck: only the 16.2 tripleo component is old | 11:46 |
rlandy|rover | tripleo in 17 promoted | 11:47 |
dviroel|ruck | what is this refresh fail? | 11:48 |
rlandy|rover | dviroel|ruck: where is a refresh fail? | 11:50 |
dviroel|ruck | tripleo component 16.2 and upstream | 11:51 |
dviroel|ruck | http://tripleo-cockpit.usersys.redhat.com/d/KyHCwLHMk/rhos-16-2-full-component-pipeline?orgId=1 | 11:51 |
dviroel|ruck | like this one here https://logserver.rdoproject.org/openstack-component-tripleo/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-standalone-on-multinode-ipa-tripleo-train/3e4fc57/logs/] | 11:52 |
rlandy|rover | oh failure log | 11:53 |
rlandy|rover | FATAL | Wait for containers to start for step 3 using paunch | standalone-0 | error={"ansible_job_id": "984259434115.114221", "attempts": 1200, "changed": false, "finished": 0, "started": 1} | 11:54 |
rlandy|rover | https://logserver.rdoproject.org/openstack-component-tripleo/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-standalone-on-multinode-ipa-tripleo-train/3e4fc57/logs/undercloud/home/zuul/standalone_deploy.log.txt.gz | 11:54 |
rlandy|rover | that is a sova record | 11:54 |
rlandy|rover | for the error | 11:55 |
rlandy|rover | marios: hey - welcome back - how was your vacation? | 11:55 |
rlandy|rover | periodic-tripleo-ci-centos-8-standalone-on-multinode-ipa-tripleo-train | 11:58 |
rlandy|rover | dviroel|ruck: ^^ I don;t think that job ever passed | 11:58 |
rlandy|rover | it was an experiment in train | 11:58 |
rlandy|rover | voting afterwards | 11:58 |
marios | rlandy|rover: o/ hi was nice thanks :) | 11:58 |
soniya29 | chandankumar, rlandy|rover, weshay|ruck, arxcruz|training , tempest meeting? | 12:02 |
chandankumar | rlandy|rover: https://review.rdoproject.org/r/c/config/+/35230 please have a look when free | 12:02 |
rlandy|rover | https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/builds?job_name=periodic-tripleo-ci-rhel-8-ovb-3ctlr_1comp-featureset001-internal-tripleo-rhos-16.2&pipeline=openstack-component-tripleo | 12:03 |
rlandy|rover | ^^ that's the issue for tripleo 16.2 | 12:03 |
soniya29 | rlandy|rover, weshay|ruck, arxcruz|training, ^^ | 12:04 |
rlandy|rover | soniya29: I have a clash with the progran call | 12:04 |
ysandeep | kopecmartin: https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/806574 | 12:04 |
rlandy|rover | 2021-08-31 20:26:27.331827 | primary | PLAY [Prepare the SSL Configuration for the overcloud deployment] ************** | 12:06 |
kopecmartin | ysandeep: interesting, thanks | 12:06 |
rlandy|rover | dviroel|ruck: ovb on tripleo component | 12:07 |
rlandy|rover | ysandeep: ^^ know about this error? or worth rerun on 16.2 https://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-component-tripleo/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-ovb-3ctlr_1comp-featureset001-internal-tripleo-rhos-16.2/583e952/job-output.txt | 12:08 |
ysandeep | rlandy|rover: yes already rerunning it here: https://code.engineering.redhat.com/gerrit/c/testproject/+/211494 | 12:08 |
rlandy|rover | ysandeep: yep - I see thanks | 12:09 |
rlandy|rover | going to rerun periodic-tripleo-ci-rhel-8-multinode-1ctlr-featureset010-tripleo-rhos-16.2 | 12:09 |
rlandy|rover | also failed | 12:09 |
ysandeep | rlandy|rover, we have atleast 1 green run for ^^ above job for current hash in testing . http://pastebin.test.redhat.com/991032 | 12:11 |
rlandy|rover | yeo | 12:12 |
rlandy|rover | yep | 12:12 |
rlandy|rover | ysandeep: and the good news is that it's the only component that got called out | 12:13 |
*** amoralej is now known as amoralej|lunch | 12:14 | |
ysandeep | rlandy|rover: :( dang, we got new content on 27th(friday), That tempest allow bug was blocking fs001 till yesterday | 12:17 |
rlandy|rover | ysandeep: it's ok - we'll get that component cleared shortly | 12:18 |
ysandeep | rlandy|rover, fyi.. reported this one for 17 https://bugzilla.redhat.com/show_bug.cgi?id=2000070 | 12:20 |
soniya29 | rlandy|rover, okay, no problem | 12:20 |
pojadhav | arxcruz, zbr, sshnaidm, rlandy|rover , marios, ysandeep, bhagyashris, soniya29 , akahat, weshay|ruck , chandankumar, frenzy_friday, dviroel|ruck : RETRO in next 5 mins | 12:25 |
pojadhav | https://meet.google.com/kkp-bejs-vvo?authuser=0 | 12:25 |
pojadhav | https://miro.com/app/board/o9J_lz-Gr0g=/ | 12:25 |
*** jpena|lunch is now known as jpena | 12:30 | |
bhagyashris | akahat, chandankumar dviroel|ruck soniya29 sshnaidm weshay|ruck retro time | 12:31 |
soniya29 | bhagyashris, i am in :) | 12:31 |
bhagyashris | chandankumar, akahat ^ | 12:32 |
*** amoralej|lunch is now known as amoralej | 13:08 | |
chandankumar | Thank you pojadhav nice retro :-) | 13:14 |
bhagyashris | Thanks pojadhav :) | 13:15 |
dviroel|ruck | rlandy|rover: fyi https://review.rdoproject.org/r/c/testproject/+/34983/10#message-4479ab3f7f1dd2d83fd0b768a019f9e2f64684d9 | 13:15 |
pojadhav | chandankumar, bhagyashris :-) | 13:15 |
bhagyashris | dviroel|ruck, hey have a sec? just want to update few things | 13:16 |
chandankumar | sshnaidm: please have a look https://review.rdoproject.org/r/c/config/+/35230 thanks! | 13:17 |
sshnaidm | chandankumar, I'll +w, but need to keep eye on it, because you know - no testing | 13:24 |
sshnaidm | chandankumar, if something is wrong, all container jobs will fail | 13:24 |
dviroel|ruck | bhagyashris: hey, yes :) | 13:25 |
bhagyashris | give me 5 mins | 13:42 |
bhagyashris | dviroel|ruck, https://meet.google.com/vue-xagq-pjn?authuser=0 | 13:43 |
rlandy|rover | dviroel|ruck: very nice on scenario001 | 13:57 |
marios | pojadhav: o/ hey when you have time please check https://review.rdoproject.org/r/c/rdo-jobs/+/34927/6#message-8d65ee6fc3377bab74841fef1db5682f35d0da10 -1 is for the removal of the wallaby jobs i think that may be accidental? | 14:08 |
zbr | i am back | 14:31 |
zbr | rlandy|rover: dviroel|ruck a review on https://review.opendev.org/c/openstack/tripleo-repos/+/805400 would be apprecatiated. | 14:32 |
rlandy|rover | zbr: voted - will W+ once dviroel|ruck votes | 14:33 |
pojadhav | marios, thanks for review.. yeah it was accidental.. i will fix the review comments. :) | 14:35 |
dviroel|ruck | zbr: rlandy|rover review done | 14:36 |
rlandy|rover | w+'ed | 14:36 |
chandankumar | sshnaidm: rlandy|rover https://review.rdoproject.org/r/c/config/+/35155 merge merge please | 14:37 |
chandankumar | sshnaidm: found this https://review.opendev.org/c/openstack/tripleo-ansible/+/806428 in our debugging | 14:37 |
chandankumar | lessons donot use latest version of podman | 14:38 |
rlandy|rover | done | 14:39 |
chandankumar | rlandy|rover: jpena sshnaidm thanks :-) | 14:41 |
* dviroel|ruck lunch | 14:48 | |
frenzy_friday | Hey zbr arxcruz|training do you have some time, I need a little help with the ER bot | 15:12 |
*** chem is now known as Guest5989 | 15:16 | |
rlandy|rover | periodic-tripleo-ci-rhel-8-ovb-3ctlr_1comp-featureset001-internal-tripleo-rhos-16.2 (3rd attempt) p assed yay | 15:47 |
rlandy|rover | ysandeep++ | 15:48 |
*** jpena is now known as jpena|off | 16:01 | |
ysandeep | rlandy|rover: ah, nice.. tripleo component should promote soon | 16:02 |
ysandeep | rlandy|rover: we are getting close to fixing 17 bm job, https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/806724 and https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/268893 | 16:03 |
rlandy|rover | very nice | 16:03 |
*** marios is now known as marios|out | 16:04 | |
ysandeep | I hope i am rightly assuming environment_type == 'baremetal' is just set for baremetal deployments, https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/806724/2/roles/overcloud-prep-config/tasks/main.yml#34 | 16:05 |
dviroel|ruck | rlandy|rover: this one is also passing https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/805899 | 16:08 |
ysandeep | rlandy|rover: pojadhav and I have decided to have joint call thrice a week(just me and her) for like half an hour for downstream ruck/rover sync, so that we can divide and conquer the job failures.. | 16:09 |
ysandeep | It took longer than ~2 hours for us today, but hopefully things will be smooth soon | 16:10 |
ysandeep | 17 components are all green | 16:10 |
*** ysandeep is now known as ysandeep|out | 16:15 | |
* ysandeep|out out for the day, see you guys tomorrow o/ | 16:15 | |
*** amoralej is now known as amoralej|off | 16:15 | |
rlandy|rover | dviroel|ruck: ok - so we can merge those reverts | 16:25 |
rlandy|rover | dviroel|ruck: w+'ed https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/805899\ | 16:25 |
dviroel|ruck | no 100% sure that they solved the issue, but we may want to revert this one too https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/806794 and keep watching | 16:28 |
dviroel|ruck | not all succeeded again here https://review.rdoproject.org/r/c/testproject/+/34983/10#message-3994708b13e12ce0ba86f7649f5cc401f65c5166 | 16:29 |
dviroel|ruck | those tests failed in this run https://logserver.rdoproject.org/83/34983/10/check/periodic-tripleo-ci-centos-8-scenario001-standalone-glance-master/4e0d4fa/logs/ | 16:30 |
dviroel|ruck | maybe disabling cephadm in step3 mitigates the issue, but not sure if will complete solve the problem | 16:30 |
rlandy|rover | k - let me look at that one more in a few | 16:33 |
rlandy|rover | just want to get the openvswitch review in | 16:33 |
rlandy|rover | lunch brb | 16:49 |
dviroel|ruck | chasing train here https://review.rdoproject.org/r/c/testproject/+/35132 | 17:03 |
rlandy|rover | oh dear ovb | 17:11 |
rlandy|rover | thanks | 17:11 |
rlandy|rover | deployment issue | 17:12 |
rlandy|rover | 021-09-01 16:06:52 | sed: couldn't open temporary file /home/heat-admin/.ssh/sedkDjOHs: Permission denied | 17:12 |
rlandy|rover | 2021-09-01 16:06:52 | Could not import keys to one of ['192.168.24.8', '192.168.24.19', '192.168.24.15', '192.168.24.29']. Original error message: Command '['ssh', '-o', 'ConnectionAttempts=6', '-o', 'ConnectTimeout=30', '-o', 'StrictHostKeyChecking=no', '-o', 'PasswordAuthentication=no', '-o', 'UserKnownHostsFile=/dev/null', '-i', '/home/zuul/.ssh/id_rsa', '-l', 'heat-admin', '192.168.24.8', "sed -i -e '/TripleO split | 17:12 |
rlandy|rover | stack short term key/d' $HOME/.ssh/authorized_keys"]' returned non-zero exit status 4. | 17:12 |
rlandy|rover | ^^ real issue? | 17:12 |
rlandy|rover | 2021-09-01 15:49:41 | sed: couldn't open temporary file /home/heat-admin/.ssh/sedmapNtx: P | 17:13 |
rlandy|rover | dviroel|ruck: ^^ we have areal problem | 17:14 |
rlandy|rover | will likely reproduce | 17:15 |
rlandy|rover | checking component line | 17:15 |
dviroel|ruck | yeah, all ovb failed | 17:17 |
rlandy|rover | https://logserver.rdoproject.org/openstack-component-tripleo/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-tripleo-train/b80c17c/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz | 17:17 |
rlandy|rover | same problem | 17:17 |
rlandy|rover | dviroel|ruck: creating bug | 17:17 |
rlandy|rover | so we can track debug/log | 17:17 |
dviroel|ruck | ack | 17:19 |
rlandy|rover | dviroel|ruck: https://bugs.launchpad.net/tripleo/+bug/1942356 | 17:23 |
dviroel|ruck | ++ | 17:23 |
rlandy|rover | dviroel|ruck; going to ask on tripleo | 17:43 |
rlandy|rover | #tripleo | 17:43 |
rlandy|rover | no idea here :) | 17:43 |
dviroel|ruck | yeah, was digging here, no idea yet | 17:44 |
rlandy|rover | alex will know in 2 minutes | 17:45 |
rlandy|rover | dviroel|ruck: easiest is to ssh to the one of the nodes where you have a running job right now | 18:01 |
rlandy|rover | hold the node | 18:01 |
rlandy|rover | and dig around | 18:01 |
rlandy|rover | the /ssh folder permissions | 18:01 |
rlandy|rover | I need to prep for an interview candidate | 18:02 |
rlandy|rover | will look back here in a bit | 18:02 |
dviroel|ruck | yes | 18:05 |
dviroel|ruck | rlandy|rover: where should i look for the cloud-init changes? | 18:05 |
dviroel|ruck | maybe i can find something | 18:05 |
rlandy|rover | cloud_config | 18:09 |
rlandy|rover | or at least os_cloud_config is where the trouble often was | 18:09 |
rlandy|rover | cloud-init is available on the os | 18:10 |
rlandy|rover | but could be installed from elsewhere | 18:10 |
rlandy|rover | dviroel|ruck: on the node | 18:12 |
rlandy|rover | http://pastebin.test.redhat.com/991161 | 18:12 |
dviroel|ruck | and /home/heat-admin/.ssh permissions? | 18:13 |
rlandy|rover | dir not created yet | 18:14 |
rlandy|rover | still in undercloud setup | 18:14 |
rlandy|rover | if it's only ovb | 18:14 |
rlandy|rover | could be an images thing | 18:14 |
rlandy|rover | http://mirror.regionone.vexxhost-nodepool-tripleo.rdoproject.org/centos/8-stream/AppStream/x86_64/os/Packages/ | 18:15 |
rlandy|rover | dviroel|ruck: ^^ cloud-init was updated on 08/25 | 18:15 |
rlandy|rover | https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-train | 18:17 |
rlandy|rover | 08/27 was last success there | 18:17 |
rlandy|rover | failed tempest | 18:17 |
rlandy|rover | so it passed deploy | 18:17 |
rlandy|rover | dviroel|ruck: k - really need to go prep | 18:19 |
rlandy|rover | back to look again in a bit | 18:19 |
rlandy|rover | can you get onto the node? | 18:19 |
rlandy|rover | ssh zuul@38.102.83.134 | 18:19 |
dviroel|ruck | let me see | 18:19 |
rlandy|rover | your keys should be on there | 18:19 |
dviroel|ruck | hum, asking password | 18:20 |
dviroel|ruck | may not be there | 18:20 |
rlandy|rover | let me check your keys | 18:21 |
rlandy|rover | hmmm | 18:21 |
rlandy|rover | seems not | 18:21 |
* rlandy|rover gets | 18:21 | |
dviroel|ruck | https://github.com/viroel.keys | 18:21 |
rlandy|rover | try now | 18:22 |
dviroel|ruck | rlandy|rover: working now | 18:22 |
dviroel|ruck | tks | 18:22 |
rlandy|rover | dviroel|ruck: we nede to check into why your keys are not there | 18:23 |
rlandy|rover | dviroel|ruck: this is why we need to lock stream | 18:24 |
dviroel|ruck | yeah =/ | 18:24 |
rlandy|rover | also it's only train | 18:26 |
rlandy|rover | cloud-init should have hit everywhere | 18:26 |
rlandy|rover | let's take this debg as far as we can - we can pull in the DF engineers more | 18:26 |
rlandy|rover | debug | 18:26 |
rlandy|rover | https://logserver.rdoproject.org/openstack-periodic-integration-stable4/opendev.org/openstack/tripleo-ci/9d5ff817ea8d75d837d26cf34d1dd6b949d7b4e0/periodic-tripleo-centos-8-buildimage-overcloud-full-train/3cd93cf/build.log | 18:33 |
rlandy|rover | maybe from image builds | 18:33 |
rlandy|rover | 2021-09-01 12:14:35.652 | cloud-init noarch 21.1-6.el8 appstream 1.0 M | 18:33 |
dviroel|ruck | is this node on hold? 38.102.83.134 | 18:49 |
rlandy|rover | not at the moment | 18:50 |
* rlandy|rover holds | 18:50 | |
rlandy|rover | one sec | 18:50 |
rlandy|rover | dviroel|ruck: should be held now | 18:52 |
dviroel|ruck | great, thanks | 18:52 |
rlandy|rover | comparing cloud init in previous image build logs | 18:52 |
dviroel|ruck | i did compare periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-tripleo-train, passing and failing | 18:54 |
dviroel|ruck | last succeeded: | 18:54 |
dviroel|ruck | cloud-init.noarch 21.1-3.el8 @appstream | 18:54 |
dviroel|ruck | first failure: | 18:54 |
dviroel|ruck | cloud-init.noarch 21.1-6.el8 @appstream | 18:54 |
rlandy|rover | yep | 18:54 |
rlandy|rover | dviroel|ruck: so on the held node | 18:55 |
rlandy|rover | once it fails | 18:55 |
rlandy|rover | downgrade cloud-init and run deploy again | 18:55 |
rlandy|rover | let's see if we can get it to work with downgraded rpm | 18:55 |
rlandy|rover | in fact downgrade now if you can | 18:55 |
rlandy|rover | are both available for install | 18:55 |
rlandy|rover | dviroel|ruck: ^^ was that compare on the undercloud or overcloud? | 18:58 |
dviroel|ruck | overcloud | 18:58 |
dviroel|ruck | https://logserver.rdoproject.org/openstack-component-tripleo/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-tripleo-train/cbc1d80/logs/overcloud-controller-1/var/log/extra/package-list-installed.txt.gz | 18:58 |
rlandy|rover | ok - so image builds | 18:58 |
rlandy|rover | hmmm ... checking current-tripleo | 19:02 |
dviroel|ruck | rlandy|rover: http://pastebin.test.redhat.com/991174 | 19:05 |
rlandy|rover | pls put that in the bug for tracking | 19:06 |
dviroel|ruck | ack | 19:07 |
dviroel|ruck | same thing for other overcloud note | 19:08 |
dviroel|ruck | deploy just failed | 19:08 |
rlandy|rover | https://github.com/openstack/os-net-config/commit/69699cda8ff1deda7c121be7660145a793258766 | 19:12 |
rlandy|rover | maybe | 19:12 |
rlandy|rover | we may need to use old overcloud images | 19:19 |
rlandy|rover | and deploy with those | 19:19 |
dviroel|ruck | cloud-init.log | 19:24 |
dviroel|ruck | https://logserver.rdoproject.org/openstack-component-tripleo/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-tripleo-train/cbc1d80/logs/overcloud-controller-0/var/log/cloud-init.log.txt.gz | 19:24 |
dviroel|ruck | .. | 19:24 |
dviroel|ruck | 2021-08-28 21:57:54,216 - util.py[DEBUG]: Changing the ownership of /home/heat-admin/.ssh to 0:0 | 19:24 |
dviroel|ruck | while in passing jobs is: | 19:24 |
dviroel|ruck | 2021-08-27 21:52:29,295 - util.py[DEBUG]: Changing the ownership of /home/heat-admin/.ssh to 1000:1001 | 19:24 |
dviroel|ruck | need to check overcloud cloud-init scripts | 19:25 |
rlandy|rover | going to chase ussuri | 19:32 |
rlandy|rover | #- periodic-tripleo-ci-centos-8-ovb-1ctlr_1comp-featureset002-ussuri | 19:34 |
rlandy|rover | #- periodic-tripleo-ci-centos-8-scenario010-standalone-ussuri | 19:34 |
rlandy|rover | actually should promote | 19:34 |
rlandy|rover | 2021-09-01 19:04:04,654 3791530 INFO promoter Running: env ANSIBLE_LOG_PATH=/home/promoter/web/promoter_logs/container-push/20210901-190351.log ANSIBLE_DEBUG=False ansible-playbook -v -e @/tmp/tmpjasfarz9.yaml /home/promoter/ci-config/ci-scripts/container-push/container-push.yml | 19:35 |
rlandy|rover | yes it is | 19:35 |
rlandy|rover | dviroel|ruck: pls paste all your debug into the LP s we can hand it over to frenzy_friday in the start of her morning | 19:36 |
rlandy|rover | and see if we should lock cloud-init | 19:36 |
dviroel|ruck | rlandy|rover: ok, will do | 19:40 |
* dviroel|ruck need to take a walk, brb in a couple of min | 19:51 | |
rlandy|rover | sure | 20:01 |
*** rlandy|rover is now known as rlandy|rover|mtg | 20:29 | |
dviroel|ruck | job from master https://logserver.rdoproject.org/26/806926/2/openstack-check/tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001/77510da/logs/overcloud-controller-0/var/log/extra/package-list-installed.txt.gz | 20:39 |
dviroel|ruck | cloud-init.noarch 21.1-6.el8 @appstream | 20:39 |
dviroel|ruck | 2021-09-01 18:00:30,325 - util.py[DEBUG]: Changing the ownership of /home/heat-admin/.ssh to 0:0 | 20:40 |
dviroel|ruck | but don't fail | 20:40 |
*** rlandy|rover|mtg is now known as rlandy|rover | 21:19 | |
rlandy|rover | dviroel|ruck: hey | 21:19 |
rlandy|rover | dviroel|ruck: no worries - you can just leave all your notes on the LP | 21:20 |
rlandy|rover | Adding a ping now for frenzy_friday and chandankumar to take a look in their morning | 21:20 |
rlandy|rover | also updating the ussuri standalone upgrades card | 21:20 |
dviroel|ruck | rlandy|rover: doing some tests here too | 21:21 |
dviroel|ruck | in the node | 21:21 |
rlandy|rover | sure | 21:24 |
rlandy|rover | updated the standalone-ussuri-upgrade card | 21:28 |
rlandy|rover | the solution is right but in the wrong place - will have to check with marios about the best place to put it. | 21:28 |
rlandy|rover | dviroel|ruck: ^^ | 21:28 |
dviroel|ruck | rlandy|rover: i have a workaround for this bug, without locking cloud-init version | 21:43 |
dviroel|ruck | in tripleoclient | 21:43 |
rlandy|rover | dviroel|ruck++ very nice | 21:56 |
dviroel|ruck | rlandy|rover: https://review.opendev.org/c/openstack/python-tripleoclient/+/806993 | 22:14 |
dviroel|ruck | only affects stable/train | 22:14 |
* rlandy|rover looks | 22:15 | |
dviroel|ruck | another possible fix is to make cloud-init set the expected permissions on .ssh dir | 22:15 |
rlandy|rover | dviroel|ruck: this seems reasonable to me | 22:16 |
rlandy|rover | are you still testing it? | 22:17 |
rlandy|rover | when ready pls add kevin and alex on the review | 22:17 |
rlandy|rover | maybe cedric | 22:17 |
rlandy|rover | so they can vote on it | 22:17 |
dviroel|ruck | tested in the node hold | 22:18 |
rlandy|rover | dviroel|ruck: also - you can add Closes-Bug: #1942356 | 22:18 |
rlandy|rover | ok | 22:18 |
rlandy|rover | so you can remove the WIP then and put it in for review | 22:18 |
dviroel|ruck | yes, set as wip | 22:18 |
rlandy|rover | dviroel|ruck: thank you for following this all up!! | 22:19 |
rlandy|rover | very nicely done | 22:19 |
dviroel|ruck | will this work in testproject with depends-on? | 22:19 |
rlandy|rover | yes it should | 22:20 |
rlandy|rover | because the change is not in the image build | 22:20 |
rlandy|rover | but the cloud actions | 22:20 |
rlandy|rover | we should get build-test packages on that | 22:20 |
rlandy|rover | dviroel|ruck: stepping away for a few hours (going to volunteer job) - will check back in on the test run later | 22:23 |
*** rlandy|rover is now known as rlandy|rover|bbl | 22:23 | |
dviroel|ruck | ok | 22:24 |
*** dviroel|ruck is now known as dviroel|out | 22:43 | |
dviroel|out | https://review.rdoproject.org/r/c/testproject/+/35235 | 22:43 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!