*** jmasud has joined #oooq | 00:06 | |
*** Goneri has quit IRC | 00:08 | |
*** rlandy has quit IRC | 00:12 | |
*** Goneri has joined #oooq | 00:49 | |
*** jmasud has quit IRC | 00:55 | |
*** Goneri has quit IRC | 01:03 | |
*** saneax_AFK has joined #oooq | 02:14 | |
*** rfolco|rover has joined #oooq | 02:41 | |
*** saneax_AFK has quit IRC | 02:49 | |
*** jmasud has joined #oooq | 02:49 | |
*** jmasud has quit IRC | 03:08 | |
*** jmasud has joined #oooq | 03:25 | |
*** ykarel|away is now known as ykarel | 04:11 | |
*** jmasud has quit IRC | 04:36 | |
*** jmasud has joined #oooq | 04:38 | |
*** jtomasek has joined #oooq | 04:48 | |
*** ratailor has joined #oooq | 04:52 | |
*** udesale has joined #oooq | 04:56 | |
*** jmasud has quit IRC | 05:03 | |
*** marios has joined #oooq | 05:14 | |
*** Tengu has quit IRC | 06:07 | |
*** Tengu has joined #oooq | 06:09 | |
*** whoami-rajat has quit IRC | 06:17 | |
*** whoami-rajat has joined #oooq | 06:18 | |
*** saneax_AFK has joined #oooq | 06:22 | |
chandankumar | ysandeep, Hello | 06:44 |
---|---|---|
chandankumar | ysandeep, it is defined here https://opendev.org/openstack/tripleo-image-elements/src/branch/master/elements/interface-names/install.d/70-clear-net-ifnames#L9 | 06:45 |
chandankumar | I need to check downstream repo what is there | 06:46 |
chandankumar | here it is already sedded | 06:46 |
ysandeep | chandankumar, thank you! yes we need to get rid of it in downstream | 06:48 |
ysandeep | atleast for overcloud-full.qcow | 06:48 |
chandankumar | ysandeep, can you check the respective tripleo-image-elements repo downstream | 06:48 |
ysandeep | chandankumar, yes looking currently | 06:49 |
chandankumar | ysandeep, https://code.engineering.redhat.com/gerrit/gitweb?p=openstack-tripleo-image-elements.git;a=blob;f=elements/interface-names/install.d/70-clear-net-ifnames;h=d276decd1b01301006bbbb984871cd1ffc246dad;hb=refs/heads/rhos-17.0-trunk-patches | 06:52 |
*** ratailor has quit IRC | 06:53 | |
*** jmasud has joined #oooq | 06:54 | |
*** ratailor has joined #oooq | 06:56 | |
*** saneax_AFK is now known as saneax_ | 07:00 | |
ysandeep | chandankumar, thank you | 07:07 |
*** skramaja has joined #oooq | 07:13 | |
*** ccamacho has joined #oooq | 07:24 | |
*** tosky has joined #oooq | 07:29 | |
*** amoralej|off is now known as amoralej | 07:51 | |
*** chem has quit IRC | 07:56 | |
*** jpena|off is now known as jpena | 07:56 | |
*** chem has joined #oooq | 07:58 | |
*** jmasud has quit IRC | 07:58 | |
*** dtantsur|afk is now known as dtantsur | 08:32 | |
chandankumar | cgoncalves, Hello | 08:43 |
chandankumar | cgoncalves, I need some help on fixing this issue https://logserver.rdoproject.org/36/27636/9/check/periodic-tripleo-ci-centos-8-scenario011-standalone-master/671c94f/logs/undercloud/var/log/tempest/tempest_run.log.txt.gz | 08:44 |
chandankumar | cgoncalves, tempest.lib.exceptions.SSHTimeout: Connection to the 192.168.24.137 via SSH timed out. | 08:44 |
*** jmasud has joined #oooq | 08:48 | |
*** jbadiapa has joined #oooq | 09:10 | |
*** ykarel is now known as ykarel|lunch | 09:13 | |
cgoncalves | chandankumar, looking. something basic failed on the network side as tempest could not connect to a cirros VM via FIP | 09:14 |
*** pojadhav|ruck is now known as pojadhav|lunch | 09:26 | |
*** ysandeep is now known as ysandeep|lunch | 09:48 | |
*** ykarel|lunch is now known as ykarel | 10:04 | |
*** pojadhav|lunch is now known as pojadhav|ruck | 10:14 | |
*** sshnaidm|afk is now known as sshnaidm | 10:14 | |
*** jfrancoa has joined #oooq | 10:15 | |
*** jmasud has quit IRC | 10:17 | |
*** jmasud has joined #oooq | 10:18 | |
*** ysandeep|lunch is now known as ysandeep | 10:20 | |
ysandeep | marios, o/ Hey , Could you please add https://code.engineering.redhat.com/gerrit/#/c/201168/ to your review list.. just interface name change according to new OS - will be quick . | 10:23 |
ysandeep | patch is in "Ready to Submit" state not sure what that mean. | 10:23 |
marios | ysandeep: for d/stream we need to also click 'submit' for it to merge but i don't have permissions to do that so you'll have to ask ronelle later | 10:24 |
marios | ysandeep: she must have just forgotten | 10:25 |
ysandeep | marios, ack o/ and thank you for sharing what "Ready to Submit" means.. I was wondering about this since morning. | 10:26 |
marios | ysandeep: :) np | 10:26 |
*** derekh has joined #oooq | 10:49 | |
*** jmasud has quit IRC | 11:01 | |
chandankumar | cgoncalves, still we are seeing same issue after removing octavia kvm env file https://logserver.rdoproject.org/36/27636/10/check/periodic-tripleo-ci-centos-8-scenario011-standalone-master/bc5d28e/logs/undercloud/var/log/tempest/tempest_run.log.txt.gz | 11:39 |
*** jpena is now known as jpena|lunch | 11:40 | |
chandankumar | does this tests triggered some where? so that we can see what is missing in our config | 11:40 |
cgoncalves | chandankumar, mot sure what's causing that. I compared the tripleo vs rdo job definition and they look alike. I removed the octavia-kvm.yaml env just to make sure it was not causing problems | 11:51 |
cgoncalves | chandankumar, is there any job def delta between tripleo and rdo jobs that I am missing? | 11:51 |
chandankumar | cgoncalves, we have this job https://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ci-centos-8-scenario010-ovn-provider-standalone where we run the same tests, but there it is also broken | 11:55 |
chandankumar | cgoncalves, the job definitions are same there aslo https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_694/719695/5/check/tripleo-ci-centos-8-scenario010-ovn-provider-standalone/694af49/logs/undercloud/var/log/tempest/tempest_run.log | 11:56 |
cgoncalves | chandankumar, ignore ovn-provider. | 11:56 |
weshay|ruck | marios, hey.. help me think of which jobs we can start removing from tripleo-ci check/gate.. SO MANY JOBS :) https://review.opendev.org/#/c/730763/ | 11:57 |
weshay|ruck | that's a large wall to climb each review | 11:57 |
marios | weshay|ruck: o/ yeah that's part of the problem, maybe we can consider that if current run fails? | 12:01 |
marios | weshay|ruck: but i mean, maybe we could remove the multinodes | 12:01 |
marios | weshay|ruck: but the buildimage/containers ones we need cos that's what the change is touching... | 12:02 |
weshay|ruck | marios, ya.. not related your patch specifically.. but have been thinking about cutting down the # of jobs against tripleo-ci | 12:02 |
marios | weshay|ruck: and the scenarios too can go i think | 12:02 |
*** amoralej is now known as amoralej|lunch | 12:03 | |
marios | weshay|ruck: cos of that i guess (I am touching layout.yaml) https://opendev.org/openstack/tripleo-ci/src/branch/master/zuul.d/standalone-jobs.yaml#L145 | 12:03 |
weshay|ruck | agree.. scenarios can probably go unless the featureset file changes | 12:04 |
*** derekh has quit IRC | 12:06 | |
*** derekh has joined #oooq | 12:06 | |
cgoncalves | beagles, re: https://logserver.rdoproject.org/36/27636/10/check/periodic-tripleo-ci-centos-8-scenario011-standalone-master/bc5d28e/logs/undercloud/var/log/tempest/tempest_run.log.txt.gz, tempest failed to connecto to a cirros VM via FIP. anything that stands out that we may have tweaked in the scn010 job and need also in the RDO job side? | 12:10 |
rfolco|rover | arxcruz, chandankumar, sc10 keeps failing load balancer tempest test.. can we replace its test or fix it somehow ? https://bugs.launchpad.net/tripleo/+bug/1881584 | 12:11 |
openstack | Launchpad bug 1881584 in tripleo "SC10 train periodic job failing on LoadBalancerScenarioTest consistently" [Critical,Triaged] - Assigned to Rafael Folco (rafaelfolco) | 12:11 |
cgoncalves | beagles, https://review.rdoproject.org/r/27636 | 12:11 |
beagles | cgoncalves: I'll look ... | 12:11 |
cgoncalves | thank you | 12:11 |
rfolco|rover | arxcruz, chandankumar last failure https://review.rdoproject.org/zuul/build/8970cb6d944e465bb35af8ff0c107160 | 12:11 |
*** derekh has quit IRC | 12:12 | |
*** derekh has joined #oooq | 12:12 | |
rfolco|rover | arxcruz, chandankumar ah wes is adding one more test here https://review.opendev.org/#/c/733038/ | 12:13 |
chandankumar | rfolco|rover, I am doing some experiment here https://review.rdoproject.org/r/#/c/27912/ | 12:14 |
rfolco|rover | chandankumar, ok let me know how it goes | 12:14 |
rfolco|rover | chandankumar, let me know if you found anything or have any progress on images | 12:14 |
chandankumar | weshay|ruck, rfolco|rover please remove +w | 12:16 |
chandankumar | https://review.opendev.org/#/c/733038/1 | 12:16 |
weshay|ruck | chandankumar, problem to have both basic-ops and whitelist octavia tests? | 12:17 |
*** rlandy has joined #oooq | 12:18 | |
chandankumar | weshay|ruck, it is meant for octavia testing, there is no problem but better to have octavia tests there | 12:18 |
chandankumar | weshay|ruck, https://logserver.rdoproject.org/12/27912/1/check/periodic-tripleo-ci-centos-8-scenario010-standalone-master/9643be2/logs/undercloud/var/log/tempest/tempest_run.log.txt.gz | 12:18 |
chandankumar | from last experiment | 12:18 |
chandankumar | I am finding out more tests to add there | 12:18 |
weshay|ruck | chandankumar, k.. just patch on top | 12:19 |
weshay|ruck | once you have more tests... you can remove basic-ops | 12:19 |
chandankumar | ok | 12:19 |
beagles | cgoncalves: nothing stands out ... I'll think about it though.. its' a bit weird ... the rdo job pretty closely mirrors everything | 12:22 |
chandankumar | rfolco|rover, I built the image locally successfully | 12:24 |
*** derekh has quit IRC | 12:24 | |
chandankumar | and in the image directories are there | 12:24 |
cgoncalves | beagles, agreed. I also compared and they are relatively the same :/ | 12:24 |
*** derekh has joined #oooq | 12:24 | |
rfolco|rover | chandankumar, that's why I suspect its an I/O issue.... tried cache=writethrough without luck | 12:24 |
beagles | just wondering if the standalone node is configured in some way that affects the fip behavior | 12:24 |
beagles | like at the quickstart level | 12:25 |
chandankumar | sshnaidm, ^^ | 12:25 |
beagles | maybe instead of running the octavia tempest test, try just running a basic network fip test | 12:26 |
beagles | cgoncalves: does this test check access to the amphora and that's what is failing or is it through access to a user VM? | 12:27 |
cgoncalves | does neutron have such test where the client is *outside* the cloud (i.e. not a Nova VM)? | 12:27 |
beagles | hrm | 12:27 |
cgoncalves | beagles, tempest -> cirros VM SSH access | 12:27 |
*** ratailor has quit IRC | 12:28 | |
beagles | cgoncalves: ack | 12:28 |
arxcruz | rfolco|rover: chandankumar checking the logs, it's a connection issue, and we are using cirros 4, i believe those ssh issues are more stable on cirros 5 image | 12:28 |
beagles | cgoncalves: so if we just tweaked the whitelist to run a simple neutron tempest test we might learn something | 12:29 |
chandankumar | beagles, cgoncalves I will switch to neutron tests and let's see | 12:29 |
beagles | chandankumar++ | 12:29 |
beagles | thanks! | 12:29 |
cgoncalves | beagles, maybe, yes | 12:29 |
chandankumar | any specific neutron tests we want to run or test network basic ops? | 12:29 |
rfolco|rover | arxcruz, ok will re-test w/ new cirros... arxcruz chandankumar pls review this one https://review.opendev.org/#/c/733676/ | 12:30 |
cgoncalves | basic ops sounds enough but I don't know if it tests tempest->cirros via FIP | 12:30 |
chandankumar | it exercies floating ips tests | 12:31 |
cgoncalves | chandankumar, maybe just VM -> VM via FIP. I'm thinking the problem we're seeing is specifically to tempest->Nova VM | 12:31 |
*** jpena|lunch is now known as jpena | 12:38 | |
chandankumar | cgoncalves, ok | 12:40 |
chandankumar | cgoncalves, I have added network basics ops and neutorn scenario tests let's see how it goes | 12:40 |
cgoncalves | ack | 12:41 |
ysandeep | rlandy, o/ hello , seem https://code.engineering.redhat.com/gerrit/#/c/201168/ will need push to submit button as well | 12:45 |
rlandy | done | 12:46 |
*** Goneri has joined #oooq | 12:48 | |
ysandeep | rlandy, thank you :) | 12:50 |
rfolco|rover | pojadhav|ruck, chandankumar weshay|ruck: I am testing ykarel's workaround - https://review.rdoproject.org/r/#/c/27998/5/playbooks/tmp.yaml --> here: https://review.rdoproject.org/r/#/c/27986/ | 12:53 |
chandankumar | aye | 12:54 |
*** udesale_ has joined #oooq | 12:54 | |
*** rlandy_ has joined #oooq | 12:54 | |
*** udesale has quit IRC | 12:57 | |
*** rlandy has quit IRC | 12:58 | |
*** rlandy_ is now known as rlandy | 12:59 | |
*** jschlueter has quit IRC | 13:01 | |
rlandy | ysandeep: weshay|ruck: I reworked the patches for baremetal so that they should be mergeable... pls see reviews in card https://tree.taiga.io/project/tripleo-ci-board/task/1762 | 13:02 |
weshay|ruck | k | 13:02 |
weshay|ruck | thanks | 13:02 |
rlandy | I left out the adding the work for the actual virthost setup in every job | 13:02 |
rlandy | takes too long | 13:02 |
*** amoralej|lunch is now known as amoralej | 13:02 | |
rlandy | and we do that only on reprovision | 13:02 |
rlandy | the notes are there on how to do that | 13:02 |
rlandy | ysandeep: weshay|ruck: we are now failing on ssh to the undercloud | 13:03 |
rlandy | which I think is cloud related | 13:03 |
rlandy | since we go through the zuul jumbox | 13:03 |
rlandy | jumpbox | 13:03 |
ysandeep | rlandy, yes.. i spend many hours today - troubleshooting undercloud unreachability for Baremetal job (even though undercloud locally was running accessible) - https://sf.hosted.upshift.rdu2.redhat.com/logs/30/201630/6/check/periodic-tripleo-ci-rhel-8-bm_envA-3ctlr_1comp-featureset001-baremetal-rhos-17/873ac5a/job-output.txt but could not get what's wrong there.. apart from n/w range change public to private in zuul-info/host-info.primary.y | 13:03 |
ysandeep | aml | 13:03 |
rlandy | and it shows up consistently since we changed the network | 13:04 |
ysandeep | later read chat on rhos-ops about - PSI private networks with FIPs are not functional atm | 13:04 |
rlandy | will decide what to do here after the prod chain discussion | 13:04 |
*** holser has quit IRC | 13:05 | |
ysandeep | rlandy, ack, I was trying to rework prework on BM - https://review.opendev.org/#/c/734654/ but later realized you posted similiar patch with a different approach. | 13:05 |
ysandeep | I will abandon mine after few tests | 13:06 |
*** jschlueter has joined #oooq | 13:07 | |
weshay|ruck | rlandy, and now for your moment of mashugina | 13:07 |
weshay|ruck | openstack-tox-lintersFAILURE in 7m 11s | 13:07 |
* weshay|ruck runs locally | 13:07 | |
*** holser has joined #oooq | 13:07 | |
rlandy | weshay|ruck: OMG lol - let me fix that | 13:08 |
rlandy | ysandeep: that's fine - good to consider all approaches | 13:09 |
rlandy | Both of them - ugh | 13:09 |
rlandy | weshay|ruck: the linter is correct - fixing | 13:10 |
weshay|ruck | rlandy, aye.. easy ones :) | 13:10 |
weshay|ruck | rlandy, reminder, prod-chain mtg at the bottom of the hour | 13:12 |
rlandy | weshay|ruck: wouldn't miss this fight for the world | 13:12 |
*** jschlueter has quit IRC | 13:15 | |
weshay|ruck | zuul++ | 13:16 |
weshay|ruck | rlandy, did you see that zuul points out the linter errors inline | 13:16 |
rlandy | weshay|ruck: yes - specially for linter idiots like me | 13:17 |
weshay|ruck | so cool | 13:17 |
weshay|ruck | zbr cool stuff :) | 13:17 |
rlandy | on my grave stone, you can write ... she put up a brave fight against linters | 13:17 |
chandankumar | rlandy, does image build worked downstream? | 13:23 |
*** skramaja has quit IRC | 13:25 | |
weshay|ruck | rfolco|rover, how did you get this data? | 13:27 |
weshay|ruck | - total of 97 hits since 06-03-2020 | 13:27 |
weshay|ruck | - 75% in master, 25% in ussuri | 13:27 |
weshay|ruck | - 70% on vexxhost, 30% on rdocloud | 13:27 |
rlandy | chandankumar: hey - in what context? | 13:28 |
rlandy | chandankumar: in the job that uploads images or on BM? | 13:28 |
zbr | weshay|ruck: not sure what you refer too, i personally had only ugly discoveries today | 13:28 |
zbr | weshay|ruck: but if you can confirm something to me, it would be great | 13:28 |
zbr | are we allowed to switch to py36 everywhere with ci/deployment code? (even on maintenance branches). | 13:28 |
zbr | because now we have py36 on both c7/c8, so we no longer have excuses | 13:28 |
rfolco|rover | weshay|ruck, kibana | 13:28 |
rlandy | wrt BM, we don;t know - the job fails on ssh now :( | 13:28 |
chandankumar | rlandy, on the BM | 13:28 |
rlandy | the other image build seems fine | 13:28 |
weshay|ruck | zbr, just highlighting improvements in zuul that are cool.. no action required | 13:28 |
rlandy | chandankumar: lots of issues on downstream cloud | 13:28 |
chandankumar | ok | 13:28 |
weshay|ruck | rfolco|rover, k.. great | 13:28 |
rlandy | unrelated to image build | 13:28 |
rfolco|rover | weshay|ruck, https://review.rdoproject.org/analytics/app/kibana#/discover?_g=(refreshInterval:(display:Off,pause:!f,value:0),time:(from:now-30d,mode:quick,to:now))&_a=(columns:!(_source),index:AXJvxHvGHHNS04O3aJ0_,interval:auto,query:(query_string:(query:'%22free%20physical%22')),sort:!('@timestamp',desc)) | 13:29 |
rfolco|rover | weshay|ruck, then just click on fields like branch on the left side and you'll see the % charts, so cool | 13:30 |
*** ratailor has joined #oooq | 13:30 | |
*** ykarel is now known as ykarel|afk | 13:31 | |
zbr | weshay|ruck: ahh, yeah, basically small changes that I find useful in daily tripleo work. | 13:32 |
zbr | i was surprised that nobody bothered to fix the "Failed to install some of the specified packages" one. | 13:33 |
zbr | what "some" means was more of a trade secret | 13:33 |
*** ratailor has quit IRC | 13:42 | |
*** TrevorV has joined #oooq | 14:01 | |
*** ykarel|afk is now known as ykarel | 14:02 | |
*** rlandy is now known as rlandy|mtg | 14:08 | |
*** udesale_ has quit IRC | 14:10 | |
*** ccamacho has quit IRC | 14:11 | |
zbr | who knows what leftover goodies we still have under ci-config/jenkins folder? | 14:15 |
zbr | i am asking because I seen these some usage of the virtualenv and i wanted to know if we should migrate these to py3/venv | 14:16 |
chandankumar | zbr, you can check with ykarel | 14:19 |
*** sshnaidm is now known as sshnaidm|bbl | 14:23 | |
chandankumar | cgoncalves, beagles Here is the results from neutron and network basic ops tests https://logserver.rdoproject.org/36/27636/11/check/periodic-tripleo-ci-centos-8-scenario011-standalone-master/ea111b6/logs/undercloud/var/log/tempest/tempest_run.log.txt.gz | 14:25 |
chandankumar | {0} neutron_tempest_plugin.scenario.test_floatingip.FloatingIpSeparateNetwork.test_east_west(SRC with FIP,DEST with FIP) [55.310350s] ... ok | 14:25 |
*** jschlueter has joined #oooq | 14:26 | |
chandankumar | beagles, cgoncalves please review this one https://review.opendev.org/#/c/729518/ also when free | 14:27 |
rlandy|mtg | sshnaidm|bbl: hey .. when you get back .. you said you edited the DNS on the router - can you point me to where you made that change | 14:28 |
rlandy|mtg | DNS Name Servers | 14:30 |
rlandy|mtg | 10.11.5.19 10.5.30.45 | 14:30 |
rlandy|mtg | maybe there? | 14:30 |
*** rlandy|mtg is now known as rlandy | 14:31 | |
chandankumar | rfolco|rover, rlandy weshay|ruck https://review.rdoproject.org/r/#/c/28042/ | 14:35 |
chandankumar | rlandy, marios can we merge this https://code.engineering.redhat.com/gerrit/#/c/201152/ | 14:40 |
chandankumar | build test packages downstream working fine | 14:40 |
rlandy | chandankumar: so we need candidate and trunk deps together? | 14:41 |
rlandy | iirc, we were discussing the diff | 14:41 |
chandankumar | proof : https://sf.hosted.upshift.rdu2.redhat.com/logs/84/200084/30/check/tripleo-ci-rhel-8-standalone-rhos-17/77605b6/logs/undercloud/var/log/extra/package-list-installed.txt | 14:42 |
marios | rlandy: yeah that was my question last time too... otherwise no objection here chandankumar | 14:42 |
chandankumar | rlandy, we need both | 14:42 |
rlandy | I meant to try that with regular standalone | 14:42 |
rlandy | there was some clash with openvswitch versions | 14:42 |
rlandy | when we used used | 14:42 |
marios | chandankumar: (I don't have +2 on tripleo-environments) | 14:42 |
chandankumar | rlandy, marios I have answered in commit message | 14:42 |
rlandy | ok - let me read through | 14:42 |
chandankumar | and comments on the release file also | 14:42 |
chandankumar | marios, thanks! | 14:43 |
rlandy | just in the middle of working on the network problem | 14:43 |
chandankumar | rlandy, no problem take your time | 14:43 |
rlandy | chandankumar: I'm fine with it if it passes standalone | 14:43 |
rlandy | but I want to test project for my sanity | 14:43 |
chandankumar | rlandy, tested here https://code.engineering.redhat.com/gerrit/#/c/200084/ | 14:44 |
rlandy | on afternoon TODO list | 14:44 |
chandankumar | rlandy, aye no problem | 14:44 |
rlandy | chandankumar: ah - I see disabled by defualt | 14:46 |
rlandy | ok then | 14:46 |
rlandy | np | 14:46 |
rlandy | chandankumar: mind if I rebase that? | 14:47 |
chandankumar | rlandy, nope | 14:47 |
chandankumar | rlandy, go ahead | 14:47 |
rlandy | chandankumar: looks ok - will merge after we fix the network issue | 14:50 |
rlandy | don't want to change multiple things concurrently | 14:50 |
marios | rlandy: Attempt 1 of 20 to get DLRN hash failed to get a response. | 14:51 |
chandankumar | rlandy, weshay|ruck longer patch https://review.opendev.org/#/c/727200/ when free | 14:51 |
marios | rlandy: is that the error per chance? (network error) | 14:51 |
rlandy | marios: probably | 14:51 |
marios | rlandy: i am geting it there https://sf.hosted.upshift.rdu2.redhat.com/logs/43/201543/15/check/periodic-tripleo-ci-rhel-8-standalone-upgrade-rhos-17/dfb944e/logs/emit_releases_file.log | 14:51 |
rlandy | give me a bit | 14:51 |
marios | rlandy: k thanks | 14:51 |
chandankumar | rlandy, do we have any jobs running with rhos-16? | 14:51 |
chandankumar | I will test build test packages on that also | 14:52 |
rlandy | I can out in a workaround to fix the problem but I would prefer to get the source fixed - hence chatting with admins | 14:52 |
rlandy | chandankumar: ack we do - not yet though | 14:52 |
rlandy | we need to get dlrn on 16 | 14:52 |
rlandy | let's talk about that at tomorrow's sync | 14:52 |
chandankumar | yup sure | 14:52 |
rlandy | chandankumar: ^^ pls bring that up | 14:52 |
*** ccamacho has joined #oooq | 14:54 | |
*** ykarel is now known as ykarel|away | 15:01 | |
*** ysandeep is now known as ysandeep|away | 15:30 | |
weshay|ruck | rfolco|rover, pojadhav|ruck thinking about promoting https://trunk.rdoproject.org/api-centos8-ussuri/api/civotes_agg_detail.html?ref_hash=b4caf97568e312eeeaa44f69efea6400 | 15:40 |
weshay|ruck | fs35 passed, 20 failed w/ two tempest errors, fs001 failed on network... 39 passed | 15:40 |
weshay|ruck | full tempest passed in standalone jobs | 15:40 |
*** jaosorior has joined #oooq | 15:40 | |
weshay|ruck | think we have a decent enough build | 15:41 |
rfolco|rover | cool, thanks for covering promotions weshay|ruck | 15:41 |
rlandy | weshay|ruck: https://code.engineering.redhat.com/gerrit/#/c/202943/ - just fyi ... if we can't get the nameserver sorted on a cloud level | 15:44 |
chandankumar | weshay|ruck, rfolco|rover can we merge this one https://review.rdoproject.org/r/#/c/28042/ | 15:45 |
*** marios is now known as marios|out | 15:49 | |
weshay|ruck | rfolco|rover, pojadhav|ruck ok.. ussuri is promoting | 15:59 |
weshay|ruck | chandankumar++ | 16:01 |
weshay|ruck | chandankumar, +2 on dlrn internal.. but I guarantee | 16:05 |
weshay|ruck | pip: | 16:05 |
weshay|ruck | name: rdopkg | 16:05 |
weshay|ruck | virtualenv: "{{ build_repo_dir }}/dlrn-venv" | 16:05 |
weshay|ruck | state: latest | 16:05 |
weshay|ruck | when: not dlrn_pre_installed|bool | 16:05 |
weshay|ruck | using latest will bite us | 16:05 |
chandankumar | weshay|ruck, not it will not unless we get a new release on pypi | 16:14 |
weshay|ruck | right.. which we will, and we'll eventually hit something | 16:14 |
chandankumar | weshay|ruck, I will check with jpena to gate DLRN and rdopkg before releasing new builds | 16:15 |
jpena | chandankumar: what kind of gating would you need? | 16:16 |
weshay|ruck | jpena, standalone job that automatically tries to build a package | 16:16 |
chandankumar | jpena, once a new tag of dlrn or rdoinfo it should npot break build-test-packages | 16:16 |
weshay|ruck | so has a depends-on injected into it | 16:16 |
chandankumar | *rdopkg | 16:16 |
*** jmasud has joined #oooq | 16:17 | |
weshay|ruck | chandankumar, we could write a playbook that just calls build-test-packages | 16:17 |
weshay|ruck | and injects a change | 16:17 |
jpena | hm, we do have a job for that on every commit, see https://softwarefactory-project.io/r/#/c/18473/ (dlrn-rpmbuild-tripleo-ci-oooq and dlrn-rpmbuild-tripleo-ci-oooq-rhel8) | 16:17 |
weshay|ruck | that would be much quicker | 16:17 |
jpena | maybe we just need to adapt those | 16:17 |
weshay|ruck | hot dam.. you have it | 16:18 |
weshay|ruck | jpena++ | 16:18 |
weshay|ruck | can we make it vote? | 16:19 |
jpena | we could if needed. In general, we never merge anything if one of those jobs fails, even if they are non-voting | 16:20 |
*** amoralej is now known as amoralej|off | 16:30 | |
weshay|ruck | rfolco|rover, upstream gate returning to a more stable version of itself | 16:37 |
rfolco|rover | weshay|ruck, cool, I checked twice today, rate pass is over 96% | 16:37 |
weshay|ruck | PASS | 16:40 |
weshay|ruck | 96.7% | 16:40 |
weshay|ruck | FAIL | 16:40 |
weshay|ruck | 2.9% | 16:40 |
weshay|ruck | TIMED_OUT | 16:40 |
weshay|ruck | 0.5% | 16:40 |
weshay|ruck | Ussuri Branch, Upstream Gate | 16:40 |
weshay|ruck | Last 24 hours | 16:40 |
weshay|ruck | PASS | 16:40 |
weshay|ruck | 98.2% | 16:40 |
weshay|ruck | FAIL | 16:40 |
weshay|ruck | 1.8% | 16:40 |
weshay|ruck | Train Branch, Upstream Gate | 16:40 |
weshay|ruck | Last 24 hours | 16:40 |
weshay|ruck | PASS | 16:40 |
weshay|ruck | 98.7% | 16:40 |
weshay|ruck | FAIL | 16:40 |
weshay|ruck | 1.3% | 16:40 |
weshay|ruck | Stein Branch, Upstream Gate | 16:40 |
weshay|ruck | Last 24 hours | 16:40 |
weshay|ruck | PASS | 16:41 |
weshay|ruck | 100.0% | 16:41 |
weshay|ruck | Rocky Branch, Upstream Gate | 16:41 |
weshay|ruck | Last 3 days | 16:41 |
weshay|ruck | pretty good :) | 16:41 |
rfolco|rover | nice | 16:41 |
rfolco|rover | weshay|ruck, look at this | 16:41 |
rfolco|rover | https://review.rdoproject.org/r/#/c/27986/ | 16:41 |
rfolco|rover | ykarel|away you rock | 16:41 |
weshay|ruck | chandankumar, are you familiar w/ yatins patch ^ | 16:43 |
rfolco|rover | weshay|ruck, no missing files ... maybe we need a better config for /usr/lib/tmpfiles.d/ but this is a good workaround for now | 16:43 |
rfolco|rover | chandankumar is aware weshay|ruck | 16:43 |
chandankumar | weshay|ruck, yes | 16:44 |
chandankumar | weshay|ruck, patch is merged now | 16:44 |
chandankumar | the job will be passing in periodic pipeline | 16:44 |
weshay|ruck | rfolco|rover, chandankumar so can we cover how this fixes the issue in the next comm call? | 16:44 |
rfolco|rover | weshay|ruck, https://review.rdoproject.org/r/#/c/28041 | 16:44 |
chandankumar | weshay|ruck, https://bugs.launchpad.net/tripleo/+bug/1882664/comments/6 explained here | 16:45 |
openstack | Launchpad bug 1882664 in tripleo "error: unpacking of archive failed on file /usr/share/ansible/plugins/modules/pacemaker_cluster.py;5eded785: cpio: open failed - Inappropriate ioctl for device" [High,Triaged] | 16:45 |
chandankumar | there were two issues | 16:45 |
chandankumar | one missing files and corrupted rpms | 16:45 |
chandankumar | it will fix both via workaround | 16:45 |
chandankumar | need to find a proper solution | 16:45 |
rfolco|rover | weshay|ruck, I am still looking on a better fix for py3 on c7 and a less ugly hack on systemd tmpfiles.d | 16:45 |
weshay|ruck | but how does stopping the cleanup in /tmp | 16:45 |
weshay|ruck | fix that? | 16:45 |
rfolco|rover | the cleanup is not in tmp weshay|ruck | 16:46 |
rfolco|rover | tmp.conf could be any name | 16:47 |
rfolco|rover | it is not cleaning up files older than 10d or 30d | 16:47 |
weshay|ruck | where is the src dib to 30d | 16:48 |
chandankumar | weshay|ruck, ykarel|away removed the temp cleaning condition | 16:48 |
chandankumar | from temp.conf | 16:48 |
chandankumar | /usr/lib/tmpfiles.d/tmp.conf is available in any local system generated by systemd | 16:49 |
rfolco|rover | I think some config is added during the image build/ package install, coz my rdo cloud vm does not have any 20d or 30d configured on tmp.conf | 16:50 |
weshay|ruck | chandankumar, OH WAIT | 16:51 |
weshay|ruck | are we saying that systemd is cleaning files WHILE we're building images? | 16:51 |
rfolco|rover | possibly | 16:51 |
weshay|ruck | and that's what is killing us | 16:51 |
rfolco|rover | # Clear tmp directories separately, to make them easier to override | 16:51 |
rfolco|rover | q /tmp 1777 root root 10d | 16:51 |
rfolco|rover | q /var/tmp 1777 root root 30d | 16:51 |
weshay|ruck | so the host system.. nodepool.. runs log cleanup | 16:51 |
rfolco|rover | this is what we see in a fresh vm | 16:51 |
*** marios|out has quit IRC | 16:52 | |
rfolco|rover | shoudn't clean /etc/pki right ? | 16:52 |
rfolco|rover | so some config is being added there... I think ykarel|away knows more since he held a vm and got this conclusion | 16:52 |
weshay|ruck | well.. the image is mounted while the build is going on | 16:52 |
chandankumar | dib creates these dir in /tmp as a chroot | 16:52 |
weshay|ruck | ya | 16:52 |
weshay|ruck | chandankumar, but systemd log cleanup in zuul jobs seems like something that is not needed at all | 16:53 |
weshay|ruck | since the dam job only runs for 3hours max | 16:53 |
weshay|ruck | zbr, ^ | 16:53 |
weshay|ruck | this is nuts.... | 16:53 |
rfolco|rover | private is usualy an empty dir | 16:53 |
weshay|ruck | we knew the root cause would be nuts | 16:53 |
weshay|ruck | but DAM | 16:53 |
rfolco|rover | so it might think the file is not needed | 16:53 |
rfolco|rover | unused files | 16:54 |
weshay|ruck | still | 16:54 |
weshay|ruck | fuck cleaning /tmp for a 3 hour lifespan | 16:54 |
weshay|ruck | should be turned off in nodepool conf | 16:54 |
weshay|ruck | imho | 16:54 |
weshay|ruck | rfolco|rover, chandankumar how certain are we this IS the fix | 16:55 |
weshay|ruck | guess we'll see in time | 16:55 |
weshay|ruck | BUT DAAAAMMMMMM | 16:55 |
rfolco|rover | weshay|ruck, I'll respin my testproject, its working on both periodic and check | 16:56 |
rfolco|rover | and... | 16:56 |
weshay|ruck | rfolco|rover, bah.. let's just monitor the periodic jobs | 16:56 |
rfolco|rover | ykarel|away fixes periodic , we need similar fix for check | 16:56 |
weshay|ruck | rfolco|rover, no need to keep spinning | 16:56 |
rfolco|rover | ok ok | 16:56 |
rfolco|rover | I am paranoic now | 16:56 |
rfolco|rover | weshay|ruck, just need a similar fix for check | 16:56 |
weshay|ruck | ykarel|away, rfolco|rover I would think we would want the cleaning of tmp files OFF in the nodepool image | 16:56 |
rfolco|rover | weshay|ruck, ok, let me search for this how to completely disable systemd delete service | 16:57 |
*** derekh has quit IRC | 17:00 | |
*** jpena is now known as jpena|off | 17:01 | |
chandankumar | https://questions.wizardzines.com/ is very nice to learn new things | 17:09 |
*** dtantsur is now known as dtantsur|afk | 17:10 | |
*** TrevorV has quit IRC | 17:25 | |
*** TrevorV has joined #oooq | 17:29 | |
*** jbadiapa has quit IRC | 17:30 | |
*** jmasud has quit IRC | 17:36 | |
*** sshnaidm|bbl is now known as sshnaidm | 17:40 | |
sshnaidm | rlandy, it was dns in private-subnet settings | 17:40 |
rlandy | git it - thanks | 17:40 |
rlandy | got | 17:40 |
sshnaidm | rlandy, didn't help much though | 17:40 |
rlandy | k - we got it sorted | 17:41 |
rlandy | weshay|ruck: https://bugzilla.redhat.com/show_bug.cgi?id=1545850 | 17:51 |
openstack | bugzilla.redhat.com bug 1545850 in rhel-guest-image "rhel-guest-image contains a resolv.conf with an address" [High,Closed: duplicate] - Assigned to nobody | 17:51 |
rlandy | https://bugzilla.redhat.com/show_bug.cgi?id=1545842 | 17:52 |
openstack | bugzilla.redhat.com bug 1545842 in rhosp-director-images "resolv.conf contains nameserver 192.168.122.1 on rhel7.5" [High,Closed: errata] - Assigned to aschultz | 17:52 |
*** jmasud has joined #oooq | 17:54 | |
*** jbadiapa has joined #oooq | 18:06 | |
*** jmasud has quit IRC | 18:16 | |
*** rlandy is now known as rlandy|mtg | 19:05 | |
*** sshnaidm is now known as sshnaidm|afk | 19:05 | |
*** jmasud has joined #oooq | 19:54 | |
*** jmasud has quit IRC | 20:04 | |
weshay|ruck | rlandy|mtg, when you have a sec https://review.rdoproject.org/r/#/c/28044/ | 20:11 |
rlandy|mtg | looking | 20:23 |
*** rlandy|mtg is now known as rlandy | 20:23 | |
rlandy | weshay|ruck: sshnaidm|afk: Alex is right ... | 20:24 |
rlandy | periodic-tripleo-ci-rhel-8-standalone-swift-rhos-17openstack/tripleo-ci-internal-jobsmastercheck202956,31 hr 38 mins 14 secs2020-06-10 18:45:19SUCCESS | 20:24 |
weshay|ruck | nice | 20:24 |
rlandy | that took the job time back down again ... | 20:25 |
rlandy | periodic-tripleo-ci-rhel-8-standalone-swift-rhos-17openstack/tripleo-ci-internal-jobsmastercheck202956,31 hr 38 mins 14 secs2020-06-10 18:45:19SUCCESS | 20:25 |
rlandy | periodic-tripleo-ci-rhel-8-standalone-swift-rhos-17openstack/tripleo-cimasteropenstack-component-swiftmaster2 hrs 1 min 21 secs2020-06-10 17:34:09SUCCESS | 20:25 |
rlandy | weshay|ruck: hmm ... how does this work? https://review.rdoproject.org/r/#/c/28044/1/zuul.d/project-templates-components.yaml | 20:36 |
rlandy | where the name of the pipeline is repeated? | 20:36 |
rlandy | zuul joins them all together? | 20:36 |
* rlandy checks rdocloud | 20:37 | |
weshay|ruck | the name of the pipeline or component template? | 20:37 |
weshay|ruck | that's just the project template | 20:37 |
weshay|ruck | not the zuul queue | 20:37 |
rlandy | whatever it is, it works :) | 20:38 |
weshay|ruck | https://review.rdoproject.org/r/gitweb?p=config.git;a=blob;f=zuul.d/tripleo.yaml#l18 | 20:41 |
*** rfolco|rover has quit IRC | 21:23 | |
*** jfrancoa has quit IRC | 22:07 | |
*** jmasud has joined #oooq | 22:21 | |
*** TrevorV has quit IRC | 22:30 | |
*** tosky has quit IRC | 23:15 | |
*** rlandy has quit IRC | 23:39 | |
*** jmasud has quit IRC | 23:44 | |
*** jmasud has joined #oooq | 23:59 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!