Wednesday, 2020-01-29

*** yolanda has quit IRC00:06
*** tosky has quit IRC00:08
*** yolanda has joined #oooq00:09
*** matbu has quit IRC00:09
*** matbu has joined #oooq00:10
*** ysandeep has joined #oooq01:02
*** jmasud has joined #oooq01:11
*** hamzy has quit IRC01:56
*** hamzy has joined #oooq01:56
*** jrist has quit IRC01:57
*** jpena|off has quit IRC01:57
*** weshay has joined #oooq01:57
*** kopecmartin has quit IRC01:57
*** weshay|ruck has quit IRC01:57
*** migi has quit IRC01:58
*** kopecmartin has joined #oooq02:00
*** jpena|off has joined #oooq02:05
*** rfolco has joined #oooq02:31
*** rfolco has quit IRC02:36
*** rlandy|bbl is now known as rlandy03:29
*** rlandy has quit IRC03:42
*** jmasud has quit IRC04:12
*** jmasud has joined #oooq04:13
*** udesale has joined #oooq04:13
*** skramaja has joined #oooq04:17
*** ykarel|away is now known as ykarel04:35
weshayzbr, upstream logs are still busted.. please check it out... zipped logs :(05:00
*** raukadah is now known as chkumar|rover05:13
*** saneax has joined #oooq05:19
*** marios has joined #oooq06:17
*** dsneddon has quit IRC06:45
*** ratailor has joined #oooq06:45
*** dsneddon has joined #oooq06:45
*** jfrancoa has joined #oooq07:00
*** soniya29 has joined #oooq07:08
*** ykarel is now known as ykarel|lunch07:42
*** dtantsur|afk is now known as dtantsur07:54
*** saneax has quit IRC07:55
zbryep. i seen07:59
*** chem has quit IRC08:00
*** chem has joined #oooq08:01
*** tesseract has joined #oooq08:03
*** jmasud has quit IRC08:11
*** jtomasek has joined #oooq08:12
*** jmasud has joined #oooq08:14
*** holser has joined #oooq08:14
*** holser has quit IRC08:19
*** holser has joined #oooq08:20
*** bogdando has joined #oooq08:25
*** holser has quit IRC08:26
*** soniya29 has quit IRC08:31
*** soniya29 has joined #oooq08:31
mariosbiab08:32
*** marios has quit IRC08:32
*** tosky has joined #oooq08:35
*** soniya29 has quit IRC08:50
*** soniya29 has joined #oooq08:50
*** jpena|off is now known as jpena08:54
*** amoralej|off is now known as amoralej08:58
*** ykarel|lunch is now known as ykarel09:04
chkumar|rovermatbu, hello, please have a look at this review https://review.rdoproject.org/r/#/c/24719/ it requires in tht spec file, thanks :-)09:20
*** jbadiapa has joined #oooq09:27
*** holser has joined #oooq09:36
*** marios has joined #oooq09:41
matbuchkumar|rover: yep i will change that aspa thx :)09:48
zbrpanda: I am doing some cleanups in very old tq* repos, i hope you do not mind me abandoning stuff like https://review.opendev.org/#/c/583916/09:49
zbri would assume that any review >1y that is unmergeable, broken or with unaswered questions is safe to abandon. (abandon != delete )09:50
zbrlucky for us, infra disabled delete09:50
pandazbr: Noooooo that's absolutely a critical patch !09:53
pandazbr: another couple of years and I surely can get it merged.09:54
chkumar|roverykarel, https://bugs.launchpad.net/tripleo/+bug/1832166/comments/1009:54
openstackLaunchpad bug 1832166 in tripleo "[FUll Tempest job][master][periodic]3 Tempest tests related to DNS failing " [Critical,Fix released] - Assigned to Ronelle Landy (rlandy)09:54
zbrpanda: right after openstack moves from irc to something else.09:55
ykarelchkumar|rover, ack09:57
pandazbr: unix talk ?09:59
*** ysandeep has quit IRC10:02
sshnaidmzbr, I think you can abandon only if it has -1, either from CI or people10:08
sshnaidmzbr, not sure about patches with +210:08
zbri think we need to use common sense, making a decision based on all factors. we should either aim to workflow it or abandon it. clearly that +2 pushes towards merging.10:11
zbrbut a +2 added two years ago on a change that needs rebase may not be a real +2 :D10:11
zbrfew months back I was inclined to support typo fixes, but since some people tried to game the system and to a single typo per patch, I decided to abandon them.10:12
zbri suspect they even used bots to create them10:13
mariossshnaidm: zbr: imho if they are that old/untouched... to be clear zbr IMO 'untouched six months' means from the latest comment there, not from the patch submission date right?10:13
*** derekh has joined #oooq10:13
mariossshnaidm: zbr: if you disagree just click restore. this is too much wasted effort on typing for discussion, imho ;D10:13
zbrmarios: true+ last touch matterns, not age.10:13
chkumar|roverarxcruz, testing ssh tempest patch here https://review.rdoproject.org/r/2472310:14
zbrwe all know we age like wine10:14
sshnaidmmarios, yeah, untouched, but not mergeable10:14
sshnaidmmarios, if you see untouched patch with +2 you can also +w it :)10:14
mariossshnaidm: wll, not really. at least you would have to recheck it first10:14
mariossshnaidm: and maybe ping them say hey wassup with that? should we merge it?10:15
sshnaidmyes, and then +w after recheck10:15
mariossshnaidm: s/would/should recheck it first, at least everyone on this team should10:15
mariossshnaidm: yeah thats fair. i mean if in six months it didn't need rebase for conflict, and it has green ci run. then its like the jesus patch10:15
marios:D10:15
mariossshnaidm: like we HAVE to merge it!10:16
sshnaidmI like these patches with changing one letter in comment10:17
sshnaidmso like that I'd -2 them all..10:17
mariossshnaidm: yeah i'm conflicted sometimes on those ones. like where is the line. on one hand it could be folks looking to get involved. on the other hand it could also just be folks looking to say 'i have a commit in openstack'10:18
sshnaidmmarios, yeah, seems like that10:19
mariossshnaidm: usually i like to make them work more first. Like if its fixing a typo ... then 'challenge accepted' its like auto -1. you can easily find more typos in that file (assuming it's not just 5 line function in one file)10:19
mariossshnaidm: or maybe fix lots of typos/minor things across the whole repo ;)10:20
sshnaidmmarios, like "If you started..." :D10:20
mariossshnaidm: sometimes they don't even bother responding to that request ^^ so thn you can just ignore them and they go away10:20
mariossshnaidm: yeah try it10:20
mariossshnaidm: it at least gives you some satisfaction and the result is it becomes more of a contribution ;)10:20
*** holser has quit IRC10:21
*** holser has joined #oooq10:28
*** ykarel is now known as ykarel|away10:31
*** soniya29 has quit IRC10:36
mariosreviews please when you next have time thank you "Refresh start_named_hashes after promotion to prevent false positive" https://review.rdoproject.org/r/#/c/24665/11:01
*** udesale has quit IRC11:01
*** saneax has joined #oooq11:18
*** saneax has quit IRC11:19
*** saneax has joined #oooq11:20
*** migi has joined #oooq11:21
*** saneax has quit IRC11:25
migiowalsh++11:29
*** soniya29 has joined #oooq11:36
mariosarxcruz: fyi cos you had the same question looks like the regex works at tempest white/black https://review.opendev.org/#/c/701016/10/config/general_config/featureset020.yml11:38
*** ykarel|away is now known as ykarel11:49
arxcruzmarios: you already told me that iirc11:52
mariosarxcruz: k11:59
*** amoralej is now known as amoralej|lunch12:05
*** rfolco has joined #oooq12:10
*** jmasud has quit IRC12:12
*** jmasud has joined #oooq12:13
*** sshnaidm is now known as sshnaidm|afk12:15
marioschkumar|rover: arxcruz: kopecmartin: do you folks recall what happens if black/white list clash? what wins? /me goes looking but maybe you already know12:17
kopecmartinthat's a good question, i don't know, I'd need to check too12:18
marioskopecmartin: ack thx12:18
kopecmartinmarios: i think that black list wins12:19
marioskopecmartin: cool thanks i hope so, can't quickly find but i can more easily just try it and see12:20
kopecmartinif we're talking about tempest parameters12:20
marioskopecmartin: yah tempest_test_whitelist and tempest_test_blacklist12:20
kopecmartinok, because it wouldn't make sense otherwise, the white list can contain regexes .. in case the regex is not very specific I might want to exclude some specific tests .. therefore i think that black list wins12:21
marioskopecmartin: sec i'll point to what i mean . in my case they both contain regex but the blacklist has a bigger one and i hope it wins. i'll work it out if its the other way around but sec i'll post th update and point12:24
kopecmartinmarios: https://stestr.readthedocs.io/en/latest/MANUAL.html#test-selection12:25
kopecmartin$ stestr run --black-regex 'slow_tests|bad_tests' ui\.interface12:25
kopecmartin'Here first we selected all tests which matches to ui\.interface, then we are dropping all test which matches slow_tests|bad_tests from the final list.'12:25
arxcruzmarios: iirc blacklist wins12:27
*** holser has quit IRC12:29
*** holser has joined #oooq12:31
*** jpena is now known as jpena|lunch12:31
mariosarxcruz: thanks arxcruz12:33
mariosthank you for digging kopecmartin12:34
chkumar|roverowalsh, migi, Thank you for finding the issue, preparing the bug report :-)12:39
weshayzbr, how we doing w/ logs?12:58
weshayzbr, upstream unzipped.. 3rd party check and periodic zipped is what we're after12:59
zbrweshay: use gzip topic to see progress, https://review.opendev.org/#/q/topic:gzip+(status:open+OR+status:merged)12:59
zbrweshay: https://review.opendev.org/#/c/704738/ needs to be wf asap,13:01
zbrthat one brings back unzip to upstream, and not affecting rdo, at least on the clicks i did. better if you can double check13:02
*** rlandy has joined #oooq13:02
chkumar|roverweshay, zbr few of the jobs are timing out, having no logs collected13:03
chkumar|roverbasically fs02113:03
weshaychkumar|rover, yup... zbr not all the jobs were getting gzipped13:03
weshaythe log server was faull13:03
weshaychkumar|rover, /me looks again13:03
chkumar|roverweshay, last night train promoted na13:04
zbri am now polishing the optimization on collection, which should bring its duration to ~50%.13:04
weshayu'@/home/zuul/workspace/.quickstart/config/release/tripleo-ci/CentOS-7/promotion-testing-hash-master.yml', u'@/home/zuul/src/opendev.org/openstack/tripleo-quickstart/config/general_config/featureset001.yml', u'@/home/zuul/src/opendev.org/openstack/tripleo-ci/toci-quickstart/config/testenv/ovb.yml', u'@/home/zuul/src/opendev.org/openstack/tripleo-ci/toci-quickstart/config/testenv/ovb-rdocloud.yml', u'@/home/zuul/workspace/logs/role-vars.yaml',13:06
weshayu'local_working_dir=/home/zuul/workspace/.quickstart', u'virthost=undercloud', u'tripleo_root=/home/zuul/src/opendev.org/openstack', u'working_dir=/home/zuul', u'@/home/zuul/src/opendev.org/openstack/tripleo-ci/toci-quickstart/config/collect-logs.yml', u'artcl_collect_dir=/home/zuul/workspace/logs', u'@/home/zuul/workspace/logs/zuul-variables.yaml', u'@/home/zuul/workspace/logs/hostvars-variables.yaml')13:06
weshaythat should zip it13:06
chkumar|roverDoes anyone knows how role_networks got set in tripleo?13:07
chkumar|roverweshay, regarding server tempest tests failure like parallel migration or ssh issue due to missing of internal api in compute0 node13:08
chkumar|roverweshay, I will open a bug soon may be seperate one13:09
migiowalsh: ^^13:09
chkumar|roverweshay, owalsh and migi were looking in to the issue13:09
weshayreally?13:09
owalshinternal api missing in ssh_known_hosts13:09
owalshweshay: ^^^13:09
chkumar|roverwith comparising to https://b73800ba39ea3e53b9d3-9ff934c0eeb3c69295fc2c5d6afc2bc8.ssl.cf5.rackcdn.com/649418/12/check/tripleo-ci-centos-7-containers-multinode/2463fd9/logs/subnode-1/etc/hosts and https://b73800ba39ea3e53b9d3-9ff934c0eeb3c69295fc2c5d6afc2bc8.ssl.cf5.rackcdn.com/649418/12/check/tripleo-ci-centos-7-containers-multinode/2463fd9/logs/subnode-1/etc/ssh/ssh_known_hosts13:10
chkumar|roverand check bm logs13:10
weshaychkumar|rover, rock on13:10
chkumar|roverhttps://sf.hosted.upshift.rdu2.redhat.com/logs/36/189436/2/check/periodic-tripleo-ci-centos-7-bm_envD-1ctlr_2comp-featureset021-master/19aef03/logs/overcloud-novacompute-0/etc/hosts and https://sf.hosted.upshift.rdu2.redhat.com/logs/36/189436/2/check/periodic-tripleo-ci-centos-7-bm_envD-1ctlr_2comp-featureset021-master/19aef03/logs/overcloud-novacompute-0/etc/ssh/ssh_known_hosts13:10
weshaymigi++, chkumar|rover++13:10
owalshI suspect it's a Camelcase vs lowercase issue for network names - running a quick test in https://review.opendev.org/#/c/704781/1/tripleo_ansible/roles/tripleo_ssh_known_hosts/tasks/main.yml13:11
chkumar|roverit is for ssh issue and there are other issues there also13:11
*** amoralej|lunch is now known as amoralej13:11
chkumar|roverweshay, we can see those error in fs021 job also http://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-master/7c6d1fc/logs/overcloud-novacompute-0/var/log/containers/nova/nova-compute.log13:14
chkumar|roverowalsh, testing this patch in rdoside13:15
owalshweshay, chkumar|rover: could we add something to catch this in CI? E.g try sshing from the nova_compute container to the same hosts internal_api ip13:16
chkumar|roverowalsh, https://review.rdoproject.org/r/2472613:18
chkumar|roverowalsh, you mean from podman side where we collect data related to containers that time?13:19
*** holser has quit IRC13:22
owalshchkumar|rover: not sure where yet... need to look into it13:23
rlandyweshay: do you want https://code.engineering.redhat.com/gerrit/#/c/190564 merged or was it a once off?13:24
*** skramaja has quit IRC13:24
tosky*cough* downstream13:28
*** jpena|lunch is now known as jpena13:29
chkumar|roverweshay, rlandy sshnaidm|afk https://review.opendev.org/#/q/topic:cellv2+(status:open+OR+status:merged) will be good to go today13:29
chkumar|roverwe need a promotion version of that job13:29
*** ratailor has quit IRC13:32
dtantsurheya, folks! does quickstart currently work on RHEL/CentOS 8 as both virthost and VM OS?13:32
*** holser has joined #oooq13:35
*** ykarel is now known as ykarel|away13:36
migiowalsh: weshay: chkumar|rover: I am thinking laud here, is the health-check of container right plac to add such test - it's deployment issue imo13:37
migikind of connectivity between containers that have services13:38
owalshmigi: don't think that's the right place to check the config isn't junk13:38
chkumar|roverweshay, heading home see you in some hours13:39
weshaychkumar|rover++++13:40
weshaychkumar|rover, thank you sir13:40
migiowalsh: well it's not the config imo, it's the connectivity between the services that are broken13:41
*** soniya29 has quit IRC13:42
migithe config is the cause of problem13:42
owalshyup13:42
migibut maybe other time it's different cause of connectivity issue13:42
dtantsurhttps://opendev.org/openstack/tripleo-quickstart/commit/b8b85b348da761906a28e74995a8c4ccb92785ae gives me some hope13:42
owalshmigi: don't see how it could be a container healthcheck - 1 sshd container running, 2 potential client containers - which runs the healthcheck?13:43
*** ratailor has joined #oooq13:44
*** saneax has joined #oooq13:44
migifrom each container that uses external api services health check to ensure that service IP(s) is/are accessible and ssh-able?13:45
migiran from overcloud where container is running13:45
weshaypanda, want to go early?13:46
owalshmigi: client is a compute node nova_compute or nova_libvirt container, service is another compute node nova_migration_target container...13:46
migiowalsh: nova_compute one that speaks to another nova_compute, but yeah maybe it's just this particular case13:47
owalshnope, nova_migration_target runs sshd13:47
*** ratailor has quit IRC13:48
owalshnova_compute or nova_libvirt ssh to it13:48
migiowalsh: whatever IP is specified as internalapi from the nova_compute client13:49
*** ratailor has joined #oooq13:50
weshaypanda, paging panda13:51
owalshmigi: don't failure to ssh back to the same host should fail a healthcheck. migration would be to a different host and we can't realistically ssh to N compute nodes in a healthcheck13:55
owalshdon't think13:55
migiowalsh: ok that makes sense13:55
migiowalsh: config check then13:56
owalshmigi: also last time I looked failing healthchecks were ignored during deployment13:56
pandaweshay: joining13:56
owalshmigi: a validation task might be the correct place, but do we run any validations in CI jobs?13:57
*** ratailor has quit IRC13:59
pandapanda: sorry need a couple minutes more ...13:59
*** ykarel|away is now known as ykarel14:02
migiowalsh: dunno14:02
migiowalsh: I saw some in the past, but maybe they are ignored as you say14:02
arxcruzmarios: the fs020 was timeing out with how much time ?14:03
mariosarxcruz: fs20 had tempest fail on last run14:06
mariosarxcruz:         * http://logs.rdoproject.org/39/24339/8/check/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset020-master/3364398/job-output.txt14:06
marios                * 2020-01-28 01:37:50.715515 | primary | TASK [os_tempest : Execute tempest tests] **************************************14:06
marios2020-01-28 01:37:50.739220 | primary | Tuesday 28 January 2020  01:37:50 +0000 (0:00:00.090)       0:02:38.544 *******14:06
mariosarxcruz: didn't check what/where14:06
arxcruzshit, even changing the bmc_flavor still have 8 cpus the controller14:08
arxcruzrlandy: how can i change the flavor of a ovb job?14:09
arxcruzi tried bmc_flavor and baremetal_flavor, i see it change from medium to xlarge, but still have 8 cpus14:09
rlandyarxcruz: yep - there are multiple nodes with different flavors14:09
arxcruzrlandy: any node with a ci.m1.xlarge flavor ?14:10
rlandythe undercloud is the largest node we use so far14:10
rlandychecking what's the latest14:10
rlandyhttps://github.com/rdo-infra/review.rdoproject.org-config/blob/master/roles/ovb-manage/defaults/main.yml#L4014:12
rlandyarxcruz: ^^14:12
rlandyif it's not overridden elsewhere14:12
rlandyarxcruz: which node do you need a larger flavor for?14:13
arxcruzrlandy: i've override these and vm created still with 8 cpu and 16gb ram14:13
rlandyarxcruz: we need to clarify which vm you want to change ...14:14
rlandythe undercloud comes from the zuul nodes14:14
rlandythe overcloud nodes you can change the settings14:14
rlandyie: baremetal_flavor: ci.m1.xlarge14:14
rlandyor the bmc flavor14:14
arxcruzrlandy: https://review.rdoproject.org/r/#/c/24709/14:14
arxcruzI can see setting ci.m1.xlarge14:15
arxcruzbut it doesn't seems to be working14:15
arxcruzrlandy: https://logs.rdoproject.org/09/24709/7/check/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset020-master/c717e89/job-output.txt14:15
chkumar|roverarxcruz, it was a config chnage na?14:17
rlandy2020-01-29 09:03:16.544746 | primary |       "disk": 160,14:17
rlandy2020-01-29 09:03:16.544768 | primary |       "arch": "x86_64",14:17
rlandy2020-01-29 09:03:16.544787 | primary |       "cpu": 8,14:17
rlandy2020-01-29 09:03:16.544809 | primary |       "pm_user": "admin"14:17
rlandy2020-01-29 09:03:16.544822 | primary |     },14:17
chkumar|roverwothout merging it does not works14:17
arxcruzso xlarge still uses 8 cpus ?14:18
arxcruzrlandy: i can see the amount of memory is higher14:18
arxcruzi want to increase the cpu14:18
rlandyarxcruz: well - there are two parts here ...14:18
rlandythe node that gets created14:18
rlandyand the node that is deployed14:18
arxcruzi want to increase cpu on computer and controller nodes14:19
owalshchkumar|rover, migi: https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_287/704781/1/check/tripleo-ci-centos-7-containers-multinode/287e000/logs/subnode-1/etc/ssh/ssh_known_hosts14:19
rlandyarxcruz: bmc node at medium should be more than big enough14:19
rlandyso don't worry about that14:19
owalshchkumar|rover, migi: so it's not that they are using different cases... but for some reason the ips/hostnames for these networks were not added14:19
rlandyarxcruz: the only one I would guess you want to change is the baremetal flavor14:20
rlandytake a look at ... 2020-01-29 09:03:05.743145 | TASK [ovb-manage : Build nodes.json file to be used as instackenv.json]14:20
*** TrevorV has joined #oooq14:21
rlandyconfirming flavor attributes14:21
rlandyarxcruz: so a general baremetal instance - uses ci.m1.large ... 4vcpus 8GB RAM 80 GB disk ...14:24
rlandycomparing with your instackev.json14:24
rlandy  "cpu": 8,   "disk": 160, "memory": 16384,14:24
rlandyarxcruz: ^^ so you must have sized up14:25
arxcruzrlandy: yeah, it seems doesn't matter for tempest :/14:25
rlandyarxcruz: drop the bmc and undercloud flavors back ... they won't help you14:25
arxcruzi was betting it would be faster...14:25
rlandyarxcruz: I guess if the original node was big enough, so no diff14:25
rlandyarxcruz: best suggestion would be to deploy the same on your own tenant14:26
rlandyget on the nodes and try14:26
rlandyarxcruz: you can see the difference on real bm14:26
rlandythere the nodes are HUGE14:26
owalshmigi, chkumar|rover: ah, it's a jinja scoping issue - https://github.com/pallets/jinja/issues/64114:26
rlandytempest is not speedy either14:26
owalshchkumar|rover: do you have an LP for this yet? I'll push a fix14:27
chkumar|roverowalsh, getting lp, give me 10 mins14:27
owalshack, thanks14:27
*** holser__ has joined #oooq14:29
*** holser__ has quit IRC14:29
*** holser has quit IRC14:29
*** holser has joined #oooq14:30
arxcruzrlandy: thanks for the tips14:31
arxcruzi need to go buy a gift to my daughter, it's her birthday today, brb14:31
rlandyarxcruz: happy birthday!!14:32
rfolcoarxcruz, happy bday !! :)14:32
mariosarxcruz: all the best :D mm cake14:33
*** jmasud has quit IRC14:39
chkumar|rovermigi, owalsh https://bugs.launchpad.net/tripleo/+bug/186129614:40
openstackLaunchpad bug 1861296 in tripleo "multiple nova server related test failure due to Host key verification failed on compute node" [Critical,Confirmed]14:40
*** saneax has quit IRC14:43
chkumar|roverweshay, ^^14:44
weshaychkumar|rover, NICE14:45
chkumar|roverweshay, https://review.opendev.org/#/c/703953/3 to fix fs039 master error14:47
chkumar|roverweshay, all most all cards updated for prodchain call14:48
chkumar|roverrlandy, around?14:54
chkumar|roverrlandy, what to do with podman downstream work?14:54
chkumar|roverdo we want to merge and add the job?14:54
*** Trevor_V has joined #oooq14:55
rlandychkumar|rover: we should merge the tq patch first14:55
rlandychkumar|rover: once that merges, we can merge the downstream work - but not as is14:56
rlandyI need to fix the projects file14:56
rlandyso that we add the job to the periodic pipeline14:56
rlandychkumar|rover: so the job will run once a day14:56
rlandyis that enough?14:56
*** sshnaidm|afk is now known as sshnaidm14:56
*** Trevor__V has joined #oooq14:57
rlandychkumar|rover: will make those edits now - give me a few14:57
chkumar|roverrlandy, yes14:57
chkumar|roverrlandy, https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/config-errors what about these one14:57
chkumar|roverrlandy, we may need to work with delivery team on how to gate podman from rhel stream14:57
rlandychkumar|rover: so the config-errors are a diff problem14:58
rlandyone at a time14:58
*** TrevorV has quit IRC14:58
rlandythe config errors come from the fact that we inclued zuul-jobs jobs14:58
rlandyper inheritance14:58
rlandyone sec - updating review14:59
chkumar|roverok14:59
*** Trevor_V has quit IRC15:00
rlandymarios: just before you go ... left some notes in the osp containers card wrt osp-17 build s- can discuss at the meeting tomorrow15:01
weshayzbr, can you join #sf-ops internal please15:03
weshayrlandy, migi fyi.. first pass http://tripleo-cockpit.usersys.redhat.com/d/Wj-ao_sWz/full-component-pipeline?orgId=115:03
mariosrlandy: i saw those thanks, and i checked https://code.engineering.redhat.com/gerrit/#/q/topic:17-standup+status:open15:03
mariosrlandy: thanks for that15:04
mariosrlandy: i moved us to hack md finally had enough of taiga and markdown15:04
rlandymarios++15:04
rlandywe outran taiga usage15:04
migiweshay++++++15:04
mariosrlandy: heh, just copy paste. but honestly the bug with the removal of http from the shell stuff was really really annoying it was the final straw15:05
rlandychkumar|rover: https://code.engineering.redhat.com/gerrit/#/c/190384/ - pls review15:07
rlandythat should be mergeable after tq patch merges15:07
rlandymarios: yep - markdown was leading to personal breakdown15:08
mariosrlandy: just to demonstrate the extend of the issue, and warning language follows you may want to ensure young children are not watching your screen, from my notes today15:09
marios*=* 15:50:57 *=*=*= "fucking move to hackmd...  "15:09
marios:D15:09
mariosweshay: hr violation! ^15:09
rlandychkumar|rover: before we work on gate podman from rhel stream, we should look at roles - otherwise we will have a ton of duplication = debug/maintenance nightmare15:09
chkumar|roverrlandy, yes15:10
rlandylol15:10
chkumar|roverrlandy, small typo rest looks ok15:11
*** TrevorV has joined #oooq15:12
rlandymarios: with kolla-build.conf, that has to be diff for downstream15:12
* rlandy will comment on hackmd15:12
mariosrlandy: different registry?15:12
mariosrlandy: cool thank you15:12
rlandyyep15:12
mariosrlandy: just update it15:12
mariosrlandy: or comment if you aren't sure needs more digging15:12
rlandytag15:12
rlandyetc.15:12
rlandynot with my changes, I had any success or anything ;(15:13
*** chem has quit IRC15:13
mariosrlandy: k for tag i just went to15:13
marios* https://trunk.registry.rdoproject.org:8443/oapi/v1/namespaces/tripleomaster/imagestreamtags/15:13
mariosrlandy: and found a recent one15:13
rlandyosp containers are not there15:13
mariosrlandy: but anyway it seems we are missing lots of patches15:13
chkumar|roverrfolco, marios etcd is available now15:13
rlandy?15:13
mariosrlandy: according to the comments15:13
chkumar|roverin centos 815:13
rlandychkumar|rover: looking15:13
mariosrlandy: k, i probably looked for         *         "name": "rhel-binary-base:f2d10ed08cd893efa29d4d2610f5abcc4b0682b1_73e77d2b",15:14
marios        *         "name": "rhel-binary-base:cbfcf18d489feeba5e78e0f8b2fd3eaebec3f504_b7487084",15:14
mariosthat on ei used i think15:14
mariosrlandy: but fine anyway please go ahead15:14
rlandyrhel-binary for rdo?15:14
mariosthanks chkumar|rover15:14
rfolcochkumar|rover, thx, anyone else?15:14
*** chem has joined #oooq15:15
*** Trevor__V has quit IRC15:15
chkumar|roverrfolco, uwsgi also15:16
chkumar|roverwe are waiting on gfidente for ceph repos15:16
rlandychkumar|rover: we have a linter issue: https://sf.hosted.upshift.rdu2.redhat.com/logs/84/190384/18/check/tox-linters/811addf/job-output.txt15:17
rfolcochkumar|rover, you know if shellinabox is really required ?15:17
rfolcofor ironic-conductor15:17
rfolcoand the last one is collectd-ping15:18
chkumar|roverrfolco, not required15:18
chkumar|roverrfolco, https://opendev.org/openstack/tripleo-common/src/branch/master/container-images/tripleo_kolla_template_overrides.j2#L17615:19
rfolcook so maybe only ceph15:19
chkumar|roverrfolco, for collectd mathias runge is the person15:19
rlandyzuul_log_path should include all that15:20
rlandy zuul_log_url: "https://sf.hosted.upshift.rdu2.redhat.com/logs"15:22
chkumar|roverrlandy, will that fixed tomorrow, if no urgency15:25
rlandyno rush15:26
chkumar|roverweshay, my home network not working properly please take care of escalation call15:35
*** Trevor_V has joined #oooq15:38
*** dpawlik has joined #oooq15:41
dpawlikHello15:41
zbro/15:41
*** TrevorV has quit IRC15:41
dpawlikI'm a member of RDO team. We would like to switch images server (images.rdoproject.org) to new provider (name will be still same, but during the migration, you can run jobs using other address: images-vexxhost.rdoproject.org)15:44
dpawlikquestion is: when can we do that15:45
dpawlikI saw in few points, that you are using in CI scripts hardcoded value (https://review.rdoproject.org/r/#/c/24725/) and Im trying to change that to be set as a variable15:45
dpawlikbecause in the meantime of switching servers, there can be situation that some hosts can upload image(old server),some download(new server) because of DNS propagation15:47
rlandydpawlik: the images server is referenced in a lot of places15:48
rlandypromotion server, release files etc.15:48
rlandyif you are asking for a suitable date when you can switch the server, the ruck, rovers can advise you best15:49
rlandychkumar|rover and weshay15:49
rlandygetting references15:49
weshaydpawlik, what did you find out re: the rhel protected image server?15:50
zbras the server is new geo location. it would worth testing the effects of the change before doing the DNS flip15:50
dpawlikcc jpena ^^15:50
chkumar|roverweshay, https://review.rdoproject.org/r/#/c/24726/15:50
dpawlikand cc tristanC15:50
dpawlikweshay: AFAIK rhel image server is a separated server15:52
weshayit is.. but it's the same subject15:53
dpawlikweshay, could you be more verbose?15:55
*** dmsimard has joined #oooq15:58
dmsimarddpawlik: o/ can I help15:58
dmsimard(I don't have backlog history for this channel)15:58
dpawliklet me paste you on priv dmsimard15:59
weshaydpawlik, before moving the images.rdoproject.. my expectation is there is at least a plan for the RHEL protected server as well16:00
weshaydpawlik, until I see that.... I'm nacking16:00
dmsimardweshay: the rhel protected server ?16:00
weshaythe.. server zoli put up16:01
* weshay forgets the name you guys used16:01
*** nhicher has joined #oooq16:02
jpenaweshay: is that the rcm share ?16:02
* jpena tries to find16:02
chkumar|roversee ya people tomorrow16:04
chkumar|roverhave a nice day ahead16:04
*** chkumar|rover is now known as raukadah16:04
jpenadmsimard, dpawlik: I think I found it. That's https://github.com/rdo-infra/rdo-infra-playbooks/blob/master/doc/source/rcm-share_about.rst16:04
weshayjpena, yup rcm16:04
dmsimardjpena: what tenant is that in ? I don't see it16:05
jpenaSo... this is an interesting one, I had forgotten about it. It seems to be located in another tenant, which we don't have access to (just zoli it seems)16:05
dmsimardT_T16:06
dpawlikxD16:06
jpenalet's sync about it internally, we'll need some more info about the server16:07
dmsimardyup16:07
weshay:)16:07
weshaythanks guys!!16:08
dpawlikweshay, so I will come back with the topic later ;)16:08
weshaydpawlik, sounds fine.. thank you :)16:08
mariosreviews please when you next have time thank you "Refresh start_named_hashes after promotion to prevent false positive" https://review.rdoproject.org/r/#/c/24665/16:08
weshayzbr, can I borrow you for 5 min?  re: logs16:09
zbrsure16:09
*** ykarel is now known as ykarel|away16:09
arxcruzback16:09
weshayzbr, thanks.. /me sends meet link16:10
weshayzbr, https://meet.google.com/eqr-rkpn-upt?authuser=116:10
weshayarxcruz, need you for a sec too :)16:14
weshayre: logging16:14
weshayhttps://meet.google.com/eqr-rkpn-upt?authuser=116:15
arxcruzweshay: logging16:16
weshayzbr, https://opendev.org/openstack/ansible-role-collect-logs/src/branch/master/tasks/publish.yml#L81-L10816:17
marioskopecmartin: raukadah: arxcruz: rlandy: sshnaidm: please check update in https://review.opendev.org/#/c/701016/ fs20 split when you next have some time thanks. I made it only fs1 per discussions.16:19
rlandyk16:20
arxcruzmarios: sure, i'm doing some tests, and i'll update as soon as I get it, in like 3 hours16:20
arxcruzprobably tomorrow morning i'll do it16:20
mariosarxcruz: ack whenever you next have reviews time thank you16:20
sshnaidmmarios, I think you want to do it only for periodic fs001, right?16:21
marioshttps://review.rdoproject.org/r/#/c/24665/ reviews please when you next have time thank you "Refresh start_named_hashes after promotion to prevent false positive"16:21
mariossshnaidm: yes i updated https://review.opendev.org/#/c/701016/ I made it only fs1 per discussions.16:21
mariossshnaidm: i mean fs1 and fs20 split removed all the fs10 stuffs16:22
sshnaidmmarios, would be great to see list of tests from old 020 and from new jobs16:22
mariossshnaidm: and needed a new depends-on for fs1 timeout bump please check there too when you have time for reviews16:22
mariossshnaidm: there is from th eold one in th taiga16:22
mariossshnaidm: and i attached it in https://tree.taiga.io/project/tripleo-ci-board/task/138316:22
mariossshnaidm: and also your script for generating the ordered list16:22
mariossshnaidm: we can re-run once we get green one on test review https://review.rdoproject.org/r/#/c/2433916:23
sshnaidmmarios, is there periodic fs001 running with this patch?16:23
mariossshnaidm: on test review https://review.rdoproject.org/r/#/c/2433916:23
weshayzbr, one more odd thing..16:23
weshayhttp://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset020-master/a642ca9/logs/quickstart_files/16:23
sshnaidmmarios, ack, thanks16:23
mariossshnaidm: all the things are linked in taiga16:23
weshaydoens't have the log file sizes in it16:23
mariossshnaidm: welcom thanks for checking16:23
sshnaidmmarios, cool16:23
weshayzbr, where http://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-scenario004-standalone-master/b14b570/logs/quickstart_files/log-size.txt16:24
weshaydoes16:24
weshayI think again an ovb thing or setting16:24
zbrlower prio, let me do them one by one16:24
zbrmaybe someone can explain me why there is a "not artcl_collect|bool" on publish task from mian..16:26
weshayzbr, aye.. thanks16:26
zbrit seems that collect and publish are exclusive16:27
sshnaidmweshay, I see downstream dashboard is doing well now16:27
weshaysshnaidm,  :)16:28
weshaysshnaidm++16:28
weshaysshnaidm, really nice work there!!16:28
weshaymakes me happy16:28
weshayrlandy, baremetal jobs visible \0/16:28
rlandyweshay: yay!!16:29
rlandyI see16:29
sshnaidmcharts and dashboard always make management happy16:29
rlandysshnaidm: lol - you forgot spreadsheets16:29
sshnaidmrlandy, oh, this is the hardcore16:30
rlandyweshay: do you want to merge the unpinning of rhos-15? https://code.engineering.redhat.com/gerrit/#/c/190564/16:30
zbri am not sure where the move of tempest report should be done, in collect or in publish.16:31
weshayrlandy, I think it may be a good test use case for doing so... osp-15 is very low impact / priority16:31
rlandyweshay: fine - merging16:31
weshayrlandy, the other interesting bit... is that the issue w/ the osp job was real16:31
rlandymeaning?16:31
weshayrlandy, /me wonders if we could run fs010 containers-multinode16:31
zbri am inclined to believe that publish should not do any "post processing" like moving tempest files16:31
weshayrlandy, meaning changing the pin will have no real consequence16:32
weshayso let's do it..16:32
weshayzbr++16:32
rlandyit's done16:32
rlandyyou want fs10 downstream periodic?16:32
weshayrlandy, rock.. I'll put up a multinode job there and see if it recreates the overcloud deploy issue16:32
weshayrlandy, ya.. I'll have a go at it16:32
rlandysure16:32
weshayrlandy, no need for to ruck / rover16:33
raukadahweshay, rlandy may be instead of fs10, I would suggest fs00116:33
weshayuntil I'm stuck again16:33
* weshay runs16:33
rlandyyou may hit a inheritance issue - just ping if you dp16:33
weshayraukadah, so.. there is that16:33
rlandydo16:33
weshaybut.. let's see if multinode gets the job done first16:33
weshayraukadah, but that is what I'm wondering... fs001 vs. fs01016:33
weshayto discover terrible overcloud deployment issues16:34
raukadahfs010 more stable but fs001 has given use more hidden issues16:34
raukadahbetter run both and see16:34
*** marios is now known as marios|out16:45
raukadahzbr, weshay, rlandy https://review.rdoproject.org/zuul/builds?result=TIMED_OUT%20 timeout is growing please have a look17:04
rlandyoh dear17:04
rlandyok17:05
rlandywill look in a bit17:05
weshayneed ara17:08
weshayrlandy, don't worry17:08
rlandyweshay: k - np17:15
*** dtantsur is now known as dtantsur|afk17:19
rlandyweshay: zuul quickstart works great17:21
rlandyjust need to update git17:21
weshayrlandy, ah .. very cool17:21
rlandythink it's a worthwhile exercise17:21
rlandyI'm going to run through it in training17:22
rlandyit's setup on my hardware box17:22
*** bogdando has quit IRC17:23
*** marios|out has quit IRC17:24
raukadahrlandy, zbr weshay panda rfolco more python standards https://github.com/rednafi/py-sanity17:25
*** jmasud has joined #oooq17:25
raukadahzbr, we can inforce isort also with black changes17:25
zbrnot yet. thre is a bug they need to fix17:25
zbralso we need to pace it, first lets gets used to black17:26
zbrisort bug prevented pip from adopting black17:26
raukadahwe have lots of stuff in town pyproject.toml poetry and many more17:27
*** holser has quit IRC17:29
*** jmasud has quit IRC17:30
*** amoralej is now known as amoralej|off17:34
*** jmasud has joined #oooq17:34
*** jfrancoa has quit IRC17:38
*** holser has joined #oooq17:38
*** jmasud has quit IRC17:41
*** jmasud has joined #oooq17:46
*** jmasud has quit IRC17:52
*** jfrancoa has joined #oooq17:53
*** jmasud has joined #oooq17:57
*** jpena is now known as jpena|off17:58
raukadahrlandy, https://review.opendev.org/#/c/704583/ sslverify patch merged now, I will setup a call tomorrow for next course of action on downstream, weshay do you want to join the party?18:02
raukadahregarding podman18:02
*** derekh has quit IRC18:03
raukadahrlandy, invite sent18:05
raukadahplease modify based on availability18:05
*** jmasud has quit IRC18:06
weshayraukadah, was that related to a bug?18:06
raukadahweshay, nope, related to podman downstream gating with all ideas captured on prod chain council board18:07
weshayah.. thought it may be that18:09
raukadahweshay, will discuss one bug also18:10
raukadahwill file tomorrow18:10
raukadahand debug18:10
weshayraukadah, k.. claim a 1/2 hour my cal18:10
weshayplease18:10
weshayraukadah, can be early18:10
raukadahweshay, feel free to change that18:10
raukadahI just need 30 mins18:10
raukadahmade it to 30 mins18:12
*** holser has quit IRC18:15
weshaythanks18:15
*** jmasud has joined #oooq18:18
rlandyraukadah: sure18:19
*** tosky has quit IRC18:22
*** jmasud has quit IRC18:27
*** jmasud has joined #oooq18:30
zbrwho has time for a quick explanation about... re-collection?18:31
zbrmy https://review.opendev.org/#/c/703586/ produced the desired output https://review.opendev.org/#/c/703586/18:31
rlandyzbr: unrelated to ^^ ... I see you commented on this error in other reports ... is there  a way around this error: https://sf.hosted.upshift.rdu2.redhat.com/logs/84/190384/19/check/tox-linters/133f8da/job-output.txt18:32
rlandysafe to just ignore?18:33
zbrrlandy: not for a long time, i can fix, too complex to explain, but you can see the CR18:34
zbrrlandy: weird, unable to reproduce locally on master18:36
rlandyhttps://code.engineering.redhat.com/gerrit/#/c/190384/20/playbooks/podman/run.yaml18:37
rlandycode is legit - can we just add something to ignore that?18:37
rlandyit won't resolve18:37
rlandyw/o the dependent job18:38
zbrrlandy: give me few minutes, i will try to fix it18:38
rlandyzbr: thanks - no rush18:38
zbrit does reproduce with that patch18:38
rlandyack  three times so far18:38
zbri know how to fix it18:38
rlandyawesome18:41
*** jmasud has quit IRC18:41
zbrmainly the linter fails to find zuul_return ansible module. yep you can ignore but is not a good practice18:42
zbrmaking it find, is a little bit tricky but doable.18:42
zbrsomething like here https://opendev.org/opendev/base-jobs/src/branch/master/tox.ini#L2218:42
*** jmasud has joined #oooq18:44
*** jmasud has quit IRC18:49
zbrweshay: IMHO updates/upgrades should never gate.18:52
zbrthis is not an acceptable failure rate18:53
zbrhttps://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates&pipeline=gate18:53
weshayzbr, well.. luckily I also have opinions :)18:53
weshayzbr, I respectfully disagree w/ you :)18:54
zbrthe never ending negociation about what goes where...18:54
weshayya18:54
weshaywell that is tru18:54
zbron the other hand, happy that i do not see more tripleo on rhttps://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates&pipeline=gate18:56
zbrwrong link, correct https://zuul.opendev.org/t/openstack/builds?pipeline=gate&result=failure#18:56
zbrwho fails at the gate, we need a big shaming dasboard for that :D18:57
zbrrlandy: ignore it for the moment. watch https://github.com/ansible/ansible-lint/issues/372#issuecomment-579903645 for details18:59
*** tesseract has quit IRC18:59
zbris not a bug in the linter, is a missing module, one that is not easy to make available.18:59
rlandyok19:02
weshayzbr, in other news https://review.rdoproject.org/zuul/stream/fb2e24e897d54373b9885c9f41778821?logfile=console.log19:04
weshaycheck the time stamp on collect logs there19:05
weshayle yikes19:05
weshayrlandy, qq.. doesn't look like we have an internal-base job for multinode.. in your humble opinion which would be better..19:08
weshaymaybe easier19:08
weshaystanding up ovb or multinode19:08
rlandyweshay: multinode19:08
rlandyeasier on the resources19:08
weshayand psi still is flaky I think19:08
rlandyworks fairly well for us19:09
rlandyas opposed to what other rock solid cloud???19:09
weshayheh19:09
weshayrlandy, seen issues w/ ovb19:10
weshayin psi19:10
weshayrlandy, hence https://code.engineering.redhat.com/gerrit/#/c/185863/719:10
rlandyyeah - so multinode may be easier to go with19:10
*** holser has joined #oooq19:10
rlandyweshay: so here's the funny thing19:11
rlandyhttp://git.app.eng.bos.redhat.com/git/openstack/tripleo-ci-internal-config.git/tree/zuul.d/jobs.yaml#n25 for example19:12
rlandyparents off19:12
rlandyhttps://github.com/openstack/tripleo-ci/blob/master/zuul.d/base.yaml#L719:12
rlandywhich parents off multinode19:12
rlandyso if you look at standalone19:12
rlandyand make the same modifications on an internal job parenting off https://github.com/openstack/tripleo-ci/blob/master/zuul.d/base.yaml#L6819:13
weshayah.. right19:13
weshaythat's right.19:13
rlandyit should be ok19:13
weshayso I just need to define the node19:13
rlandybe careful of the nodeset19:13
weshayor look if it already is19:13
rlandyyou need one that supports multinode19:13
*** jmasud has joined #oooq19:15
rlandyzbr: um .. how do I find which rule number this error is to skip?19:17
*** jmasud has quit IRC19:20
zbrrlandy: you cannot use a skip rule on that.19:24
*** jmasud has joined #oooq19:24
zbris not a rule, is an internal failure, at this moment.19:24
weshayrfolco, ping19:24
rfolcoweshay, o/19:24
rlandyk - suggestion on how to ignore this?19:24
weshayrfolco, you have 5min?19:25
rfolcoweshay, yep19:25
weshayhttps://meet.google.com/erg-zoyg-jnz?authuser=119:25
rlandytrying the tag19:28
rlandy2020-01-29 19:32:27.543362 | upstream-centos-7 |   linters: commands succeeded19:32
rlandy2020-01-29 19:32:27.543393 | upstream-centos-7 |   congratulations :)19:32
rlandywoohoo19:32
rlandyzbr: ^^ thanks19:33
rlandyraukadah: yay ... merging podman jobs19:34
weshayrfolco, https://github.com/rdo-infra/review.rdoproject.org-config/blob/master/roles/promote-hash/tasks/get-hash.yaml19:34
*** jmasud has quit IRC19:48
weshayrlandy, have you ever seen a bm job get a node failure?19:59
weshayrlandy, https://code.engineering.redhat.com/gerrit/#/c/189436/19:59
rlandyweshay: yes for the nodepool node20:00
weshayoh.. right.. it runs as the slave20:00
weshaydam them20:00
rlandyweshay: yeah ... we still rely on a nodepool 'jumpbox'20:00
weshayrlandy, you know we should start socializing that as an option for infrared folks20:01
weshayto use all their hardware.. but move to zuul20:01
weshaymeh20:01
rlandyweshay: yeah - that was suggested on the call today, no?20:02
rlandyI'm all for it20:02
*** holser has quit IRC20:02
rlandyremember when we tried to update our tests still using jenkins? shoot me20:02
weshayrlandy, looks like it's nothing we did20:57
* rlandy is not so sure20:59
rlandythere is something up with that floating ip allocation20:59
weshayomg.. I just learned something new about tmux21:03
weshayand it's the feature I've waiting for..21:03
weshay\0/21:03
rlandyweshay: that's almost as good as my getting by the linter error earlier21:04
weshayrlandy, https://review.opendev.org/#/c/680571/21:07
rlandyweshay: let's give that a run on real bm21:09
weshaygithub sucks21:25
weshayrfolco, help me find this "Pull ppc64le tagged container images from trunk.registry.rdoproject.org registry" task21:28
weshayrfolco, nvrmind21:30
weshaycontainers-push..21:30
weshaynot promoter21:30
weshayduh21:30
*** apetrich has quit IRC21:43
*** jmasud has joined #oooq21:55
*** jmasud has quit IRC22:06
*** jtomasek has quit IRC22:06
*** jmasud has joined #oooq22:09
*** tosky has joined #oooq22:10
*** holser has joined #oooq22:12
*** jmasud has quit IRC22:16
*** jfrancoa has quit IRC22:30
*** Trevor_V has quit IRC22:31
*** rfolco has quit IRC22:33
*** rlandy is now known as rlandy|bbl23:14
owalshweshay: hey, ok if i just change the depends on for https://review.rdoproject.org/r/#/c/24726/ to check if the fix is good?23:30
*** ysandeep has joined #oooq23:37
owalshmeh, no logs from that job anyway but if it passes I'll assume it's all good23:37
*** sshnaidm is now known as sshnaidm|afk23:45
*** jmasud has joined #oooq23:52
*** tosky has quit IRC23:52

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!