Thursday, 2021-08-26

*** ysandeep|away is now known as ysandeep05:39
zbrvote on https://review.opendev.org/c/openstack/project-config/+/80601206:38
*** amoralej|off is now known as amoralej07:02
*** jpena|off is now known as jpena07:37
*** ykarel is now known as ykarel|lunch08:28
frenzy_fridayHey zbr, https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/787502/ is failing in the linters but is passing locally. Do you have an idea why?09:17
*** sshnaidm|afk is now known as sshnaidm09:38
*** ykarel|lunch is now known as ykarel10:18
chandankumarsshnaidm: ysandeep arxcruz for interview coordination, how we want to coordinate?10:19
chandankumarAre we planning to time box it?10:19
sshnaidmchandankumar, I think better in that channel10:19
sshnaidmarxcruz, ysandeep ^^10:21
*** rlandy is now known as rlandy|rover10:38
*** ysandeep is now known as ysandeep|mtg10:57
*** rlandy is now known as rlandy_testbox10:58
*** dviroel|out is now known as dviroel|ruck11:14
*** jpena is now known as jpena|lunch11:42
* dviroel|ruck brb in a couple of mins11:58
*** dviroel|ruck is now known as dviroel|ruck|brb11:58
*** dviroel|ruck|brb is now known as dviroel|ruck12:20
*** jpena|lunch is now known as jpena12:38
arxcruzbhagyashris: i need to take my daughter at school, i'll be late for the scrum 12:47
bhagyashrisarxcruz, ack12:47
chandankumarrlandy|rover: weshay|ruck just wanted to update on container build https://review.rdoproject.org/zuul/builds?job_name=tripleo-build-containers-stream9-development&project=testproject without ceph12:49
chandankumartrying now with ceph12:49
weshay|ruckchandankumar, w/ ceph.. re: a deploy or image build... we don't build ceph12:54
chandankumar*ceph-common12:56
*** amoralej is now known as amoralej|lunch12:59
bhagyashrisakahat, chandankumar rlandy_testbox zbr sshnaidm scrum time13:00
bhagyashrisrlandy|rover, ^13:01
bhagyashrisReview list 13:04
bhagyashrishttps://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34706/13:04
bhagyashrishttps://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3499513:04
bhagyashrishttps://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3500813:05
rlandy|roverhttps://review.opendev.org/c/openstack/tripleo-quickstart/+/80439913:06
frenzy_fridayhttps://review.opendev.org/c/openstack/ansible-role-collect-logs/+/787502/ - need some help on the linetrs - it passes on local13:09
bhagyashrishttps://review.opendev.org/c/openstack/ansible-role-collect-logs/+/78750213:11
bhagyashrishttps://review.opendev.org/c/openstack/tripleo-quickstart/+/79148613:26
*** ysandeep|mtg is now known as ysandeep13:29
zbris "Failed to store expired repos cache" a redherring?13:31
sshnaidmprobably13:31
ysandeepakahat, hey o/ do you have fs001 results for train branch for your operator patches? I want to compare them with 16.2 results13:49
*** amoralej|lunch is now known as amoralej13:50
ysandeepakahat: found it! https://logserver.rdoproject.org/17/804117/7/openstack-experimental/tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-train-branch/bb5c657/job-output.txt 13:52
akahatysandeep, i've for fs02 and fs35: https://review.rdoproject.org/r/c/testproject/+/3161213:57
akahatysandeep, well network roles are skipped in fs01-train. 13:58
ysandeepakahat: thanks! i got now - network_provision is w+ and baremetal_provision is ussuri+ 13:58
ysandeephttps://github.com/openstack/tripleo-quickstart/blob/master/config/general_config/featureset001.yml#L216-L22513:58
akahat:)13:59
ysandeepakahat: you have a suggestion from sagi on https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/804117 14:01
rlandy|roverysandeep; anything required for downstream today?14:06
rlandy|roverif not will just look into the couple failing on 17 line14:06
rlandy|roverI know Pooja is out for a bit14:07
rlandy|roverdviroel|ruck: woohoo master promotion14:07
dviroel|ruckyes :)14:08
rlandy|roverdviroel|ruck: we should check the component line - I can split that task with you14:08
rlandy|roverdviroel|ruck: know how to check the dates on those lines?14:09
dviroel|ruckrlandy|rover: ok, yeah, i think so, was just looking glance errors here https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-scenario001-standalone-glance-master14:09
rlandy|roverdviroel|ruck: k - I'll take train, ussuri, victoria14:10
chandankumarrlandy|rover: arxcruz kopecmartin we need to bring back the tempest container for operator work14:10
rlandy|roveryou good with master/wallaby?14:10
dviroel|ruckrlandy|rover: yes 14:10
rlandy|rovercool14:10
chandankumarrlandy|rover: arxcruz kopecmartin https://review.opendev.org/c/openstack/tripleo-common/+/80198714:11
ysandeeprlandy|rover: Didn't get a chance to ruck/rove downstream today, I was neck down trying to figure out bm deployment templates, which I finally did today..14:11
ysandeeprlandy|rover, fyi:- made some improvements14:12
ysandeep* While switching 17 jobs to use networkdatav2, Noticed we are not deleting redis/ovndb ports during "openstack overcloud delete overcloud":-14:12
ysandeep* Spoke with rabi, he cherrypicked the fix to wallaby https://review.opendev.org/c/openstack/tripleo-ansible/+/806083, I tested the change in my environemnt it works.14:12
ysandeep* Backported a feature for wallaby "node unprovisioning during overcloud delete in wallaby" https://review.opendev.org/c/openstack/python-tripleoclient/+/80607914:12
weshay|ruckysandeep++14:15
rlandy|roversec14:15
kopecmartinchandankumar: then we need to figure out who's gonna maintain it, how is it tested, built etc .. because at this moment i don't know anything about it14:15
arxcruzchandankumar: why god... why?14:18
chandankumararxcruz: it is required for operator work14:19
chandankumarto run tempest14:19
chandankumaropenstack deployment using https://github.com/openstack-k8s-operators/osp-director-operator14:19
chandankumarit is totally new work14:19
rlandy|roverysandeep: looks good - will check it out14:28
rlandy|roverchandankumar: k - we will be going down the path of testing that soon enough14:36
dviroel|ruckrlandy|rover: will testproject all components that run scenario001, they have been failing a lot, for different reasons. It is possible to see the same error in different components too, but not more than once in the component.14:51
rlandy|roverdviroel|ruck: didn't we skip some scenario001 tests?14:54
rlandy|roverwe may need to change the jobs to []14:55
rlandy|roverto include all14:55
rlandy|roverin skiplist14:55
rlandy|roverrather than having you add every component test14:55
rlandy|roverif it's the same failure14:55
dviroel|ruckisn't the same failure. actually, there is differents failures happening on scenario001, we just skip some tempest test.14:56
sshnaidmzbr, I don't see install_ansible_collections functions is executed anywhere, I think it's just defined, but not executed14:57
sshnaidmzbr, in install-deps.sh14:57
dviroel|ruckwill continue investigate after lunch, brb14:58
* dviroel|ruck lunch14:58
zbrinstall deps itself is sourced, it only has functions inside not direct actions, is like a bash library.15:02
zbrsshnaidm: installation of collections happens inside qs boostrap(), which is executed only when qs is called with --boostrap.15:04
zbrwe should either ensure we do this, or call install_ansible_collections from another appropiate place.15:05
sshnaidmzbr, I don't think we use quickstart.sh in ovb jobs, only install_deps.sh, so it's not executed15:06
rlandy|roverdviroel|ruck: k - ping if you need help with components15:06
sshnaidmzbr, can you see in logs where it's executed? Maybe worth to add -v to ansible-galaxy15:07
zbrany problem adding install_ansible_collections at the end after install_package_deps_via_bindep? it should work unless ansible is missing.15:07
*** amoralej is now known as amoralej|off15:10
chandankumarrlandy|rover: ysandeep meeting time15:29
ysandeepchandankumar, joining in a minute, sry jumping from another mtg15:31
*** jpena is now known as jpena|off15:39
chandankumarysandeep: rlandy|rover frenzy_friday https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/infra-setup/roles/promoter/templates/dlrn-promoter-service.sh.j215:46
* dviroel|ruck back15:48
rlandy|roverhttps://zuul.opendev.org/t/openstack/builds?job_name=puppet-openstack-unit-6.21-centos-8-stream16:13
rlandy|roverweshay|ruck: ^^ we just ping on #tripleo for this?16:13
rlandy|roverpuppet-tripleo failures in gate16:13
weshay|ruckcan email takashi16:15
rlandy|rovertrain component line is good16:16
rlandy|rover6 days agotripleopromoted-components9 hours ago - trouble16:33
ysandeeprlandy|rover: with 16.2 bm envd - looks like there is some hardware issue, not because of operator work16:38
ysandeephttps://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-periodic-integration-rhos-16.2/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-bm_envD-3ctlr_1comp-featureset035-rhos-16.2/5231268/logs/undercloud/home/zuul/overcloud_import_nodes.log16:38
rlandy|roverk - thanks16:38
ysandeepI will look tomorrow o/ 16:39
*** ysandeep is now known as ysandeep|away16:39
chandankumarrlandy|rover: please use the older one promoter16:39
chandankumarrlandy|rover: it needs some fixes16:39
rlandy|roverchandankumar: np - thanks16:42
rlandy|roverchandankumar: fyi - I am out tomorrow16:42
rlandy|roverso no rush16:42
dviroel|ruckrlandy|rover: scenario001 seems to be a real issue in master components16:45
dviroel|ruckrlandy|rover: but not all issues are the same16:46
rlandy|roverdviroel|ruck: k16:53
rlandy|roverlunch - brb16:53
dviroel|ruckrlandy|rover: here https://review.rdoproject.org/r/c/testproject/+/34983, common component seems to have a similar failure to https://bugs.launchpad.net/tripleo/+bug/194086617:46
dviroel|ruckadding as a comment17:46
rlandy|roverok17:47
rlandy|roverso add to skiplist?17:47
dviroel|ruckyeah, probably. There are other runs that says that failed in tempest execution, but there is no log outputs: https://logserver.rdoproject.org/83/34983/8/check/periodic-tripleo-ci-centos-8-scenario001-standalone-cinder-master/3c12df2/logs/17:56
dviroel|ruckmaybe if we skip those tests, we might see these other ones passing too, maybe17:56
*** rlandy is now known as rlandy_testbox17:57
dviroel|ruckrlandy|rover: adding components to skiplist https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/80626118:39
rlandy|roverdviroel|ruck: sorry - getting a lesson on cockpit editing18:45
* rlandy|rover review18:45
rlandy|roverdviroel|ruck: you'll probably need to add all of wallaby18:45
rlandy|roverif it's too much of a pain you can skip all jobs with []18:46
rlandy|roverbut approving this for now18:46
rlandy|roverlet's merge and see if it clears the lines18:46
rlandy|roverthanks18:46
dviroel|ruckok :)18:51
rlandy|rover https://code.engineering.redhat.com/gerrit/c/openstack/rrcockpit/+/268195 Add promotions in descending order19:17
rlandy|roverweshay|ruck: ^^19:18
*** sshnaidm is now known as sshnaidm|afk19:32
rlandy|roverdviroel|ruck: hi - fyi - I am PTO tomorrow20:46
rlandy|roverweshay|ruck will be around though20:47
weshay|ruck:)20:47
weshay|ruckenjoy it20:47
dviroel|ruckrlandy|rover: ok20:47
rlandy|roverthanks - mom says hi20:47
dviroel|ruckhi o/20:48
dviroel|ruckrlandy|rover: https://logserver.rdoproject.org/83/34983/9/check/periodic-tripleo-ci-centos-8-scenario001-standalone-cloudops-master/0e503f9/logs/20:50
dviroel|ruckseen this mysql errors more often20:50
dviroel|ruckseems to be another issue with scenario00120:51
rlandy|roverwow scenario001 is a true winner20:52
rlandy|rover69ca146fc05d  192.168.24.1:8787/tripleomaster/openstack-mariadb:64e0281eb4c358d842dfdd238d9486dd-updated-20210826192259                     /container_puppet...  27 minutes ago      Exited (6) 17 minutes ago          mysql_wait_bundle20:53
rlandy|roveryeah  we were tracking that somewhere 20:54
rlandy|roverlooking at a tripleo line failure20:54
rlandy|rovercompares20:54
rlandy|rover22 days agotraincurrent-tripleocentos-7 - will also check that out20:55
rlandy|rover6 days agotripleopromoted-components14 hours ago20:56
weshay|ruckrlandy|rover, I think c7-train was all passing when we looked earlier in the week20:56
rlandy|roverweshay|ruck: it was20:56
rlandy|roverwhich is why its strange it did not promote20:57
rlandy|roverthat was wednesday20:57
rlandy|roverweshay|ruck; line is green, green, green https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-stable4-centos720:58
rlandy|rovermust be criteria mismatch20:58
rlandy|roveror the promoter for c7 not running20:58
rlandy|roverweshay|ruck: if 10.0.148.59 is still valid as the c7 promoter, I can't access it21:01
rlandy|roverdoesn't ping21:02
weshay|ruckrlandy|rover, it's not valid.. c7 should be on promoter.rdo21:02
rlandy|roverk - checking there21:02
weshay|ruckrlandy|rover, probably not even configured to run21:03
weshay|rucklolz21:03
rlandy|roverit's nowhere21:03
* rlandy|rover gets on promoter21:04
weshay|ruckya.. see if the script is kicking it off21:04
rlandy|roverlet's see what's supposed to run there21:04
rlandy|rover22 long days21:04
rlandy|roverwith bright green runs :)21:04
rlandy|roverweshay|ruck: yep not in ci-scripts/dlrnapi_promoter/dlrn-promoter.sh on the actual promoter box21:09
rlandy|roveradding and adding config and restarting21:09
rlandy|roverdviroel|ruck: weshay|ruck; added train c7 to rdo promoter and restarted21:14
rlandy|roverin case you see any issues tomorrow21:14
weshay|ruckk21:14
weshay|ruckthanks21:14
weshay|ruckwould love c7 to die21:14
dviroel|ruckok21:14
rlandy|roverye - should not require much love at this point - hopefully it just runs21:15
rlandy|rovercriteria seemed up to date21:15
rlandy|roverhttps://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-standalone-upgrade-tripleo-ussuri&pipeline=openstack-component-tripleo21:16
rlandy|roverha ok - that's why it's not promoting21:17
rlandy|rover2021-08-21 21:19:43.949813 | primary | TASK [os_tempest : Ping router ip address] *************************************21:18
rlandy|roverinteresting - we had this issue in 1721:18
rlandy|roverchecking if it's the same21:18
rlandy|roveropenvswitch ovn/ovs21:18
dviroel|rucksome weird outputs here: https://logserver.rdoproject.org/83/34983/9/check/periodic-tripleo-ci-centos-8-scenario001-standalone-common-master/1a28102/logs/undercloud/var/log/containers/stdouts/rabbitmq-bundle.log.txt.gz21:42
dviroel|ruckLooks like this is not happening in jobs that succeeded ^21:49
rlandy|roverdviroel|ruck: going to log a bug for failing  periodic-tripleo-ci-centos-8-standalone-upgrade-tripleo-ussuri21:52
rlandy|roverlooks similar to a bugzilla we raised 21:53
rlandy|rovercan you ping ykarel with that bug21:53
rlandy|roverI think he updated OVN21:53
dviroel|ruckok, sure21:53
rlandy|roverthanks21:53
rlandy|roverwill post bug here - juts collecting info21:53
dviroel|ruckok, I will ping tomorrow21:54
rlandy|roverdviroel|ruck: https://bugs.launchpad.net/tripleo/+bug/1941802 - we fixed the bugzilla referenced by upgrading openvswitch22:11
dviroel|ruckack22:13
* dviroel|ruck dinner and out 22:18
*** dviroel|ruck is now known as dviroel|out22:18
*** rlandy|rover is now known as rlandy|rover|bbl22:19

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!