Friday, 2022-06-24

*** rlandy|bbl is now known as rlandy01:33
rlandyjm1[m]: akahat:pls check hackmd - and work with fmount to get the following merged:01:39
rlandyhttps://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/84623101:39
rlandyhttps://review.opendev.org/q/I137e335abeedccad801cdc03feee654c3e42a0e201:40
*** rlandy is now known as rlandy|out01:40
*** ysandeep|out is now known as ysandeep01:59
ysandeepGood morning team o/01:59
*** undefined_ is now known as Guest311204:09
*** akahat is now known as akahat|ruck04:39
chandankumarysandeep: good morning \o04:56
akahat|ruckchandankumar, ysandeep o/05:19
akahat|ruckchandankumar, ysandeep could you please vote on this: 05:19
akahat|ruck<frenzyfriday|PTO> yeah :D just came back to check the promoter patch05:19
akahat|ruckhttps://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4370805:19
akahat|ruckthis ^^05:19
akahat|ruckso i can start promoter again.05:19
* ysandeep looking05:26
ysandeepakahat|ruck: +wed05:28
akahat|ruckysandeep, thanks!05:29
ysandeepakahat|ruck, jm1[m] fyi.. new bug https://bugs.launchpad.net/tripleo/+bug/1979707 , Rabi have already proposed a fix.05:33
ysandeepfeel free to add promotion-blocker flag incase this is blocking check/gate05:33
akahat|ruckysandeep, ack05:34
*** undefined_ is now known as Guest311505:51
akahat|ruckchandankumar, ysandeep  ping about registry: quay registry is compatible to work with docker-py?06:34
chandankumarakahat|ruck: donot know, never tried that06:34
akahat|ruckI'm trying to login to registry using docker-py but it's throwing error.06:35
ysandeepakahat|ruck, I don't have idea about that.06:35
chandankumarakahat|ruck: what is the error?06:35
akahat|ruckIt is the case for the registry login.. docker is not able to connect quay.. 06:35
akahat|ruckpromoter blocker06:35
akahat|ruckchandankumar, docker.errors.APIError: 500 Server Error for http+docker://localhost/v1.41/auth: Internal Server Error ("Get "https://quay.rdoproject.org/v2/": unauthorized: Could not find robot with specified username")06:36
akahat|ruckit always taking /v2/ 06:36
ysandeepmarios, fyi.. replied to your query here: https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/403826/18..20#message-dd72d77b589f23c8b85b188b34dc3ad4acfc36b006:37
akahat|ruckregistry are getting logged in using docker login but not using docker-py06:37
ysandeepbhagyashris, could you please check https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/403826/18..20#message-21c9393813564cccf515a731e9e8a88f44044dd3 06:37
chandankumarakahat|ruck: "https://index.{{ registry_host }}/v1/"06:39
bhagyashrisysandeep, yup checking 06:39
chandankumarakahat|ruck: check the container-push.yml playbook?06:39
chandankumarakahat|ruck: can you change it from v2 to v106:41
akahat|ruckchandankumar, tried.. but not working.. 06:44
marioso/ ysandeep thanks will check 06:48
mariosysandeep: ack thanks ok then 06:48
*** arxcruz|rover is now known as arxcruz06:55
*** jm1|ruck is now known as jm1|rover06:56
jm1good morning, ci :)06:58
jm1akahat|ruck: what is going on with ci? what should i have a look at? 👀06:58
akahat|ruckjm1, o/ morning.. will have call in few mins.. chasing with promoter.. 06:59
akahat|ruckjm1, meanwhile can you take look at downstream?07:00
jm1akahat|ruck: never worked on downstream before 🙈 let me google that..07:01
jm1btw nice rr doc07:02
*** amoralej|off is now known as amoralej07:03
*** jpena|off is now known as jpena07:06
*** ysandeep is now known as ysandeep|afk07:11
akahat|ruckjm1, https://meet.google.com/ius-qipc-odj07:47
akahat|ruckjm1,  joning?07:48
jm1akahat|ruck: yep07:49
amoralejchandankumar, how is the issue with redis?07:50
amoralejstill failing?07:50
akahat|ruckjm1, https://hackmd.io/9hv3vTNlST2rw014LSDqcg07:51
chandankumaramoralej: we workaroud by downgrading pacemake and resourceagent https://review.opendev.org/c/openstack/tripleo-common/+/84628707:53
amoralejit's still failing in rdo jobs07:53
amoralejlemme check why07:53
chandankumarok07:55
chandankumarjm1: akahat|ruck's place has power cut right now, so he might went offline08:05
chandankumarhe just informed me08:05
jm1chandankumar: thanks for the heads up! are power cuts usual in india?08:06
chandankumarjm1: nope08:06
chandankumarjm1: sometimes bad weather and hot summer might cause it08:07
jm1chandankumar: was just wondering because it is not the first time that happens while we are in a call ^^08:07
chandankumarit depends on place to place, each places have different infrastructure and backup situations08:08
jm1chandankumar: you have backups? (we dont, causing a blackout for several days last year 🙈)08:11
chandankumarjm1: I have backups which will run for a 1 day or 208:11
jm1chandankumar: wow ok08:12
chandankumarsometimes infra issues or coal shortage brought power cut across india and also seen in neighbouring countries08:12
chandankumarNow i think it is fixed08:12
amoralejchandankumar, i see new builds in repos so the downgrade is installing the bad version?08:50
amoralejhaven't you seen that in your jobs?08:50
*** ysandeep|afk is now known as ysandeep09:00
ysandeepmarios, just to confirm have we left deprecated-jobs.yaml on purpose? https://review.opendev.org/c/openstack/tripleo-ci/+/847241/1#message-b338e0f6909c98417495c74561500ba4cbe7f28e 09:02
ysandeepI wonder when zuul will remove declaring queue at pipeline level.. will it complain because we didn't update deprecated-jobs.yaml09:03
mariosysandeep: looking 09:07
mariosysandeep: thanks replied... its a good point but i think its fine to leave it those templates more likely need removing altogether 09:11
mariosysandeep: there should not be any zuul explosions or other complaints ;) according to the mails about it it would mean jobs aren't run in the layouts that use the queue in the 'old' way 09:11
ysandeepmarios: ack09:16
jm1reviewbot: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4371409:16
jm1akahat|ruck: still no power? 😳10:16
akahat|ruckjm1, hey.. 10:17
akahat|rucki'm back a while ago.10:17
akahat|ruckchasing some wired issues on promoter.10:17
jm1akahat|ruck: ah, thought we had something to discuss?! anyway, rr doc is updated10:18
jm1akahat|ruck: went through all bugs and checked whether they are still open or not. havent added new bugs yet10:19
akahat|ruckjm1, okay.. 10:19
*** rlandy|out is now known as rlandy10:21
rlandyakahat|ruck: jm1: hello10:22
akahat|ruckrlandy, hello10:22
rlandyakahat|ruck; jm1|rover:let's sync10:22
akahat|ruckrlandy, okay10:22
rlandyhttps://meet.google.com/ots-qfcp-uzf?pli=1&authuser=010:23
chandankumaramoralej: checking10:30
ysandeepakahat|ruck, jm1 many jobs in post failure: https://zuul.opendev.org/t/openstack/builds?result=POST_FAILURE&skip=010:38
ysandeepjm1|rover, ^^10:38
ysandeepyou have so many nicks :)10:38
akahat|ruckysandeep, yes. saw that.. but it do not shows th elogs10:38
akahat|ruckthe logs *10:38
jm1|roverysandeep: just choose whatever nick you want ^^10:39
chandankumaramoralej: you are correct, now it is installing pacemaker.x86_64                               2.1.4-1.el9                         @quickstart-centos-highavailability and resource agent version have not got changed 10:41
ysandeepakahat|ruck, ack.. looks like noonedeadpunk was taking about that issue on #opendev  few mins back.10:41
chandankumaramoralej: https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_d75/847329/2/check/tripleo-ci-centos-9-scenario001-standalone/d75fb7c/logs/undercloud/var/log/extra/podman/containers/redis-bundle-podman-0/podman_info.log10:42
amoralejchandankumar, so jobs are not failing?10:43
chandankumaramoralej: for wallaby it is failing10:43
amoralejmmm and why not in master?10:43
chandankumaramoralej: since resource-agent version does not got changed https://review.opendev.org/c/openstack/tripleo-common/+/84743710:43
amoralejno, i don't understand10:44
amoralejmmm10:44
amoralejwhat i'm hitting in rdo10:44
amoralejis that10:44
chandankumarand it is downgraded in master so master is working fine10:44
amoralejjobs keep failing with downgrade10:44
amoralejthat's what i see in my env10:44
chandankumarand for wallaby https://review.opendev.org/c/openstack/tripleo-common/+/847437 downgrade patch is not merged that's why it is failing10:45
chandankumaramoralej: https://bugs.launchpad.net/tripleo/+bug/1978997/comments/1010:46
amoralejchandankumar, https://logserver.rdoproject.org/73/42273/10/check/rdoinfo-tripleo-master-testing-centos-9-scenario001-standalone/f96bad2/logs/10:46
amoralejthat's from yesterday, with the downgrade10:46
amoralejyes i get that10:47
amoralejhttps://logserver.rdoproject.org/73/42273/10/check/rdoinfo-tripleo-master-testing-centos-9-scenario001-standalone/f96bad2/logs/undercloud/home/zuul/container-builds/302ade5d-3634-4635-987c-a03ff94a7a5d/base/redis/redis-build.log.txt.gz10:47
amoralejsee the downgrade there installed the bad versions10:48
amoralejpacemaker                x86_64  2.1.3-2.el910:48
amoralejas there is a more recent one10:48
amoralejpacemaker-2.1.4-1.el9.x86_6410:49
amoraleji'm not sure why you are not hitting that10:49
chandankumarlet me check the logs from periodic line also to confirm10:50
chandankumaramoralej: reason we are not seeing it in periodic line https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-9-scenario001-standalone-master&project=openstack/tripleo-ci - sc01 is busted already due to ceph failure https://review.opendev.org/c/openstack/tripleo-ansible/+/84732310:54
chandankumarit has not reached to standalone deployment10:54
amoralejand in check pipeline with provider jobs?10:54
amoralejyou rebuild the redis container, right?10:54
amoralejor use redis from last promoted?10:55
chandankumaramoralej: https://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ci-centos-9-scenario001-standalone&branch=master&skip=010:56
chandankumaramoralej: we rebuild the redis container10:57
amoralejlet me check there10:57
amoralejthanks10:57
chandankumarone of the passing job from today https://b82ba900a06c8c7cbdc7-ff1e6572e024e404a82f69f5d976dc9d.ssl.cf5.rackcdn.com/846231/10/gate/tripleo-ci-centos-9-scenario001-standalone/a5dcf51/logs/undercloud/var/log/extra/podman/containers/redis-bundle-podman-0/podman_info.log10:58
chandankumarpacemaker.x86_64                               2.1.2-4.el9                         @quickstart-centos-highavailability10:58
chandankumarpacemaker-cli.x86_64                           2.1.2-4.el9                         @quickstart-centos-highavailability10:58
chandankumarpacemaker-cluster-libs.x86_64                  2.1.2-4.el9                         @quickstart-centos-highavailability10:58
chandankumarpacemaker-libs.x86_64                          2.1.2-4.el9                         @quickstart-centos-highavailability10:58
chandankumarpacemaker-remote.x86_64                        2.1.2-4.el9                         @quickstart-centos-highavailability10:58
chandankumarpacemaker-schemas.noarch                       2.1.2-4.el9                         @quickstart-centos-highavailability10:58
chandankumarand10:58
chandankumarresource-agents.x86_64                         4.10.0-17.el9                       @quickstart-centos-highavailability10:58
amoralejwhere are the container build logs?10:59
*** ysandeep is now known as ysandeep|brb11:01
chandankumarbuildset history https://zuul.opendev.org/t/openstack/buildset/e34cbeae93694af88b4fe64d2257398d and https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_6ef/846231/10/gate/tripleo-ci-centos-9-content-provider/6efa83c/logs/container-builds/4dfa92a4-c0a3-4404-b684-450781385526/base/redis/redis-build.log11:01
amoraleji got it11:02
amoralejchandankumar, there is specific nvr there11:03
amoralejdnf -y downgrade pacemaker-2.1.2-4.el9 pacemaker-remote-1.2-4.el9 resource-agents-4.10.0-17.el911:04
amoralejthat's what is missing in my patch ...11:04
chandankumaramoralej:  i think downgrading resource-agents is enough11:04
amoralejmaybe11:04
amoralejbut the problem is not specifying the version11:05
amoralejhttps://review.opendev.org/c/openstack/tripleo-common/+/84722211:05
chandankumaramoralej: will i update the workaround with nvr to avoid further issue?11:05
amoraleji didn't have that when i ran the jobs11:05
amoralejshould be fine now11:05
amoralejrechecking ...11:05
chandankumarok11:06
amoralejchandankumar, https://review.opendev.org/c/openstack/tripleo-common/+/847222 you need that in wallaby11:06
amoralejmake sure you include exact nvr11:06
amoralejunless the latest release in centos is fixed ...11:06
amoralejactually, see the commit message11:07
amoralejSo the problem resurfaced when11:07
amoralejthese rpms upgraded yet again.11:07
chandankumaramoralej: https://review.opendev.org/c/openstack/tripleo-common/+/847437 for wallaby backport11:08
chandankumarlet me fix it11:08
amoralejactually that looks fine11:09
* jm1 lunch11:14
*** ysandeep|brb is now known as ysandeep11:18
ysandeepanyone want to join review time?11:19
mariosysandeep: i was alone and dropped 11:19
mariosysandeep: after couple mins11:19
*** dviroel|out is now known as dviroel11:19
mariosysandeep: you have something to discuss? otherwise i think you and me have already reviewed so we can skip looks like?11:19
ysandeepmarios, rlandy and dviroel joined if you want to join back11:20
mariosysandeep: k rejoining11:20
rlandyakahat|ruck: jm1: https://zuul.opendev.org/t/openstack/builds?result=POST_FAILURE&skip=0 - pls follow #opendev on that11:26
akahat|ruckrlandy, ack11:27
ysandeepreviewbot, please add in review list: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4371411:39
reviewbotI have added your review to the Review list11:39
ysandeepdviroel, hey o/11:40
ysandeepdviroel, do you have few mins to sync?11:40
dviroelysandeep: yep11:40
ysandeepdviroel, https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4371411:41
ysandeepdviroel, oops https://meet.google.com/vzm-nrah-qqf?authuser=011:41
dviroelysandeep: trying to join, it is slow to me11:42
ysandeepokay.. no worries11:43
*** rlandy is now known as rlandy|dr_appt11:48
ysandeepdviroel, https://github.com/scality/openstack-actions-runner11:49
chandankumar#oooq, anyone seen this issue: https://9220ef844f68ff46b4db-cd7f53ffbb0b86c69deae453da021fe8.ssl.cf1.rackcdn.com/839688/82/check/tripleo-ansible-centos-stream-molecule-tripleo_collectd/3cabfb4/job-output.txt 11:58
chandankumarThe task includes an option with an undefined variable. The error was: 'nslookup_target' is undefined\11:58
chandankumarthe job is failing with retry_limit11:58
dviroelchandankumar: yes12:02
chandankumardviroel: how to fix it?12:02
dviroelchandankumar: actually, I think that this job is running with fips enabled12:03
dviroelchandankumar: this is something that ade_lee added in fips role12:04
chandankumardviroel: is there a way to disable it?12:04
dviroel TASK [include_role : enable-fips]12:04
chandankumarit is a molecule job and only collectd one is failing 12:04
dviroelchandankumar: fips mode only runs if you parent of multinode-fips12:05
dviroelyou should parent from multinode only12:05
dviroelchandankumar: pre-run: zuul.d/playbooks/enable-fips.yml12:06
dviroelin job definition, don't know why12:06
dviroelnot related to parent job12:06
chandankumaradding  enable_fips: false in the job var to disable it12:07
dviroelif the idea is to run with fips enabled, you can set 'nslookup_target' vat to opendev.org12:07
dviroelnslookup_target: 'opendev.org'12:08
chandankumardviroel: currently it is not needed12:09
dviroelchandankumar: ok, remove the pre-run then12:09
chandankumardviroel: thank you for the tip, i hope this time it will work12:09
dviroelchandankumar: np o/12:10
chandankumardviroel: the job is inherited from base 12:11
chandankumarhttps://opendev.org/zuul/zuul-jobs/src/branch/master/playbooks/enable-fips/pre.yaml#L912:11
chandankumarenable_fips | default(true) is causing that12:11
*** ysandeep is now known as ysandeep|afk12:12
*** amoralej is now known as amoralej|lunch12:19
dviroelchandankumar: yeah, just remove the pre-run playbook12:31
dviroelchandankumar: doesn't make sense to keep and have it disabled12:31
akahat|ruckreviewbot, please add in review list: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4371912:42
reviewbotI have added your review to the Review list12:42
chandankumardviroel: ok12:43
akahat|ruckjm1, chandankumar dviroel ysandeep|afk ^^ please take a look when you free.12:43
*** rlandy|dr_appt is now known as rlandy12:51
jm1akahat|ruck, rlandy: updated rr doc with rerun testprojects and info for c9 wallaby and rhel9 osp1712:52
rlandyjm1'; great - thank you12:52
* jm1 mtg12:57
*** ysandeep|afk is now known as ysandeep12:58
*** undefined__ is now known as rcastillo13:08
*** undefined_ is now known as rcastillo13:09
rlandyakahat|ruck; thanks for fixing promoter13:13
*** amoralej|lunch is now known as amroalej13:16
*** amroalej is now known as amoralej13:16
amoralejrlandy, i'm updating the versions of puppet and facter in master https://review.rdoproject.org/r/c/rdoinfo/+/4227313:16
amoraleji expect it will just work fine13:16
rlandychandankumar: dviroel: operator meeting13:16
amoralejjust in case, you prefer me to do it on monday?13:16
rlandyamoralej: probably13:17
rlandywe're very behind promoting mater13:17
rlandymaster13:17
amoralejok, i'll do in on monday13:17
amoralejnp13:17
rlandyso monday should be better13:17
rlandythanks13:17
*** dasm|off is now known as dasm13:33
akahat|ruckC8 train promoted13:39
rlandyakahat|ruck++13:40
rlandyjm1: hey - where are your pictures???14:03
jm1rlandy: mtg, joining later14:03
ysandeepHave a superb weekend everyone! See you on Monday o/15:00
*** ysandeep is now known as ysandeep|out15:00
chandankumarsee ya!15:17
*** dviroel is now known as dviroel|lunch15:19
rlandyjm1: network jobs are still failing wallaby c8 and c915:27
rlandyhttps://logserver.rdoproject.org/openstack-component-network/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-standalone-network-wallaby/eb551b6/job-output.txt15:27
rlandywe should have ykarel's fix merged15:28
rlandyjm1: hey  can we also get a testproject downstream for periodic-tripleo-ci-rhel-8-bm_envD-3ctlr_1comp-featureset035-rhos-16.215:42
jm1rlandy: so we wait until periodic job runs again or should we tricger it again?15:42
jm1rlandy: ack15:42
rlandy        - periodic-tripleo-ci-rhel-8-bm_envD-3ctlr_1comp-featureset035-rhos-16.2:15:42
rlandy            vars:15:42
rlandy              force_periodic: true15:42
rlandy              featureset_override:15:42
rlandy                dlrn_hash_tag: e944f162e1f8a310431480d222279a9915:42
rlandytrigger it now pls15:43
rlandypromoted last on 06/2015:43
jm1rlandy: ack, then i will be eod15:43
rlandysure15:44
jm1rlandy: first one https://code.engineering.redhat.com/gerrit/c/testproject/+/41677815:45
jm1rlandy: c9 wallaby network component https://review.rdoproject.org/r/c/testproject/+/4372715:47
akahat|ruckrlandy, jm1 https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4371915:47
rlandyjm1: thank you15:48
jm1rlandy: added it to rr notes in section "Investigations and Reruns"15:49
jm1rlandy, akahat|ruck: have an appointment with mother-in-law now, eod 🙈15:51
jm1have a nice weekend, oooci team :)15:52
rlandyjm1[m]: thanks - have a good weekend15:53
*** jpena is now known as jpena|off16:02
*** amoralej is now known as amoralej|off16:09
*** dviroel|lunch is now known as dviroel16:18
akahat|ruckrlandy, about POST_FAILURE https://zuul.opendev.org/t/openstack/builds?result=POST_FAILURE&skip=018:02
akahat|ruckrlandy, uploading logs to the fileserver taking too much time.. 18:03
akahat|ruckit is causing timeout in some jobs. 18:03
akahat|ruckAs per discussion with fungi on #opendev he suggested few points:18:04
akahat|ruck- Uploading too much logs to the fileserver causing timeout.18:04
akahat|ruck- Each dir/nested dir is treated as swift object.. if possible remove nested logs18:04
akahat|ruck- Lot's of duplicate dirs like: https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_614/842073/10/gate/tripleo-ci-centos-9-undercloud-upgrade/6142441/logs/undercloud/home/zuul/src/opendev.org/openstack/tripleo-quickstart-extras/roles/validate-tempest/files/tempestmail/tests/fixtures/index.html18:05
akahat|ruckhttps://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_614/842073/10/gate/tripleo-ci-centos-9-undercloud-upgrade/6142441/logs/undercloud/home/zuul/workspace/.quickstart/usr/local/share/ansible/roles/validate-tempest/files/tempestmail/tests/fixtures/index.html18:05
akahat|ruckWe need to take a look around what files we really need. So we can remove unnecessary files.18:06
rlandyakahat|ruck: ahok18:24
rlandyah ok18:24
* akahat|ruck off18:58
rlandyysandeep|out: https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/builds?job_name=periodic-tripleo-ci-rhel-8-bm_envD-3ctlr_1comp-featureset035-rhos-16.2&skip=0 - think we broke bm with your review19:07
rlandygetting there 20:25
rlandy4 jobs away from promoting master20:26
dasmneverending story :)20:26
dasmi already hear Lamal in my head ^^20:26
dasm*Limahl20:26
dviroelhave a great weekend team - enjoy o/20:48
dasmdviroel: o/20:48
*** dviroel is now known as dviroel|out20:50
* dasm leaves for the week.21:49
dasmhappy weekend y'all21:49

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!