Wednesday, 2021-10-13

*** rlandy|ruck|bbl is now known as rlandy|ruck00:20
*** ysandeep|out is now known as ysandeep04:04
ysandeepchandankumar, fyi.. https://bugs.launchpad.net/tripleo/+bug/1946822 we need to cix c9 blocker, right?04:12
ysandeepykarel|away: hey good morning o/ I am working on c9, I don't see extras repos here: https://composes.stream.centos.org/production/latest-CentOS-Stream/compose/extras/x86_64/os/ , do you know if we have moved content of that repo to other repos?04:22
*** ykarel|away is now known as ykarel04:32
ykarelysandeep, it will be there when c9 get's released04:32
ykarelas that contains release rpms for SIG04:33
ysandeepykarel, ack, thanks!04:33
ysandeepykarel, fyi.. we are hitting this dependency issue https://bugs.launchpad.net/tripleo/+bug/194682204:33
ykarelysandeep, ack will check in some time, currently in a meeting04:34
ysandeepykarel: sure, no hurry04:34
chandankumarysandeep: yes, all cs9 issues needs to be cixed04:42
ysandeepchandankumar: ack, thanks! adding promotion-blocker flag04:44
chandankumarysandeep: you can find few centos repos here also https://trunk.rdoproject.org/centos9-master/delorean-deps.repo coming from deps04:45
ysandeepchandankumar: ack04:46
chandankumarysandeep: container-tools module comes from appstream 04:47
chandankumarso we do not need powertools repos04:47
chandankumarin future there will be no container-tools modules for cs9 also04:47
ysandeepchandankumar, I didn't find powertools here: https://composes.stream.centos.org/production/latest-CentOS-Stream/compose/ but I found it here https://composes.centos.org/latest-CentOS-Stream-8/compose/PowerTools/x86_64/os/ , If we don't need that, I can remove it .04:50
chandankumarysandeep: https://wiki.centos.org/AdditionalResources/Repositories 04:51
chandankumar    PowerTools - Available only for CentOS8, the PowerTools repository provides most of the developer tools. Disabled by default. 04:51
chandankumarbut I will confirm with amoralej|off regarding powertools (if in future needed for any package)04:52
*** bhagyashris is now known as bhagyashris|rover05:21
chandankumarysandeep: you can find the container build logs here https://logserver.rdoproject.org/53/18953/106/check/tripleo-build-containers-stream9-development/75f400e/logs/build.log05:31
ysandeepchandankumar, I was talking about individual container build log, it was missing in the failing job, i think that was failing at a earlier stage, I can see logs in good run https://logserver.rdoproject.org/53/18953/106/check/tripleo-build-containers-stream9-development/9379f76/logs/container-builds/83cef063-7a2b-4fe7-b27d-29863f349dcd/base/os/os-build.log05:34
chandankumarysandeep: yes, it was failing in the earlier stage while evaluating the jinja05:35
chandankumarthanks for checking in by the way :-)05:36
ysandeepmy pleasure :D05:42
mariosbhagyashris|rover: arxcruz|ruck: o/ any fallout observed from https://review.opendev.org/c/openstack/tripleo-ci/+/810261 ? (is 3rd party/rdo check/gate broken?)06:10
*** amoralej|off is now known as amoralej06:19
amoralejysandeep, chandankumar in centos9 the equivalent to powertools is CRB06:20
amoralejwe need powertools in CS8 and CRB in CS906:21
ysandeepamoralej, ack thanks and congratulations! 06:21
amoralejthanks ysandeep :)06:21
ysandeepamoralej, fyi.. we are hitting this issue https://bugs.launchpad.net/tripleo/+bug/1946822 because of missing genisoimage  package06:22
amoralejmmm we are waiting for a new release to merge the review, iirc06:23
amoralejhttps://review.rdoproject.org/r/c/openstack/ironicclient-distgit/+/3567106:23
ysandeepamoralej, thanks! added myself as a follower on that review06:25
bhagyashris|rovermarios, not yet. will check06:37
mariosbhagyashris|rover: thanks i looked around a bit i saw some running 3rd party jobs after that merged .06:38
mariosbhagyashris|rover: if you see something ping we will need to revert immediately 06:38
mariosbhagyashris|rover: or find some other solution anyway 06:38
bhagyashris|rovermarios, ack thanks06:44
mariosbhagyashris|rover: arxcruz|ruck: o/ whats the story on tox molecule job https://zuul.opendev.org/t/openstack/builds?job_name=openstack-tox-molecule&project=openstack/ansible-role-collect-logs 07:22
mariosseems to be broken for a while since september 07:23
mariosah i see https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/809979 07:25
bhagyashris|rovermarios, ysandeep chandankumar kindly add in your review list fix mol-get-hash-centos7 job https://review.opendev.org/c/zuul/zuul-jobs/+/813749 thanks 07:26
bhagyashris|roverhere iit's tested https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3623107:28
*** jpena|off is now known as jpena07:29
*** poojajadhav is now known as pojadhav07:29
mariosbhagyashris|rover: ack but you will have to find some zuul-jobs cores 07:29
bhagyashris|roveryup07:30
mariosbhagyashris|rover: https://review.opendev.org/admin/groups/339d2fc70a5268571c130371e7501193d9ce7e86,members 07:30
bhagyashris|rovermarios, thanks 07:31
bhagyashris|roveradded07:32
Tengutosky, marios https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/81377708:17
mariosTengu: ack thanks adding to reviews 08:22
Tengu:)08:23
Tengutrying my best08:23
arxcruzbhagyashris|rover: ping 08:47
arxcruzok, i'm back 08:47
*** arxcruz is now known as arxcruz|rover08:48
Tenguarxcruz|rover: I see you're trying to correct the molecule tests on the collect-logs role - is this one already known? https://6bccc71dcd109068b189-1635a47266c72ea76f240926bd5c4cc0.ssl.cf2.rackcdn.com/813777/3/check/openstack-tox-molecule/6d68348/tox/reports.html09:21
TenguI think I've already seen something related to that error...09:22
*** ykarel is now known as ykarel|lunch09:23
*** ysandeep is now known as ysandeep|mtg09:25
arxcruz|roverTengu: oh, is this back ?09:25
Tenguarxcruz|rover: apparently? I'm up-to-date against master I think09:25
arxcruz|roverfrenzy_friday: did you change the elastic again ?09:25
Tenguyep. up-to-date against master.09:25
frenzy_fridayarxcruz|rover, the nova mag? Nope09:26
arxcruz|roverTengu: yeah, this is a bit hard to properly fix, and i'm ruck rovering 09:26
* frenzy_friday reading ^09:26
arxcruz|roverfrenzy_friday: i think the molecule job is getting a different nova error 09:26
Tenguyay :)09:27
arxcruz|roverTengu: i mean, it's simulating a nova error, not a "real" error :) 09:27
Tenguarxcruz|rover: ah :). better!09:28
Tengubut since it's not working.....09:28
frenzy_fridaychecking if there are any more "No_valid_host_was_found" in the queries09:28
frenzy_fridayarxcruz|rover, I thought we already changed this https://opendev.org/openstack/tripleo-ci-health-queries/src/branch/master/src/data/queries.yml#L578 in patch  https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/809577/2/src/data/queries.yml#57809:33
frenzy_fridaynot sure what happened, adding again09:33
arxcruz|roverfrenzy_friday: https://opendev.org/openstack/tripleo-ci-health-queries/src/branch/master/src/data/queries.yml#L3309:33
arxcruz|roverthere's more 09:34
arxcruz|rovernot sure if it's related, need to check09:34
arxcruz|roverafter the meeting today i can take a look09:34
frenzy_fridayyes, but what happened to patch 809577 that merged?09:34
arxcruz|roveri have deutches klass jetzt 09:34
arxcruz|roverfrenzy_friday: no idea 09:34
arxcruz|roverfrenzy_friday: ci works in misterious ways 09:34
TenguSchroedinger Patch? :)09:37
frenzy_friday^ lol09:40
Tengu^^09:41
frenzy_fridayhttps://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791 should fix09:42
Tengufrenzy_friday: hmmmm the name seems different? _No_valid_host_was_found.log  is wanted in the molecule run for my patch... Or is it not linked?09:43
Tengui.e. there seem to be a space prefix, and a capital N..09:44
frenzy_fridaychecking09:44
soniya29chandankumar, kopecmartin, arxcruz|rover, ysandeep|mtg, please add/edit today's agenda for the tempest meeting - https://hackmd.io/fIOKlEBHQfeTZjZmrUaEYQ09:51
Tengufrenzy_friday: can I depends-on my patch on yours?09:52
frenzy_fridayTengu, no, sova/ansible collect logs directly downloads the file https://opendev.org/openstack/tripleo-ci-health-queries/src/branch/master/output/sova-pattern-generated.json -  so depends on doent work right now09:54
Tengufrenzy_friday: ok! no problem09:54
frenzy_fridayThat is what arx was trying to fix before his ruck/rover sprint09:54
Tenguok09:54
Tenguso no actual way to test things then09:54
frenzy_fridayTengu, right now I am comparing https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791/2/output/sova-pattern-generated.json and https://opendev.org/openstack/tripleo-ci-health-queries/src/branch/master/output/sova-pattern-generated.json 09:55
frenzy_fridayChecking if the script adds _ prefix and capitalises the forst letter or that has to be present09:56
Tenguok09:56
frenzy_fridayHey dpawlik, is https://review.rdoproject.org/analytics/app/ down?09:58
dpawlik>.< yes.... 10:04
dpawlikfrenzy_friday: restarting...10:08
frenzy_fridaythanks10:08
dpawlikfrenzy_friday: could you make a notes after which query the elastic is dying ?10:09
dpawlikfrenzy_friday: we increse xmx and xms flag twice from default vaules that was running few months10:09
dpawlikfrenzy_friday: and it's still dying 10:09
dpawlikfrenzy_friday: maybe I can try to help with query or set some limitation...?10:10
dpawlikif it needs to be like it is, I will set the current values as default10:11
frenzy_fridaydpawlik, yes sure, lemme rerun the last query that I ran and see if it goes down again10:11
dpawlikthanks frenzy_friday !10:13
frenzy_fridayok, its running now10:15
frenzy_fridaydpawlik, when a patch is submitted to this repo the check job validates the query files (https://opendev.org/openstack/tripleo-ci-health-queries/src/branch/master/output/elastic-recheck) be querying ES with each of them10:16
frenzy_friday^ this was the last thing I ran10:17
frenzy_fridayonce on my local and once by zuul10:17
dpawlikfrenzy_friday: hmm, could we add some time limitation like "last 7 days"? so far we have retention set to 14 days AFAIR10:19
frenzy_fridaydpawlik, ack, lemme change it to 7 days in elastic recheck10:20
dpawlikit would be huge improvement10:20
dpawlik<elasticsearch says thank you :) >10:20
akahatysandeep|mtg, chandankumar, arxcruz|rover bhagyashris|rover ping, I'm stopping downstream promoter server.. i'll let you know once issue is fixed.10:20
bhagyashris|roverakahat, ack10:21
frenzy_fridayTengu, arxcruz|rover looks like there were some mismatch btw the original sova file and the ones we generate - https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791 should fix it hopefully10:28
*** pojadhav is now known as pojadhav|brb10:31
*** rlandy is now known as rlandy|ruck10:31
rlandy|ruckarxcruz|rover: bhagyashris|rover: hi - how are things today?10:35
arxcruz|roverrlandy|ruck: a few failures10:37
* arxcruz|rover needs 30 min10:37
*** ykarel|lunch is now known as ykarel10:37
rlandy|ruckarxcruz|rover: there are conent-provider failures due to ceph related containers10:38
rlandy|ruckdid you have a patch for those issues?10:39
rlandy|ruckarxcruz|rover: other failures you are looking at?10:39
arxcruz|roverrlandy|ruck: i thought chandankumar was working on that 10:40
rlandy|ruck2021-10-12 21:29:56.758376 | primary |     "cmd": "buildah images; buildah push --format=v2s2 --tls-verify=False --log-level debug quay.ceph.io/prometheus/node-exporter:v0.17.0  docker://127.0.0.1:5001/tripleomaster/node-exporter:v0.17.0;\n",10:40
rlandy|ruckchandankumar: ^^ do we have a bug for this10:40
chandankumarrlandy|ruck: https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/78236210:40
chandankumararxcruz|rover: I got busy with cs9 work sorry for that10:40
chandankumarplease take over the review10:40
rlandy|ruckchandankumar: thank you10:41
rlandy|ruck10:41
rlandy|ruckFix Released10:41
rlandy|ruckbug is ^^10:41
bhagyashris|roverrlandy|ruck, the issue you pointed got resolved 10:41
bhagyashris|roverpromoting the first hash one 10:42
rlandy|ruckakahat: hi - any update on internal promoter?10:42
rlandy|ruckbhagyashris|rover: on rdo promotions?10:42
bhagyashris|roverrlandy|ruck, yes, 10:42
akahatrlandy|ruck, hello.. i'm looking in to it.10:42
rlandy|ruckbhagyashris|rover: also - thank you for posting a patch for mol-710:42
bhagyashris|roverrlandy|ruck, :)10:43
rlandy|ruckakahat: should we promote 17 with the old promoter?10:43
rlandy|ruckor do you want to use this to test the current one?10:43
akahatrlandy|ruck, i want to use the current one for testing. so we can have smooth promotions.10:44
rlandy|ruckakahat: ok10:44
akahatrlandy|ruck, for internal promoter issue with name resolution. Paramiko is not able to resolve the hostname. Replacing it with ip is working.10:49
rlandy|ruckakahat: ok - has 17 promoted?10:50
rlandy|ruckis there an associated patch?10:50
akahatrlandy|ruck, no. not yet. I'm trying to promote it now.10:50
rlandy|ruckakahat: great  - thanks10:51
rlandy|ruckbhagyashris|rover: can you look into https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/78236210:51
bhagyashris|roverrlandy|ruck, will check10:54
rlandy|ruckbhagyashris|rover: thanks - see yatin's comment10:54
akahatrlandy|ruck, bhagyashris|rover arxcruz|rover ysandeep|mtg RHOS-17 is promoted: http://10.0.110.143/promoter_logs/redhat8_osp17.log10:56
akahatsending patch to fix it.10:57
ysandeep|mtgakahat++ nice, thank you!10:57
*** ysandeep|mtg is now known as ysandeep10:57
rlandy|ruckakahat: good news- thanks for fixing that10:58
Tengufrenzy_friday: great, thanks!11:04
frenzy_fridayTengu, arxcruz|rover when you get some time pls add to your review list https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/81379111:05
Tengufrenzy_friday: on it!11:05
Tenguthough... is "Nova_failure_no_valid_host_was_found" correct in https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791/4/output/sova-pattern-generated.json#695 ? I'm not used to sova tool, so that's probably a stupid question11:05
Tenguhttps://logserver.rdoproject.org/77/813777/3/openstack-check/tripleo-ci-centos-8-ovb-1ctlr_1comp-featureset001/1f31dcc/logs/undercloud/var/log/extra/selinux_consolidated_avc.txt.gz  \o/11:12
Tenguleading to: https://logserver.rdoproject.org/77/813777/3/openstack-check/tripleo-ci-centos-8-ovb-1ctlr_1comp-featureset001/1f31dcc/logs/undercloud/var/log/extra/selinux_denials_detail.txt.gz11:13
Tenguhmmm. wait. I should also use the "what" in the matcher.11:13
frenzy_fridayTengu, I think so, because in the original sova file we have https://opendev.org/openstack/tripleo-ci-health-queries/src/branch/master/output/sova-patterns.json#L982 -> "ResourceInError: resources.NovaCompute: Went to statu..."11:15
frenzy_fridayIn the generated one https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791/4/output/sova-pattern-generated.json#695 -> "id:Nova_failure_no_valid_host_was_found" -> https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791/4/output/sova-pattern-generated.json#128111:16
frenzy_fridaySo "ResourceInError: resources.NovaCompute: Went to status ERROR due to \"Message: No valid host was found. , Code: 500" in both files are called "Nova_failure_no_valid_host_was_found"11:16
rlandy|ruckakahat: hey - chandankumar and I would like to merge the log rotation patch - pls can you  look at marios commemts on https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/36130/3/ci-scripts/infra-setup/roles/promoter/tasks/main.yml11:16
akahatrlandy|ruck, ysandeep https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/28164411:17
*** dviroel|out is now known as dviroel11:17
akahatrlandy|ruck, okay looking in it.11:17
frenzy_fridayTengu, and msg:"No valid host was found." in the original file corresponds to patterns "No valid host was found. There are not enough hosts" and "Went to status ERROR due to \"Message: No valid host was found" (lines 1108 and  172. 11:19
rlandy|ruckakahat: hi - https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/281644/1/ci-scripts/dlrnapi_promoter/config_environments/rdo/RedHat-8/rhos-16.2.yaml11:19
rlandy|ruckpls see question in ^^11:19
rlandy|ruckimages is in there twice11:19
rlandy|ruckok to use ip if we really need to11:19
rlandy|rucklet me know and will merge11:19
frenzy_fridayIn the generated file msg:"No valid host was found." -> https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791/4/output/sova-pattern-generated.json#65 and https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791/4/output/sova-pattern-generated.json#80611:20
ysandeepakahat, rlandy|ruck instead of replacing host with ip, I think would be good if we add internal dns in resolc.conf11:20
ysandeepresolv.conf*11:20
rlandy|ruckysandeep: ack11:20
rlandy|ruckit should resolve11:20
akahatysandeep, yes that will also work.11:21
rlandy|ruckysandeep: https://review.opendev.org/c/openstack/tripleo-quickstart/+/813624 - has that been tested with downstream?11:22
rlandy|ruckif  no problems there, we can merge11:22
Tengufrenzy_friday: ok. well. let's get it in then!11:22
ysandeeprlandy|ruck, chandankumar left a comment there.. I need to look at that. 11:23
ysandeep+ test with downstream + c7 jobs11:23
rlandy|ruckysandeep: ok - pls ping when we have that confirmation and we'll merge11:23
ysandeepack11:24
*** jpena is now known as jpena|lunch11:24
frenzy_friday*fingers crossed*11:24
akahatrlandy|ruck, marios chandankumar there is one more question related to logrotate: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/36130/3/ci-scripts/infra-setup/roles/promoter/tasks/main.yml11:26
rlandy|ruckok - let's close that out tomorrow11:26
rlandy|ruckfinish the discussion11:26
rlandy|ruckfrenzy_friday: https://review.opendev.org/c/opendev/elastic-recheck/+/805638 - ready for review?11:27
rlandy|ruckzuul -1 on that11:27
frenzy_fridayrlandy|ruck, yes, fixing the tox and addressing a comment from Sagi11:28
*** pojadhav|brb is now known as pojadhav11:29
rlandy|ruckfrenzy_friday: k - np- just checking11:31
*** ysandeep is now known as ysandeep|afk11:31
mariosakahat: i think you misunderstood my comment the rotate 60 means that after it is rotated 60 times it will be removed11:33
mariosakahat: at least according to the docs11:33
* marios food biab11:35
rlandy|ruckarxcruz|rover: hey - do you have a patch for https://trello.com/c/ptIv41kA/2139-cixlp1946659tripleociproa-update-upgrades-jobs-are-failing-in-check-gate-after-promotions-not-found-image11:37
rlandy|ruckhttps://bugs.launchpad.net/tripleo/+bug/194665911:37
rlandy|ruckto report to CIX11:37
arxcruz|roverrlandy|ruck: yes, just a sec11:37
rlandy|ruckchandankumar: can we close this out? https://trello.com/c/uxNdFLUp/2134-cixlp1946461tripleociproa-centos-stream-9-missing-packages-tracker-for-container-build11:37
chandankumarrlandy|ruck: nope, we need to wait for qrouted and collectd container11:38
akahatmarios, okay. i got your point. I think we don't need maxage.11:38
Tengucool. just that molecule thing, and we're good with my selinux consolidation script for sealert!11:39
arxcruz|roverrlandy|ruck: https://review.opendev.org/c/openstack/tripleo-ci/+/813629 i update the card on cix 11:39
akahatmarios, so we should reduce the value to 15 ?11:39
rlandy|ruckarxcruz|rover: great - thank you11:39
mariosakahat: yeah maybe, or even 30 i guess we should discuss that but 60 days seems high 11:56
sshnaidmarxcruz|rover, do we still use tempest-sendmail.tripleo.org ?11:59
arxcruz|roversshnaidm: nope 11:59
soniya29chandankumar, kopecmartin, rlandy|ruck, ysandeep|afk, tempest meeting?12:00
*** ysandeep|afk is now known as ysandeep12:01
soniya29chandankumar, ^^12:01
*** amoralej is now known as amoralej|lunch12:02
rlandy|rucksoniya29: ^^ on program call12:02
bhagyashris|roverdviroel, hey i have updated the invite by providing the miro board link 12:04
dviroelbhagyashris|rover: ah, thanks :)12:04
marioslol dviroel i love the happy/sad icons for the retro 12:14
marios:D12:14
dviroelmarios: \o/ 12:15
soniya29rlandy|ruck, no problem12:15
rlandy|ruck Check Default IPv4 Gateway availability | overcloud-controller-0 | error={"changed": false, "cmd": ["ping", "-w", "10", "-c", "1", "10.0.0.1"], "delta": "0:00:03.078052", "end": "2021-10-13 10:32:10.263464", "msg": "non-zero return code", "rc": 1, "start": "2021-10-13 10:32:07.185412", "stderr": "", "stderr_lines": [], "stdout": "PING 10.0.0.1 (10.0.0.1) 56(84) bytes of data.\nFrom 10.0.0.48 icmp_seq=1 Destination Host 12:17
rlandy|ruckUnreachable\nFrom 10.0.0.48 icmp_seq=2 Destination Host Unreachable\nFrom 10.0.0.48 icmp_seq=3 Destination Host Unreachable\n\n--- 10.0.0.1 ping statistics ---\n3 packets transmitted, 0 received, +3 errors, 100% packet loss, time 2047ms\npipe 3", "stdout_lines": ["PING 10.0.0.1 (10.0.0.1) 56(84) bytes of data.", "From 10.0.0.48 icmp_seq=1 Destination Host Unreachable", "From 10.0.0.48 icmp_seq=2 Destination Host 12:17
rlandy|ruckUnreachable", "From 10.0.0.48 icmp_seq=3 Destination Host Unreachable", "", "--- 10.0.0.1 ping statistics ---", "3 packets transmitted, 0 received, +3 errors, 100% packet loss, time 2047ms", "pipe 3"]}12:17
rlandy|ruckoops12:17
rlandy|ruckbhagyashris|rover: arxcruz|rover: ^^ fyi - OVB failures12:17
rlandy|rucktrying rerun to check12:19
bhagyashris|roverrlandy|ruck, ack12:19
rlandy|ruckysandeep: akahat: merging https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/28164412:20
akahatrlandy|ruck, ack12:20
*** jpena|lunch is now known as jpena12:24
rlandy|ruckysandeep: chandankumar: https://review.rdoproject.org/r/c/rdo-jobs/+/3591212:24
rlandy|ruckgoing to merge that12:24
rlandy|ruckok to go?12:24
ysandeeprlandy|ruck, yes12:24
ysandeepI already voted there, Only have +1 on rdo-jobs repo12:25
dviroelakahat: bhagyashris|rover arxcruz|rover frenzy_friday marios rlandy|ruck soniya29 chandankumar pojadhav sshnaidm  ysandeep retro meeting will start in 5 min12:25
arxcruz|roverdviroel: still stucked in the program call meeting 12:25
arxcruz|roveri'll be late 12:25
rlandy|ruckyep12:26
dviroelok12:26
rlandy|rucksame12:26
chandankumarrlandy|ruck: good to go12:26
rlandy|ruckarxcruz|rover: you reporting on the call?12:26
arxcruz|roverrlandy|ruck: yes12:26
dviroelboard link: https://miro.com/app/board/o9J_lrP1x4k=/?invite_link_id=18283331761912:26
rlandy|ruckarxcruz|rover: dropped call - you can as well12:28
rlandy|ruckdviroel; very cute board12:28
dviroel:)12:28
*** ykarel__ is now known as ykarel12:37
arxcruz|roverdviroel: bota ai o sertanejo 12:41
arxcruz|roverxD 12:42
dviroellolz12:42
*** amoralej|lunch is now known as amoralej13:04
pojadhavReview Request : https://review.rdoproject.org/r/c/rdo-jobs/+/36134 and https://review.rdoproject.org/r/c/rdo-jobs/+/3613313:12
chandankumardviroel: thank you for making awesome retro :-)13:32
*** pojadhav is now known as pojadhav|ruck13:32
dviroelchandankumar: \o/ thank, and thank you all for joining and participating13:33
*** dviroel is now known as dviroel|rover13:33
bhagyashris|roverdviroel, thanks for the retro :)13:33
frenzy_fridayrlandy|ruck, arxcruz|rover can we pls merge https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/813791/ ? Hope it fixes the collect logs failure13:34
pojadhav|ruckbhagyashris|rover, arxcruz|rover, rlandy|ruck : when we are planning for RR hand off.. ?13:34
bhagyashris|roverdviroel|rover, pojadhav|ruck tell your free time...13:34
rlandy|ruckmarios: ysandeep: forwarding the doodle to you13:34
rlandy|ruckif you want to join13:34
bhagyashris|roverit would be great if we do today13:35
dviroel|roverbhagyashris|rover: i'm available in the next 90 min, after that lunch13:35
ysandeeprlandy|ruck, thanks! 13:36
bhagyashris|roverdviroel|rover, ack13:36
bhagyashris|roverrlandy|ruck, dviroel|rover arxcruz|rover pojadhav|ruck  fyi i will take day off tomorrow for some personal work... but i will be available for planning meeting 13:37
*** arxcruz|rover is now known as arxcruz13:37
bhagyashris|roverso that's why it would be great if we do it today13:37
arxcruzbhagyashris|rover: you're the boss, just say the time :) 13:38
rlandy|ruckdviroel|rover: will touch base with you this afternoon to start ruck/rover13:38
dviroel|roverrlandy|ruck: ok13:38
rlandy|ruckpojadhav|ruck: pls touch base with arxcruz 13:38
rlandy|ruckas what to pick up13:38
pojadhav|ruckrlandy|ruck, ack13:39
bhagyashris|roverrlandy|ruck, or else we will meet now and in case if we missed some thing then dviroel|rover will ask you13:39
*** bhagyashris|rover is now known as bhagyashris13:39
rlandy|ruckarxcruz: are you done with the updates/upgrades hash work or you want to hand that on?13:39
rlandy|ruckbhagyashris: I am on prod chain - feel free to meet with other ruck/rovers now w/o me13:40
arxcruzrlandy|ruck: i'm still testing, the job variable wasn't available, not sure if it's because it fails on the content-provider job or not 13:40
bhagyashrisrlandy|ruck, sure thanks13:40
arxcruzrlandy|ruck: i would like to continue with that at least until monday 13:40
rlandy|ruckarxcruz: k- pls do13:40
bhagyashrispojadhav|ruck, dviroel|rover arxcruz let's meet https://meet.google.com/xnf-tvdh-pmk?authuser=013:40
bhagyashrispojadhav|ruck, dviroel|rover https://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ansible-centos-8-molecule-tripleo_sshd13:46
Tengutosky, marios still working a bit on the regexp - I have to make it a bit more lose, the content of an AVC line might change and include (or not) some details.13:58
mariosTengu: thanks i have it in my list for tomorrow morning reviews 14:00
Tengumarios: trying to get it stable before my long weekend14:01
Tenguthere's a good job example directly in the project CI, just perfect.14:02
rlandy|ruckbhagyashris: hey - just following up on https://review.opendev.org/c/zuul/zuul-jobs/+/81374914:11
rlandy|ruckbhagyashris: have you pinged anyone in openstack-infra to review it?14:12
rlandy|ruckdviroel|rover: pojadhav|ruck: you all set?14:22
dviroel|roverrlandy|ruck: yep, w14:25
dviroel|roverwill follow up next cix call 14:25
Tengufrenzy_friday: is there anything to do/wait now that the "no valid host" patch merged?14:27
Tengufrenzy_friday: I apparently still hit it now: https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_c9f/813777/5/check/openstack-tox-molecule/c9f6a8d/tox/reports.html - but maybe there's some automated things that will kick later?14:27
rlandy|ruckok14:27
rlandy|ruckdviroel|rover: just testproject'ed a bunch of failing promotion tests14:28
rlandy|ruckall except master14:28
rlandy|ruckOVB was a bit funky - we need to keep an eye on that14:28
dviroel|rovermet me check master then14:29
dviroel|roverack14:29
pojadhav|ruckrlandy|ruck, yup14:48
pojadhav|ruckrlandy|ruck, leaving for the day, pls left some notes for me if I need to keep eye on anything. will catch up from tomorrow morning.14:51
marioschandankumar: rlandy|ruck: ** tripleo-ci please add to reviews list https://review.opendev.org/c/openstack/tripleo-ci/+/808177 14:52
ysandeepmarios, ack14:54
*** ysandeep is now known as ysandeep|dinner14:55
frenzy_fridayTengu, hm.. looks like something else is wrong in the sova file. It should have worked right after merging. Lemme check14:58
* dviroel|rover lunch, brbr15:01
chandankumardviroel|rover: rlandy|ruck please get these two merged https://review.opendev.org/c/openstack/tripleo-common/+/800580/68 and https://review.opendev.org/c/openstack/tripleo-common/+/813745 merged15:05
chandankumarmarios: may be tomorrow15:06
chandankumarsee ya!15:06
marioschandankumar: ack adding to list15:08
*** ykarel is now known as ykarel|away15:12
bhagyashrisrlandy|ruck, nope i have just added the infra core in the reviewers list15:19
rlandy|ruckbhagyashris: np - I pinged fungi15:20
bhagyashrisrlandy|ruck, ack thanks 15:20
rlandy|ruckmarios: is https://review.opendev.org/c/openstack/diskimage-builder/+/806819 ready for merge again?15:21
rlandy|ruckif so, will ping ianw and stevebaker15:22
rlandy|ruckwhen they come on line15:22
mariosrlandy|ruck: looking15:24
mariosah i didn't revisit after this morning 15:24
mariosrlandy|ruck: job is failing i men the one that ianw wanted to check 15:24
rlandy|ruckmarios: no worries - it's just a depends_on on the patch you wanted merge15:25
rlandy|ruckk15:25
mariosrlandy|ruck: right15:25
rlandy|ruckmarios: story of our lives, right :) 15:25
mariosrlandy|ruck: ack i'll check it again in the morning see whats up15:26
rlandy|ruckmarios: cool - thanks15:26
frenzy_fridayTengu, found the issue: The file it looks for is generated by sova as "_No_valid_host_was_found__pip_conflicting_dependencies.log" This is probably because in the files that sova looks into (https://opendev.org/openstack/ansible-role-collect-logs/src/branch/master/roles/collect_logs/tasks/sova.yml#L17-L21) there is indeed an error related to pip15:31
*** marios is now known as marios|out15:33
frenzy_friday^ looking for a way to solve it https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/81385315:40
rlandy|ruckysandeep|dinner: chandankumar: fyi ... breakages on downstream with new container build code https://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-periodic-integration-rhos-16.2/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-build-containers-ubi-8-internal-rhel-8-build-push-upload-rhos-16.2/e12144f/logs/build.log15:46
rlandy|ruck openstack tripleo container image build: error: unrecognized arguments: --tcib-extra tcib_release=8 --tcib-extra tcib_python_version=3.615:46
rlandy|ruckbest fix there?15:46
rlandy|ruckdviroel|rover: pojadhav|ruck: ^^ fyi15:47
rlandy|ruckwallaby still failing network provision15:50
rlandy|ruckNetwork configuration file does not exist: /usr/share/openstack-tripleo-heat-templates/network-data-samples/default-network-isolation.yaml15:50
rlandy|ruckjenkins jobs15:50
frenzy_fridayrlandy|ruck, http://ci-health.tripleo.org/16:07
frenzy_fridayhttp://ci-health.tripleo.org/:500116:07
*** ysandeep|dinner is now known as ysandeep16:10
ysandeeprlandy|ruck, reading back16:10
rlandy|ruckin meeting16:10
rlandy|ruckpls see logs16:10
* ysandeep looking16:13
ysandeeprlandy|ruck: proposing fix16:17
*** amoralej is now known as amoralej|off16:22
ysandeepchandankumar, rlandy|ruck fix is up: https://review.opendev.org/c/openstack/tripleo-ci/+/813863 16:28
ysandeepi had tested 17 container build with chandan patch here.. https://code.engineering.redhat.com/gerrit/c/testproject/+/211643 .. I should have included 16.2 as well in my test16:29
Tengufrenzy_friday: ah cool, hopefully things will be good soon then :)16:30
*** jpena is now known as jpena|off16:30
ysandeeprlandy|ruck, when you get free, what's the issue in wallaby, which job is failing?16:31
rlandy|ruckysandeep: Network configuration file does not exist: /usr/share/openstack-tripleo-heat-templates/network-data-samples/default-network-isolation.yaml16:31
rlandy|ruckwill ping when off meeting16:31
ysandeepack16:31
* ysandeep running testproject in 16.2 in the meantime to test the fix16:31
rlandy|ruckysandeep: hey - off meeting16:37
ysandeeprlandy|ruck, which job failing for wallaby, minimal?16:37
rlandy|ruckysandeep: ack16:38
rlandy|ruckI haven't checked yet  if master is failing the same way16:38
rlandy|rucklooking now16:38
rlandy|ruck+ /tmp/workspace/tripleo-quickstart-promote-master-current-tripleo-delorean-minimal/bin/cico node get --arch x86_64 --release 8-stream --count 1 --retry-count 6 --retry-interval 60 -f csv16:40
rlandy|ruck+ sed 1d16:40
rlandy|ruckstring indices must be integers16:40
rlandy|ruckmaster may be infra failure16:40
rlandy|ruck+ echo 'FATAL: no nodes were provisioned'16:41
rlandy|ruckFATAL: no nodes were provisioned16:41
rlandy|ruck+ exit 116:41
ysandeeprlandy|ruck, testing 16.2 container build fix here: https://code.engineering.redhat.com/gerrit/c/testproject/+/19067216:43
rlandy|ruckysandeep: master looks like infra stuff16:43
* ysandeep looking at minimal jobs now16:43
rlandy|ruckpinged on rdo16:43
rlandy|ruckysandeep: for minimal ...16:44
rlandy|ruckhttps://github.com/openstack/tripleo-heat-templates/blob/stable/wallaby/network-data-samples/default-network-isolation.yaml16:45
rlandy|ruckshould be there16:45
rlandy|ruckhttps://artifacts.ci.centos.org/rdo/jenkins-tripleo-quickstart-promote-wallaby-current-tripleo-delorean-minimal-60/undercloud/usr/share/openstack-tripleo-heat-templates/16:46
rlandy|ruckha16:46
rlandy|rucknot16:46
rlandy|ruckysandeep: ^^ not there16:46
rlandy|ruckcorrect16:46
ysandeeprlandy|ruck, should be there: https://github.com/openstack/tripleo-heat-templates/blob/stable/wallaby/network-data-samples/default-network-isolation.yaml 16:47
rlandy|ruckysandeep: not installed16:48
ysandeeplet me check what version of tht is insalled there16:48
rlandy|ruckysandeep: installed in master jobs16:52
rlandy|ruckhttps://artifacts.ci.centos.org/rdo/jenkins-tripleo-quickstart-promote-master-current-tripleo-delorean-minimal-50/undercloud/usr/share/openstack-tripleo-heat-templates/16:52
ysandeepinstalled tht on wallaby is quite recent: nstack-tripleo-heat-templates.noarch       14.3.1-0.20211010015303.f556f24.el8       @delorean-component-tripleo  16:53
ysandeepopenstack-tripleo-heat-templates.noarch       14.3.1-0.20211010015303.f556f24.el8       @delorean-component-tripleo16:53
rlandy|ruckright - but it's not copying that dir16:53
ysandeepyeah... and patch was merged in march: https://review.opendev.org/c/openstack/tripleo-heat-templates/+/765218 16:54
ysandeeplet me compare that dir from one of our wallaby job16:55
rlandy|ruckysandeep: we don't collect /usr/share there :(16:56
rlandy|ruckI looked16:57
rlandy|ruckysandeep: whatever is doing the copying is not copying that folder16:57
rlandy|ruckysandeep: creating bug - need to roll to next meeting16:58
ysandeeprlandy|ruck, let me troubleshoot for a while.. i will write a bug if i cannot figure out16:58
rlandy|ruckhttps://bugs.launchpad.net/tripleo/+bug/194701517:03
rlandy|ruckysandeep: ^^ to track17:03
ysandeeprlandy|ruck, thanks! I extracted the rpm locally.. indeed that dir is not present in rpm itself17:03
ysandeeprlandy|ruck, fixed merged in wallaby 2 days ago https://review.rdoproject.org/r/c/openstack/tripleo-heat-templates-distgit/+/35974 17:18
ysandeeprpm distgit issue... that directory was not packaged17:18
ysandeepconfirmed issue is now resolved in latest tht rpm(tripleo component currently at consistent hash)17:22
rlandy|ruckysandeep: nice17:25
rlandy|ruckwe should that resolved with next run then17:25
ysandeepyes17:25
dviroel|rovernice debug :)17:29
rlandy|ruckdviroel|rover: leaving https://bugs.launchpad.net/tripleo/+bug/1947015 open17:31
rlandy|ruckuntil we see a clean run17:31
dviroel|roverack17:31
rlandy|ruckI didn't mark it promotion-blocker17:32
rlandy|ruckas it may resolve in next run17:32
rlandy|ruckdviroel|rover: https://review.opendev.org/c/zuul/zuul-jobs/+/813749 got w+'ed so that's good17:50
rlandy|ruckwe can revert the patch to make the c7 jobs non-voting after that merges17:50
dviroel|rovergreat17:52
rlandy|ruckdviroel|rover: https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/78236217:52
rlandy|ruck^^ we need to get that working17:52
rlandy|ruckfor the content-provider failures pulling non-tripleo containers17:53
* dviroel|rover looking17:56
* ysandeep out for the day, see you tomorrow o/18:18
*** ysandeep is now known as ysandeep|out18:19
* rlandy|ruck lunch - brb18:19
dviroel|roverrlandy|ruck: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3597519:18
rlandy|ruckdviroel|rover: nice - merging19:18
dviroel|roverrlandy|ruck: https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp_1supp-featureset039-train19:27
dviroel|roverfailed twice in "openstack overcloud node introspect"19:27
dviroel|roverbut in the end, introspection was sucess https://logserver.rdoproject.org/95/24995/118/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp_1supp-featureset039-train/c61f37b/logs/undercloud/home/zuul/overcloud_introspect.log.txt.gz19:28
rlandy|ruckhttps://logserver.rdoproject.org/95/24995/118/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp_1supp-featureset039-train/c61f37b/logs/undercloud/home/zuul/overcloud_introspect.log.txt.gz19:29
dviroel|roverjob that passes https://logserver.rdoproject.org/openstack-periodic-integration-stable4/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp_1supp-featureset039-train/a3f0078/logs/undercloud/home/zuul/overcloud_introspect.log.txt.gz19:29
rlandy|ruckIntrospection of node completed:de85b2cd-8ba3-4769-a76f-76466b612142. Status:FAILED. Errors:Introspection timeout19:31
rlandy|ruckright ^^19:31
rlandy|ruckso - you're asking if we should write that up yet as a bug?19:32
rlandy|ruckor debug/fix it?19:32
rlandy|ruckdviroel|rover: ^^?19:32
rlandy|ruckit's running again now in testproject19:33
rlandy|ruckand in the line19:33
rlandy|ruckif either of them fails, yes19:33
dviroel|roveryes, if we should give another try19:33
dviroel|roveryeah, there is an extra 'timeout' error in the failing job, we should try again 19:35
rlandy|ruckdviroel|rover: yeah if current runs fail - we'll bug and investigate19:52
rlandy|ruckdviroel|rover: pls vote on https://review.opendev.org/c/openstack/tripleo-ci/+/81386320:25
rlandy|ruckand then we can merge that20:25
dviroel|roverdone20:37
dviroel|roverrlandy|ruck: https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-standalone-full-tempest-api-master20:49
dviroel|roverlots of tempest tests failing20:50
rlandy|ruckone sec - just resubmitting patch20:51
dviroel|roverah, mysqld daemon is restarting20:52
dviroel|roverhttps://logserver.rdoproject.org/32/35132/11/check/periodic-tripleo-ci-centos-8-standalone-full-tempest-api-master/94ee759/logs/undercloud/var/log/containers/mysql/mysqld.log.txt.gz20:54
dviroel|rover2021-10-13 16:36:02 3038 [ERROR] InnoDB: WSREP: referenced FK check fail: Lock wait index `PRIMARY` table `ovs_neutron`.`securitygroupportbindings`20:54
dviroel|rover211013 16:42:17 mysqld_safe Starting mysqld daemon with databases from /var/lib/mysql20:54
rlandy|ruckdviroel|rover: looking20:55
dviroel|roverhttps://logserver.rdoproject.org/32/35132/11/check/periodic-tripleo-ci-centos-8-standalone-full-tempest-api-master/94ee759/logs/undercloud/var/log/containers/keystone/keystone.log.txt.gz20:56
dviroel|rover2021-10-13 16:40:11.186 210 ERROR keystone.server.flask.request_processing.middleware.auth_context [req-17ded4ab-4d62-46a8-b09f-6a83afa5fe9a 8a0a462af25248b99f6b44cf33b2c205 08812bde6bc74d3ba52eb6d5ff8b46ef - default default] (pymysql.err.OperationalError) (2013, 'Lost connection to MySQL server during query')20:56
rlandy|ruckdviroel|rover: oh wow - we got some failures there20:57
rlandy|ruckyep - let's get a bug on that20:57
dviroel|rover\o/20:57
rlandy|ruckyou can mark it promotion-blocker20:58
dviroel|roverack20:58
rlandy|ruckwe'll close it if the current run does not show that20:58
rlandy|ruckset up issue20:58
* rlandy|ruck checking previous runs20:59
rlandy|ruckhmmm ...21:00
rlandy|ruckchecking if we have diff tests failing here21:00
rlandy|ruckdviroel|rover: so the interesting thing here is ....21:01
rlandy|ruckwe have diff setup tests failing each time:21:02
rlandy|rucksetUpClass (tempest.api.compute.servers.test_instance_actions21:02
rlandy|rucksetUpClass (keystone_tempest_plugin.tests.api.identity.v3.test_mapping_rules21:02
rlandy|rucksetUpClass (cinder_tempest_plugin.api.volume.test_create_from_image21:02
rlandy|ruckin each of the three failing runs21:02
rlandy|rucknever the less, let's get a tracking bug on this21:03
dviroel|roverrlandy|ruck: https://logserver.rdoproject.org/32/35132/11/check/periodic-tripleo-ci-centos-8-standalone-full-tempest-api-master/94ee759/logs/undercloud/var/log/containers/mysql/mysqld.log.txt.gz 21:03
dviroel|roveri think that mysqld error are the same21:03
dviroel|roverneutron related21:03
rlandy|ruckyeah  possible it hits a different test21:04
* dviroel|rover not sure if I checked all 3 jobs21:04
rlandy|ruckeither way21:04
dviroel|roverwe need new milestones in LP21:10
dviroel|roverrlandy|ruck: https://bugs.launchpad.net/tripleo/+bug/194705021:13
rlandy|ruckdviroel|rover: ack - you are right21:14
rlandy|ruckasking slagle21:14
* dviroel|rover going out in min, but will be around21:19
rlandy|ruckdviroel|rover; nice catch!21:33
*** rlandy|ruck is now known as rlandy|ruck|bbl22:28
-opendevstatus- NOTICE: Both Gerrit and Zuul services are being restarted briefly for minor updates, and should return to service momentarily; all previously running builds will be reenqueued once Zuul is fully started again22:49

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!