Friday, 2024-03-22

opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372800:44
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394201:35
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372802:05
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394202:05
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394203:09
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394203:21
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394204:02
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372804:30
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394204:30
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394205:20
opendevreviewVerification of a change to openstack/kolla-ansible stable/2023.2 failed: CI: Increase galera node timeouts  https://review.opendev.org/c/openstack/kolla-ansible/+/91381007:40
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372809:26
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394209:26
opendevreviewIvan Halomi proposed openstack/kolla-ansible master: Refactor of kolla_container_facts  https://review.opendev.org/c/openstack/kolla-ansible/+/91141709:32
opendevreviewIvan Halomi proposed openstack/kolla-ansible master: Move actions to kolla_container_facts  https://review.opendev.org/c/openstack/kolla-ansible/+/91150509:33
SvenKieskekevko: what is wrong with parted that we need to loop there?oO https://review.opendev.org/c/openstack/kolla-ansible/+/913942/8/tests/setup_disks.sh09:33
kevkoSvenKieske: i don't know ..i was trying to wait ...because there was a fail that it's used ...09:46
kevkoSvenKieske: /dev/* is used ..try to restart ..blablabla 09:46
kevkoSvenKieske: but this is very veird ... /tmp/zuul-logs.QmYhf3/primary/logs/kolla/all-WARNING.txt:2024-03-22 05:55:41.558 7 WARNING nova.compute.manager [None req-a9925dbc-4810-4354-b54c-269e484e048b - - - - - -] [instance: fda729ca-e859-45e0-b02d-72c657b0469e] Instance shutdown by itself. Calling the stop API. Current vm_state: active, current09:46
kevkotask_state: None, original DB power_state: 1, current VM power_state: 4 09:46
opendevreviewIvan Halomi proposed openstack/kolla-ansible master: Merge of container_facts modules  https://review.opendev.org/c/openstack/kolla-ansible/+/91246009:47
kevkoSvenKieske: there is something weird in upgrade process 09:49
kevkoSvenKieske: https://paste.openstack.org/show/b7yV9dJXuLYl6wUaJrAt/09:49
SvenKieskeI'm also currently trying to figure out if our CI is green or if there is something newly broken again :D09:50
SvenKieskeyour ubuntu-ovn job also fails, interesting09:50
kevkoSvenKieske: there is tempest directory ...tempests are passing ...09:51
SvenKieske+ openstack server create --wait --image cirros --flavor m1.tiny --key-name mykey --network demo-net lb_member --wait09:51
kevko(most of the time)09:51
SvenKieskeRequest to http://192.0.2.10:8774/v2.1/servers/7951da63-e0f6-4198-80a4-85e8b7caa2e9 timed out09:51
SvenKieskehttps://zuul.opendev.org/t/openstack/build/affe567db343421c8bcf6d32fa649499/log/primary/logs/ansible/test-ovn09:52
kevkonow im looking into https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_f92/913728/24/check/kolla-ansible-debian-upgrade/f9279f1/primary/logs/ansible/test-core-openstack-upgrade <<< 09:54
kevkokolla-upgrade-test   instance is shutted down after upgrade .... i reaaaaallly don't know why 09:55
kevkoeven if I've already slept ...i can't get it :D 09:55
SvenKieskelooking at the IRC history you couldn't have slept that much? :D09:59
opendevreviewMerged openstack/kolla-ansible stable/2023.2: CI: Increase galera node timeouts  https://review.opendev.org/c/openstack/kolla-ansible/+/91381010:02
kevkoSvenKieske: Nope, I'm not used to going to sleep until I solve something, I fell asleep involuntarily  :D 10:03
kevkoSvenKieske: can u give some advice ? hint ? 10:03
SvenKieskenot really, are there any CI related automatisms that shut down VMs? was there some maintenance ongoing (I don't think so, that would be announced). Maybe the host itself had a problem. I never have seen this:10:06
SvenKieskeInstance shutdown by itself. Calling the stop API10:06
kevkoaaaa 10:06
kevkothis is another job ...debian classic 10:06
kevkohttps://7becaa41bf65a7eaafce-2bb61388e3d961ebec04f637f3a6e452.ssl.cf5.rackcdn.com/913728/24/check/kolla-ansible-debian/a0739a6/primary/logs/tempest/reports/tempest-smoke.1.html10:06
kevkotempest.lib.exceptions.IdentityError: Got identity error10:07
kevkoDetails: Unexpected status code 50310:07
SvenKieskewhy print a nice error message containing the error code when you also can just produce a stacktrace? :D10:07
SvenKieske503 so there's clearly something wrong with the webserver or the backend app10:08
kevkoSvenKieske: something with neutron maybe ? .. ican see some errors ..10:09
kevkoSvenKieske: there are also bunch of  keystone not found errors ....trust ...domain ... role ...user ....10:09
kevkoSvenKieske: wondering ..if some of them can't cause problems10:10
SvenKieskethere is afaik still my patch missing..wait10:12
SvenKieskemhm but that was only on 2023.2 branch: https://review.opendev.org/c/openstack/kolla-ansible/+/91245210:13
SvenKieskeFl1nt had some other keystone related patches10:13
SvenKieskeFl1nt: sorry btw, I still had not the time to look into your patches!10:14
fricklerfor stopped instances, check whether qemu got oom killed?10:14
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372810:14
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394210:14
SvenKieskefrickler: I was about to suggest the same, but afaik that doesn't log "instance shutdown itself"? or does it?10:15
fricklerI think it could10:15
SvenKieskeI have seen many oom kills, admittedly I never logged at those logs then, because from an ops persepctive one of the first things I check is dmesg/kernel and then you don't need to look any further.10:16
SvenKieskelooked*10:16
fricklerbecause from nova perspective this cannot be distinguished from doing poweroff from within the instance10:16
kevkofrickler: please, If you can check -> https://zuul.opendev.org/t/openstack/build/f9279f19ca5249a7918946131827ba1c <-10:17
SvenKieskewell, strictly speaking it could. because qemu should get ENOMEM and could bubble up that to nova, no? :)10:17
SvenKieskewell, in overcommit scenarios (so the usual scenario) I guess ENOMEM might not be reliable10:18
fricklerbingo: Mar 22 05:55:26 primary kernel: oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),cpuset=user.slice,mems_allowed=0,global_oom,task_memcg=/machine/qemu-12-instance-0000000a.libvirt-qemu,task=qemu-system-x86,pid=113588,uid=4243610:18
fricklerthat's in syslog.txt from kevko's upgrade job above10:18
fricklerbut it's not qemu that triggered it, so really difficult to handle: Mar 22 05:55:26 primary kernel: python invoked oom-killer: gfp_mask=0x100cca(GFP_HIGHUSER_MOVABLE), order=0, oom_score_adj=010:20
SvenKieskemhm10:20
frickleranyway, the solution is run less stuff in test jobs or only in multinode jobs10:20
frickleroh, wait, is that for the tempest test? does it run with --serial? else I would not be surprised by this happening10:21
SvenKieskedid anybody talk btw about the recent python CVEs? I believe there was a memory corruption/leak also in there, maybe someone already triggerd it? :D10:21
fricklerI haven't seen those yet, do you have a link?10:22
SvenKieskeok, tempest tests running in parallel might be the culprit.10:22
SvenKieskethey are on oss-sec, since 2 days I think, wait..10:22
SvenKieskehttps://seclists.org/oss-sec/2024/q1/24010:22
SvenKieske2 days ago10:22
SvenKieskewith regular updates everybody should be secure. everybody does regular updates, right? lol (sorry I'm cynical when it comes to infosec)10:23
SvenKieskeI also did no analysis how grave these are, I usually read them when lying in bed :D10:24
SvenKieskebundled libexpat, zipfile and a crash in socket-if_indextoname and the ssl stuff look most promising for exploits I guess. the windows only stuff should be boring10:26
SvenKieskedunno about the codecs, their names are cryptic and I don't know them by name.10:28
fricklerat least ubuntu doesn't have new pkgs yet, also no reference to those cves, hmm10:29
SvenKieskeis python community maintained on ubuntu? I wouldn't be surprised.10:32
SvenKieskeno, it i maintained by "Ubuntu Core Developers" https://packages.ubuntu.com/jammy/python310:34
SvenKieskeI just asked over in #ubuntu-security on libera chat if anybody cares :) they have really not even a public launchpad bug10:37
SvenKieskeI bet they have some private stuff about it somewhere.10:37
SvenKieskethis is another datapoint to not use distro packages @kevko ;) seems they are not up to speed when it comes to security updates.10:40
SvenKieskeand I write that as someone who actually likes the distro dev model itself. :(10:40
opendevreviewMichal Nasiadka proposed openstack/kolla-ansible master: common: Stop using transitional labels  https://review.opendev.org/c/openstack/kolla-ansible/+/91396010:46
SvenKieskemhm, found a bug, nice :D10:52
opendevreviewMaksim Malchuk proposed openstack/kayobe master: Fix the glob for the custom RabbitMQ configuration  https://review.opendev.org/c/openstack/kayobe/+/90911310:53
SvenKieskeit's a bug in a shellscript ;) but no CI would've catched that I guess :D10:53
opendevreviewSven Kieske proposed openstack/kolla-ansible master: CI: fix check-failure.sh sudo missing  https://review.opendev.org/c/openstack/kolla-ansible/+/91396110:54
opendevreviewSven Kieske proposed openstack/kolla-ansible master: Add ovn-exporter  https://review.opendev.org/c/openstack/kolla-ansible/+/85549810:58
kevkofrickler: no it's not tempest ..it's regular bash instance ...tempests ran all OK 10:59
SvenKieskewell scratch that, CI would've found this if we would have tests for our tests :D but no linter would've catched it, was what I meant.10:59
mnasiadkamaster complainers those Kolla developers11:13
mnasiadka:)11:13
opendevreviewMatt Crees proposed openstack/kayobe master: CI: rework RMQ steps for queue migrations  https://review.opendev.org/c/openstack/kayobe/+/91387811:15
opendevreviewMatt Crees proposed openstack/kayobe master: Bump KA Ansible versions to match new defaults  https://review.opendev.org/c/openstack/kayobe/+/91357111:15
opendevreviewMatt Crees proposed openstack/kayobe master: Bump up Ansible supported versions to 8.x/9.x  https://review.opendev.org/c/openstack/kayobe/+/91051311:15
opendevreviewMatt Crees proposed openstack/kayobe master: Use new collections in Kayobe  https://review.opendev.org/c/openstack/kayobe/+/91074211:15
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394211:24
kevkobtw, why we are stopping this in upgrade jobs ? 11:39
kevkokolla-ansible -i /etc/kolla/inventory -vvv stop --tags heat,keystone,neutron,nova --yes-i-really-really-mean-it11:39
kevkoIf i will stop neutron and nova on production upgrade ..i am fired :D 11:39
SvenKieskefollow the git? :D11:41
SvenKieskeI always look at git blame and then find an unrelated commit ;)11:41
kevkoSvenKieske: I do that too, but today I need more to calm down and distract myself rather than to get even more stressed out :D11:43
SvenKieskexD I can relate to that11:43
kevko:D 11:44
kevkohttps://github.com/openstack/kolla-ansible/blob/80253410234cc74aab25af0f9bb90f1cbb764295/tests/upgrade.sh#L17-L19 it would probably be good to add a condition that checks if they are turned on or not... and based on that, at least write an echo... so I don't have to look for it and it's in the test log :(11:46
SvenKieskesounds like a good idea11:47
kevkoSvenKieske I know you're not a fan of bash, neither am I, however... if you've seen any of my bash scripts in kolla or kolla-ansible... then you know that I do it... and it's fine to know what script is doing ...especially if it fail ... :)11:49
kevkodid we have btw swift scenartio broken for a longer time right ? 11:50
opendevreviewMichal Nasiadka proposed openstack/kolla-ansible master: common: Stop using transitional labels  https://review.opendev.org/c/openstack/kolla-ansible/+/91396011:54
opendevreviewVerification of a change to openstack/kolla-ansible stable/2023.2 failed: fix creation of keystone service role on upgrade  https://review.opendev.org/c/openstack/kolla-ansible/+/91245211:55
kevkoanyone for approve ? https://review.opendev.org/c/openstack/kolla-ansible/+/91365312:15
opendevreviewMichal Nasiadka proposed openstack/kolla-ansible master: common: Stop using transitional labels  https://review.opendev.org/c/openstack/kolla-ansible/+/91396012:57
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394212:59
opendevreviewMerged openstack/kolla-ansible master: Add conditionals for IPv6 sysctl settings  https://review.opendev.org/c/openstack/kolla-ansible/+/90583113:26
opendevreviewVerification of a change to openstack/kolla-ansible stable/2023.2 failed: fix creation of keystone service role on upgrade  https://review.opendev.org/c/openstack/kolla-ansible/+/91245213:31
opendevreviewPierre Riteau proposed openstack/kayobe master: Bump cirros images to 0.5.3  https://review.opendev.org/c/openstack/kayobe/+/91397613:59
opendevreviewPierre Riteau proposed openstack/kayobe master: CI: Bump cirros images to 0.5.3  https://review.opendev.org/c/openstack/kayobe/+/91397614:00
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394214:02
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394214:23
kevkoSvenKieske: do we need this also in master ? 14:23
kevkohttps://review.opendev.org/c/openstack/kolla-ansible/+/91245214:23
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372814:29
SvenKieskekevko: see the linked bug report: https://bugs.launchpad.net/kolla-ansible/+bug/2056761 "when upgrading from older releases the service role introduced by [1] in kolla-ansible might not be there."14:33
SvenKieskeso that should only be an issue for old releases, so it depends how many releases you jump into the future during upgrade14:34
SvenKieskeI guess the short answer is: no :)14:35
opendevreviewMaksim Malchuk proposed openstack/kolla-ansible stable/2023.2: Add conditionals for IPv6 sysctl settings  https://review.opendev.org/c/openstack/kolla-ansible/+/91381814:38
opendevreviewMaksim Malchuk proposed openstack/kolla-ansible stable/2023.1: Add conditionals for IPv6 sysctl settings  https://review.opendev.org/c/openstack/kolla-ansible/+/91381914:39
opendevreviewMaksim Malchuk proposed openstack/kolla-ansible stable/zed: Add conditionals for IPv6 sysctl settings  https://review.opendev.org/c/openstack/kolla-ansible/+/91382014:40
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394214:41
opendevreviewMerged openstack/kolla-ansible master: CI: fix check-failure.sh sudo missing  https://review.opendev.org/c/openstack/kolla-ansible/+/91396115:12
opendevreviewOlivier Delhomme proposed openstack/kolla-ansible master: kolla-ansible now uses default inventory file  https://review.opendev.org/c/openstack/kolla-ansible/+/91399315:31
kevkotempest looks better and better :) 15:58
kevkookay, why I am failing here ? https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_6ac/913728/27/check/kolla-ansible-rocky9-multinode-ipv6/6acfa58/primary/logs/tempest/reports/index.html 15:59
kevko  >> https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_6ac/913728/27/check/kolla-ansible-rocky9-multinode-ipv6/6acfa58/15:59
fricklerBad floatingip request: Network 9fc7cc0e-c40c-44e5-b9a3-72f028b3a28c does not contain any IPv4 subnet. hmm16:00
kevkofrickler: interisting no ? :) 16:07
kevkofrickler: let me rework little bit the script as i have one idea ...16:07
kevkofrickler: btw, did we have swift scenario working ? :D 16:07
fricklerkevko: I have not idea, I have no interest in swift, ceph it is for me16:08
kevkofrickler: for me also ... i don't have interest also ..but if i am writing testing ..i really need to fix swift :D ..to test if it is really working :D 16:08
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372816:20
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394216:22
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372816:23
opendevreviewMerged openstack/kolla-ansible stable/2023.2: fix creation of keystone service role on upgrade  https://review.opendev.org/c/openstack/kolla-ansible/+/91245216:28
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372816:33
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Swift test  https://review.opendev.org/c/openstack/kolla-ansible/+/91394216:44
opendevreviewMichal Arbet proposed openstack/kolla-ansible master: Just test rally baby  https://review.opendev.org/c/openstack/kolla-ansible/+/91372816:44
opendevreviewOlivier Delhomme proposed openstack/kolla-ansible master: kolla-ansible now uses default inventory file  https://review.opendev.org/c/openstack/kolla-ansible/+/91399316:51
kevkoHmm, pulls are not working :( mirror.bhs1.ovh.opendev.org:4447/openstack.kolla/fluentd:master-debian-bookworm  <<< 17:14
kevkohttps://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_657/913728/31/check/kolla-ansible-debian/657d19d/primary/logs/ansible/pull17:15
fricklerthat some issue with quay.io, then. wait and retry I guess17:20
kevkoit looks like I win a friday to do something else ;) :D 17:23
kevkostatus.redhat.com17:24

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!