Friday, 2021-10-29

opendevreviewDanil proposed openstack/nova master: Check if the InstanceNUMACell.pcpuset attribute exist prior to relying on it. This change enables support for the VM instances with with pinned CPUs (hw:cpu_policy=dedicated) that were created before upgrading the OpenStack Nova to Victoria release.  https://review.opendev.org/c/openstack/nova/+/81592300:06
opendevreviewmelanie witt proposed openstack/nova stable/train: Clear rebalanced compute nodes from resource tracker  https://review.opendev.org/c/openstack/nova/+/81182201:48
opendevreviewmelanie witt proposed openstack/nova stable/train: Invalidate provider tree when compute node disappears  https://review.opendev.org/c/openstack/nova/+/81182301:48
opendevreviewmelanie witt proposed openstack/nova stable/train: Prevent deletion of a compute node belonging to another host  https://review.opendev.org/c/openstack/nova/+/81182401:48
opendevreviewmelanie witt proposed openstack/nova stable/train: Fix inactive session error in compute node creation  https://review.opendev.org/c/openstack/nova/+/81182501:48
brinzhang_gibi: ack, wenpingsong__ will propose it later06:09
*** brinzhang_ is now known as brinzhang06:12
*** songwenping__ is now known as songwenping06:12
gibibrinzhang: OK, cool, let me know if I can help06:59
brinzhanggibi: ack07:02
brinzhangwenpingsong: ^^07:02
gibiprometheanfire: ack, I remember looking at that failure and then got distracted, let me see if I can quickly fix it07:05
*** ianw is now known as ianw_pto07:13
opendevreviewBalazs Gibizer proposed openstack/nova master: Fix unit test for oslo.concurrency 4.5  https://review.opendev.org/c/openstack/nova/+/81594007:40
gibiprometheanfire: ^^07:41
gibibauzas: a small unit test fix ^^ to unblock the requirement bump07:43
bauzasgibi : apologies but I'm off until Tuesday07:57
bauzas(heading to Paris)07:58
gibibauzas: ohh, I remember now. Enjoy08:12
em_is there any way to add /remove/change the ssh-key of an cloud-init instance after the creation08:31
em_openstack server set --property?08:31
gibiem_: I think you can only use rebuild to inject a new keypair09:32
em_does rebuild white the database?09:40
em_sorry disk09:40
gibiyes it will rewrite the root disk of the VM from the original image10:00
em_so the concept is to never write data on the root disk, only on ephi?10:15
sean-k-mooney[m]rebuild willl erase the ephemeral disks too10:16
sean-k-mooney[m]rebuild is a distructive operation that should only be used to erase all data in the vm that is no stored in cinder volumes10:17
gibisean-k-mooney[m]: ohh, will rebuild replace ephemerals with an empty disk?10:17
sean-k-mooney[m]cloud init by default does not run on each boot so there is no reliable way to change the ssh key pair10:17
gibi(make sense though as the name is _ephemeral_10:17
gibi)10:17
sean-k-mooney[m]even if we updated it in the db cloud-init would not rerun on the next boot unless you altered its default behavior10:18
sean-k-mooney[m]if you need to rotate an ssh key or add a new one you really need to do that by sshing in wiht the orignal and adding it manually10:19
sean-k-mooney[m]gibi there used to be an api extention tat allowed you to ask for it to no be erased pre microversions10:20
opendevreviewBalazs Gibizer proposed openstack/nova master: [nova-manage]support extended resource request  https://review.opendev.org/c/openstack/nova/+/80206010:20
sean-k-mooney[m]but we never actully made that part of the api when we removed extentions10:21
sean-k-mooney[m]and it was never supported by libvirt if i understand corectly10:21
gibiohh, OK, I learned new things today \o/10:22
sean-k-mooney[m]https://github.com/openstack/nova/blob/liberty-eol/nova/api/openstack/compute/legacy_v2/contrib/preserve_ephemeral_rebuild.py10:23
*** dpawlik5 is now known as dpawlik10:25
sean-k-mooney[m]oh i guess we still support tht10:28
sean-k-mooney[m]its not documented anywhere10:28
sean-k-mooney[m]it looks like you set preserve_ephemeral=true in the server metadata10:28
sean-k-mooney[m]https://github.com/openstack/nova/blob/50fdbc752a9ca9c31488140ef2997ed59d861a41/nova/api/openstack/compute/helpers.py#L96-L10310:29
sean-k-mooney[m]i think this was only supported by ironic however10:29
sean-k-mooney[m]yep its not supported int he default implemetation https://github.com/openstack/nova/blob/50fdbc752a9ca9c31488140ef2997ed59d861a41/nova/compute/manager.py#L3234-L323810:33
sean-k-mooney[m]so libvirt does not support it10:33
opendevreviewBalazs Gibizer proposed openstack/nova master: Reno for qos-minimum-guaranteed-packet-rate  https://review.opendev.org/c/openstack/nova/+/80504610:34
opendevreviewBalazs Gibizer proposed openstack/nova master: DNM: Test nova-manage placement heal_allocations with new Neutron  https://review.opendev.org/c/openstack/nova/+/81595210:47
*** lbragstad4 is now known as lbragstad11:07
opendevreviewFederico Ressi proposed openstack/nova master: Debug Nova APIs call failures  https://review.opendev.org/c/openstack/nova/+/80668311:10
gibigmann: are you around?14:41
gmanngibi: hi14:41
gibihi14:41
gibiI'm trying to figure out the issue in the bandwidth tempest tests https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_9ab/802060/18/check/nova-next/9ab4682/testr_results.html14:42
gibiit seems that the tempest code now thinks that the new neutron API extension is enabled14:42
gibibut that is not merged yet https://review.opendev.org/c/openstack/neutron/+/80563714:42
gmanngibi: ah, got it. 14:44
gibithere is 14:44
gibiapi_extensions = all14:44
gibiin the tempest conf14:44
gmanngibi: I think we should make depends-on in otherway around14:44
gmanntempest tests should be depends-on on neutron? or neutron patch fail on existing test?14:44
gibithe neutron patch is OK to land as that adds the extension that makes the test passing14:45
gibiI have another nova patch the fails now until the neutron patch lands14:45
gibiand also the newer tempest patches adding pps coverage are failing unit the neutron patch lands14:45
gmanngibi: you mean tempest existing test (before https://review.opendev.org/c/openstack/tempest/+/809168) fail ? 14:45
gibiI mean the test we adapeted to the new neutron API in https://review.opendev.org/c/openstack/tempest/+/809168 now fails for other patches as the neutron patch haven't landed yet14:46
gibiwhich is due to the fact that tempest is configured that all the API extensions are available even the one that is not merged yet14:47
gmanngibi: yeah. but if we revert 809168 and merge the series after neutron patch then is it fine?14:47
gmanntempest seriesw14:47
gibiI would keep the 809168 and wait for https://review.opendev.org/c/openstack/neutron/+/805637 to land clearing up the situation14:48
gmannor existing assert which was modified in 809168 will fail if neutron patch merged as it change the response?14:48
gibiyepp we need 809168 before we land the neutron API change as that would break existing tempest14:48
gibiso I think we are OK as is now14:48
gibijust cannot land the nova change https://review.opendev.org/c/openstack/nova/+/802060 until the neutron API extension lands14:49
gmanngibi: I am wondering why neutron gate is not blocked as neutron-ovs-tempest-multinode-full fail after 80916814:49
gibimaybe it is :)14:49
gibilet me check14:49
gmannyeah because my DNM patch was just on neutron master https://review.opendev.org/c/openstack/neutron/+/81589814:49
gmannhttps://zuul.openstack.org/builds?job_name=neutron-ovs-tempest-multinode-full14:50
gibiyepp your DNM patch shows that the ovs job is blocked14:51
gibithis is a deadlock, we cannot have 809168 before the neutron API extension as that block everythin except the API extension patch, but we cannot have the neutron API extension before 809168, because that also blocks everything except the tempest patch14:53
gibi /o\14:53
gibiand this is basically because tempest + devstack declares a neutron API extension exists before it is really exists14:54
gmanngibi: but i am confused as test modified/merged in 809168 is passing in few pass of same job https://zuul.openstack.org/builds?job_name=neutron-ovs-tempest-multinode-full14:55
gmannand failing instances of that job does now run that tests14:55
gmannso I am not clear if master gate is blocked and if not then why14:55
gmanngibi: we have the way to unblock this type of deadlock in tempest which is common way when API change. but test is passing on current master after 809168 is not clear to me14:56
gibiI think the test is not passing on current master, except in https://review.opendev.org/c/openstack/neutron/+/80563714:57
gibiour patches top that 14:57
gibis/our/or/14:57
gmanngibi: ah yeah, its in the series patches only. let me check failing one then14:58
gibiyour DNM patch is a good example how the neutron master fails today14:59
gmannyeah14:59
prometheanfiregibi: thanks15:01
gmanngibi: for deadlock we can do: step1. revert the 809168 to unblock the gate and for neutron/nova change to land -> step 1. skip the test in tempest which is allow neutron change to merge 2. unskip the tempest test along with revert of revert of809168 15:02
gmannthat is how we land bug fixes in projects if tempest test block that15:02
gibisounds complicated15:03
gmanngibi: ah this is +A so we can just wait for this? https://review.opendev.org/c/openstack/neutron/+/805637/1215:03
gibigmann: yeah, if I know that the current recheck of 805637 will be successfully then yes15:04
gibiit is friday, I can recheck it during the weekend15:04
gibiso probably we not loosing much if we keep the gate broken for the weekend15:04
gmanngibi: +1, sure. I will also keep monitoring it.15:05
gibigmann: OK, thanks15:05
gmanngibi: just for future ref if needed. this is process in tempest side https://docs.openstack.org/tempest/latest/HACKING.html#bug-fix-on-core-project-needing-tempest-changes15:05
gibithen fingers crossed15:05
* gibi clicks15:05
gmannmy bad that i did not realize the full flow 15:05
gibiOK, so we all the new neutron API extension that breaks existing tempest test we need to temporary disable the test, merge the API extension, merge the tempest adaptation, and then re-enable the test15:07
gibis/we/with/15:07
gmannyeah, and for any type of bug fix in project if that is blocked by tempest tests15:08
gibiack15:08
gibimelwitt: hi! I've fixed and replied to your comments in https://review.opendev.org/c/openstack/nova/+/802060 please ignore the nova-next failure for now, it is due and ordering issue, we need to land https://review.opendev.org/c/openstack/neutron/+/805637 (which is on the gate) first. See our discussion with gmann above ^^15:19
opendevreviewMerged openstack/nova stable/xena: Reproduce bug 1944759  https://review.opendev.org/c/openstack/nova/+/81091016:00
spatelsean-k-mooney[m] morning17:03
gibigmann: so far so good, 805637 cleared the check queue and now running in the gate queue18:35
gmann+118:53
gmannseems ovs multinode job again failing20:04

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!