Wednesday, 2021-11-10

opendevreviewWenping Song proposed openstack/nova master: Fill the AcceleratorRequestBindingFailed exception msg info  https://review.opendev.org/c/openstack/nova/+/81732602:19
opendevreviewWenping Song proposed openstack/nova master: Fill the AcceleratorRequestBindingFailed exception msg info  https://review.opendev.org/c/openstack/nova/+/81732602:52
opendevreviewlikui proposed openstack/osc-placement master: Replace deprecated assertRaisesRegexp  https://review.opendev.org/c/openstack/osc-placement/+/81736509:09
stephenfinlyarwood: the py36 tox environment is failing for me locally (F34). I proposed this to fix it https://github.com/powervm/pypowervm/pull/1709:52
stephenfinWe'll need that to merge and a new release of pypowervm be cut before we can get things running again. I suspect we're not seeing issues in the gate yet because an older dependency resolver that doesn't care about these things is being used there09:53
stephenfinJust FYI since you're also using Fedora :)09:53
gibielodilles, lyarwood: hi! re: kernel panic in stable/victoria with volume detach: https://bugs.launchpad.net/nova/+bug/1950310 yesterday I backported the libvirt event based detach series to victoria but top of that series the kernel panic still happens (links in the bug for the recent failures) 10:07
gibiI'm out of ideas now10:08
*** melwitt is now known as Guest550810:12
sean-k-mooney[m]gibi i could maybe try to revive my attempts to add support for alpine to dib so that we could use it instead of cirros10:24
sean-k-mooney[m]i started that orignally due to kernel bugs in the cirros kernel10:25
gibisean-k-mooney[m]: I'm not sure if it is really cirros. i) it happens with both cirros 0.5.1 and 0.5.2 but it does not happen on master just on stable/victoria10:26
gibis/i)//10:26
sean-k-mooney[m]i see10:26
sean-k-mooney[m]if its still happening with you event based detach patch  the only other thing i  can think of is changing the qemu version we are using10:27
sean-k-mooney[m]i used to maintain a devstack plugin to compile qemu form git but i have not updated it to support the mason build systme wich they now use.  we could try changing the qemu we use either via a ppa or compiling it10:29
gibihm interesting, I don't know how different the qemu version between master and stable/victoria but I can check that10:29
sean-k-mooney[m]master would be focal is stable victoria on bionic?10:29
fricklergibi: sean-k-mooney[m]: if you want newer qemu, could you try running on bullseye instead ubuntu?10:33
sean-k-mooney[m]frickler: bullseye and centos would be options. its not really about newer it could be older. if the panics are caused by a kvm or qemu bug then just using a diffent disto with different verions might fix the problem10:34
lyarwoodgibi: yeah I didn't think it would help tbh, you're still firing device_del down to QEMU and the ACPI requests to the guestOS, are we pinning tempest on stable/victoria? There was an effort recently to force tempest to wait until the guestOS was up before attaching and detaching volumes, I wonder if that logic is there?10:36
lyarwoodgibi: I've had my head downstream on something since yesterday but I'll try to help out some more for the rest of the week10:36
sean-k-mooney[m]you mean adding the validation so that we ssh into the vm before atching/detaching10:37
sean-k-mooney[m]or something else10:37
lyarwoodsean-k-mooney: yeah that's it10:37
lyarwoodstephenfin: ack not seen that personally but will keep it in mind10:38
sean-k-mooney[m]lyarwood has anyone submited patches for that10:39
lyarwoodI had thought it already landed in master tbh10:39
* lyarwood goes and checks10:39
lyarwoodokay no it hasn't10:41
lyarwoodI was sure I saw a change land for this or maybe my dreams are really becoming boring now I'm half way through my 30's10:41
sean-k-mooney[m]we spoke about it in out inernal meeting last week but unless you wrote it i dont think it happened.10:42
sean-k-mooney[m]if i remember correctly we just need to set validation=true on the create server line in the test10:43
gibilyarwood: do you have a link for the patch that adds that validation? I can add a depends-on somewhere in nova to see if that helps11:30
lyarwoodgibi: no sorry I couldn't find it, I was sure I saw something in the scenario manager for attempted to do it, I'll look again after lunch and throw something up on gerrit if I can't find it12:10
gibiack thanks12:11
opendevreviewBalazs Gibizer proposed openstack/nova stable/xena: Define new functional test tox env for placement gate to run  https://review.opendev.org/c/openstack/nova/+/81725613:33
opendevreviewBalazs Gibizer proposed openstack/placement stable/xena: Use 'functional-without-sample-db-tests' tox env for placement nova job  https://review.opendev.org/c/openstack/placement/+/81725513:35
gibigmann: ^^ added the missing cherry-pick lines in these ^^13:35
*** whoami-rajat__ is now known as whoami-rajat14:00
gmanngibi: thanks 14:25
opendevreviewBalazs Gibizer proposed openstack/nova stable/xena: Fix interference in db unit test  https://review.opendev.org/c/openstack/nova/+/81724015:05
opendevreviewStephen Finucane proposed openstack/nova master: Use unittest.mock instead of third party mock  https://review.opendev.org/c/openstack/nova/+/71467615:25
lyarwoodJust when you thought the nova codebase was safe from stephenfin ;)15:43
lyarwoodCan we give you the Friday nick of SirChangeALot?15:44
stephenfinsomeone complained about us still using it recently. blame that person15:44
stephenfinwhose name I can't remember15:44
lyarwood^_^15:44
lyarwoodI'll queue that up later in the week15:44
dasm"it's them, not us" :)15:44
sean-k-mooneylyarwood: for what its worth its really pretty tirvial15:49
lyarwoodyeah I know I was only jesting15:49
sean-k-mooneymost of it was  find replace and one or two other changes which are ease to fined by the change size beside the file15:49
sean-k-mooneyit made me sad you cant sort the list of files in the gerrit review by changed lines15:50
sean-k-mooneyit would be a nice feature for that type of change15:50
EugenMayerAh nova backup process was queued, but it was queue since i had not enough space. I removed the queued images in the ui (glance images), but now instances show up a task `image backup` and i cannot see how to remove it. Is that a nove tasks i can list (cannot find anything with the nova cli) or is that rather a glance task? A little confused who15:53
EugenMayerholds that data15:53
EugenMayer `glance task-list` does not show up those tassks15:54
EugenMayeris `nova reset-state --active` the way to solve this16:00
opendevreviewAlexey Stupnikov proposed openstack/nova master: Test aborting queued live migration  https://review.opendev.org/c/openstack/nova/+/77625016:41
sean-k-mooneygibi: good to see the neutron sepc is apporved ill try to rereview https://review.opendev.org/c/openstack/nova-specs/+/787458 this week but i belive its largely the same as v9 and i was ok with the overall design a that point.17:37
sean-k-mooneyby the way when was/is the spec review day17:37
sean-k-mooneyah the 16th17:38
sean-k-mooneyi tought i migth have missed it ill be looking at specs tomorrow in anycase17:38
sean-k-mooneyEugenMayer: reste-state shoudl be used with caution generally as a last resort before do ing a direct db modification17:40
sean-k-mooneyif its stuck in the image backup ste a hard reboot might be a better first attempt17:40
sean-k-mooneyreststate may leave the instnace in an unclean state17:40
opendevreviewMerged openstack/nova master: db: Remove models that were moved to the API database  https://review.opendev.org/c/openstack/nova/+/81214917:56
EugenMayersean-k-mooney that is what happened, reset state was clearing the ui, but it stuck nevertheless. Tried soft reboot, was not able to, so i rebooted the compute17:57
EugenMayeri cannot say that openstack shines in this areas to be honest. Not used to have such undefined, oncontrolled states created by UI interactions alone17:58
EugenMayerin general, tasks cannot be stopped, cannot be seen and cannot be controlled, while they can stuck. This is the one area i'am kind of frustrated about17:58
sean-k-mooneyEugenMayer: that is because they are not independent tasks for the most poart17:58
sean-k-mooneythere are a limit set of operation that are modeled as task like live migration that can be aborted17:59
sean-k-mooneybut most api action are not premetable once issued17:59
sean-k-mooneyEugenMayer: creating a backup for example is not really a task its an imperitive action you have taken that shoudl either complete or put the vm in an error state18:02
sean-k-mooneythat error state can be then recovered form vai a hard-reboot or other operation18:02
sean-k-mooneywhere we can recover form a failed operation nova shoudl and mark the instnace event in error18:02
opendevreviewMerged openstack/nova master: objects: Remove 'bandwidth' fields from notifications  https://review.opendev.org/c/openstack/nova/+/81214818:02
opendevreviewMerged openstack/nova master: db: Remove models for removed services, features  https://review.opendev.org/c/openstack/nova/+/81215018:03
EugenMayerit's what happens if 'microservice and self contained' is the main principal as an arch without controller the echo that comes with it. In an image backup, you invoke via nova, which then invokes a glance task, set some second state or whatever somewhere. So we have a glance task and state, a second task and state for the server and nothing that is18:03
EugenMayeractually controlling it. And if the task creating that image never reports back and most probably moves the server task/state further, it is stuck forever18:03
opendevreviewDan Smith proposed openstack/nova master: Log instance event wait times  https://review.opendev.org/c/openstack/nova/+/81703018:04
opendevreviewMerged openstack/nova master: db: Remove nova-network models  https://review.opendev.org/c/openstack/nova/+/81215118:04
EugenMayerif you decouple that much, the hard thing becomes cascading and controlling events and tasks which involve several subsystem. And for this part, it seems that is where openstack has nothing in place. Nova tasks in nove - great. if it involves other parts, things get stuck very fast. Be it neutron, the storage or similar.18:04
EugenMayerAnyway, sorry. A little frustrated. I get over it :)18:05
sean-k-mooneywe do have some recovery logic but asa an operator you really should not be managing any of that internal state18:05
sean-k-mooneyhttps://docs.openstack.org/api-ref/compute/#servers-actions-servers-os-instance-actions is where you can list the action performed on an instance by the way18:06
EugenMayer(i would love to not manage it, but it becomes my duty if queued stuck because it waits to claim storage space, which does not happen and then it rolls forever)18:06
sean-k-mooneyit should actully fail if there is not enouch space. is it failing on the image upload 18:07
sean-k-mooneyor the creation of the inital snapshot18:07
EugenMayersean-k-mooney interesting, i have see that API but assumed this was rather 'list the actions one can do'. I guess 'nova instance-action-list' is not that, but 'openstack server event list esxi`18:08
EugenMayerwell i can assure you, it did not fail but has been queued. The task was started using `nova backup ...` 18:09
sean-k-mooneyqued in glance 18:10
EugenMayerYes, but 'instance backup' in nova/compute18:10
EugenMayeri deleted the image in glance (not stopping the task, if that is possible at all), which then stuck the compute instance in 'instance backup' forever18:11
EugenMayerA different thing, trying to host an ESXi as a VM on openstack KVM. The sole purpose of that VM is using ESXi native tools to convert qcow2 disks to _proper_ vmdks18:12
sean-k-mooneyinstance backup... so its no in image uploading or image pendign upload18:12
sean-k-mooneyEugenMayer: right we would not expect deleting the image in glance to stop the task18:13
sean-k-mooneythat could infact break our cleanup code18:13
EugenMayerI have run this under proxmox with ethernet e1000 but under openstack it starts with virtio, which seems not to be supported by ESXi thus the VM has no interface when booted. Can i switch the interface type?18:13
EugenMayersean-k-mooney that i understood - after that :)18:13
sean-k-mooneyyou can specify e100018:13
EugenMayervia xml only?18:13
sean-k-mooneyyou set hw_vif_model=e1000 in the glance image metadata18:13
sean-k-mooneyhttps://github.com/openstack/glance/blob/master/etc/metadefs/compute-libvirt-image.json#L11118:14
EugenMayerthat is pure meta-data,right?18:14
sean-k-mooneyit will change the xml that is generated18:15
sean-k-mooneythat is how you change the xml in a supported way18:15
sean-k-mooneythat and flaovr extra specs but devicve models are genrealy only tunable via the image18:15
EugenMayerah the flavor can stear that? That's fine, i create a dedicated flavor via terraform anyway18:16
sean-k-mooneya flaovr cannot change that option18:16
sean-k-mooneyit can change some others 18:16
EugenMayerso it must be an image option only, ok18:16
sean-k-mooneyeverything in that file is setable via the glance image only18:16
sean-k-mooneyyes the image used for the root disk18:16
sean-k-mooneyso your esxi image in this case18:16
sean-k-mooneyopenstack is a cloud plathform rather then virtualisation solution18:17
opendevreviewJulia Kreger proposed openstack/nova master: Ironic - Handle instance host on rebalance  https://review.opendev.org/c/openstack/nova/+/81389718:17
sean-k-mooneyso you create high levle templatees via the flavor and imave that you then instnaciate many copies of ranter then creatin indivicual instances18:17
EugenMayeri see, i try to adopt that. Can i manipualte an existing glance image or how would i introduce this?18:18
sean-k-mooneymodifying an exsiting image will not affect instnace already using it18:18
sean-k-mooneybut you can update it18:18
EugenMayersure, i will redo the instance. That is what i do with terraform anyhhow18:18
sean-k-mooneygenreeally if it widely used in production you will not modify it but its really a case by case decision18:18
EugenMayerok so it is API only i guess18:18
EugenMayeris this an md-property?18:19
sean-k-mooneywhen we create an image we "snapshot" or copy the exra specs form the flavor and metadta form the image and store a perinstance copy 18:19
sean-k-mooneythat copy is then used for the lifetime of the instnace18:20
mlozahi folks, i have two kind of computes, first one is configured to store boot disk in ceph and second is configured to store boot disk locally on the host. Both works as intendend but second one can be bypass if the user tries to boot the instance from a volume18:20
sean-k-mooneyyes these are image metadata properties18:20
sean-k-mooneymloza: correct that is expect18:21
sean-k-mooneyif the user boots from a cinder volume that storage will always be used18:21
sean-k-mooneyso if you have a cinier providing storage via say iscsi18:21
sean-k-mooneythen the boot form volume instnace can boot on either host by default18:22
EugenMayersean-k-mooney: glance image-update bbcb16b8-8c84-4b01-92e8-6d66938460bf --property hw_vif_model=e1000 .. is taht what you mean by updating the image?18:22
sean-k-mooneyEugenMayer: yes exactly18:22
sean-k-mooneyif you create a new vm with that image then it should have e1000 nics instead of virtio18:22
mlozasean-k-mooney: seems what I expected. Thanks for the info18:23
EugenMayerok learning. Well needless to say, but still much appretiated. Thank you so much!18:23
EugenMayeri guess saving https://github.com/openstack/glance/blob/master/etc/metadefs/compute-libvirt-image.json#L111 and having this file as lookup makes sense. That link i store :)18:24
sean-k-mooneyEugenMayer: there is a nova doc for this too18:24
sean-k-mooneyim currently looking for it18:24
sean-k-mooneythat file is basicaly where horizon and heat get there info18:25
sean-k-mooneyto generate uis18:25
sean-k-mooneyhorizone quiery glance for the set of know image properties18:25
sean-k-mooneyvia the metadefs api18:25
sean-k-mooneyEugenMayer: this is glances docs on the image porperties https://docs.openstack.org/glance/latest/admin/useful-image-properties.html18:25
EugenMayerinteresting. Kind of unusual to me to have all this in a map, not a typed object. But i guess that is due to the gazzilion different names/options different hypervisors would need18:26
EugenMayerwow that is a nice one, thanks18:26
sean-k-mooneythe actual values are defiend in a concreate object in nova18:30
sean-k-mooneyhttps://github.com/openstack/nova/blob/master/nova/objects/image_meta.py#L15818:30
EugenMayersean-k-mooney next time when and glance operation is queued and i want to cancel, should i use `glance image-tasks` and then? i cannot see anything like stop/kill/cancel tasks when i have the ID18:31
sean-k-mooneyEugenMayer:  theses are the concreate field definiton https://github.com/openstack/nova/blob/master/nova/objects/image_meta.py#L293-L56318:31
EugenMayerThis all got straight into my docs, thank you for digging those gems out18:33
EugenMayerinteresting, the interface is now available in ESXi but somehow DHCP does not work. That is unexpected18:33
opendevreviewMerged openstack/nova master: Apply common irrelevant_files for centos 8 job  https://review.opendev.org/c/openstack/nova/+/81727818:35
EugenMayersean-k-mooney up up and away to dinner. Thank you big times. Not sure why DHCP is not working on that ESXi, i will fiddle with that tomorrow. Have a nice evening/moring/whatever :)18:42
opendevreviewJulia Kreger proposed openstack/nova master: WIP Ironic - Reconcile compute host records on rebalance  https://review.opendev.org/c/openstack/nova/+/81746718:42
sean-k-mooneyEugenMayer: you too enjoy dinner18:49
opendevreviewAlexey Stupnikov proposed openstack/nova master: Test aborting queued live migration  https://review.opendev.org/c/openstack/nova/+/77625020:19
opendevreviewHang Yang proposed openstack/nova master: Support creating servers with RBAC SGs  https://review.opendev.org/c/openstack/nova/+/81152121:50
opendevreviewMerged openstack/nova stable/victoria: Store old_flavor already on source host during resize  https://review.opendev.org/c/openstack/nova/+/81091522:58

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!