Wednesday, 2022-08-03

opendevreviewOpenStack Proposal Bot proposed openstack/nova master: Imported Translations from Zanata  https://review.opendev.org/c/openstack/nova/+/85133704:02
opendevreviewAmit Uniyal proposed openstack/nova master: For evacuation, ignore if task_state is not None  https://review.opendev.org/c/openstack/nova/+/84888604:53
gibigood morning07:22
Ugglagibi, o/07:33
opendevreviewMerged openstack/nova master: Updated Suspend definition in server concepts doc  https://review.opendev.org/c/openstack/nova/+/85151107:33
gibi\o/ the unittest.mock series landed \o/07:35
gibi(hope nobody too mad about the merge conflicts we generated with that :D)07:35
gibisean-k-mooney[m]: left feedback in the vdpa patch https://review.opendev.org/c/openstack/nova/+/83233007:47
gibisean-k-mooney[m]: could you look at https://review.opendev.org/c/openstack/nova/+/851909 it is needed for the sdk 0.100 release here https://review.opendev.org/c/openstack/requirements/+/84998608:04
sean-k-mooney[m]im just about to have a quick call downstream but ill look at it shortly thanks for reviewing the vdpa patch ill adress your feedback when im adressing stephens08:12
sean-k-mooney[m]if i can get the first patch at least to a point where it can merge today that would be ideall then i can focus on just the ones that wont be backported and thos can hopefully merge seperately as a group08:13
gibisean-k-mooney[m]: sure. ping me and I will re-review the vdpa patch08:15
gibiif stephenfin is also available the we can land that today08:16
opendevreviewBalazs Gibizer proposed openstack/placement master: Func test for os-traits and os-resource-classes lib sync  https://review.opendev.org/c/openstack/placement/+/85196608:36
gibisean-k-mooney[m], melwitt: as we agreed on the Zed PTG I change the how placement tests for the lib sync so that we don't need to do the test disable/enable dance any more at a lib release  ^^08:37
sean-k-mooney[m]ack almost done with internal call08:41
sean-k-mooney[m]ill look at that next08:42
sean-k-mooney[m]+2 on the safeconnect fix08:42
*** tosky_ is now known as tosky08:47
gibithanks08:58
sean-k-mooney[m]i was going to ask for the placement change to be done slightly differntly09:04
sean-k-mooney[m]but i realise now that you are loading the tratis/resouce classes directly form the lib and assertign the api returns the same content09:04
sean-k-mooney[m]which should always be in sync09:04
sean-k-mooney[m]i was going to ask that you check that the api respocne contains the lib content and the could was greater or equal09:05
sean-k-mooney[m]but the exact comparison should work as ultimately they have the same data source09:05
sean-k-mooney[m]the lib09:05
sean-k-mooney[m]the only test coverage we currntly loose by not doing the >= check is if we acidentally delete a trait09:09
sean-k-mooney[m]but we are aware that that is not allowed so im not really concerned by that09:09
gibido you mean accidentally deleting a trait from os-traits? yeah that is something we need to cover elsewhere09:14
gibitoday we also not checking for that. Also a >= would not cover that as a os-traits lib version bump migh add 10 new traits and remove 2 but the change is net positive so >= would pass09:15
sean-k-mooney[m]we were checking for it indirectly by asserting the exact number 09:16
sean-k-mooney[m]but ya09:16
sean-k-mooney[m]we could have added 2 and removed 209:16
sean-k-mooney[m]i think core review is enough to catch that honestly09:16
gibiyes, I hope so09:16
sean-k-mooney[m]we all know that we cant ever remove traits09:16
gibiif not then we need a test in os-traits for it09:17
sean-k-mooney[m]if we need too i would put a hacking test in or similar09:17
gibias the placement test would be too late to catch it09:17
gibiyeah, hacking would work too09:17
sean-k-mooney[m]ok ill be back in about 10 mins09:19
gibiack09:19
sean-k-mooney[m]just going to check on fryea and what she is barking at09:19
* gibi needs a cat09:20
amorinhello nova team, when shelving an instance, the port binding in neutron is "staying" on the host, I was expecting it to be unbound09:34
amorinam I wrong?09:34
sean-k-mooney[m]amorin: no your are not wrong its a know issue09:34
amorinnice!09:35
sean-k-mooney[m]your expectation alines with mine but we have never actully done it09:35
amorindo you have a launchpad bug already reported for this?09:35
sean-k-mooney[m]https://review.opendev.org/c/openstack/nova/+/832330/7/nova/tests/functional/libvirt/test_pci_sriov_servers.py#128609:36
sean-k-mooney[m]amorin no not currently 09:36
sean-k-mooney[m]i just came across it whlie wrting functional tests for something else09:36
sean-k-mooney[m]so if you want to file one and or adress it please do09:36
sean-k-mooney[m]amorin: this should not break anything today09:36
amorinok, I will create the bug report at least09:37
sean-k-mooney[m]ack09:37
amorindo you know if neutron already implement such API?09:37
amorinthat would allow nova to unbound the port?09:37
sean-k-mooney[m]yes it does09:37
sean-k-mooney[m]and nova has some code to do it too09:38
amorinin the binding extension probably09:38
sean-k-mooney[m]it currently however does too much09:38
sean-k-mooney[m]well unbinding is jsut doing 2 things09:38
amorinoh nice, any hint where it's located in nova code?09:38
sean-k-mooney[m]one settign bind-host=None09:38
sean-k-mooney[m]and second clearing the files we set in the binding profile09:38
sean-k-mooney[m]what we shoudl not do is clear the device id09:38
sean-k-mooney[m]that is the bit our current unbind code is doing that it should not be09:39
sean-k-mooney[m]our current unbind is only used for detaching a port or when deleteing a vm09:39
amorinack, yes, the device_id should stay09:39
sean-k-mooney[m]https://github.com/openstack/nova/blob/master/nova/network/neutron.py#L61609:40
amorinperfect, that where I was also looking :)09:41
sean-k-mooney[m]so this should not be part of unbind https://github.com/openstack/nova/blob/master/nova/network/neutron.py#L64209:41
sean-k-mooney[m]         'device_id': '',09:41
sean-k-mooney[m]                    'device_owner': '',09:41
sean-k-mooney[m]is acttully detach09:41
amorinack, we should maybe split that in 2 separate function09:42
amorinwhere detach would call unbind09:42
sean-k-mooney[m]that or add a detach kwarg09:42
amorinok, maybe easier09:42
amorinI think I have all info, I'll do some tests, open the bug, and come back09:43
sean-k-mooney[m]did this cause you any issues or did you just notice it and find it ood09:43
amorinit's causing us issues because we are monitoring the ports attached to a compute, and we found some bound ports without instances09:43
amorinand then, digging into that, we were thinking that it's odd to keep the binding :)09:44
sean-k-mooney[m]that should not result in the ports being bound on the compute node09:44
sean-k-mooney[m]sorry not bound09:44
sean-k-mooney[m]but present on ovs09:44
amorinit's not anymore09:44
amorinso i'ts good09:44
sean-k-mooney[m]and or "plugged"09:44
sean-k-mooney[m]ya okj09:45
sean-k-mooney[m]just making sure we were not leaking the neutron ports09:45
sean-k-mooney[m]we acctuly do call unplug_vifs09:45
sean-k-mooney[m]during shelve offload09:45
amorinyes, everything is actually fine on the compute09:45
sean-k-mooney[m]incidentally shelve offload is where it should be unbound09:46
sean-k-mooney[m]not shelve itslef09:46
amorinit's just that the database (ml2_port_binding) does not reflect what is on the compute09:46
amorinyes09:46
sean-k-mooney[m]by defualt we automaticaly shelve offload with a time out of 0 seconds09:46
sean-k-mooney[m]yep that makes sense09:46
amorintrue, we kept this parameter to 0 here09:46
sean-k-mooney[m]personally i would prefer to eventurlaly remove the config option and always just offload but that proably wont happen09:47
amorinwhat's the purpose of delaying the offload?09:48
sean-k-mooney[m]in rare cases if you unshleve quickly enough it can sometimes be useful to delay. the use case was for bustable instances09:48
sean-k-mooney[m]basically in some cases where shelve/unshele is used to scale in/out09:49
sean-k-mooney[m]its nice to have say a 15min delay incase the load spike09:49
sean-k-mooney[m]as its faster to unshleve if the vms is still in place09:49
sean-k-mooney[m]in partice that was only ever useful for vms without ceph or boot form volume09:50
amorinok, that's sound like a weird usecase to me09:50
amorinshelve / unshelve is not suppose to happen a quick way09:50
sean-k-mooney[m]well that depends09:50
sean-k-mooney[m]shelve should be very very quick09:50
sean-k-mooney[m]if you are using rbd or boot form volume09:50
sean-k-mooney[m]sicne there is no data to copy09:50
sean-k-mooney[m]but ya it was a pretty niche usecase which is why i would prefer to simplify the code09:51
amorinack09:51
sean-k-mooney[m]that or remove the config option but make it an api parmater09:51
sean-k-mooney[m]so the user can say delay for up to x time09:51
sean-k-mooney[m]i do not like config driven api behavior shich this currently is09:52
opendevreviewMerged openstack/nova master: Remove the PowerVM driver  https://review.opendev.org/c/openstack/nova/+/85034610:47
*** bhagyashris is now known as bhagyashris|ruck11:11
sean-k-mooney[m]stephenfin:  your lines of code stats will never not be negitive ^11:23
gibidon't encourage him, he will find a way to replace the content of the nova repo with a simple readme file :D11:23
sean-k-mooney[m]cells v1, nova networks xen and power vm. 11:23
sean-k-mooney[m]whats next on the list :)11:24
sean-k-mooney[m]oh i for got the docs project :) 11:24
sean-k-mooney[m]@gibi: that would be one way to close out all the bugs11:25
gibiwhat an elegant way :)11:25
sean-k-mooney[m]@gibi speaking of which i marked the custom device_owner bug as invalid11:25
gibiheh, I guessed you would11:26
gibiand I agree11:26
gibitheir use case can be solved without the requested change11:28
sean-k-mooney[m]if we wanted to support it we should do it properly as a minor feature with the filtering of events on the neutron side and filtering of port on the nova side so taht they are never stored in our netrowk info cache ectra11:30
opendevreviewMerged openstack/nova master: [trivial] Simplify dict get call by removing unused default  https://review.opendev.org/c/openstack/nova/+/85045011:30
gibisean-k-mooney[m]: I agree11:37
gibiduring live migration what triggers the creation of the domain on the dest host? Is it the call to migrateToURI3 from the source host? or nova somehow pre-create a domain on the dest host?12:12
sean-k-mooney[m]yes12:12
sean-k-mooney[m]its created by libvirt as a result of our call to libvirt on the source node12:13
gibithanks12:13
sean-k-mooney[m]via the migrateToURI3 function12:13
gibiso even though our func test env simulate libvirt it does not simulate that 14:12 < gibi> during live migration what triggers the creation of the domain on the dest host? Is it the call to migrateToURI3 from the source host? or nova somehow pre-create a domain on the dest host?12:13
gibi14:12 < sean-k-mooney[m]> yes12:13
gibiagain12:13
gibisorry12:13
sean-k-mooney[m]correct12:13
sean-k-mooney[m]we just mock out that call12:14
gibiso even though our func test env simulate libvirt it does not simulate that migrateToURI3 triggers the domain creation on the dest12:14
gibiack12:14
gibithat explains some func test failures I see with https://review.opendev.org/c/openstack/nova/+/851832/312:14
sean-k-mooney[m]we dont really simulate the domins in general in the libvirt fixture12:14
sean-k-mooney[m]at least not for the move ops like that12:14
sean-k-mooney[m]we proably could mock it differntly adn have it do that to be fair12:15
gibithree is some vm tracking in the fixture but yes, that seems to be incomplete12:15
opendevreviewBalazs Gibizer proposed openstack/nova master: Do not mutate migration context for rollback_live_migration_at_destination  https://review.opendev.org/c/openstack/nova/+/85183213:13
opendevreviewBalazs Gibizer proposed openstack/nova master: Prevent instance.save() under mutated migration context  https://review.opendev.org/c/openstack/nova/+/85074613:13
*** dasm|off is now known as dasm13:18
opendevreviewBalazs Gibizer proposed openstack/placement master: Clarify trait filtering in the API doc  https://review.opendev.org/c/openstack/placement/+/82550113:22
opendevreviewBalazs Gibizer proposed openstack/nova stable/ussuri: Reproduce bug 1944759  https://review.opendev.org/c/openstack/nova/+/83673313:25
opendevreviewBalazs Gibizer proposed openstack/nova stable/ussuri: Store old_flavor already on source host during resize  https://review.opendev.org/c/openstack/nova/+/83673413:25
opendevreviewBalazs Gibizer proposed openstack/nova master: Poison /sys access via various calls in test  https://review.opendev.org/c/openstack/nova/+/84462713:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Add compute restart capability for libvirt func tests  https://review.opendev.org/c/openstack/nova/+/85051013:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Rename [pci]passthrough_whitelist to device_spec  https://review.opendev.org/c/openstack/nova/+/84383413:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Rename exception.PciConfigInvalidWhitelist to PciConfigInvalidSpec  https://review.opendev.org/c/openstack/nova/+/84386113:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Rename whitelist in tests  https://review.opendev.org/c/openstack/nova/+/84386213:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Basics for PCI Placement reporting  https://review.opendev.org/c/openstack/nova/+/84618713:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Extend device_spec with resource_class and traits  https://review.opendev.org/c/openstack/nova/+/84621813:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Reject PCI dependent device config  https://review.opendev.org/c/openstack/nova/+/84643513:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Reject mixed VF rc and trait config  https://review.opendev.org/c/openstack/nova/+/84643613:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Ignore PCI devs with physical_network tag  https://review.opendev.org/c/openstack/nova/+/84621913:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Reject devname based device_spec config  https://review.opendev.org/c/openstack/nova/+/84646613:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Support [pci]device_spec reconfiguration  https://review.opendev.org/c/openstack/nova/+/84647013:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Stop if tracking is disable after it was enabled before  https://review.opendev.org/c/openstack/nova/+/84700913:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Move provider_tree RP creation to PciResourceProvider  https://review.opendev.org/c/openstack/nova/+/85054613:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Retry /reshape at provider generation conflict  https://review.opendev.org/c/openstack/nova/+/85135813:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Heal missing PCI allocation in the resource tracker  https://review.opendev.org/c/openstack/nova/+/85135913:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Allow enabling PCI tracking in Placement  https://review.opendev.org/c/openstack/nova/+/85046813:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Add more test coverage for devname base dev spec  https://review.opendev.org/c/openstack/nova/+/84462513:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Extra tests for remote managed dev spec  https://review.opendev.org/c/openstack/nova/+/84462613:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Unparent PciDeviceSpec from PciAddressSpec  https://review.opendev.org/c/openstack/nova/+/84449113:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Fix PciAddressSpec descendants to call super.__init__  https://review.opendev.org/c/openstack/nova/+/84456513:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Remove dead code from PhysicalPciAddress  https://review.opendev.org/c/openstack/nova/+/84462813:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Clean up mapping input to address spec types  https://review.opendev.org/c/openstack/nova/+/84576513:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Remove unused PF checking from get_function_by_ifname  https://review.opendev.org/c/openstack/nova/+/84577513:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Fix type annotation of pci.Whitelist class  https://review.opendev.org/c/openstack/nova/+/84578013:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Move __str__ to the PciAddressSpec base class  https://review.opendev.org/c/openstack/nova/+/84578113:38
gibipheww rebased out the pci serieses from the merge conflict13:39
*** lifeless_ is now known as lifeless13:51
opendevreviewBalazs Gibizer proposed openstack/nova master: Reproducer for bug 1982497  https://review.opendev.org/c/openstack/nova/+/85067214:41
opendevreviewBalazs Gibizer proposed openstack/nova master: Do not mutate migration context for rollback_live_migration_at_destination  https://review.opendev.org/c/openstack/nova/+/85183214:41
opendevreviewBalazs Gibizer proposed openstack/nova master: Prevent instance.save() under mutated migration context  https://review.opendev.org/c/openstack/nova/+/85074614:41
melwittgibi: ack, will look15:29
gibithanks15:31
sean-k-mooney[m]gibi are you going with dans suggetion to just remove all the save or proceeding with the decorator for now15:51
sean-k-mooney[m]or both15:51
sean-k-mooney[m]backport decorator adn then remove saves on master only?15:51
gibisean-k-mooney[m]: I proposed a decorator now and fixed the rebuild case by hand with apply and revert15:51
sean-k-mooney[m]ok15:52
gibiI feel that changing the contract between the compute manager and the virt drivers is a bit much for me for now15:52
sean-k-mooney[m]ack15:52
gibithe decorator will catch all the problematic cases15:52
gibiand we have two of those one that is fixed by the bugfix and the other was as simple one with rebuild15:53
sean-k-mooney[m]ill review tomorrow but i assume you are just logging the issue for now and not saving15:53
gibiwait15:53
gibiso for the rollback live migration15:53
gibiI was able to remove the mutated migration context and keep the save15:53
sean-k-mooney[m]ok15:54
gibithe vpmem cleanup that needed the mutation contex can be done based on the domain and not based on the instance.resources storage15:54
gibithis solves our current bug I believe15:54
gibithe decorator is added in a followup to catch new cases by raising at save() if it is called under a mutated context15:54
gibiand that revealed a second place where we do this, at rebuild15:55
gibibut that was fairly simple to fix15:55
sean-k-mooney[m]oh ok so you have resolved the current incorrect uses15:56
sean-k-mooney[m]and new incorrect uses will raise15:56
gibiyes15:56
gibiyes15:56
sean-k-mooney[m]so we wont merge it15:57
sean-k-mooney[m]ok15:57
sean-k-mooney[m]that a liit more context then im prepared to digest to review today15:57
sean-k-mooney[m]but ill look back on this tomorrow15:57
gibiack15:58
gibiartom and dansmith are also on it to keep me honest :)15:58
dansmithyeah will try to look a bit later15:58
sean-k-mooney[m]ok if dansmith  and melwitt  are able to review it i would prefer not to add another context switch to my current list15:59
sean-k-mooney[m]but if this is urgent i can16:00
melwittsean-k-mooney[m]: sure, I can look16:01
gibiI don't think that it is super urgent16:01
gibithank you all16:03
fricklerdansmith: melwitt: could you also have a look at https://review.opendev.org/c/openstack/nova/+/851909 pls? would be good to be able to have wider testing of latest sdk, currently this is blocking getting it into u-c16:37
dansmithhmm, I guess that's needed because of when we talk to placement16:39
dansmithfrickler: +W16:42
fricklerthx16:50
opendevreviewJan Hartkopf proposed openstack/nova master: add support for updating server's user_data  https://review.opendev.org/c/openstack/nova/+/81615717:26
opendevreviewMerged openstack/placement master: Func test for os-traits and os-resource-classes lib sync  https://review.opendev.org/c/openstack/placement/+/85196617:44
opendevreviewMerged openstack/nova master: Fix mocking SafeConnectedTestCase  https://review.opendev.org/c/openstack/nova/+/85190917:44
stephenfingibi: Is that PCI series ready for review?17:49
spateldoes nova support sound adaptor ? 18:59
wolsen[m]gibi: bauzas: sorry I've been slow to respond on that patch set (the mdev devices libvirt 7.x one - https://review.opendev.org/c/openstack/nova/+/838976) - I 'm reading the backscroll now and see you commented that you need something for me. I'm jugging various things at the moment but will review feedback shortly19:19
wolsen[m]looks like an update around the FUP patch that bauzas had posted19:21
*** EugenMayer4 is now known as EugenMayer21:39
*** dasm is now known as dasm|off23:26

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!