Wednesday, 2019-11-13

*** brinzhang has quit IRC00:01
openstackgerritEric Fried proposed openstack/nova master: WIP: Add emulated TPM support to Nova  https://review.opendev.org/63136300:12
openstackgerritEric Fried proposed openstack/nova master: Add support for resize and cold migration of emulated TPM files  https://review.opendev.org/63993400:12
openstackgerritEric Fried proposed openstack/nova master: WIP: vTPM request_filter  https://review.opendev.org/67832500:12
*** ociuhandu has joined #openstack-nova00:13
*** ociuhandu has quit IRC00:18
*** TxGirlGeek has quit IRC00:24
*** mlavalle has quit IRC00:32
*** macz has quit IRC00:33
*** brinzhang has joined #openstack-nova00:53
*** brinzhang_ has quit IRC00:57
*** Liang__ has joined #openstack-nova01:01
*** brinzhang_ has joined #openstack-nova01:02
*** brinzhang has quit IRC01:05
*** brinzhang has joined #openstack-nova01:08
*** dswebb has quit IRC01:09
*** brinzhang has quit IRC01:10
*** brinzhang has joined #openstack-nova01:10
*** ociuhandu has joined #openstack-nova01:16
melwittgmann: johnthetubaguy has updated the spec for your comments, if you could take another look when you get a chance https://review.opendev.org/60220101:21
*** ociuhandu has quit IRC01:24
*** ociuhandu has joined #openstack-nova01:36
*** gyee has quit IRC01:41
*** ociuhandu has quit IRC01:46
*** ileixe has quit IRC01:57
*** ileixe has joined #openstack-nova01:59
*** ileixe has quit IRC02:00
*** ileixe has joined #openstack-nova02:00
*** macz has joined #openstack-nova02:01
*** macz has quit IRC02:06
*** takashin has joined #openstack-nova02:08
*** TxGirlGeek has joined #openstack-nova02:29
*** brault has quit IRC02:32
*** brault has joined #openstack-nova02:32
*** mkrai has joined #openstack-nova02:51
*** abaindur has joined #openstack-nova02:58
*** brinzhang has quit IRC02:59
*** brinzhang_ has quit IRC03:00
*** ccamacho has quit IRC03:13
*** ociuhandu has joined #openstack-nova03:17
*** ociuhandu has quit IRC03:22
*** tbachman has quit IRC03:25
*** zhanglong has joined #openstack-nova03:31
*** bhagyashris has joined #openstack-nova03:42
*** udesale has joined #openstack-nova03:42
*** liuyulong has quit IRC03:46
*** ileixe has quit IRC03:46
*** ileixe has joined #openstack-nova03:50
*** awalende has joined #openstack-nova03:54
*** zhubx has quit IRC03:59
*** awalende has quit IRC03:59
*** boxiang has joined #openstack-nova03:59
*** dave-mccowan has quit IRC04:02
*** dave-mccowan has joined #openstack-nova04:12
*** yaawang has quit IRC04:15
*** yaawang has joined #openstack-nova04:16
*** tkajinam has quit IRC04:26
*** tkajinam has joined #openstack-nova04:33
*** TxGirlGeek has quit IRC04:36
*** ociuhandu has joined #openstack-nova04:43
*** bhagyashris has quit IRC04:44
*** ociuhandu has quit IRC04:48
*** takashin has left #openstack-nova04:51
*** dave-mccowan has quit IRC05:05
openstackgerritya.wang proposed openstack/nova-specs master: Add "live migration without performance impact" spec.  https://review.opendev.org/69365505:06
*** tkajinam_ has joined #openstack-nova05:08
*** tkajinam has quit IRC05:11
*** bhagyashris has joined #openstack-nova05:18
*** ociuhandu has joined #openstack-nova05:31
*** tkajinam_ has quit IRC05:34
*** ociuhandu has quit IRC05:35
*** tkajinam has joined #openstack-nova05:36
*** tkajinam has quit IRC06:03
*** tkajinam has joined #openstack-nova06:05
*** abaindur has quit IRC06:16
*** tkajinam has quit IRC06:30
*** tkajinam has joined #openstack-nova06:31
*** tkajinam has quit IRC06:31
*** tbachman has joined #openstack-nova06:34
*** dtantsur|afk is now known as dtantsur06:34
*** tbachman_ has joined #openstack-nova06:35
*** tbachman has quit IRC06:39
*** tbachman_ is now known as tbachman06:39
*** Luzi has joined #openstack-nova06:40
*** slaweq has quit IRC06:45
*** sridharg has joined #openstack-nova06:46
*** tbachman has quit IRC06:53
*** zhanglong has quit IRC07:00
*** do3meli has joined #openstack-nova07:01
*** tkajinam has joined #openstack-nova07:01
*** jangutter has joined #openstack-nova07:02
*** dpawlik has joined #openstack-nova07:05
*** chenhaw has joined #openstack-nova07:07
*** dpawlik has quit IRC07:10
*** dklyle has quit IRC07:19
*** dklyle has joined #openstack-nova07:19
*** dpawlik has joined #openstack-nova07:20
*** igordc has quit IRC07:25
*** do3meli has left #openstack-nova07:27
*** bhagyashris has quit IRC07:28
*** ociuhandu has joined #openstack-nova07:35
*** ociuhandu has quit IRC07:41
*** damien_r has joined #openstack-nova07:52
*** mmethot has quit IRC07:56
*** trident has quit IRC07:57
*** tesseract has joined #openstack-nova08:00
*** maciejjozefczyk has joined #openstack-nova08:02
*** trident has joined #openstack-nova08:06
*** bhagyashris has joined #openstack-nova08:15
*** awalende has joined #openstack-nova08:16
*** rpittau|afk is now known as rpittau08:17
*** ivve has joined #openstack-nova08:19
*** yaawang has quit IRC08:24
*** yaawang has joined #openstack-nova08:24
*** ralonsoh has joined #openstack-nova08:29
*** dpawlik has quit IRC08:32
*** tkajinam has quit IRC08:35
*** xek_ has joined #openstack-nova08:38
*** links has joined #openstack-nova08:40
*** yan0s has joined #openstack-nova08:40
*** zhanglong has joined #openstack-nova08:44
*** dpawlik has joined #openstack-nova08:47
*** dswebb has joined #openstack-nova08:49
*** dswebb has left #openstack-nova08:49
*** slaweq has joined #openstack-nova08:52
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Wire up a force disconnect_volume flag  https://review.opendev.org/58484908:53
*** damien_r has quit IRC08:54
*** ociuhandu has joined #openstack-nova09:03
openstackgerritBalazs Gibizer proposed openstack/nova stable/train: Use admin neutron client to query ports for binding  https://review.opendev.org/69401309:06
openstackgerritBalazs Gibizer proposed openstack/nova stable/train: Use admin neutron client to gather port resource requests  https://review.opendev.org/69401509:07
*** ociuhandu has quit IRC09:09
openstackgerritBalazs Gibizer proposed openstack/nova stable/train: Use admin neutron client to gather port resource requests  https://review.opendev.org/69401509:10
*** ociuhandu has joined #openstack-nova09:10
*** ccamacho has joined #openstack-nova09:12
*** jistr has quit IRC09:19
*** jistr has joined #openstack-nova09:20
openstackgerritBalazs Gibizer proposed openstack/nova stable/train: Use admin neutron client to see if instance has qos ports  https://review.opendev.org/69401809:21
*** ociuhandu has quit IRC09:25
*** ociuhandu has joined #openstack-nova09:26
stephenfinsean-k-mooney: Approved https://review.opendev.org/#/c/683174 FYI09:27
*** ociuhandu has quit IRC09:29
*** jistr has quit IRC09:29
*** ociuhandu has joined #openstack-nova09:30
*** tssurya has joined #openstack-nova09:30
*** ociuhandu has quit IRC09:31
*** ociuhandu has joined #openstack-nova09:31
*** ccamacho has quit IRC09:32
*** jistr has joined #openstack-nova09:36
*** ccamacho has joined #openstack-nova09:36
openstackgerritMerged openstack/nova-specs master: Add spec for VM-scoped SR-IOV NUMA affinity  https://review.opendev.org/68317409:37
*** ociuhandu has quit IRC09:37
*** ociuhandu has joined #openstack-nova09:37
*** ileixe has quit IRC09:47
*** ileixe has joined #openstack-nova09:48
*** zhanglong has quit IRC09:50
*** Liang__ has quit IRC09:52
*** priteau has joined #openstack-nova10:05
*** shilpasd has joined #openstack-nova10:05
*** gibi is now known as gibi_off10:11
* gibi_off is on two days of internal conference10:11
*** ociuhandu has quit IRC10:13
*** ociuhandu has joined #openstack-nova10:14
*** ileixe has quit IRC10:16
*** ociuhandu has quit IRC10:19
*** zbr has quit IRC10:21
*** brinzhang has joined #openstack-nova10:24
openstackgerritLee Yarwood proposed openstack/os-traits master: WIP Add COMPUTE_RESCUE_STABLE_DEVICES and COMPUTE_RESCUE_BFV traits  https://review.opendev.org/69403310:25
*** brinzhang_ has joined #openstack-nova10:29
*** brinzhang has quit IRC10:31
*** brinzhang has joined #openstack-nova10:42
*** chenhaw has quit IRC10:44
*** brinzhang_ has quit IRC10:45
*** zbr has joined #openstack-nova10:48
*** ralonsoh has quit IRC11:07
*** ralonsoh has joined #openstack-nova11:09
*** ociuhandu has joined #openstack-nova11:14
*** udesale has quit IRC11:15
*** ociuhandu has quit IRC11:18
*** purplerbot has quit IRC11:23
*** brinzhang_ has joined #openstack-nova11:29
*** brinzhang has quit IRC11:32
*** ociuhandu has joined #openstack-nova11:36
*** bhagyashris has quit IRC11:59
sean-k-mooneystephenfin: when you wrote the orginial alias based numa affintiy feature did you add docs? i dont see it referenced here https://docs.openstack.org/nova/train/admin/pci-passthrough.html12:01
sean-k-mooneythere is https://docs.openstack.org/nova/train/configuration/config.html#pci.alias12:02
sean-k-mooneywhcich mention the numa_policy field but it does not explain them12:02
*** kaisers has quit IRC12:02
*** henriqueof1 has quit IRC12:04
*** kaisers has joined #openstack-nova12:05
*** dpawlik has quit IRC12:06
*** dpawlik has joined #openstack-nova12:12
*** dpawlik has quit IRC12:14
*** purplerbot has joined #openstack-nova12:14
*** dpawlik has joined #openstack-nova12:17
*** dpawlik has quit IRC12:17
*** priteau has quit IRC12:19
*** mkrai has quit IRC12:21
*** mkrai has joined #openstack-nova12:21
*** priteau has joined #openstack-nova12:22
*** dpawlik has joined #openstack-nova12:22
*** brinzhang has joined #openstack-nova12:25
*** dpawlik has quit IRC12:26
*** dpawlik has joined #openstack-nova12:27
*** brinzhang_ has quit IRC12:28
*** ociuhandu has quit IRC12:35
*** ociuhandu has joined #openstack-nova12:35
*** ociuhandu has quit IRC12:36
*** ociuhandu has joined #openstack-nova12:37
openstackgerritBalazs Gibizer proposed openstack/nova stable/train: Use admin neutron client to see if instance has qos ports  https://review.opendev.org/69401812:45
gibi_offelod: ^^ additional diff was needed to make the backport work properly due to feature merged in ussuri12:47
*** shilpasd has quit IRC12:48
openstackgerritBalazs Gibizer proposed openstack/nova master: Mask the token used to allow access to consoles  https://review.opendev.org/22062212:50
*** brinzhang_ has joined #openstack-nova12:53
*** damien_r has joined #openstack-nova12:53
*** rcernin has quit IRC12:54
openstackgerritLee Yarwood proposed openstack/nova-specs master: Virtual instance rescue with stable disk devices  https://review.opendev.org/69384912:55
openstackgerritLee Yarwood proposed openstack/nova-specs master: Boot from volume instance rescue  https://review.opendev.org/69406312:55
*** brinzhang has quit IRC12:55
*** dtantsur is now known as dtantsur|afk12:57
elodgibi_off: thanks, i'm not there yet, but will look into it :)13:01
stephenfinsean-k-mooney: Think I just documented it in the config option?13:02
stephenfinthough we should really have it documented in doc/source/admin/pci-passthrough.rst13:02
stephenfinI can do that now13:02
stephenfingood way to get used to having working internet again13:03
*** tbachman has joined #openstack-nova13:03
* stephenfin has no idea how alex_xu et al manages :O13:04
*** tbachman has quit IRC13:08
*** ociuhandu has quit IRC13:08
*** ociuhandu has joined #openstack-nova13:09
*** brinzhang has joined #openstack-nova13:09
*** brinzhang has quit IRC13:10
*** brinzhang_ has quit IRC13:11
openstackgerritBalazs Gibizer proposed openstack/nova master: Remove functional test specific nova code  https://review.opendev.org/68360913:12
*** tbachman has joined #openstack-nova13:14
*** ociuhandu has quit IRC13:14
*** dpawlik has quit IRC13:24
*** mkrai has quit IRC13:28
*** mkrai_ has joined #openstack-nova13:28
*** tbachman has quit IRC13:30
openstackgerritBalazs Gibizer proposed openstack/nova master: Remove functional test specific nova code  https://review.opendev.org/68360913:31
efriedstephenfin: alex_xu already has to deal with Intel's firewall, the gfwoc is nbd.13:40
efriedI work around it by using a personal computer for real work.13:41
bauzasI wonder whether we should propose some Chinese IRC server13:41
efriedbauzas: isn't that called WeChat?13:41
bauzasefried: sure, but WeChat doesn't support IRC13:41
efriedoh, you mean a server for actual IRC, got it.13:41
bauzashaving the same client would be nice13:41
efriedI'm not sure the technology is the problem; it's the connectivity.13:42
bauzaslike, I could use this Chinese server plus the Freenode one13:42
efriedisn't the problem that the bits can't get in & out?13:42
bauzasof course, but I'm pretty sure a IRC server would be supported by the chinese government :)13:42
bauzasI mean, when running in a chinese cloud ;)13:42
bauzasalex_xu: FWIW, I'll still continue to use WeChat13:43
*** nweinber has joined #openstack-nova13:47
*** mdbooth has joined #openstack-nova13:52
*** mriedem has joined #openstack-nova13:53
kashyapefried: Hi, when you get a moment, is a Blueprint required for this: https://bugs.launchpad.net/nova/+bug/1852437 (Allow ability to disable individual CPU features via `cpu_model_extra_flags`)13:54
openstackLaunchpad bug 1852437 in OpenStack Compute (nova) "Allow ability to disable individual CPU features via `cpu_model_extra_flags`" [Undecided,New]13:54
*** dpawlik has joined #openstack-nova13:54
*** davee_ has joined #openstack-nova13:55
efriedkashyap: at a glance, a blueprint seems entirely necessary. A spec, not as sure. Perhaps put it on tomorrow's meeting agenda for discussion?13:55
kashyapefried: Spec / BP seems like an overkill, IMHO.  But sure, can discuss tomm13:56
*** tbachman has joined #openstack-nova13:56
efriedwe're talking about enhancing syntax of a conf option in what seems like a nontrivial way.13:56
kashyapIt is essentially aims to implement what I stated at the end of this commit in paranthesis: https://opendev.org/openstack/nova/commit/cc27a2007f31413:56
*** eharney has joined #openstack-nova13:56
kashyapefried: Hmm, phrased that way...13:56
kashyapefried: But what do you think of it?  Is there a better way you can think of than the +/- notion?13:57
*** mmethot has joined #openstack-nova13:57
efriedkashyap: no, that seems reasonable, just seems like a thing that ought to have a "design" somewhere written down that we can agree on. Just having it in a RFE bug might be sufficient, but really that's what blueprints are for. And it's the kind of thing I would not expect to backport.13:59
*** tbachman has quit IRC14:00
*** tbachman has joined #openstack-nova14:00
kashyapefried: Sure, can file a simple BluePrint14:01
efriedthanks kashyap.14:01
kashyapI first wanted to file it, but went the bug route, thinking it is a "simple idea".  :D14:02
*** tbachman has quit IRC14:09
kashyapDone: https://blueprints.launchpad.net/nova/+spec/allow-disabling-cpu-flags14:09
*** dviroel has joined #openstack-nova14:13
*** amodi has quit IRC14:14
*** mkrai_ has quit IRC14:18
kashyapmriedem: Maybe we can just close this, as we'll be tracking it in the Blueprint? - https://bugs.launchpad.net/nova/+bug/185243714:19
openstackLaunchpad bug 1852437 in OpenStack Compute (nova) "Allow ability to disable individual CPU features via `cpu_model_extra_flags`" [Wishlist,New]14:19
* kashyap marked it as "Invalid" / "Wishlist"14:20
mriedemwfm14:20
*** links has quit IRC14:30
*** tbachman has joined #openstack-nova14:36
efriedmriedem: what's the easiest/best way to "discover" a compute node's UUID (the one that'll match the placement root RP)?14:43
efried(I'm trying to middleman here, not actually sure if we know anything about the node beforehand)14:44
*** ociuhandu has joined #openstack-nova14:44
*** dpawlik has quit IRC14:48
mriedemyou mean to get the ComputeNode object?14:48
mriedemfor non-ironic nodes the compute node uuid is randomly generated when the record is created the first time14:49
mriedemto look up the computenode record, you want to use the host/nodename14:49
mriedemif you're on compute and it's not ironic, you can just use CONF.host14:49
mriedemhost == node for non-ironic14:49
mriedemotherwise you get the nodenames from the driver14:49
mriedemsee ComputeManager.update_available_resource14:50
mriedemor you're trying to do something outside of nova to try to expose a vulnerability?14:50
bauzasefried: yeah, you need to know the (host, node) tuple14:51
bauzasmriedem: and no, AFAICR, for some virt drivers, node != host14:51
mriedembauzas: yeah, ironic14:52
mriedemi said that14:52
bauzasmriedem: not only for ironic14:52
mriedemwhich ones?14:52
mriedemif you're thinking vcenter, you're thinking of kilo era14:52
bauzasI don't remember, lemme look14:52
*** Luzi has quit IRC14:52
*** tesseract has quit IRC14:52
efriedI think I can work with this, thanks.14:53
bauzasI do wonder for HyperV14:53
mriedemhyperv has only 1 node per host14:53
mriedemhttp://cloudbase-ci.com/nova/693937/1/windows/logs/n-h2-693937-1/nova-compute.log.gz14:54
mriedem2019-11-13 05:46:51.769 5012 103574784 GreenThread-1 INFO nova.compute.resource_tracker [req-320b7c3d-8917-4c50-b48e-18dd405c7877 - - - - -] Compute node record created for n-h2-693937-1:n-h2-693937-1 with uuid: 77e9e8d8-bba6-4aba-81f9-5e9f69ca1db914:54
*** boxiang has quit IRC14:54
*** tssurya has quit IRC14:54
*** boxiang has joined #openstack-nova14:55
mriedemironic is the only weirdo14:55
*** tesseract has joined #openstack-nova14:55
bauzasanyway, looks you're right14:56
bauzashttps://github.com/openstack/nova/blob/master/nova/compute/manager.py#L918914:57
*** liuyulong has joined #openstack-nova14:58
*** lpetrut has joined #openstack-nova14:58
*** usr2033 has joined #openstack-nova14:58
usr2033Hi, can i pxe boot nova/kvm instances?14:59
mriedemnote that https://docs.openstack.org/nova/latest/admin/configuration/hypervisors.html doesn't have a subpage for ironic as a compute driver in nova - that's probably a decent sized gap given all of the edge cases we could describe with ironic in nova as a compute driver14:59
openstackgerritMerged openstack/python-novaclient master: Add minor version [21] to the test_versions  https://review.opendev.org/68859914:59
mriedemnot to mention scaling issues when using the ironic driver to manage lots of nodes from a single compute service, re the ML threads on RT perf issues15:00
mriedemusr2033: not natively no15:02
mriedemsee https://serverfault.com/questions/469479/does-nova-support-pxe-boot15:02
mriedemusr2033: you may be interested in https://openstack-virtual-baremetal.readthedocs.io/en/latest/index.html15:03
*** ociuhandu has quit IRC15:04
efriedI may be choppy today, going to be trying to work remotely (like, more remotely than usual).15:06
*** ociuhandu has joined #openstack-nova15:06
* efried travels...15:06
*** efried has quit IRC15:06
openstackgerritStephen Finucane proposed openstack/nova master: Remove 'os-consoles' API  https://review.opendev.org/68790715:09
openstackgerritStephen Finucane proposed openstack/nova master: Remove 'nova-console' service, 'os-consoles' API  https://review.opendev.org/68790815:09
openstackgerritStephen Finucane proposed openstack/nova master: Remove 'nova-xvpvncproxy'  https://review.opendev.org/68790915:09
*** ociuhandu has quit IRC15:11
*** liuyulong has quit IRC15:11
openstackgerritMerged openstack/nova master: Remove TODOs around claim_resources_on_destination  https://review.opendev.org/69363515:15
mriedembauzas: https://bugs.launchpad.net/nova/+bug/185244615:15
openstackLaunchpad bug 1852446 in OpenStack Compute (nova) "Hypervisors in nova - no subpage details for ironic" [Undecided,New]15:15
bauzasack, good point15:16
* bauzas needs to disappear for 45 mins15:16
stephenfinmriedem: It's not urgent, but if you can rebase https://review.opendev.org/#/c/693425/ today I'm happy to push it through15:18
mriedemack, let me rebase the entire cross-cell series first quick so i can destroy the gate15:19
openstackgerritMatt Riedemann proposed openstack/nova master: Make API always RPC cast to conductor for resize/migrate  https://review.opendev.org/69393715:21
openstackgerritMatt Riedemann proposed openstack/nova master: Filter duplicates from compute API get_migrations_sorted()  https://review.opendev.org/63622415:21
openstackgerritMatt Riedemann proposed openstack/nova master: Start functional testing for cross-cell resize  https://review.opendev.org/63625315:21
openstackgerritMatt Riedemann proposed openstack/nova master: Handle target host cross-cell cold migration in conductor  https://review.opendev.org/64259115:21
openstackgerritMatt Riedemann proposed openstack/nova master: Validate image/create during cross-cell resize functional testing  https://review.opendev.org/64259215:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add zones wrinkle to TestMultiCellMigrate  https://review.opendev.org/64345015:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add negative test for cross-cell finish_resize failing  https://review.opendev.org/64345115:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add negative test for prep_snapshot_based_resize_at_source failing  https://review.opendev.org/66901315:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add confirm_snapshot_based_resize_at_source compute method  https://review.opendev.org/63705815:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add ConfirmResizeTask  https://review.opendev.org/63707015:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add confirm_snapshot_based_resize conductor RPC method  https://review.opendev.org/63707515:21
openstackgerritMatt Riedemann proposed openstack/nova master: Confirm cross-cell resize from the API  https://review.opendev.org/63731615:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add revert_snapshot_based_resize_at_dest compute method  https://review.opendev.org/63763015:21
openstackgerritMatt Riedemann proposed openstack/nova master: Deal with cross-cell resize in _remove_deleted_instances_allocations  https://review.opendev.org/63945315:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add finish_revert_snapshot_based_resize_at_source compute method  https://review.opendev.org/63764715:21
openstackgerritMatt Riedemann proposed openstack/nova master: WIP: Add RevertResizeTask  https://review.opendev.org/63804615:21
openstackgerritMatt Riedemann proposed openstack/nova master: Add revert_snapshot_based_resize conductor RPC method  https://review.opendev.org/63804715:21
openstackgerritMatt Riedemann proposed openstack/nova master: Revert cross-cell resize from the API  https://review.opendev.org/63804815:21
mriedemdansmith: gibi_off: https://review.opendev.org/#/c/693937/ is fixed now15:22
mriedempy2 vs py3 weirdness15:22
*** mkrai_ has joined #openstack-nova15:26
openstackgerritMerged openstack/nova master: Remove now invalid TODO from ComputeManager._confirm_resize  https://review.opendev.org/69342715:27
openstackgerritMerged openstack/nova master: Use ListOfUUIDField from oslo.versionedobjects  https://review.opendev.org/69325815:27
openstackgerritMerged openstack/nova master: Add known limitation about resize not resizing ephemeral disks  https://review.opendev.org/69191515:27
openstackgerritMerged openstack/nova master: api-ref: re-work resize action post-conditions  https://review.opendev.org/69191815:27
openstackgerritMerged openstack/nova master: Provide a better error when _verify_response hits a TypeError  https://review.opendev.org/69304215:27
openstackgerritMatt Riedemann proposed openstack/nova master: Use named kwargs in compute.API.resize  https://review.opendev.org/69342515:30
mriedemstephenfin: done ^15:30
*** awalende has quit IRC15:33
*** awalende has joined #openstack-nova15:33
openstackgerritElod Illes proposed openstack/nova stable/pike: cleanup evacuated instances not on hypervisor  https://review.opendev.org/68791215:36
mdboothlyarwood: Re https://review.opendev.org/#/c/694033/ What did you think about combining these 2 traits?15:36
mdboothIIUC their window of usefulness is limited to the period between stable disks and BFV rescue landing, right?15:37
mdboothCOMPUTE_RESCUE_BFV implies COMPUTE_RESCUE_STABLE_DEVICES15:37
*** awalende has quit IRC15:38
lyarwoodmdbooth: for the libvirt driver implementation yeah15:39
* mdbooth suggests: 1) nobody else is going to implement this 2) if they did, there's no reason they couldn't have the same semantics.15:40
sean-k-mooneyi think having two traits is fine15:40
sean-k-mooneyglad to see you put them under compute :)15:41
mdboothlyarwood: As I said in the spec, though, if this discussion gets in the way of getting this done, it's not worth it.15:41
* mdbooth is +1 on this with or without 2 traits.15:41
mdboothBut I think 1 trait would be better.15:41
*** JamesBenson has joined #openstack-nova15:41
sean-k-mooneyits only  better if we support rescue for BFV in the same step15:42
sean-k-mooneyor release15:42
mdboothsean-k-mooney: Right, but the change from stable disks to rescue BFV is trivial.15:42
mdboothIIRC it's just removing a check in the api which prevents it.15:42
sean-k-mooneysure although one point. you cant assume that just using a usb device will mean it wont reorder the disk in all cases15:43
sean-k-mooneyits going to be true 99% of the time15:43
mdboothsean-k-mooney: That's unrelated.15:43
mdbooth(True, but unrelated)15:44
sean-k-mooneywell its part of the premisis of the stable_device resuce spec15:44
sean-k-mooneyya15:44
*** TxGirlGeek has joined #openstack-nova15:44
lyarwoodit's not going to reorder the physical layout, that's all we can guarantee15:44
sean-k-mooneyjust said i would mention it since you can use hw_disk_bus to usb already https://github.com/openstack/glance/blob/master/etc/metadefs/compute-libvirt-image.json#L3915:44
lyarwoodeverything else within the guestOS is out of our control15:44
mdboothRight. It's definitey way better than what we do now in all cases.15:45
mdboothlyarwood: Don't suppose you still have a link to the old patches kicking about, do you?15:45
*** jangutter has quit IRC15:45
lyarwoodyeah I'm working through a rebase now15:45
lyarwoodmdbooth: https://review.opendev.org/#/q/topic:bp/virt-rescue-stable-disk-devices15:45
lyarwoodmdbooth: hope to have it posted later this evening15:46
mdboothlyarwood: Cool. Did we discuss switching it on unconditionally in a new microversion, btw?15:47
mdboothIIRC there's a new microversion involved anyway.15:47
lyarwoodmdbooth: that's what I'm suggesting in the follow up spec at the moment15:48
lyarwoodmdbooth: well, with the trait15:48
lyarwoodmdbooth: so it's not unconditional15:48
openstackgerritMatt Riedemann proposed openstack/nova master: api-ref: re-work migrate action post-conditions  https://review.opendev.org/69410315:53
openstackgerritStephen Finucane proposed openstack/nova master: Fix review link.  https://review.opendev.org/68961215:54
openstackgerritElod Illes proposed openstack/nova stable/pike: Add functional test for resize crash compute restart revert  https://review.opendev.org/68791315:59
openstackgerritMerged openstack/nova master: ItemsMatcher: mock call list arg in any order  https://review.opendev.org/68948716:04
artomsean-k-mooney, I know you're respinning https://review.opendev.org/#/c/674072/4 soon, but there are a couple more typos I found16:04
mriedemmelwitt: dansmith: this might have come up before and i'm just not remembering, but an instance that fails initial scheduling and is buried in cell0 doesn't have a 'create' instance action because we apparently don't create that in cell016:04
dansmithmriedem: not sure it has come up before16:05
dansmithmriedem: did it have one prior to the bury-in-cell0 behavior?16:05
melwittI don't recall talking about this before either16:05
*** ccamacho has quit IRC16:06
mriedemlooking back at mitaka, the api would create the 'create' action in the 'nova' cell db https://github.com/openstack/nova/blob/mitaka-eol/nova/compute/api.py#L1180 before casting off to conductor16:07
*** lpetrut has quit IRC16:08
mriedemi don't see anything in conductor which would "complete" that action with a fail even if scheduling failed https://github.com/openstack/nova/blob/mitaka-eol/nova/conductor/manager.py#L37416:08
mriedembut at least the action would exist16:08
dansmithack16:09
dansmithwell, makes sense16:09
openstackgerritElod Illes proposed openstack/nova stable/pike: Prevent init_host test to interfere with other tests  https://review.opendev.org/68791616:11
*** mlavalle has joined #openstack-nova16:12
mriedemmakes sense that we wouldn't create the instance action in cell0?16:13
sean-k-mooneyartom: cool i ran a spell check on it since i figured out how to do that with emacs16:13
sean-k-mooneyartom: im just running the tests currently16:14
artomsean-k-mooney, it was just capitalization and "use case" into 2 words16:14
openstackgerritMerged openstack/nova master: Fix ItemMatcher to avoid false positives  https://review.opendev.org/68969016:15
mriedemsean-k-mooney: are you planning on writing functional tests for https://review.opendev.org/#/c/674072/ ?16:15
dansmithmriedem: no makes sense that we missed doing that when we created bury-in-cell0, and makes sense that we should fix that16:17
mriedemdansmith: ah ok16:17
mriedemi'll open a bug in a bit16:17
*** gyee has joined #openstack-nova16:18
*** JamesBen_ has joined #openstack-nova16:18
*** ociuhandu has joined #openstack-nova16:20
sean-k-mooneyam i can if you would like them16:20
*** JamesBenson has quit IRC16:21
mriedemanything involving image meta / flavor extra specs + pci + affinity + scheduling + compute likely means unit tests aren't sufficient, yeah16:21
mriedem?16:21
mriedemmaybe that's just me16:21
openstackgerritAlexandre arents proposed openstack/nova master: Abort live-migration during instance_init  https://review.opendev.org/67801616:21
sean-k-mooneymriedem: the code change is just reuseing the existing support for numa policies16:23
sean-k-mooneyso im jsut passing that policy via the flavor instead of the alisa16:23
sean-k-mooneythat said we may not have existing fucntional test for that so i will look16:24
*** TxGirlGeek has quit IRC16:24
*** ivve has quit IRC16:26
sean-k-mooneymriedem: i was kind of assuming the existing functional test for the numa polices would be sufficent but i will check.16:27
mriedemi never assume anything relating to numa test coverage is sufficient for new code dealing with numa16:30
sean-k-mooneyfair16:30
mriedembut i'm a cranky old troll16:30
*** ociuhandu has quit IRC16:30
*** ociuhandu has joined #openstack-nova16:31
*** efried has joined #openstack-nova16:36
openstackgerritElod Illes proposed openstack/nova stable/pike: Functional reproduce for bug 1833581  https://review.opendev.org/68791716:37
openstackbug 1833581 in OpenStack Compute (nova) train "instance stuck in BUILD state if nova-compute is restarted" [Low,Fix committed] https://launchpad.net/bugs/1833581 - Assigned to Balazs Gibizer (balazs-gibizer)16:37
mriedemmelwitt: can you hit https://review.opendev.org/#/c/693554/ to keep amodi's doc bug fix backports moving?16:38
*** ociuhandu has quit IRC16:44
melwittmriedem: yup, thanks16:45
stephenfinmriedem: You're not that old16:45
stephenfin🙃16:45
sean-k-mooneyartom: looks like i have already fixed most of your comments locally im going to fix the last few then ill push it up.16:47
sean-k-mooneystephenfin: do you recall if you wrote functional tests when you implemented the orgininal numa policies. if so ill extend them if not i guess ill write tests for both.16:48
stephenfinI very much doubt it. I've only started writing those for the last two cycles or so16:49
artomsean-k-mooney, I know I added https://review.opendev.org/#/c/682941/ recently16:49
sean-k-mooneyok16:50
* bauzas is back from town hall16:50
sean-k-mooneyartom: ya i rememebr16:50
sean-k-mooneyso those cover the alias based policies i think16:50
sean-k-mooneyyou added the missing one16:50
sean-k-mooneyso i was more or less relying on thos but it hsould be easy to etend those to use the policy form the flavor16:51
artomsean-k-mooney, well yeah, because we only have the alias based policies for now :)16:51
*** jaosorior has joined #openstack-nova16:51
sean-k-mooneyok ill push there version i have now after i adress you comments and ill start working on the func test after16:52
*** TxGirlGeek has joined #openstack-nova16:52
*** yan0s has quit IRC16:53
openstackgerritsean mooney proposed openstack/nova master: support pci numa affinity policies in flavor and image  https://review.opendev.org/67407216:54
*** TxGirlGeek has quit IRC16:54
*** nweinber_ has joined #openstack-nova16:56
*** mgariepy has quit IRC16:56
*** nweinber has quit IRC16:58
*** TxGirlGeek has joined #openstack-nova17:00
*** ociuhandu has joined #openstack-nova17:01
*** igordc has joined #openstack-nova17:02
*** rpittau is now known as rpittau|afk17:08
*** mkrai_ has quit IRC17:08
openstackgerritEric Fried proposed openstack/nova master: Consolidate [image_cache] conf options  https://review.opendev.org/69072317:09
openstackgerritEric Fried proposed openstack/nova master: Add image caching to the support matrix  https://review.opendev.org/69074817:09
efriedmriedem, stephenfin: Rebased and nit-fixed ^17:09
*** TxGirlGeek has quit IRC17:10
*** jaosorior has quit IRC17:12
openstackgerritStephen Finucane proposed openstack/nova master: "SUSPENDED" description changed in server_concepts guide and API REF  https://review.opendev.org/66359017:12
openstackgerritElod Illes proposed openstack/nova stable/pike: Error out interrupted builds  https://review.opendev.org/68791817:13
artomstephenfin, oh, friendly reminder that the NUMA LM func test stack is ready for another look: https://review.opendev.org/#/c/687404/17:22
artom(I know it's not so much a learning curve as a learning brick wall)17:22
stephenfinartom: Sure thing, but that's too much work for 5:20pm when jetlagged. I'll grab it in the morning :)17:24
artomstephenfin, ack, thank you!17:24
openstackgerritEric Fried proposed openstack/nova master: Remove functional test specific nova code  https://review.opendev.org/68360917:25
bauzasmriedem: dumb question but what the user sees from the API when we error out some instance because of NoValidHost ?17:31
bauzasby a nova show I mean17:31
bauzasmriedem: was about to say +1 to your ML thread, but  was wondering the API result for the same17:32
*** igordc has quit IRC17:32
* bauzas looks at https://docs.openstack.org/api-ref/compute/?expanded=show-server-details-detail#list-servers-detailed17:33
mriedembauzas: the non-admin user? they see this:17:33
mriedem$ openstack server show build-fail1 -f value -c fault {u'message': u'No valid host was found. ', u'code': 500, u'created': u'2019-11-13T15:57:13Z'}17:33
mriedemthe fault is only shown if the server status is ERROR or DELETED17:33
bauzasmriedem: so the instance action event show should do the same17:33
mriedemtbc, i wasn't proposing that the action event exception type is only shown for an ERROR or DELETED status server17:34
mriedemsince as i said in the thread, you can fail a resize and the server status doesn't go to ERROR17:35
*** efried has quit IRC17:35
*** igordc has joined #openstack-nova17:35
bauzasmriedem: yeah, I understood this17:35
stephenfinefried: Think you could hold your nose on https://review.opendev.org/#/c/684345/16/nova/network/neutronv2/api.py to keep this moving, given I'm removing it again shortly after?17:36
*** nweinber_ has quit IRC17:36
bauzasmriedem: that's why I think we should *also* do it17:36
stephenfindrat, just missed him17:36
bauzasand yeah, of course, when a user calls some ops, we can ask 'do a nova show'17:36
bauzasbut sometimes we need to look at the instance actions (like for resize) and that's why I'd love your spec :)17:36
openstackgerritStephen Finucane proposed openstack/nova master: nova-net: Add TODOs for remaining nova-network functional tests  https://review.opendev.org/68434517:53
openstackgerritStephen Finucane proposed openstack/nova master: Remove 'os-security-group-default-rules' REST API  https://review.opendev.org/68680717:53
openstackgerritStephen Finucane proposed openstack/nova master: nova-net: Remove unused '*_default_rules' security group DB APIs  https://review.opendev.org/68680817:53
openstackgerritStephen Finucane proposed openstack/nova master: Remove (most) '/os-networks' REST APIs  https://review.opendev.org/68680917:53
openstackgerritStephen Finucane proposed openstack/nova master: Remove '/os-tenant-networks' REST API  https://review.opendev.org/68681017:53
openstackgerritStephen Finucane proposed openstack/nova master: nova-net: Remove 'USE_NEUTRON' from functional tests  https://review.opendev.org/68681117:53
openstackgerritStephen Finucane proposed openstack/nova master: nova-net: Remove 'networks' quota  https://review.opendev.org/68681217:53
openstackgerritStephen Finucane proposed openstack/nova master: Remove nova-manage network, floating commands  https://review.opendev.org/68681317:53
openstackgerritStephen Finucane proposed openstack/nova master: nova-net: Remove associate, disassociate network APIs  https://review.opendev.org/68681417:53
openstackgerritStephen Finucane proposed openstack/nova master: nova-net: Remove 'nova-dhcpbridge' binary  https://review.opendev.org/68681517:53
openstackgerritStephen Finucane proposed openstack/nova master: nova-net: Remove 'nova-network' binary  https://review.opendev.org/68681617:53
openstackgerritStephen Finucane proposed openstack/nova master: docs: Blast most references to nova-network  https://review.opendev.org/68681717:53
openstackgerritStephen Finucane proposed openstack/nova master: WIP  https://review.opendev.org/68681817:53
*** damien_r has quit IRC17:56
*** maciejjozefczyk has quit IRC17:58
*** JamesBen_ has quit IRC18:01
*** JamesBenson has joined #openstack-nova18:01
*** ociuhandu has quit IRC18:17
*** ociuhandu has joined #openstack-nova18:18
*** sridharg has quit IRC18:20
*** ociuhandu has quit IRC18:23
*** slaweq has quit IRC18:25
*** nweinber_ has joined #openstack-nova18:28
*** priteau has quit IRC18:32
*** tesseract has quit IRC18:35
*** jkulik has quit IRC18:40
*** gbarros has quit IRC18:49
*** JamesBen_ has joined #openstack-nova19:02
*** JamesBenson has quit IRC19:04
*** ralonsoh has quit IRC19:25
*** ociuhandu has joined #openstack-nova19:50
*** ociuhandu has quit IRC19:56
*** ociuhandu has joined #openstack-nova19:59
*** slaweq has joined #openstack-nova20:01
*** efried has joined #openstack-nova20:02
*** JamesBen_ has quit IRC20:02
*** JamesBenson has joined #openstack-nova20:03
*** ociuhandu has quit IRC20:06
openstackgerritDan Smith proposed openstack/nova-specs master: Virtual instance rescue with stable disk devices  https://review.opendev.org/69384920:08
openstackgerritMatt Riedemann proposed openstack/nova master: Create instance action when burying in cell0  https://review.opendev.org/69416520:11
*** abaindur has joined #openstack-nova20:14
*** ociuhandu has joined #openstack-nova20:16
*** dklyle has quit IRC20:20
*** dklyle has joined #openstack-nova20:23
*** ociuhandu has quit IRC20:25
*** jaosorior has joined #openstack-nova20:26
openstackgerritMatt Riedemann proposed openstack/nova master: Create instance action when burying in cell0  https://review.opendev.org/69416520:27
openstackgerritMatt Riedemann proposed openstack/nova master: doc: add troubleshooting guide for cleaning up orphaned allocations  https://review.opendev.org/69142720:31
*** abaindur has quit IRC20:40
*** abaindur has joined #openstack-nova20:40
*** abaindur has quit IRC20:41
*** abaindur has joined #openstack-nova20:42
*** CeeMac has quit IRC20:44
*** abaindur has quit IRC20:45
*** ociuhandu has joined #openstack-nova21:00
*** ociuhandu has quit IRC21:06
*** priteau has joined #openstack-nova21:17
*** priteau has quit IRC21:18
*** priteau has joined #openstack-nova21:20
*** priteau has quit IRC21:25
*** nweinber_ has quit IRC21:33
*** gshippey has quit IRC21:33
openstackgerritFrançois Palin proposed openstack/nova master: Add retry to cinder api calls related to volume detach  https://review.opendev.org/66967421:41
openstackgerritMerged openstack/nova master: Rename Claims resources to compute_node  https://review.opendev.org/67947021:44
openstackgerritMerged openstack/nova master: Clear instance.launched_on when build fails  https://review.opendev.org/68372521:45
*** takashin has joined #openstack-nova21:47
melwittso... after helping a colleague unwedge a failed resize for a customer, I've learned that we intentionally don't roll back port bindings to the source when finish_resize fails https://github.com/openstack/nova/blob/master/nova/compute/manager.py#L527421:49
melwittand the reasoning is because the assumption is "everything is fine" on the dest and that only the virt guest needs to be created21:49
melwittbut, when finish_resize fails because update of volume attachments fails (error had to do with "duplicate connectors") it doesn't seem like the dest should really be considered an ok place for the instance to be21:50
*** dviroel has quit IRC21:52
*** eharney has quit IRC21:54
melwittI wonder if fixing this might be as simple as moving the volume attach update before the network setup21:57
melwitton the dest21:57
mriedemi feel like i recently added that comment but maybe not21:58
mriedemwe have a lot of "cleanup networking and cleanup volumes" in live and cold migrate/resize where none of it is really atomic21:58
mriedemi made a note of that in the revert code for cross-cell resize here https://review.opendev.org/#/c/637647/51/nova/compute/manager.py@469021:59
mriedemi'm not sure what that duplicate connectors error is22:00
mriedemah that comment is relatively new https://review.opendev.org/#/c/635349/22:00
melwittyeah we're going to be investigating the root cause of the duplicate connectors thing, never seen that before22:01
melwitterror came back from cinder22:01
mriedemwhat was the volume type?22:01
melwittI dunno yet. we only focused on recovery for now, it's late at night for those who were fixing the wedged resize22:02
mriedemso migrate_instance_finish updated the port bindings to point at the dest host and then _update_volume_attachments blew up right?22:03
melwittwe (they) moved the port bindings back to the source manually and then we hard rebooted the instance to get it going for the customer again. they had only wanted to resize to a bigger flavor22:03
melwittyes22:03
mriedemstill, if you get to finish_resize the instance.host already points at the dest22:04
mriedemso they would have had to do more than change port bindings22:04
mriedemto get the guest to reboot on the source22:04
mriedemresize_instance on the source updates the instance.host/node before casting to finish_resize on the dest22:04
melwittand it started up fine with the larger flavor once the port binding was put back on the source. and prior to that yes I had them change the host/node back to the source22:04
melwittyeah sorry22:04
melwittthey'll have to fix the allocations too but the host is full so they're gonna move other stuff off and then fix once there's room22:05
*** slaweq has quit IRC22:06
mriedemso even if we swapped migrate_instance_finish and _update_volume_attachments for this very specific fail case, the operator still has manual stuff they have to do, like updating the instance to point back at the source host22:06
mriedemand if you do swap those, you likely should revert the instance host/node so hard rebooting on the source host works without manual db surgery22:07
melwittyeah, fair, but doing that is far easier than the port binding update. that was pretty involved and I personally didn't know how to do it. involved updating the virtual_interfaces table too I think22:07
mriedemhell, even if finish_migration i'm not sure why we wouldn't roll everything back to the source host, but we've already updated port bindings and volume attachments by then which is likely why we say "you're on the dest now"22:07
mriedemthe virtual_interfaces table shouldn't have anything to do with it22:08
mriedemunless they created new ports22:08
melwittok, maybe they just looked for a uuid or something. I wasn't really understanding22:09
melwittlooks like the volume type is "null" which would be the default22:10
melwittand I don't remember what the default is22:10
mriedemwell, if we changed the resize/cold migrate flow to use the multi-port binding api like we do for live migratoin since rocky thing would be a bit simpler to cleanup, because you'd have 2 port binding resources in neutron rather than one, one is active and one is inactive22:10
mriedembut that's not a backportable change22:11
mriedemand trying to rollback automatically from everything that could go wrong during finish_resize is likely to be pretty hairy22:11
melwittyeah, if port bindings AND volume attach succeed then saying you're on the dest now is fine22:11
melwittbut if volumes fails, I don't see how dest could be ok22:11
melwittI was thinking of swapping the network setup and volume update only, in _finish_resize22:12
melwittso that if volume update fails, set the host/node back to the source like you said and bail. leave the port bindings alone22:13
mriedemi don't think that gets you out of the woods,22:14
mriedembecause resize_instance on the source, before casting to finish_resize on the dest, deletes the old volume attachment with the source host connector and creates a new 'empty' volume attachment that gets updated on the dest with the dest host connector22:14
mriedemsee _terminate_volume_connections22:14
melwittO.o22:14
melwittdammit22:14
mriedemso even if you go back to the source, the volume attachment has to be updated with the source host connector to re-connect the volumes on the source host22:15
melwittgah, this is over my head. I let them know they might have more to do to fix that VM22:16
mriedemi want to say that at some point booth had a patch which would automatically call revert_resize (dest) from finish_resize (dest) if finish_resize failed22:16
mriedembut i very much doubt that revert_resize is idempotent22:17
melwittthat sounds familiar22:17
mriedemi.e. i would not be surprised if just calling revert_resize from finish_resize fails in some weird way because it has some implicit preconditions that aren't setup when it's called to rollback22:17
melwittyeah22:18
mriedemnow if this all used task flow tasks with built in rollbacks...then we'd be cooking!22:18
mriedemi kid, but that's why i used granular conductor tasks for the cross-cell stuff so we can rollback at certain points in the flow22:19
melwitttask flow did come to mind22:20
mriedemi was hoping efried had it as a keyword and he'd just appear22:20
efried*poof*22:20
efriedyou should ask johnsom, shiny new taskflow core. Been a while since I touched it.22:21
efriedthough it's pretty simple when used simply.22:21
efriedand you're already doing most of the work, defining `execute` and `rollback` methods in your classes. Make those inherit from Task and do a little plumbing on the engine side, and presto.22:22
johnsomHi, happy to chat about taskflow, but I'm in a meeting right now. Ping me in 30 if you need something22:22
efriedjroll: There's generally a master switch to enable a feature like swtpm. I'm just defining that. I was starting to make it boolean, but it occurred to me that someone might care to say "enable vtpm {1.2 and/or 2.0}". Do you?22:22
jrollefried: I'm not sure if we care22:23
jrollthat seems reasonable, though, 2.0 isn't backward compat22:23
jroll(from an application perspective)22:23
efriedyeah, rn there's support for both or neither, conf opt notwithstanding.22:23
efriedso really it would just allow you to disable one or the other.22:24
melwittmriedem: how do we find the source host connector? is it in our db or somewhere else?22:24
efriedjroll: Easier for me if you don't care. Since you didn't immediately, I'll code it up bool, and we can discuss it in review if necessary.22:24
jrollefried: in the 'both' case, how is it decided which version is presented to a vm?22:24
efriedjroll: flavor22:25
jrollah22:25
efriedyou are required to ask for one version or the other22:25
jrollyeah, +1 for discuss in review22:25
efriedbecause, as you say, they're not compat.22:25
mriedemmelwitt: the host connector is retrieved from the driver22:28
melwittargh.. ok, I was just seeing that in the code22:29
mriedemcinder does stash the host connector in the volume attachment record when we update the attachment, but as noted in resize_instance we blow away the source host attachment and create a new empty attachment for the dest host to update22:29
mriedemhaving said that, we (nova) do stash the last host connector used in the bdm.connection_info i think22:30
melwitt:'(22:30
melwittcray cray22:30
mriedemthat might only be when using initialize_connection for legacy attachments though22:30
mriedemi don't think we do that pack rat stash for new style volume attachments because cinder stores that information per volume attachment22:32
*** rcernin has joined #openstack-nova22:32
mriedemwhereas before with initialize_connection we worked with a single attachment record and duplicated it in our bdm.connection_info22:32
melwittlooks like we're ok, this is using ceph and somehow it's connected to the correct host (source)22:33
mriedemin this very specific scenario22:33
melwittand it's boot from volume22:33
melwittyeah22:33
*** JamesBen_ has joined #openstack-nova22:33
mriedemi think another idea that has come up before when talking about resize fail recovery is allowing the admin to reset the status on the server so they could then revert the resize22:33
mriedemor allow the revert resize api to work with instances in ERROR status22:34
melwittyup. that is the first thing I told them to try and there was no way to get the api to allow it22:34
mriedemthe instance was in ERROR state in this case right?22:34
*** JamesBen_ has quit IRC22:34
mriedemso if the api was changed to allow reverting a resize on an instance in ERROR status that could be potentially one way22:35
melwittit was originally yeah22:35
mriedemand we can detect that the instance was being resized because of the old_vm_state and old/new flavor stuff stashed on the instance that isn't cleaned up until confirm/revert22:35
mriedemmeaning, the api could puke if you tried reverting a resize on an ERROR instance that wasn't actually being resized22:35
*** JamesBenson has quit IRC22:36
mriedemi want to say dansmith was in that discussion when it happened and wasn't crazy about the idea - maybe around the time of the proposal from booth to auto-revert on failed resize22:37
mriedemor maybe dan preferred just reverting from ERROR instead, i can't remember22:37
melwitthmm ok22:38
mriedemhttps://review.opendev.org/#/c/462521/ was booth's patch btw22:38
melwittah ok22:40
melwitttbh I think reverting from ERROR would be nice as just something22:41
melwittit's painful knowing all the code is there to do the thing but you can't get to it because of a vm_state block and then have to do a bunch of gnarly manual stuff22:42
*** abaindur has joined #openstack-nova22:46
mriedemyeah it's definitely a kick in the ass to fix up a failed resize because of volumes, networking, allocations in placement, and some day cyborg devices,22:46
mriedemplus the quota stuff involved to even move the server back22:46
mriedemlike you said, the source host was full when they tried moving it back22:47
mriedemi left a note in https://review.opendev.org/#/c/462521/12/releasenotes/notes/resize-auto-revert-6e1648828aba16b2.yaml@5 if it helps start a new discussion about changing the API22:47
melwittyeah thanks. and thanks for talking through this, I'm gonna refresh on what was going on in that review and then I was thinking start a ML thread about the API angle based on your comment22:49
mriedemi believe the tl;dr on why that patch to auto-revert was not great was dan's comment in the reno22:51
mriedemexternal tooling could get screwed up22:51
melwittgiven the behaviorial/api change called out on that patch, I'm not sure whether there's a decent way to roll back automatically. if there's not, the api change would be a big help imho22:51
mriedemjust allowing reverting an ERROR'ed resize from the api though is pretty straight-forward and opt-into-y22:51
melwittyeah22:51
openstackgerritTakashi NATSUME proposed openstack/nova master: Update keypairs in saving an instance object  https://review.opendev.org/68304322:52
*** jaosorior has quit IRC22:55
dansmithmriedem: yeah, not sure I remember revert-from-ERROR specifically, but definitely not in favor of auto rollback23:01
*** tkajinam has joined #openstack-nova23:06
openstackgerritTakashi NATSUME proposed openstack/nova master: Update keypairs in saving an instance object  https://review.opendev.org/68304323:09
openstackgerritTakashi NATSUME proposed openstack/nova master: Update keypairs in saving an instance object  https://review.opendev.org/68304323:13
openstackgerritMerged openstack/nova master: cond: rename 'recreate' var to 'evacuate'  https://review.opendev.org/69290023:22
*** xek_ has quit IRC23:24
*** awalende has joined #openstack-nova23:25
*** igordc has quit IRC23:26
*** mlavalle has quit IRC23:26
openstackgerritMerged openstack/nova master: Remove PlacementAPIConnectFailure handling from AggregateAPI  https://review.opendev.org/66085223:27
*** awalende has quit IRC23:29
openstackgerritMatt Riedemann proposed openstack/nova master: Do not reschedule on ExternalNetworkAttachForbidden  https://review.opendev.org/69417923:30
*** ociuhandu has joined #openstack-nova23:31
openstackgerritMatt Riedemann proposed openstack/nova master: Do not reschedule on ExternalNetworkAttachForbidden  https://review.opendev.org/69417923:34
*** ociuhandu has quit IRC23:35
openstackgerritEric Fried proposed openstack/nova master: WIP: Add emulated TPM support to Nova  https://review.opendev.org/63136323:36
openstackgerritEric Fried proposed openstack/nova master: Add support for resize and cold migration of emulated TPM files  https://review.opendev.org/63993423:36
openstackgerritEric Fried proposed openstack/nova master: WIP: vTPM request_filter  https://review.opendev.org/67832523:36
*** efried has quit IRC23:39
openstackgerritMerged openstack/nova master: Remove dead HostAPI.service_delete code  https://review.opendev.org/69342223:43
openstackgerritMerged openstack/nova master: Add support matrix for Delete (Abort) on-going live migration  https://review.opendev.org/62578123:43
openstackgerritMerged openstack/nova master: Implement update_provider_tree for mocked driver in test_resource_tracker  https://review.opendev.org/69343123:44
openstackgerritMatt Riedemann proposed openstack/nova master: api-ref: mark device response param as optional for list/show vol attachments  https://review.opendev.org/69038323:45
*** brault has quit IRC23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!