Thursday, 2020-02-06

*** igordc has quit IRC00:08
*** sapd1_ has joined #openstack-nova00:15
*** TxGirlGeek has joined #openstack-nova00:16
*** dtantsur has joined #openstack-nova00:16
*** mdbooth_ has joined #openstack-nova00:18
*** lee1 has joined #openstack-nova00:19
*** sapd1 has quit IRC00:21
*** TxGirlGe_ has quit IRC00:21
*** dtantsur|afk has quit IRC00:21
*** szaher has quit IRC00:22
*** mdbooth has quit IRC00:22
*** mvkr has quit IRC00:22
*** lyarwood has quit IRC00:22
*** sorrison has quit IRC00:22
*** sorrison has joined #openstack-nova00:22
*** mvkr has joined #openstack-nova00:22
*** johanssone has quit IRC00:23
*** dustinc has quit IRC00:25
*** cz3 has quit IRC00:25
*** mnasiadka has quit IRC00:25
*** dustinc has joined #openstack-nova00:25
*** knikolla has quit IRC00:25
*** arne_wiebalck has quit IRC00:25
*** ildikov has quit IRC00:25
*** pas-ha has quit IRC00:25
*** andreaf has quit IRC00:25
*** CeeMac has quit IRC00:25
*** vdrok has quit IRC00:25
*** donnyd has quit IRC00:25
*** rajinir has quit IRC00:25
*** ab-a has quit IRC00:25
*** fyx has quit IRC00:25
*** logan- has quit IRC00:25
*** mnasiadka has joined #openstack-nova00:25
*** andreaf has joined #openstack-nova00:25
*** CeeMac has joined #openstack-nova00:25
*** pas-ha has joined #openstack-nova00:25
*** knikolla has joined #openstack-nova00:25
*** cz3 has joined #openstack-nova00:25
*** donnyd has joined #openstack-nova00:25
*** rajinir has joined #openstack-nova00:25
*** vdrok has joined #openstack-nova00:25
*** fyx has joined #openstack-nova00:25
*** arne_wiebalck has joined #openstack-nova00:26
*** johanssone has joined #openstack-nova00:26
*** ildikov has joined #openstack-nova00:26
*** rm_work has quit IRC00:27
*** StevenK has quit IRC00:27
*** logan- has joined #openstack-nova00:27
*** StevenK has joined #openstack-nova00:27
*** openstackstatus has quit IRC00:28
*** jamesdenton has quit IRC00:28
*** Jeffrey4l has quit IRC00:29
*** melwitt has quit IRC00:29
*** rm_work has joined #openstack-nova00:30
*** melwitt has joined #openstack-nova00:30
*** ab-a has joined #openstack-nova00:30
*** Jeffrey4l has joined #openstack-nova00:31
*** jamesdenton has joined #openstack-nova00:31
efriedThanks alex_xu00:44
*** TxGirlGeek has quit IRC00:49
*** brinzhang has joined #openstack-nova00:56
*** spatel has joined #openstack-nova01:00
*** spatel has quit IRC01:00
*** vishalmanchanda has quit IRC01:04
*** adriant has quit IRC01:10
*** adriant has joined #openstack-nova01:13
*** jhesketh has quit IRC01:13
*** jhesketh has joined #openstack-nova01:14
*** mdbooth_ has quit IRC01:40
*** mdbooth has joined #openstack-nova01:41
*** Liang__ has joined #openstack-nova01:48
*** brinzhang has quit IRC02:04
*** brinzhang has joined #openstack-nova02:05
*** brinzhang has quit IRC02:06
*** brinzhang has joined #openstack-nova02:07
openstackgerritMerged openstack/nova master: Revert "nova shared storage: rbd is always shared storage"  https://review.opendev.org/68252303:00
*** gyee has quit IRC03:03
*** zhurong has joined #openstack-nova03:03
*** damien_r has joined #openstack-nova03:06
*** mkrai has joined #openstack-nova03:28
*** jmlowe has joined #openstack-nova03:31
openstackgerritQiu Fossen proposed openstack/nova-specs master: specify mac for creating instance  https://review.opendev.org/70042903:38
*** psachin has joined #openstack-nova03:43
*** jmlowe has quit IRC03:43
*** jmlowe has joined #openstack-nova03:45
*** xek has joined #openstack-nova04:00
*** udesale has joined #openstack-nova04:15
*** tetsuro has joined #openstack-nova04:27
*** tetsuro has quit IRC04:34
*** tetsuro has joined #openstack-nova04:37
*** zhurong has quit IRC04:41
*** jmlowe has quit IRC04:45
*** zhurong has joined #openstack-nova05:07
openstackgerritBrin Zhang proposed openstack/nova master: Add SYSTEM_READER role to server actions API  https://review.opendev.org/70617905:16
*** artom has quit IRC05:27
*** adriant has quit IRC05:27
*** adriant has joined #openstack-nova05:27
*** evrardjp has quit IRC05:33
*** evrardjp has joined #openstack-nova05:34
*** udesale_ has joined #openstack-nova05:39
*** udesale has quit IRC05:42
*** sean-k-mooney has quit IRC05:53
*** Liang__ has quit IRC05:54
*** sean-k-mooney has joined #openstack-nova05:55
*** ratailor has joined #openstack-nova06:05
*** artom has joined #openstack-nova06:10
*** yedongcan has joined #openstack-nova06:16
*** ccamacho has quit IRC06:26
*** brinzhang has quit IRC06:32
*** links has joined #openstack-nova06:32
*** brinzhang has joined #openstack-nova06:32
*** brinzhang has quit IRC06:35
*** brinzhang has joined #openstack-nova06:36
*** yedongcan has quit IRC06:40
*** vishalmanchanda has joined #openstack-nova06:40
*** yedongcan has joined #openstack-nova06:40
*** zhurong has quit IRC06:42
openstackgerritMerged openstack/nova master: Avoid calling neutron for N networks  https://review.opendev.org/70578406:50
*** sean-k-mooney has quit IRC07:01
openstackgerritBrin Zhang proposed openstack/nova master: Add SYSTEM_READER role to servers actions API  https://review.opendev.org/70617907:02
*** sean-k-mooney has joined #openstack-nova07:03
*** lpetrut has joined #openstack-nova07:09
*** lpetrut has quit IRC07:10
*** lpetrut has joined #openstack-nova07:10
*** yedongcan has quit IRC07:20
*** artom has quit IRC07:20
*** yedongcan has joined #openstack-nova07:22
*** artom has joined #openstack-nova07:23
*** artom has quit IRC07:24
*** artom has joined #openstack-nova07:24
*** iurygregory has joined #openstack-nova07:46
*** mkrai has quit IRC07:47
*** gentoora- has joined #openstack-nova07:50
*** gentoorax has quit IRC07:50
*** gentoora- is now known as gentoorax07:50
gibiefried: yeah max 20 I t was more on the 12 - 15 side in my memory07:53
gibiefried, dansmith: I was avoiding cyborg integration code recently as I saw that both of you are actively handled that but now I feel you want me to read in. I hope I will not blow up things with my fresh oppinion07:54
*** xek has quit IRC07:56
*** xek has joined #openstack-nova07:56
openstackgerritBrin Zhang proposed openstack/nova master: Expose instance action event details out of the API  https://review.opendev.org/69443008:04
*** tkajinam has quit IRC08:04
*** tesseract has joined #openstack-nova08:13
*** xek has quit IRC08:16
*** xek has joined #openstack-nova08:16
*** maciejjozefczyk has joined #openstack-nova08:21
*** amoralej|off is now known as amoralej08:23
*** mkrai has joined #openstack-nova08:23
*** brinzhang_ has joined #openstack-nova08:27
*** brinzhang_ has quit IRC08:28
*** brinzhang has quit IRC08:31
*** ccamacho has joined #openstack-nova08:33
*** tosky has joined #openstack-nova08:40
*** slaweq has joined #openstack-nova08:40
*** brinzhang has joined #openstack-nova08:46
*** brinzhang has quit IRC08:46
*** brinzhang has joined #openstack-nova08:46
*** ralonsoh has joined #openstack-nova08:51
*** ratailor has quit IRC09:01
*** abhishekk|away is now known as abhishekk09:13
*** brinzhang has quit IRC09:19
*** brinzhang has joined #openstack-nova09:19
ralonsohstephenfin, sorry for bothering you again...09:30
ralonsohhttps://review.opendev.org/#/c/705760/2/nova/network/neutron.py@261509:30
ralonsohI didn't realize this when I reviewed the patch09:31
ralonsohbut a FIP can NOT be attached to a port09:31
ralonsohso this exception... is unnecessary. Actually we are still having some problems in the Neutron OVN CI09:32
ralonsohhttps://7147d7e07a0a4e67ee4e-8dbb68c995c97f9fb5b4258bb26a4c49.ssl.cf2.rackcdn.com/703537/7/check/neutron-ovn-tempest-ovs-release/7b55444/testr_results.html09:32
stephenfindamn. okay, let me fix that up09:32
ralonsohstephenfin, let me first check the test09:33
ralonsohI would like to confirm that in the logs09:33
*** bbowen has quit IRC09:35
*** bbowen has joined #openstack-nova09:35
*** martinkennelly has joined #openstack-nova09:35
ralonsohstephenfin, https://7147d7e07a0a4e67ee4e-8dbb68c995c97f9fb5b4258bb26a4c49.ssl.cf2.rackcdn.com/703537/7/check/neutron-ovn-tempest-ovs-release/7b55444/controller/logs/screen-n-api.txt09:39
ralonsohwe are creating IPs without any port associated09:39
*** brtknr has quit IRC09:40
*** ociuhandu has joined #openstack-nova09:42
*** ociuhandu has quit IRC09:42
*** ociuhandu has joined #openstack-nova09:44
*** ivve has joined #openstack-nova09:58
*** xek has quit IRC10:03
stephenfinralonsoh: What happens if I do 'list_ports(ids=['foo'])' and 'foo' doesn't exist?10:05
openstackgerritStephen Finucane proposed openstack/nova master: Don't error out on floating IPs without associated ports  https://review.opendev.org/70621310:06
stephenfinI assume I'll just get an empty response, i.e. {'ports': []}10:06
stephenfinralonsoh, gibi: That's the fix, btw ^10:06
ralonsohstephenfin, yes, you have an empty list10:08
ralonsohshow_port will return an exception10:08
ralonsohlist_ports don't10:08
stephenfincool, that's what I'd assumed in that patch10:08
*** slaweq_ has joined #openstack-nova10:13
*** slaweq has quit IRC10:15
openstackgerritStephen Finucane proposed openstack/nova master: Rework how we check for extensions  https://review.opendev.org/70579210:16
*** ratailor has joined #openstack-nova10:29
*** slaweq__ has joined #openstack-nova10:39
*** slaweq_ has quit IRC10:41
*** rpittau|afk is now known as rpittau10:45
*** slaweq__ is now known as slaweq10:46
*** mkrai has quit IRC10:52
*** udesale_ has quit IRC11:02
*** lee1 is now known as lyarwood11:05
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Rename _is_storage_shared_with to _is_path_shared_with  https://review.opendev.org/69333711:09
lyarwoodstephenfin: ^ can you +W that again, had to rebase to clear the conflict.11:09
stephenfindone11:09
lyarwoodta11:10
gibistephenfin: a floating ip without a port is an unused floating ip?11:11
openstackgerritGuo Jingyu proposed openstack/nova-specs master: Proposal for a safer noVNC console with password authentication  https://review.opendev.org/62312011:11
stephenfingibi: Yup. We're using11:12
stephenfin...it for the 'GET /os-floating-ips' API, and that describes itself as "Return(ing) a list of floating IPs allocated to a project."11:13
gibicool, thanks11:13
openstackgerritStephen Finucane proposed openstack/nova-specs master: Re-propose "Secure Boot support for KVM & QEMU guests" for Ussuri  https://review.opendev.org/69384411:24
brinzhanggibi, lyarwood: this is my spec's invalid link fix patch https://review.opendev.org/#/c/705933/, please review11:25
brinzhanggibi, lyarwood: https://review.opendev.org/#/q/status:open+project:openstack/nova+branch:master+topic:bug/1663456 this is the bug 1663456 fix, one is +2 by stephenfin11:27
openstackbug 1663456 in OpenStack Compute (nova) "Field 'updated_at' always 'None' when show aggregate" [Low,In progress] https://launchpad.net/bugs/1663456 - Assigned to Brin Zhang (zhangbailin)11:27
*** ociuhandu has quit IRC11:35
*** ociuhandu has joined #openstack-nova11:37
*** ociuhandu has quit IRC11:42
*** tbachman has quit IRC11:43
*** brtknr has joined #openstack-nova11:54
*** ociuhandu has joined #openstack-nova11:58
*** amoralej is now known as amoralej|lunch12:00
*** yedongcan has left #openstack-nova12:07
openstackgerritBrin Zhang proposed openstack/nova master: Add server actions v82 samples test  https://review.opendev.org/70625112:09
*** nicolasbock has joined #openstack-nova12:12
*** pcaruana has quit IRC12:13
gibistephenfin: fyi if you need I can reproduce the stestr bug https://github.com/mtreinish/stestr/pull/285 in a pretty stable way in nova12:16
*** slaweq has quit IRC12:17
stephenfingibi: I had this https://review.opendev.org/#/c/700522/12:17
*** slaweq has joined #openstack-nova12:23
*** ociuhandu has quit IRC12:30
*** ociuhandu has joined #openstack-nova12:30
gibistephenfin: you won :)12:31
gibistephenfin: is there a way I can help the stestr fix merged?12:31
artomstephenfin, left you a drive-by comment on https://review.opendev.org/#/c/705792/312:34
*** ociuhandu has quit IRC12:35
openstackgerritArnaud Morin proposed openstack/nova master: Force refresh of instance info cache on event  https://review.opendev.org/70239412:37
*** spatel has joined #openstack-nova12:39
*** jcosmao has joined #openstack-nova12:42
*** spatel has quit IRC12:44
*** damien_r has quit IRC12:49
*** ociuhandu has joined #openstack-nova12:50
*** rpittau is now known as rpittau|bbl12:52
openstackgerritMartin Midolesov proposed openstack/nova master: Implementing graceful shutdown.  https://review.opendev.org/66624512:55
*** b3nt_pin is now known as beagles12:59
*** ratailor has quit IRC13:00
*** jmlowe has joined #openstack-nova13:07
*** tbachman has joined #openstack-nova13:07
kashyapstephenfin: Thanks for taking time to address your own nits here!  https://review.opendev.org/#/c/693844/13:07
kashyapstephenfin: For future, don't worry about it.  I don't consider it "tedious" to adddress reviewer's feedback (and yours fall "within reason" almost always) :-)13:08
kashyapAppreciate it regardless.  Ah, I see that in some places I've used single back ticks instead of the double back ticks13:08
*** fyx has quit IRC13:13
*** slaweq_ has joined #openstack-nova13:14
gibiefried: I cannot join to the today's nova meeting as I have a F2F meeting at the same time13:15
*** guilhermesp has quit IRC13:16
*** damien_r has joined #openstack-nova13:16
*** slaweq has quit IRC13:16
*** Liang__ has joined #openstack-nova13:16
*** bcm has quit IRC13:20
*** gmann has quit IRC13:23
*** irclogbot_1 has quit IRC13:26
*** belmoreira has joined #openstack-nova13:27
*** nweinber has joined #openstack-nova13:27
*** cz3 has quit IRC13:28
*** irclogbot_0 has joined #openstack-nova13:30
*** rpittau|bbl is now known as rpittau13:30
*** links has quit IRC13:32
*** jmlowe has quit IRC13:50
stephenfingibi: Other than annoying mtreinish and lifeless, probably not :)13:51
gibi:)13:51
openstackgerritStephen Finucane proposed openstack/nova master: Rework how we check for neutron extensions  https://review.opendev.org/70579213:51
stephenfinartom: oh yeah, done13:51
openstackgerritBalazs Gibizer proposed openstack/nova master: Fix obj_what_changed for InstancePCIRequests ovo  https://review.opendev.org/70626613:52
openstackgerritBalazs Gibizer proposed openstack/nova master: Fix Instance.obj_what_changed behavior for the "extra" field  https://review.opendev.org/70626713:52
brinzhangstephenfin: hi, I have a question with the cmd.test_policy, https://github.com/openstack/nova/blob/master/nova/tests/unit/cmd/test_policy.py#L11913:54
*** amoralej|lunch is now known as amoralej13:54
efriedgibi: specifically for that one patch the important thing was seeing how the steps taken to extract device profiles and convert them into resource requests correspond to the same steps for bandwidth resources.13:55
efriedgibi: I put a patch above that one that moved some of that stuff around, would be nice to get your nod on that.13:55
openstackgerritBalazs Gibizer proposed openstack/nova master: Selectively refresh instance.pci_requests during rollback  https://review.opendev.org/70626913:55
brinzhangstephenfin: I added a new policy to the os-instance-action https://review.opendev.org/#/c/706179/2/nova/policies/instance_actions.py@27, but while run the unit test in cmd.test_policy, it was failed, and the "passing_rules " cannot get the new rule 'os_compute_api:os-instance-actions:show_details13:55
gibiefried: ack, I will try to review those13:56
brinzhangstephenfin, eric: I saw this file was changed or add by you (from github), can you give me some idea?13:56
brinzhangefried13:56
efriedme?13:59
sean-k-mooneyyou :)13:59
*** cz3 has joined #openstack-nova14:00
gibidansmith: I spent time on the selective refresh of instance.pci_requests based on obj_what_changes(), see the my stab at it here https://review.opendev.org/#/c/706269/1/nova/compute/manager.py@8566 and in the parent patches14:00
*** fyx has joined #openstack-nova14:01
*** guilhermesp has joined #openstack-nova14:01
brinzhangefried: yeah, but it's in two years ago :)14:02
*** gibi is now known as gibi_on_call14:02
efriedbrinzhang: ping me after the nova meeting14:03
brinzhangefried: ok, I am looking into the "def _check_filter_rules()", but cannot get some useful info, need some help :)14:04
*** jmlowe has joined #openstack-nova14:04
*** bcm has joined #openstack-nova14:06
efriedbrinzhang: if this is a policy thing, I'm definitely not the best person to ask. Have you checked with gmann?14:06
efriedjohnthetubaguy would also be a good resource if he's around...14:06
*** spatel has joined #openstack-nova14:10
openstackgerritsean mooney proposed openstack/nova-specs master: move implemented spec for train  https://review.opendev.org/70627614:14
umbSublimeo/14:18
*** mriedem has joined #openstack-nova14:19
*** ociuhandu has quit IRC14:21
*** ociuhandu has joined #openstack-nova14:22
*** slaweq_ is now known as slaweq14:24
*** ociuhandu has quit IRC14:24
*** ociuhandu has joined #openstack-nova14:25
*** ccamacho has quit IRC14:25
dansmithgibi_on_call: re: cyborg, know that I'm about +2 on the rest of the set aside from the second patch I've been deferring to efried on.. hopefully no major concerns as it's been a looong road :)14:29
bauzasefried: you're always using this acronym : NTF14:30
bauzaswhat's this ?14:30
efriedbauzas: NumaTopologyFilter, sorry14:31
bauzasoh ah14:32
bauzasgdi14:32
bauzasthanks14:32
*** ccamacho has joined #openstack-nova14:33
*** belmoreira has quit IRC14:38
artomstephenfin, cheers! I'll do a proper review soon14:38
*** mriosfer has joined #openstack-nova14:48
bauzassean-k-mooney: question for you14:49
bauzassean-k-mooney: if I want 8VCPUs, 8GB of RAM and hw:numa_nodes=214:49
bauzasdoes my memory split in twice ?14:49
*** spatel has quit IRC14:49
sean-k-mooneyyes and if you cant devidie the vcpus and ram an integage number of times it raise an error14:51
sean-k-mooneyso if you didn  8VCPUs, 8GB of RAM and hw:numa_nodes=314:51
sean-k-mooneythen its an error unless you also use the other args for asymetric numa toplogies14:52
bauzaswhat kind of error ?14:52
bauzassean-k-mooney: it only tells that it raises an exception in https://docs.openstack.org/nova/latest/user/flavors.html#extra-specs14:54
brinzhanggmann: are you around?14:54
sean-k-mooneyyes it raise an exception and the vm goes to error state14:55
*** Sundar has joined #openstack-nova14:56
sean-k-mooneybauzas: actully the vm wont go to error14:56
sean-k-mooneyit will rais the excptionin the api before we create teh vm db record14:56
bauzascool, that's what I hoped14:56
bauzasa synchronous 40x14:56
sean-k-mooneyproably a 400 bad request14:56
bauzasbecause we also need to make it for the translation mechanism14:56
sean-k-mooneya 4xx for sure14:57
bauzasideally, same exception14:57
*** abhishekk is now known as abhishekk|out14:57
brinzhangefried: how about now? :)14:57
sean-k-mooneyya so this is all handeled in the hardware.py module14:57
sean-k-mooneyand its reused in several places14:57
efriedbrinzhang: for which? The spec review?14:58
sean-k-mooneybauzas: its all handeled by https://github.com/openstack/nova/blob/014c1ab864cec6213750c5ef57d8080ae8cd973d/nova/virt/hardware.py#L174214:58
brinzhangefried: I added a new policy to the os-instance-action https://review.opendev.org/#/c/706179/2/nova/policies/instance_actions.py@27, but while run the unit test in cmd.test_policy, it was failed, and the "passing_rules " cannot get the new rule 'os_compute_api:os-instance-actions:show_details14:59
bauzassean-k-mooney: thanks, looking14:59
bauzasokay, will look14:59
brinzhangefried:https://github.com/openstack/nova/blob/master/nova/tests/unit/cmd/test_policy.py#L119 the check function is here14:59
Sundarefried, dansmith, sean-k-mooney: Re. https://review.opendev.org/#/c/631243/55/nova/accelerator/cyborg.py@86, I am adding checks for keystone exceptions here. However, if this 'get device profile' call fails, there will be no further calls to Cyborg for that instance. So, there is no point in adding redundant exception checks to other calls for15:00
Sundarcreating ARQs, etc. Do you agree?15:00
sean-k-mooneybauzas: the importnt bit for you is https://github.com/openstack/nova/blob/014c1ab864cec6213750c5ef57d8080ae8cd973d/nova/compute/api.py#L925 in _validate_and_build_base_options15:00
dansmithSundar: you should assume every call to another service could fail15:01
bauzassean-k-mooney: anyway, I'll write something in the spec and you'll chime in15:01
efriedbrinzhang: I'm afraid I have no clue about this. I would have to dig in fresh, and I don't have time to do that right now. However, I think gmann might be able to address your questions much more easily.15:01
sean-k-mooneybauzas: cool15:01
efriedbrinzhang: not sure where he is...15:02
bauzasbasically, my concern is 'I want 8 vCPUs, 7GB of RAM and hw:numa_nodes=2', accordingly the RAM has to be provided by two different memory pages that are in each NUMA nodes15:02
bauzassean-k-mooney: ^15:02
bauzasmeaning that the translation mechanism will split this memory amount in twice (or more)15:02
stephenfinI want to use an unreleased feature in neutronclient in a nova patch. For 'Depends-On' to work, I need to to add 'python-neutronclient' to 'required-projects' in '.zuul.yaml', right?15:02
sean-k-mooneyya you dont need to wory about that really15:03
brinzhangefried: ok, I will wait him, and I pinged him another question too, I will wait for him ,thanks15:03
sean-k-mooneyif the requst_spec has a numa_toplogy objct its a numa instnace and all of that has already been calualted15:03
efriedbauzas: yes, that sounds correct to me15:03
sean-k-mooneyso you just need to read the value form the numa toplogy object15:03
efriedstephenfin: yes, that sounds correct. But we wouldn't want to merge that change.15:04
openstackgerritStephen Finucane proposed openstack/nova master: Use neutronclient's port binding APIs  https://review.opendev.org/70629515:04
stephenfinefried: Yup, just a WIP for now15:04
efriedstephenfin: and that only works for zv3 jobs btw, not for legacy (IIUC)15:04
sean-k-mooneySundar: im goign to redeploy with the latest patches are they all updated15:04
Sundardansmith: Yes, but we are talking of failures like EndpointNotFound, MissingAuthPlugin, and Unauthorized. If the first call succeeds, is the second one going to fail for one of these reasons?15:05
sean-k-mooneyyou said you tought i was using an old set of cybog pataches when you looked at the error i got on monday but that was the latest set that were up at that point15:05
Sundarsean-k-mooney: Great, thanks15:05
dansmithSundar: no, this is a distributed system.. you should never make that assumption15:05
*** ociuhandu has quit IRC15:06
mriosferSean: if host got two sockets , is recomended to "force" hw:numa_nodes=2 ? If vcpus are more 2 / 4 / 6 / 8 etc...15:06
efriedSundar: If you wanted to trap those failures (you can use the superclass ClientException) and convert them to DeviceProfileError("Couldn't communicate with cyborg!") I think that would be okay.15:06
dansmithSundar: calls from nova to cyborg (and back) may be routed to different api endpoints on subsequent calls, in different states of broken, misconfigured, or transient upgrade15:06
openstackgerritStephen Finucane proposed openstack/nova master: WIP: Use neutronclient's port binding APIs  https://review.opendev.org/70629515:06
sean-k-mooneymriosfer: that depens on your workloads but i do for larger vms15:06
dansmithefried: agreed15:06
stephenfinkashyap: np, I figured it would just be easier do it myself real quick. No skin off my back15:07
mriosferSean : larger VM from 8 vcpu?15:07
mriosferor more15:07
sean-k-mooneyfor small vms of 4-8 CPUs i will often keep them with hw:numa_nodes=115:07
*** ociuhandu has joined #openstack-nova15:07
sean-k-mooneyya over 8 i tend to always split15:07
openstackgerritBrian Rosmaita proposed openstack/nova master: Do not inherit cinder_ image properties  https://review.opendev.org/70629815:07
sean-k-mooneyto hw:numa_nodes=215:07
stephenfindansmith: That neutron fix yesterday was incomplete, btw. ralonsoh and I worked out the fix this morning if you care to look again https://review.opendev.org/#/c/706213/15:08
dansmithstephenfin: I'm reading now15:08
stephenfinta15:08
sean-k-mooneybut that is based on host core counts of 10-16 cores per socket. if you have 32 cores per socket then you might adjust what you consider big15:08
mtreinishstephenfin: thanks for the reminder, I'll re-review that PR today or tomorrow15:08
dansmithstephenfin: where is the code that actually uses this?15:08
stephenfindansmith: Just nova/api/openstack/compute/floating_ips.py15:09
dansmithstephenfin: "port_details" is not in that file15:09
dansmithdo you mean it just passes through the structure to the api requester?15:10
Sundarefried, dansmith: Got it. Yes, I would handle service failures in distributed systems in general. I asked because I don't see these specific checks everywhere e.g.  https://github.com/openstack/nova/blob/master/nova/image/glance.py . But, I agree it is better to do it everywhere.15:10
stephenfindansmith: sure it is - line 4015:10
mtreinishstephenfin: the only thing is we're in prep for 3.0.0 (which removes 2.7). So that PR probably won't get released until then which we're shooting for mid-March15:11
dansmithstephenfin: % grep -c port_details nova/api/openstack/compute/floating_ips.py15:11
dansmith015:11
stephenfindansmith: It's definitely there :) https://github.com/openstack/nova/blob/master/nova/api/openstack/compute/floating_ips.py#L4015:12
dansmithoh is this a recent change? /me rebases15:12
sean-k-mooneydansmith: stephenfin started usign it as part of the nova net removal15:13
sean-k-mooneyso yes15:13
dansmithyeah I see15:13
stephenfinYup, it was picked up in one of the "Stop using nova-network objects as containers" patches15:13
sean-k-mooneyso floating ips are not ports and do not normally have port details15:13
stephenfinmtreinish: That's okay. To be honest, the more crucial fix is the subunit fix and I need lifeless for that15:13
dansmithokay just wanted to see what used it and that it was okay with None15:13
sean-k-mooneythere is a neutorn extnion to expose the port_details of the prot that they are attached too when attached15:14
sean-k-mooneybut its optional15:14
sean-k-mooneyapparently15:14
stephenfinmtreinish: https://github.com/testing-cabal/subunit/pull/40 for reference (I think I referenced it the stestr PR)15:14
sean-k-mooneystephenfin: when did we deprecate teh proxy api15:17
dansmithstephenfin: so because this broke the neutron gate, I assume we have tempest tests that actually use this proxy api, right?15:18
*** ociuhandu has quit IRC15:18
stephenfinsean-k-mooney: nova/api/openstack/api_version_request.py tells me 2.3615:18
*** ociuhandu has joined #openstack-nova15:18
stephenfindansmith: correct. the reason we didn't see it was because we test using ML2-OVS or ML2-LB (I don't know which) in the gate, not ML2-OVN15:18
sean-k-mooneywell it was only added to neutron in 201815:18
sean-k-mooneyhttps://github.com/openstack/neutron-lib/commit/24a0877d1d234830a36794388bb342d3d91b923015:18
sean-k-mooneywhich is well after we had deprecated it15:19
dansmithstephenfin: yeah I get that. are the tests _testing_ the proxy or using it out of convenience?15:19
sean-k-mooney2.36 was newton15:19
*** tbachman has quit IRC15:19
stephenfinsean-k-mooney: Yup, it's not the proxy API that was wrong, it was how I updated things to stop using nova-network o.vo's15:19
*** lpetrut has quit IRC15:20
sean-k-mooneystephenfin: what im saying is the proxy api should not conatin that info15:20
sean-k-mooneywe deprecated itn in 2016 that extention was only added in 201815:20
dansmithsean-k-mooney: I'm guessing we had it in there before via other mechanisms15:21
sean-k-mooneyso there should be noting relying on it for fip port details15:21
sean-k-mooneyya proably15:21
dansmithsean-k-mooney: looking at the older code I was originally looking at, it was still putting those things in there but in a different way15:21
stephenfindansmith: That I don't know. Maybe ralonsoh knows since he had a patch up against neutron to disable the broken job yesterday15:21
sean-k-mooneythe correct way to look this up before was you get all the ports attach to the vm then get the fip for each port15:21
openstackgerritBrin Zhang proposed openstack/nova master: Add SYSTEM_READER role to servers actions API  https://review.opendev.org/70617915:22
ralonsohstephenfin, yes, we are marking this CI job as non-voting15:22
stephenfinsean-k-mooney: yeah, what dansmith said. We were getting this information by always making a second call to neutron for the ports15:22
stephenfinralonsoh: Which job is it? dansmith is curious as to what's actually using these deprecated APIs15:22
sean-k-mooneystephenfin: yes which is the correct way to do it if the extention is not available15:22
stephenfinsean-k-mooney: I know. I made a mistake when updating things because I didn't know port_details was optional and figured I should use it15:23
dansmithralonsoh: job, but specifically which test(s)15:23
ralonsohdansmith, an example of CI job failing https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_db7/705779/1/check/neutron-ovn-tempest-ovs-release/db7becd/testr_results.html15:23
stephenfinIn hindsight, it should have been a separate patch :(15:23
ralonsohstephenfin, dansmith those ones with exception message15:24
ralonsoh"<class 'nova.exception.PortNotFound'>"15:24
dansmithralonsoh: okay so some specifically for that api, but also some general ones it looks like15:24
*** spatel has joined #openstack-nova15:24
stephenfindansmith: I assume you're suggesting that the general ones shouldn't be using the proxy APIs?15:24
spatelsean-k-mooney: morning15:24
stephenfini.e. test_server_basic_ops15:25
sean-k-mooneystephenfin: ya i think the secnairo test shoudl be updated15:25
spatelCould you check this CPU policy for vm - http://paste.openstack.org/show/789225/15:25
dansmithstephenfin: well, maybe, but it's also maybe interesting that if the alternative is very onerous ... that tells us something15:25
*** tbachman has joined #openstack-nova15:25
sean-k-mooneythe other way is not really that hard. its just more api requests15:26
stephenfindansmith: that everyone is probably ignoring us and using these deprecated APIs anyway? :)15:26
dansmithstephenfin: like, if the alternative is 20 calls to neutron and a bunch of local mapping, then that may be an indication that we kinda messed up15:26
dansmithstephenfin: indeed15:26
* stephenfin votes to remove all previous microversions15:27
stephenfinless users = less bugs = more time for stephenfin to eat popcorn15:27
*** gmann has joined #openstack-nova15:28
sean-k-mooneythe old way is just neutron port list device-id=<vm uuid> then you loop over that calling /v2.0/floatingips?port_id=<port uuid>)15:29
sean-k-mooneyso you get the port detalis form the orginal call to list all the ports on a vm and then map it to the fip from the second call15:30
dansmithyeah, this is exactly what I'm talking about :)15:30
sean-k-mooneyif you know the fip you jsut wrok backward first do a fip show get the port and then do a port show15:30
sean-k-mooneyport_details on the fip is a proxy  api in neutorn to get info form a different resouce15:31
sean-k-mooneythe added it as an optional exteion to make it 1 call15:31
*** tbachman has quit IRC15:31
sean-k-mooneybut each ml2 driver has to advertise support and ovn did not15:32
kashyapstephenfin: Hehe, figured as much.  (And I like the expression)15:32
*** ccamacho has quit IRC15:36
*** ociuhandu has quit IRC15:39
*** ociuhandu has joined #openstack-nova15:40
Sundar sean-k-mooney: Re. your message "that was the latest set that were up at that point", it got updated on Jan 10 to remove the specific exception that you posted: https://review.opendev.org/#/c/698846/2/cyborg/common/nova_client.py. Please rebase if your env is older. Sorry for the trouble.15:43
*** eharney has quit IRC15:43
openstackgerritSylvain Bauza proposed openstack/nova-specs master: Proposes NUMA topology with RPs  https://review.opendev.org/55292415:46
sean-k-mooneySundar: i used the local.conf i posted15:46
sean-k-mooneywhich was pulling in form a new branch15:46
*** ociuhandu has quit IRC15:46
sean-k-mooneyill double check15:47
openstackgerritMerged openstack/nova master: libvirt: Rename _is_storage_shared_with to _is_path_shared_with  https://review.opendev.org/69333715:47
sean-k-mooneystrang it seam to have checkout out master...15:48
openstackgerritStephen Finucane proposed openstack/nova master: trivial: Merge unnecessary 'NovaProxyRequestHandlerBase' separation  https://review.opendev.org/70565515:49
openstackgerritStephen Finucane proposed openstack/nova master: trivial: Remove 'run_once' helper  https://review.opendev.org/70565615:49
openstackgerritStephen Finucane proposed openstack/nova master: tox: Integrate mypy  https://review.opendev.org/67620815:49
openstackgerritStephen Finucane proposed openstack/nova master: mypy: Add type annotations to 'nova.pci'  https://review.opendev.org/67620915:49
openstackgerritStephen Finucane proposed openstack/nova master: mypy: Add nova.cmd, nova.conf, nova.console  https://review.opendev.org/70565715:49
openstackgerritStephen Finucane proposed openstack/nova master: WIP: mypy: Add type annotations to top-level modules  https://review.opendev.org/70565815:49
sean-k-mooneySundar: im using enable_plugin cyborg https://review.opendev.org/openstack/cyborg refs/changes/61/703261/115:50
sean-k-mooneythat should have checkout the specific version15:50
stephenfinbauzas: Care to do me the honour? https://review.opendev.org/#/c/703796/15:52
bauzassure15:52
bauzassean-k-mooney: efried: others, I'm glad to say I'm eventually done with https://review.opendev.org/55292415:53
sean-k-mooneySundar: it looks like there is either a devstack bug or the cybog plugin is nit support the gitref15:53
*** spatel has quit IRC15:53
sean-k-mooneySundar: https://github.com/openstack/devstack/blob/ae73b23373f8673722fac2fbd349fa2075fda8d6/doc/source/plugins.rst#plugin-interface15:53
efriedbauzas: that's what YOU think :P15:53
bauzashonestly, thanks you all15:53
bauzasbut I'm dead with this spec :p15:54
bauzasno battery left.15:54
* sean-k-mooney thinks about sending bauzas some deathwish coffee15:54
*** mdbooth has quit IRC15:55
sean-k-mooneybauzas: https://www.deathwishcoffee.com/ i have a friend that quite likes it15:55
Sundarsean-k-mooney: I think it is the latter: https://opendev.org/openstack/cyborg/src/branch/master/devstack/lib/cyborg#L8515:55
bauzassean-k-mooney: I've been told that an english tea is *always* black with milk15:56
*** mdbooth has joined #openstack-nova15:56
bauzascan't tell what an english coffee is then15:56
sean-k-mooneySundar: ok i can just define CYBORG_BRANCH in my local.conf15:56
sean-k-mooneybut you actully dont need to and should not clone the git repo in the plugin15:57
sean-k-mooneythat is done for you by devstack15:57
dansmithsean-k-mooney: devstack or devstack-gate?15:58
sean-k-mooneybauzas: are you confusing me with an english man becasue as a irish eu citizen i coudl be offended by that. also irish <anything> usally meens add good wiskey15:59
sean-k-mooney*irish <beverage or food item>15:59
sean-k-mooneydansmith: devstack16:00
dansmithsean-k-mooney: the other projects have git_clone calls in their modules16:00
*** gibi_on_call is now known as gibi16:00
sean-k-mooneyso in the gate they are disabled via a diffenert parmater16:00
mnaserok, i'm absolutely hitting a wall.  nova list --all-tenants returns only 11 vms, but i have 1297 vm_state='ACTIVE' instances.  instance_mappings are correct in nova_api16:01
sean-k-mooneybut the plugin interface specific was designed for devstack to do the clone of the pugin repo16:01
mnaseri double checked the db creds inside nova_api.cell_mappings16:01
sean-k-mooneyanyway its not a big deal16:01
mnaserthe crazier thing is running something like placement heal allocations only returns 11 instances only, which that is doing objects.InstaceList....16:02
*** redrobot has joined #openstack-nova16:02
mnaserso its note even an API level problem, it's in the db layer (i assume objects.* API inside nova-manage commands talks directly to db, not via conductor?)16:02
dansmithmnaser: nova-manage does not talk to conductor16:03
mnaserok, so that's what i thought as well16:04
sean-k-mooneynova manage will try and get the db infor form your nova.conf16:04
sean-k-mooneyis it perhaps geting the wrong db info16:04
openstackgerritStephen Finucane proposed openstack/nova master: Configure max-line-length to 88  https://review.opendev.org/70631116:04
mnaseri used the mysql client to use the same credentials in nova.conf and i see all the instances16:05
* stephenfin throws cat among pigeons16:05
dansmithmnaser: one non-cell0 cell?16:05
stephenfinbauzas: thanks :)16:05
mnaseryes, only one non-cell0 cell16:05
dansmithmnaser: and the 11 instances you do see, are they from that cell or in cell0?16:06
bauzassean-k-mooney: I've been told that by an irish man in Grenoblre16:06
mnaseri ddint check all of them but one of them is from that cell dansmith (aka inside `nova`)16:06
* bauzas remembers a song16:06
dansmithmnaser: is it complete? meaning, it's not returned as a shell instance with no real state information?16:07
mnaseroh good call16:07
mnasernova show brings even the user_data16:07
mnaserso it sounds like its pulling it in from nova and not nova_api16:07
dansmiththat's not the same16:07
dansmithwhen you see it in nova list, does it have vm_state?16:07
mnaseryes16:07
dansmithof active/16:08
mnaseryep16:08
mnaserand those instances were all created post upgrade so its almost like the db just like.. only started reading specific records16:08
mnasernow to note this is a s=>t upgrade, but those have been pretty smooth :16:08
dansmithso I think you can rule out cells, cell_mappings, instance_mapping related issues16:09
mnaserwell to me the weird thing is how objects.InstanceList which is called inside heal placement allocation.. only sees those 11 instances16:09
mnaseryet hypervisors are happily complaining like "66 vms on host but 65 in db" and things like that still happen16:10
dansmithyep, another reason I think you can rule out the cell infrastructure16:10
mnaserand if i hit a "Show" on a specific instance, i can retrieve it (just found out)16:10
mnasereven if i dont see it listed16:10
mnaseroh wth16:11
dansmithshow finds instances in a pretty different way,16:11
dansmithbut it also means that instance_mappings is correct16:11
mnaserit looks like a db migration has failed16:11
mnaser..but it somehow KEPT going with the migrations?16:11
dansmithwhat migration failed?16:11
mnaserhttp://paste.openstack.org/show/789229/16:12
* dansmith wonders why people *ever* use paste.o.o16:12
dansmithwow, that's interesting16:12
mnaserdansmith: pastebin.com sounds like it only contains leaks of *stuff*16:12
mnaserlet me look at what migration is that16:12
mnaserso i think it failed but the version was bumped16:13
dansmith"add instances hidden"16:13
mnaserand then it kept going after16:13
dansmithyeah, that'd do it16:13
*** ociuhandu has joined #openstack-nova16:13
dansmithalthough, it doesn't use server_default so it's a little weird16:13
dansmithmnaser: does your instances table have a "hidden" column?16:13
mnaserhttps://github.com/openstack/nova/blob/master/nova/db/sqlalchemy/migrate_repo/versions/399_add_instances_hidden.py <- for whoever is following and my notes16:14
mnaseryes i'm checking that now16:14
dansmithright16:14
mnaseri am gonna assume16:14
mnaserit does16:14
dansmiththe thing is that default is not server_default, so I would expect you're not missing that16:14
mnasertahts probably why new instances are ok16:14
dansmithmnaser: you're assuming or you checked?16:14
mnaserand old ones are not16:14
mnaser(iam going to check now)16:14
mnaserbut that might make sense as to why the new instances (11) show up16:15
dansmithhidden defaults to false on the code side, which means if it exists, they should all default to hidden=False,16:15
dansmithand if it didn't exist, you should be getting query errors16:15
mnaser| hidden                   | tinyint(1)            | YES  |     | NULL    |                |16:15
dansmithI think that's the right schema16:15
mnaserthere is 57 instances with hidden=016:15
mnaserand the rest are null.16:15
mnaserid be update instances set hidden=0; would fix it16:16
dansmithright, and those NULLs should be presumed =False by the python side16:16
dansmithwell,16:16
dansmithmaybe to get you out of the jam, but... that's not what *should* be happening16:16
mnaserright yeah, you're right, hidden being null should default to false16:17
mnaserlet me get out of my jam and i will look at the code16:17
dansmithoooh,16:17
dansmiththis might be a bug16:17
dansmitha really bad one16:17
dansmithlet it be known that mriedem may have made a mistake16:18
mnaserhe left us with a fun one :(16:18
dansmithit's not a hard fix16:18
*** READ10 has joined #openstack-nova16:19
dansmithmnaser: is this the first s->t upgrade for you?16:19
mnaserdansmith: no, and i didnt hit the first time16:19
mnaserbut i also dont think i hit that db migration issue that time too16:19
*** dtantsur is now known as dtantsur|afk16:19
dansmithoh, hrm16:19
dansmiththe db migration doesn't seem to have left the schema incorrect though16:19
mriedemfake news16:21
mnaserlol16:21
mnaserdansmith: i think it acutally tried to do an update and set things hidden=016:21
dansmithmnaser: can you compare that bit of schema to one of your other clusters, and also look at the hidden values on those instances/16:21
mnaser(this is a much bigger cloud)16:21
mnaserand my stupid update instances set hidden=0 gave me this16:21
mnaserERROR 1180 (HY000): Got error 90 "Message too long" during COMMIT16:21
dansmithmnaser: okay it shouldn't be doing that16:21
mnaserso its almost like once it does the schema update, it bumps the version, and then tries to backfill default values16:21
mnaserlet me check the value of hidden in the other train deployemnt16:22
dansmithyeah, that's not the intent of that migration for sure16:22
openstackgerritMerged openstack/nova-specs master: FUP: Fixed the invalid index in References  https://review.opendev.org/70593316:22
dansmithmaybe in some situations sqla-migrate actually tries to patch up data for you on existing tables, which is a real problem16:23
mnaserok yeah16:23
mnaserthe other deployment has hidden=0 for every single record16:23
dansmithand if so, this is a good example of why we don't do data transformation (intentionally) in schema migrations16:23
dansmithokay damn16:23
*** TxGirlGeek has joined #openstack-nova16:23
dansmithso we need to change that migration retroactively, and patch the load code to properly interpret the null as the default (which sqla does not do with default=)16:23
*** psachin has quit IRC16:24
mnaserso i think what happened is: schema updated and column added, db version bumped, backfill values to 016:24
mnaserand then i failed in the backfill, so the next run it just started from the next migration16:24
dansmithyeah, and on your big cloud, that timed out16:24
dansmithyeah, probably because the backfill wasn't technically a schema fail16:24
dansmithwhich is also scary16:24
mnaseri think your theory is best16:25
mnasers/theory/solution/16:25
openstackgerritStephen Finucane proposed openstack/nova master: objects: Add MigrationTypeField  https://review.opendev.org/70601316:25
dansmithmnaser: can you file a bug for me to write this up against?16:25
mnasersure16:25
dansmiththanks16:25
mriedemfwiw i count 5 non-api db schema migrations that add a new boolean column with default=False,16:26
dansmithmriedem: yup16:27
mriedemif it's actually backfilling, then maybe just not an issue for those b/c they are smaller tables16:27
dansmithmriedem: we've been doing this without realizing16:27
dansmithyu[16:27
dansmithwill need a poison test too16:27
dansmithlike the others I have for alter operations16:27
*** TxGirlGeek has quit IRC16:27
mriedemi sort of remember this discussion with jaypipes on https://review.opendev.org/#/c/557958/16:29
mriedemwhich uses server_default16:29
*** slaweq_ has joined #openstack-nova16:29
*** N3l1x has joined #openstack-nova16:30
dansmithI guess server_default would be one option, but it involves another schema change which I think is less good since we have to backport this16:30
*** slaweq has quit IRC16:31
*** spatel has joined #openstack-nova16:31
mriedemhttps://docs.sqlalchemy.org/en/13/core/metadata.html#sqlalchemy.schema.Column.params.default16:34
mriedem"which will be invoked upon insert if this column is otherwise not specified in the VALUES clause of the insert"16:34
mriedemdidn't realize it would backfill though16:34
dansmithright, I always assume it fills on read too, but just insert16:35
mriedemhttps://docs.sqlalchemy.org/en/13/core/metadata.html#sqlalchemy.schema.Column.params.server_default - "A FetchedValue instance, str, Unicode or text() construct representing the DDL DEFAULT value for the column."16:35
mriedemso yeah, i guess never use default in migrations16:35
dansmithright16:35
mriedembut is default in the models OK?16:35
mriedemi thought it was16:35
*** gyee has joined #openstack-nova16:35
dansmithit is,m16:36
mriedemmnaser: just purge your db :)16:36
dansmithand I just tested that the model sync test does not fail you for having default on the column and not the migration16:36
mnasermriedem: you're not wrong16:36
mnaserdansmith: https://bugs.launchpad.net/nova/+bug/186220516:36
openstackLaunchpad bug 1862205 in OpenStack Compute (nova) "Instances not visible when hidden=NULL" [Undecided,New]16:36
*** nweinber has quit IRC16:36
dansmithmnaser: thanks16:37
*** nweinber has joined #openstack-nova16:37
mnaserdansmith: if we come up/discuss a solution, i'm happy to work out a fix, i have bigger clouds to upgrade ahead of this sooo16:37
dansmithmnaser: I've got a patch started, but yeah, will appreciate real testing for sure16:38
mnaserdansmith: ok cool, i can try and help reviewing to my ability and id be able to provide some feedback on a cloud with 1.5 times the # of records (but only in 2 weeks :X)16:39
mriedemi've still got core if you need to poke someone later16:39
dansmithcool16:39
*** rpittau is now known as rpittau|afk16:40
melwittI can help with review too if needed16:41
*** tbachman has joined #openstack-nova16:43
*** vishalmanchanda has quit IRC16:50
gibidansmith, efried, Sundar: I agree with the discussed direction in https://review.opendev.org/#/c/631243/55 See details in my review16:52
gibiefried: is there other patches in the series that need my eyes? I can spend some extra time on the series tomorrow if needed16:54
*** tesseract has quit IRC16:54
*** eharney has joined #openstack-nova16:59
*** TxGirlGeek has joined #openstack-nova17:03
*** tbachman has quit IRC17:03
*** iurygregory has quit IRC17:07
*** READ10 has quit IRC17:08
openstackgerritMerged openstack/nova master: Don't error out on floating IPs without associated ports  https://review.opendev.org/70621317:12
openstackgerritMerged openstack/nova master: nova-net: Update API reference guide  https://review.opendev.org/70379617:12
openstackgerritMerged openstack/nova stable/queens: Add functional recreate test for bug 1852610  https://review.opendev.org/69970517:12
openstackbug 1852610 in OpenStack Compute (nova) queens "API allows source compute service/node deletion while instances are pending a resize confirm/revert" [Low,In progress] https://launchpad.net/bugs/1852610 - Assigned to Matt Riedemann (mriedem)17:12
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Remove native LUKS compat code  https://review.opendev.org/66912117:16
*** ivve has quit IRC17:22
efriedgibi: Well, I've been hoping that I would get the chance to review the series from top to bottom and be the second +2 (with dansmith) but I'm no longer sure I'm going to be able to get to it soon. So if you have time, your help would be much appreciated. But I would say you shouldn't push other stuff out of the way for it.17:25
*** ociuhandu has quit IRC17:29
openstackgerritDan Smith proposed openstack/nova master: Fix instance.hidden migration and querying  https://review.opendev.org/70633117:30
dansmithmnaser: mriedem melwitt ^ haven't run all tests yet, so it might break other things, but that's what I'm thinking as a backportable change17:31
* dansmith notes that this is the M-club apparently17:31
melwittin da club17:31
* sean-k-mooney ignore the british meaning of that https://www.urbandictionary.com/define.php?term=in%20the%20pudding%20club17:33
*** evrardjp has quit IRC17:33
dansmithgross.17:34
*** evrardjp has joined #openstack-nova17:34
*** tosky has quit IRC17:34
*** READ10 has joined #openstack-nova17:34
melwittlol wtf. never heard that before17:34
sean-k-mooneyin the club to mean pregnant ya ist an old old way of saying that in polite company when gossiping17:35
*** tbachman has joined #openstack-nova17:36
melwittinteresting. here it means like at the dance club, in a bunch of rap songs17:36
dansmithas someone who feels like society favors those who procreate unfairly, I definitely see it as "a club" in the negative sense17:36
sean-k-mooneyyes as i said it was used when gossiping about someone behind there back so it had a negitive over tone in many cases17:38
dansmiththat's not really what I meant :)17:38
melwittlol17:38
mriedemi'm in da costco club17:39
dansmithmnaser: how many instances in the database that failed, and how many in the one that didn't?17:39
mriedemif you need a years supply of toilet paper i'm your guy17:39
sean-k-mooneylol17:39
melwittme too, Costco Executive Club17:39
*** mlavalle has joined #openstack-nova17:43
*** ociuhandu has joined #openstack-nova17:44
melwittmaybe this is too out there but I wonder if this might be related to the cell timeout gate bug (affecting only grenade jobs)17:47
dansmiththe first query after upgrade works, so ... I don't think so17:48
dansmithand also, mnaser said it worked fine for his smaller cluster, which I'm sure is waaaay larger than a grenade setup with a tempest-run worth of instances in the db17:48
melwittok. I wondered if there could be something to do with the table rewriting in the background or something17:48
dansmithsurely hope not :) but it seems like it was synchronous in mnaser's case17:49
melwittbecause when I dug into that, it seemed like mysql just overloaded somehow and logging errors "reading communication packets"17:49
melwittok17:49
*** ociuhandu_ has joined #openstack-nova17:52
*** martinkennelly has quit IRC17:53
*** ociuhandu has quit IRC17:55
*** ociuhandu_ has quit IRC17:57
sean-k-mooneyany table updates or data migration should be taking a full table write lock. reads should see the old data until the transaction completes17:59
*** ociuhandu has joined #openstack-nova18:00
sean-k-mooneyat least if you are using the innodb schema for mysql18:00
*** derekh has quit IRC18:00
*** ociuhandu has quit IRC18:06
openstackgerritDan Smith proposed openstack/nova master: Fix instance.hidden migration and querying  https://review.opendev.org/70633118:14
*** jmlowe has quit IRC18:18
*** amoralej is now known as amoralej|off18:22
*** ralonsoh has quit IRC18:22
openstackgerritDan Smith proposed openstack/nova master: Fix instance.hidden migration and querying  https://review.opendev.org/70633118:28
openstackgerritDan Smith proposed openstack/nova master: Disallow specifying a default= on new Columns in schema changes  https://review.opendev.org/70635118:34
dansmith^ makes it impossible to use Column(default=) in migrations after 39918:34
dansmithefried: did you catch the earlier discussion about the relatively serious upgrade-corrupts-database bug in train?18:35
efriedno18:35
dansmithefried: on largeish deployments, upgrading to Train can make all your instances hidden silently18:35
dansmithwe're going to want to backport my fix and release a train update with definitive action at some point here soon18:36
sean-k-mooneydansmith: do we have that fix downstream by the way18:36
dansmithsean-k-mooney: we don't have it upstream yet, so..no.18:37
sean-k-mooneyok cause you know what day it is :)18:37
efrieddansmith: do I need to review things?18:38
dansmithefried: you're welcome to.. I think we're looking for a test run, a report from mnaser and melwitt/mriedem are also going to review18:38
dansmithefried: https://review.opendev.org/#/c/706331/18:38
dansmithjust for YI if nothing else18:38
efriedack18:42
mnasercatching up on buffer...18:45
mnaserdansmith: ~220kish it didnt fail, ~340kish it failed, mariadb 10.3.something i believe18:45
dansmithmnaser: okay thanks. I guess if anyone asks me I'll say "somewhere around 330k"18:46
mnaseryeah and in this case it would be nice if we purged records so18:46
dansmither, "300k" I mean18:46
mnaserimportant note that this doesnt mean "300k active instance" but "300k db records"18:47
dansmithyup18:47
*** ivve has joined #openstack-nova18:47
*** jmlowe has joined #openstack-nova18:48
*** dpawlik has quit IRC18:50
mnaserdansmith: i wonder if this might start a whack-a-mole of references to hidden=false or hidden=null18:51
efriedmnaser: I was just leaving a comment to that effect.18:51
*** Liang__ has quit IRC18:52
dansmithmnaser: maybe except I think these are the only two places :)18:52
efriedfor now18:52
mnaseryeah, thing is i have no better solution18:52
dansmithnot really, because everything else sits above the db layer18:52
sean-k-mooneywe porably should be normalising that when we load if form the db18:52
dansmiththis is what the db and object apis are *for*18:53
dansmithsean-k-mooney: we are, that's what I'm doing18:53
sean-k-mooneydansmith: in https://review.opendev.org/#/c/706331/18:53
dansmiththe second case is specifically doing a very targeted non-ORM query18:53
dansmithotherwise this would be a single place18:53
*** mvkr has quit IRC18:54
*** jmlowe has quit IRC18:58
*** tosky has joined #openstack-nova18:59
*** imacdonn has joined #openstack-nova19:00
dansmithmnaser: added comments to the test to mirror your self-discovery19:01
dansmithefried: added the hidden=False case to the test19:01
dansmithgonna wait to push it up so we can get a run on it19:01
dansmithefried: also added that ref to the commit message19:01
efriedack. dansmith when I re-add the default in the poison patch and run just that test, it works, but when I run the whole suite, the exception is weird19:02
efriedAttributeError: hidden19:02
dansmithhmm, that's weird19:02
mriedemdansmith: question in that test19:02
dansmithI was running just one19:02
dansmithmriedem: replied19:05
openstackgerritMerged openstack/nova stable/queens: Add functional recreate revert resize test for bug 1852610  https://review.opendev.org/69970819:05
openstackbug 1852610 in OpenStack Compute (nova) queens "API allows source compute service/node deletion while instances are pending a resize confirm/revert" [Low,In progress] https://launchpad.net/bugs/1852610 - Assigned to Matt Riedemann (mriedem)19:05
efrieddansmith: repro is:19:05
efriedactivate the venv19:05
efriedfails with the expected message: stestr run -n nova.tests.unit.db.test_migrations.TestNovaMigrationsSQLite.test_walk_versions19:05
efriedfails with AttributeError: stestr run -n nova.tests.unit.db.test_migrations19:05
efried...venv is py3619:06
dansmithefried: oh I was running the whole test_migrations module myself and wasn't seeing the error, but let me try again19:06
dansmithI'm on py3719:06
* efried creates py37 venv...19:07
efrieddansmith: zuul agrees with me19:07
efriedbut it also fails despite the migration having been 'fixed'.19:09
dansmithhrm19:10
dansmithwhat version of sqla is in your venv?19:10
efriedI just repro'd with a fresh py37 env, so whatever's in requirements.19:11
dansmithcan you look so I can compare?19:11
efriedSQLAlchemy==1.3.13,sqlalchemy-migrate==0.13.0,sqlparse==0.3.019:11
sean-k-mooneytox -e py37 -- nova.tests.unit.db.test_migrations.TestNovaMigrationsSQLite.test_walk_versions19:11
sean-k-mooneypasses form me with dans change19:11
dansmithokay I just did a fuuuul run and that failed like yours but doesn't fail for me on just the module19:12
dansmithso must be an ordering thing or something?19:12
efriedor something.19:12
mriedemshouldn't column_default_banned default to False for everything before 399?19:12
efriedI skimmed looking for other monkey patches of Column, but didn't see any at a glance.19:12
mriedemi guess the default maybe doesn't matter since the test is always passing a value19:12
dansmithmriedem: it does.. set it to zero and it will fail for 216_havana19:12
mriedemyeah it was the default in the fixture that i was thinking about - the default isn't used since the test always overwrites it19:13
dansmithcorrect19:13
efriedif I'm being super picky, either remove the default or specify the kwarg name in the call.19:13
dansmithefried: you see that the original doesn't specify either I assume19:14
efriedbut that's so nittish for a test I won't even put it in the review19:14
dansmiththe existing argument I mean19:14
efriedhah, the number of times we've busted Sundar for following a precedent that was wrong...19:14
dansmithpython2 used to not allow multi-line with statements so it had to be short19:14
mriedemonly other comment on the actual fix is do you want a reno to advertise it since it's going to be backported19:14
dansmithmriedem: yep19:15
mriedemcan i -1 for it to get stats!?19:15
mriedemafter that i'd be +2 on the fix19:15
mriedemwell i guess there is no test for the quota thing but i'll leave that up to you guys,19:16
mriedemin my new job tests are a low concern for people so i'm getting used to not asking for them.19:16
sean-k-mooneyefried: dansmith ya i also can repoduce the error with "tox -e py37 -- nova.tests.unit.db.test_migration" so this is like due to incorrect mocking19:16
dansmithmriedem: I didn't because we kinda already test the non-null site, and didn't want to have to replicate the raw-sql create of a null-having record for that too, but I can19:16
mriedemup to you, like i said, any tests are good :)19:17
mriedemmnaser: now that you're getting to train does this mean you're going to start cross-cell resizing like a mad man?19:19
mriedemoh wait, did that land in ussuri?19:20
mriedemah right it's available in ussuri, nvm me19:20
dansmithmriedem: he would need multiple cells to cross :)19:21
mriedemi know, it was implied as a nudge19:21
mriedembecause there has been some vexxhost multi-cell chatter for awhile19:21
mriedemdoes cern still upgrade nova?19:21
*** martinkennelly has joined #openstack-nova19:31
*** jmlowe has joined #openstack-nova19:32
dansmithah I see the problem19:37
dansmithI dunno why it doesn't always happen, but it's also not going to be an easy fix :/19:37
mnasermriedem: hah.  yeah,ii think for cross-cell there's a few mountains to climb first19:39
mnaserlike figuring out glance with multiple backends and nova cells with different ceph backends in each one19:40
mnaserand time19:40
dansmithmnaser: that's in the works fwiw19:40
dansmithglance has to do a thing first and then I plan to get on the nova side19:40
dansmithefried: mriedem: the reno for this should just have an item in "fixes:" ? or critical? or upgrade?19:46
*** jmlowe has quit IRC19:47
mriedemeither fixes or upgrade, or both i guess?19:47
mriedemso i guess if you do upgrade, you can say if you haven't rolled to this point yet make sure you do first rather than like train GA, but if you have already upgraded to train GA and hit this issue, you can manually update the records (maybe after an archive/purge)?19:48
mriedemmy guess is people that hit this will be looking for some guidance on what to do19:48
dansmithaight19:52
*** openstackstatus has joined #openstack-nova19:56
*** ChanServ sets mode: +v openstackstatus19:56
melwittdoes anyone know if there's a similar issue with a column default when using alembic for migrations?20:03
openstackgerritSundar Nadathur proposed openstack/nova master: ksa auth conf and client for Cyborg access  https://review.opendev.org/63124220:04
openstackgerritSundar Nadathur proposed openstack/nova master: Add Cyborg device profile groups to request spec.  https://review.opendev.org/63124320:04
openstackgerritSundar Nadathur proposed openstack/nova master: Define Cyborg ARQ binding notification event.  https://review.opendev.org/69270720:04
openstackgerritSundar Nadathur proposed openstack/nova master: Create and bind Cyborg ARQs.  https://review.opendev.org/63124420:04
openstackgerritSundar Nadathur proposed openstack/nova master: Pass accelerator requests to each virt driver from compute manager.  https://review.opendev.org/69858120:04
openstackgerritSundar Nadathur proposed openstack/nova master: Compose accelerator PCI devices into domain XML in libvirt driver.  https://review.opendev.org/63124520:04
openstackgerritSundar Nadathur proposed openstack/nova master: Delete ARQs for an instance when the instance is deleted.  https://review.opendev.org/67373520:04
openstackgerritSundar Nadathur proposed openstack/nova master: Enable hard/soft reboot with accelerators.  https://review.opendev.org/69794020:04
openstackgerritSundar Nadathur proposed openstack/nova master: Enable start/stop of instances with accelerators.  https://review.opendev.org/69955320:04
openstackgerritSundar Nadathur proposed openstack/nova master: Enable and use COMPUTE_ACCELERATORS trait.  https://review.opendev.org/69955420:04
openstackgerritSundar Nadathur proposed openstack/nova master: Bump compute rpcapi version and reduce Cyborg calls.  https://review.opendev.org/70422720:04
openstackgerritSundar Nadathur proposed openstack/nova master: Add cyborg tempest job.  https://review.opendev.org/67099920:04
mriedemmelwitt: i'm not actually sure if it's sqlalchemy or sqlalchemy-migrate that is applying that default value to existing records,20:20
mriedemprobably need to ask zzzeek20:20
dansmiththere are two things here: first reading the null values and needing to do the defaulting is a SQLA thing, not related to alembic20:21
dansmithsecond is the application of the default to the existing rows, which could be different20:21
melwittasking because the proposed consumer_types table in placement is specifying a default column value https://review.opendev.org/#/c/669170/10/placement/db/sqlalchemy/alembic/versions/422ece571366_add_consumer_types_table.py@8320:21
melwittok20:21
dansmithmelwitt: should be pretty easy to test.. I'm sure you have a devstack with that applied for your own testing20:21
dansmithmelwitt: just create rows without the patch applied, then apply and roll over that migration and see if the field for existing rows is NULL or the default20:22
melwittyes ... right ...20:22
*** damien_r has quit IRC20:24
mriedembingo bango https://opendev.org/x/sqlalchemy-migrate/src/branch/master/migrate/changeset/schema.py#L59420:25
mriedemdansmith: there is the proof20:25
*** maciejjozefczyk has quit IRC20:25
mriedemso you *could* use default= in schema migrations but you'd have to also set populate_default=False20:26
dansmithyeah, but no20:26
dansmithnot sure what purpose that would serve20:26
dansmithif we shared Column() defs with migrations and models or something maybe20:26
dansmithbut the models in sync test didn't even fail for me20:26
*** eharney has quit IRC20:30
melwittI'm realizing there's a difference between 'default' and 'server_default'? the placement table is using 'server_default'20:31
melwitthttps://stackoverflow.com/questions/14002631/why-isnt-sqlalchemy-default-column-value-available-before-object-is-committed#1401309020:35
zzzeekmelwitt / mriedem not reading everyhing but when you add a column to a database that has a default value and it's "not null", the DB adds that default.  that is how you get a MySQL migration that is very slow for large tables btw20:35
*** gmann has quit IRC20:36
*** damien_r has joined #openstack-nova20:37
*** gmann has joined #openstack-nova20:37
melwittzzzeek: is that true regardless of whether it's specified as a 'default' vs a 'server_default'? will 'server_default' also try to backfill in already existing records that do not have a value set?20:38
zzzeekmelwitt: oh...server default only, sorrhy20:38
zzzeekmelwitt: for "default" that is not a server default, I have no idea what sqlalhcemyt-migrate does20:38
zzzeeki'd be surprised if they use it, though, because the "add not null column / populate server default" is necessarily atomic20:38
zzzeekyou can't do that from the client using a python-side default20:39
melwittzzzeek: ok, it also does a backfill, this is the patch where it's being fixed if you're curious https://review.opendev.org/70633120:39
zzzeekonly if migrate takes the crazy insane step of making the column as nullable first, then populating, then not-nulling20:39
melwittzzzeek: this is the link from migrate https://opendev.org/x/sqlalchemy-migrate/src/branch/master/migrate/changeset/schema.py#L59420:39
zzzeekmelwitt: wow, yuck :)20:40
melwittlol20:40
zzzeeki hate migrate20:40
zzzeekb.c. you know that fails if the DB is live and new rows still getting added20:41
melwittmnaser knows ;)20:41
*** martinkennelly has quit IRC20:42
mnaseryeah, i do20:47
*** efried has quit IRC20:47
*** efried has joined #openstack-nova20:48
*** irclogbot_0 has quit IRC20:53
*** irclogbot_2 has joined #openstack-nova20:54
*** martinkennelly has joined #openstack-nova20:58
*** slaweq__ has joined #openstack-nova21:02
*** slaweq_ has quit IRC21:03
*** READ10 has quit IRC21:03
*** jmlowe has joined #openstack-nova21:12
sean-k-mooneySundar: i redeploy and have been able to boot a vm with the fake cyborg driver21:15
sean-k-mooneyhttp://paste.openstack.org/show/789247/21:16
efriednice21:16
*** Sundar has quit IRC21:18
*** nweinber has quit IRC21:18
sean-k-mooneyi will start trying different life cycle operation and testing and recording info on placement allcoation, db dumps and the like21:18
sean-k-mooneyi have a bash script i have written to automate this so ill script up a few test cases. and do some manually21:19
sean-k-mooneysince  i have created a cyborg flavor i shoudl be able to run some of the standard tempest tests with that flavor21:20
*** eharney has joined #openstack-nova21:21
*** slaweq__ has quit IRC21:27
*** eharney has quit IRC21:30
openstackgerritDan Smith proposed openstack/nova master: Fix instance.hidden migration and querying  https://review.opendev.org/70633121:39
lifelessstephenfin: hi, what do you need? mtreinish has commit rights on subunit21:39
*** spatel has quit IRC21:50
*** jmlowe has quit IRC22:00
*** damien_r has quit IRC22:21
*** jmlowe has joined #openstack-nova22:24
*** martinkennelly has quit IRC22:27
efriedmnaser: did your +1 at PS3 here https://review.opendev.org/#/c/706331/ indicate that you had successfully tested this locally?22:30
*** ivve has quit IRC22:31
*** jmlowe has quit IRC22:32
*** jmlowe has joined #openstack-nova22:35
*** mriedem has left #openstack-nova22:37
*** mriedem has joined #openstack-nova22:38
mriedemdansmith: i've got to run but will take a look at the latest later tonight22:38
*** mriedem has left #openstack-nova22:38
dansmithefried: I think he's going to have to apply it at the time he does his next upgrade which might not be for a while, it sounded like22:41
melwittdansmith: wouldn't your change unhide the instances for him today that are being incorrectly hidden? or are you saying he already fixed that via manual db update22:44
dansmithmelwitt: he already fixed up his db, as I understand it22:44
melwittgotcha22:45
*** jmlowe has quit IRC22:45
*** jmlowe has joined #openstack-nova22:48
*** tkajinam has joined #openstack-nova22:54
*** jmlowe has quit IRC22:56
*** tkajinam has quit IRC22:57
*** tkajinam has joined #openstack-nova22:57
sean-k-mooneydansmith: i can check the code but are we not storing the resouce requests form the cybrog device profile in the request spec?23:01
sean-k-mooneydansmith: im seeing "requested_resources": null in the request spec for the cyborg nova instance23:03
sean-k-mooneythe embeded flavor has   "accel:device_profile": "FakeDeviceProfile"23:04
*** jmlowe has joined #openstack-nova23:04
sean-k-mooneyhowever since we are not storing the groups if you change the device profil after the fact and we migration and instance or did something else that would need us to call plamcnet wwe would have to go back to cyborg which could have changed23:05
*** damien_r has joined #openstack-nova23:16
dansmithsean-k-mooney: I think that's the idea23:18
dansmithsean-k-mooney: you live migrate, scheduler calls placement with a new set of resources constructed from the device profile and what cyborg told you when you asked23:18
dansmithsean-k-mooney: maybe we need to be doing something like examining the existing ARQs to generate those resource requests if the instance already exists?23:19
sean-k-mooneynew arqs sure but we dont want to hard reboot or live migate and change form an nvida gpu to an intel fpga23:19
sean-k-mooneyi think we need to be storing the groups retruned by cycborg when we instilly created the vm the same way we embed the flavor or image23:20
dansmithnot sure how that would happen on a hard reboot, but obviously agree on live-migration, but that's why I'm saying maybe we should look at the device profile on boot, and look at our existing arqs on any other move operation when asking cyborg for the resources23:20
*** jmlowe has quit IRC23:21
sean-k-mooneydansmith: i guess hardreboot it would not23:21
sean-k-mooneywe woudl just use the exising arq23:22
dansmithmaybe we need to ask sundar if the dp can change in a predictable or restricted way23:22
*** nweinber has joined #openstack-nova23:22
*** damien_r has quit IRC23:22
dansmithI would expect the arqs can't be changed after they're allocated, and if we used those to base our new request on that would get us what we want23:22
sean-k-mooneylooking at the client i only see a create and delete23:22
sean-k-mooneyso i guess not23:22
sean-k-mooneyill check the api23:23
dansmithare you saying the device profile can too?23:23
dansmithcan change in place Imean23:23
sean-k-mooneythe device profile in osc can only be create or delete like a flavor23:23
sean-k-mooneyim going to check the api to see if there is an update23:23
sean-k-mooneyif they are imutable no issue23:24
dansmithah okay I thought you were implying it *could* change but you were just assuming?23:24
dansmithyea23:24
dansmithI need to run off for a bit, but we can sync with sundar tomorrow23:24
sean-k-mooneyya no worries23:24
sean-k-mooneyi was just going to write a function to pull the resouce request form the nova db23:24
sean-k-mooneyform the request spec and compar them to the allcoations23:24
sean-k-mooneyto ensure they matched up23:25
*** CeeMac has quit IRC23:26
*** artom has quit IRC23:27
*** artom has joined #openstack-nova23:27
sean-k-mooneyok device profiels cannot be updated. https://docs.openstack.org/api-ref/accelerator/v2/index.html#device-profiles but ARQs can be whcih kind of make sense23:31
sean-k-mooneyyou update ARQs with the host when binding or move operations23:31
sean-k-mooneyyou cant change the device profile name however23:32
*** nweinber has quit IRC23:33
openstackgerritDan Smith proposed openstack/nova master: Fix instance.hidden migration and querying  https://review.opendev.org/70633123:37
dansmithsean-k-mooney: cool23:37
* dansmith &23:37
*** damien_r has joined #openstack-nova23:43
*** mriosfer has quit IRC23:44
*** nicolasbock has quit IRC23:44
*** damien_r has quit IRC23:45
*** brinzhang_ has joined #openstack-nova23:47
*** tbachman has quit IRC23:51
*** brinzhang has quit IRC23:51
*** N3l1x has quit IRC23:52
*** tosky has quit IRC23:55

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!