opendevreview | keerthivasan proposed openstack/nova master: Move nova-manage db purge to nova-audit https://review.opendev.org/c/openstack/nova/+/708783 | 00:11 |
---|---|---|
*** bauzas_ is now known as bauzas | 00:13 | |
*** bauzas_ is now known as bauzas | 01:20 | |
*** bauzas_ is now known as bauzas | 02:29 | |
*** bauzas_ is now known as bauzas | 02:54 | |
*** bauzas_ is now known as bauzas | 03:33 | |
*** bauzas_ is now known as bauzas | 04:08 | |
opendevreview | Balazs Gibizer proposed openstack/nova master: Stabilize iso format unit tests https://review.opendev.org/c/openstack/nova/+/923924 | 05:32 |
opendevreview | Balazs Gibizer proposed openstack/nova master: Stabilize iso format unit tests https://review.opendev.org/c/openstack/nova/+/923924 | 06:56 |
*** bauzas_ is now known as bauzas | 07:16 | |
opendevreview | Balazs Gibizer proposed openstack/nova stable/2024.1: fix qemu-img version dependent tests https://review.opendev.org/c/openstack/nova/+/923878 | 07:27 |
*** bauzas_ is now known as bauzas | 07:53 | |
opendevreview | Takashi Kajinami proposed openstack/nova master: libvirt: Reject cpu_model_extra_flags when cpu_mode='none' https://review.opendev.org/c/openstack/nova/+/923933 | 08:06 |
*** bauzas_ is now known as bauzas | 08:49 | |
gibi | stephenfin: we are sort on cores this week. Could you check a small unit test stabilization patch https://review.opendev.org/c/openstack/nova/+/923924 ? | 09:09 |
songwenping__ | gibi, sean-k-mooney: hi, does evacute has rescheduler process? | 09:13 |
sean-k-mooney | songwenping__: no | 09:13 |
sean-k-mooney | if evacuate fails you should just evacuate again | 09:13 |
sean-k-mooney | we do not reschule any move operation as far as im aware | 09:14 |
songwenping__ | i find build,unshelve and migrate have rescheduler, right? | 09:14 |
sean-k-mooney | live migrate defintly does not i dont thik coldmigrate/resize does either | 09:14 |
sean-k-mooney | build does but im not sure about unshelve | 09:15 |
sean-k-mooney | well that not entirly true | 09:15 |
sean-k-mooney | we got rid of reschdule entirly and replaced it with alternate_host 5 or 6 years ago | 09:15 |
sean-k-mooney | maybe more | 09:15 |
sean-k-mooney | i think if pre_live migtrate failes we may check the alternate hosts and the same mighgt be true of other move op | 09:16 |
sean-k-mooney | but we never retry and reschdule once we attemet to start the migration of data | 09:16 |
songwenping__ | build still entired reschedule? | 09:16 |
opendevreview | Balazs Gibizer proposed openstack/nova stable/2024.1: Stabilize iso format unit tests https://review.opendev.org/c/openstack/nova/+/923935 | 09:16 |
sean-k-mooney | no its not a rescule because we dont go back to the scheduler it uses the alternitve hosts the scheduler orgially provided | 09:17 |
sean-k-mooney | by default the schduler gives us 3 hosts | 09:17 |
sean-k-mooney | the condocutor tries those in order | 09:17 |
opendevreview | Balazs Gibizer proposed openstack/nova stable/2023.2: Stabilize iso format unit tests https://review.opendev.org/c/openstack/nova/+/923936 | 09:18 |
sean-k-mooney | provided the allocation candiate can still be claimed in placement | 09:18 |
songwenping__ | got it. | 09:18 |
sean-k-mooney | if we fail early enouch in evacuate/resize we may do the same | 09:18 |
opendevreview | Balazs Gibizer proposed openstack/nova stable/2023.1: Stabilize iso format unit tests https://review.opendev.org/c/openstack/nova/+/923937 | 09:20 |
sean-k-mooney | songwenping__: https://docs.openstack.org/nova/latest/contributor/resize-and-cold-migrate.html#resource-claims | 09:22 |
songwenping__ | as nova-scheduler workers has same host_state info, if two evacuations come at the same time, the resource is enough in nova-scheduler, but nova-compute claim failed as two vms required resources exceed the remain resources on nova-compute. | 09:22 |
sean-k-mooney | songwenping__: that wont happen in general | 09:22 |
sean-k-mooney | it will only happen if the vm is using a resouce not tracked in placment or with numa toplogy | 09:23 |
sean-k-mooney | so before we call teh compute we the conductor creates an atomic claim in placement | 09:23 |
sean-k-mooney | for cpu/ram/disk and some other resouces | 09:24 |
songwenping__ | yeah, but in our HA mechanism, it occurs. | 09:24 |
sean-k-mooney | the vm should never get to the comptue part, are you runnign placement on an Active Active galera by any chance | 09:24 |
sean-k-mooney | active/active meaning multi master/multi writer setup | 09:25 |
songwenping__ | maybe our nova-scheduler use host_state cache info, not get ar from placement. | 09:31 |
sean-k-mooney | thats not how that works | 09:31 |
sean-k-mooney | unless this is pre rocky | 09:32 |
songwenping__ | yeah, we use rocky :) | 09:32 |
songwenping__ | the cache mechanism is deprecated after rocky? | 09:35 |
sean-k-mooney | there is a cache but we first filter in placement before the schduler filtere run and then claim atotmicaly after to avoid races | 09:35 |
songwenping__ | sean-k-mooney: got it, thanks. | 09:38 |
frickler | elodilles: you offered help some days ago, additional reviews for https://review.opendev.org/q/topic:%22format-inspector%22+status:open would be nice I think | 10:47 |
elodilles | frickler: ACK, thanks for the ping. i did not say it upstream, but i'm mostly off today. so i might only get there to review the patches tomorrow ~afternoonish | 10:55 |
frickler | elodilles: ah, ok, thx for the update, let's hope things might be resolved until then | 11:09 |
elodilles | frickler: i've reviewed and +2+W'd the stable/2024.1 patches (except the last patch as that is not merged on master yet) | 12:14 |
*** bauzas_ is now known as bauzas | 12:41 | |
sean-k-mooney | the last 2 patches that sables the unit test we belive are just ened for old veriosn of the tools | 13:07 |
sean-k-mooney | frickler: elodilles basically we were backportign those downstream too train/wallaby and had ci failures | 13:07 |
opendevreview | Arnaud Morin proposed openstack/nova master: Avoid failing if compute_id file is empty https://review.opendev.org/c/openstack/nova/+/923953 | 13:09 |
frickler | sean-k-mooney: yes, I'm not too worried about those, but we are running with the major iso format-inspector patch downstream now, so it would be comforting to see it merged | 13:19 |
sean-k-mooney | frickler: oh good to know. have you foudn any issue with it i should be aware of? | 13:33 |
sean-k-mooney | if you do just ping and ill take a look | 13:33 |
frickler | sean-k-mooney: sure, but no issues found so far | 13:36 |
mnaser | sean-k-mooney: btw https://lists.openstack.org/archives/list/openstack-discuss@lists.openstack.org/thread/M3BBTNE22VMMZSFMGFWMJJDYEPW7VNHL/ of interest based on our discussion the other day :) | 13:38 |
mnaser | when you have a second id appreciate feeedback | 13:38 |
sean-k-mooney | you realise that is O(hosts*aggreates^2) right | 13:41 |
sean-k-mooney | im not saying that wont work | 13:42 |
sean-k-mooney | but we might need to be a littel smarter if we were to do that in nova proper in terms of performce | 13:43 |
sean-k-mooney | like see that there is a server group in teh request spec, grap all the relevent host for that server group and lookup the aggerst once in a map and pass in the relevent agrrate i the host state object ot each filter | 13:44 |
sean-k-mooney | im not sure if that will work but lookign up every aggrete reated to every other host that has an instance in the same server group per host is expensive... | 13:45 |
mnaser | sean-k-mooney: I realize it is expensive, but i couldn't think of a better approach, but also I am looking up aggregate for server groups only, so it is limiting it to the scope of hosts that are part of that host aggregate | 13:57 |
mnaser | so potentially a single lookup if there is one other node for example | 13:58 |
mnaser | and I think I can't really do a cache inside of the filter for a spec because its invoked for each host | 13:59 |
sean-k-mooney | the filter runs per host however | 14:00 |
sean-k-mooney | so you multiple that by the number of host reteruend from placment * the number of vm in the build request | 14:01 |
opendevreview | Merged openstack/nova stable/2024.1: port format inspector tests from glance https://review.opendev.org/c/openstack/nova/+/923722 | 14:01 |
opendevreview | Merged openstack/nova stable/2024.1: Reproduce iso regression with deep format inspection https://review.opendev.org/c/openstack/nova/+/923723 | 14:02 |
sean-k-mooney | mnaser: what i was thinkign is if we were to do this we would have the hostmanager get ll the host in the group and the aggreates for thos groups once before running the filter on each host and storing the info in either the hoststate object or request sepc so we can share that info acrros each vm and across each host | 14:03 |
sean-k-mooney | mnaser: we do somethign simialr for groups already in terms of maintianign a mapping of instance uuids to hosts | 14:03 |
mnaser | yeah we have host_state.instances which anti affinity can easily use | 14:05 |
sean-k-mooney | yep thats why its there | 14:05 |
sean-k-mooney | its only optionally populated based on a config option | 14:05 |
sean-k-mooney | as it uses potentally a lost of memory | 14:05 |
sean-k-mooney | we also have a list of aggreats in the host_state too | 14:06 |
mnaser | I think I end up using that | 14:06 |
mnaser | but then I have to check what is the failure domain set to _this_ host aggregate, and lookup what it is for all the other VMs | 14:06 |
sean-k-mooney | yep | 14:07 |
sean-k-mooney | so im not saying your silter wont work | 14:07 |
mnaser | oh yeah no it does work its just.. eww. | 14:07 |
sean-k-mooney | but if we were to supprot this in nova properly we likely would need to do somethign else to optimise this | 14:07 |
mnaser | sean-k-mooney: I think maybe an extra kwarg for host_passes to include all of the HostState for all the nodes still eligible in the filter | 14:08 |
sean-k-mooney | we try not to do that | 14:08 |
mnaser | so that reduces how much I need to lookup | 14:08 |
sean-k-mooney | it breaks all out of tree filters when we do | 14:09 |
sean-k-mooney | not that we technilay supprot those | 14:09 |
sean-k-mooney | but adding files to the host state object does not break the api | 14:09 |
mnaser | hmm ok valid | 14:09 |
sean-k-mooney | so we woudl pass this either via the host_state object or request spec | 14:09 |
mnaser | I mean if the failure domain concept becomes a little bit more concrete then there are other place we can leverage it in other places to optimize the way its looked up fo rsure | 14:10 |
mnaser | and maybe just maybe also expose it as a sha'd string in nova's api... but thats stretching my luck :) | 14:10 |
mnaser | the goal here is to expose this to k8s cluster provisioners and so that the failure domain can be exposed to the user so they can make sure they schedule their pods on nodes with different failure domains | 14:11 |
*** bauzas_ is now known as bauzas | 14:56 | |
*** bauzas_ is now known as bauzas | 16:04 | |
sean-k-mooney | mnaser: you asked for feedback :) | 16:41 |
opendevreview | Merged openstack/nova stable/2024.1: Add iso file format inspector https://review.opendev.org/c/openstack/nova/+/923724 | 16:58 |
opendevreview | Merged openstack/nova stable/2024.1: fix qemu-img version dependent tests https://review.opendev.org/c/openstack/nova/+/923878 | 16:59 |
*** bauzas_ is now known as bauzas | 17:14 | |
opendevreview | keerthivasan proposed openstack/nova master: Move nova-manage db purge to nova-audit https://review.opendev.org/c/openstack/nova/+/708783 | 17:28 |
opendevreview | keerthivasan proposed openstack/nova master: Move nova-manage db purge to nova-audit https://review.opendev.org/c/openstack/nova/+/708783 | 17:44 |
opendevreview | Merged openstack/nova master: Stabilize iso format unit tests https://review.opendev.org/c/openstack/nova/+/923924 | 17:57 |
opendevreview | Jens Harbott proposed openstack/nova master: DNM: Test devstack change https://review.opendev.org/c/openstack/nova/+/923759 | 19:24 |
*** bauzas_ is now known as bauzas | 19:35 | |
*** bauzas_ is now known as bauzas | 21:11 | |
*** bauzas_ is now known as bauzas | 21:31 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!