Thursday, 2022-05-26

*** mnasiadka_ is now known as mnasiadka05:08
*** tkajinam_ is now known as tkajinam05:11
songwenping_sean-k-mooney:can we support multi-create vgpu vms?07:56
*** songwenping_ is now known as songwenping08:25
gibikashyap: I played with a fedora36 container but I cannot reproduce the slow pip resolving issue either there09:35
gibiso I think you might have extra python3- packages installed that creates some version confusion09:35
gibiit should now do it09:35
gibias venv is separate 09:36
gibibut I have no other ideas09:36
sean-k-mooneysongwenping: no that is not supported09:40
sean-k-mooneysongwenping: we do not suppport multi create with nested resouce providres09:41
sean-k-mooneysongwenping: https://bugs.launchpad.net/nova/+bug/187466409:42
sean-k-mooneysongwenping: actully it looks like the patches are merged09:42
sean-k-mooneyhttps://review.opendev.org/c/openstack/nova/+/723884/ and https://review.opendev.org/c/openstack/nova/+/723858/09:43
sean-k-mooneysongwenping: i dont think this fully resolves the issue bauzas ^09:45
sean-k-mooneysongwenping: ya reading them we only added documentation and test to show that it does not work in some cases09:46
sean-k-mooneyso form the comit09:46
sean-k-mooney- you can ask for a flavor with 2 VGPU with --max 209:46
sean-k-mooney - but you can't ask for a flavor with 4 VGPU and --max 209:46
kashyapgibi: Hey, thanks a ton for trying!  Also, what do you mean by "it should now do it as venv is separate"?  I can't quite parse it :)09:47
kashyapPls rephrase it09:47
gibitox should build a new virtual env that is sealed off from python packages on the host09:47
kashyapYeap; thanks!09:49
gibiinterestingly the dockerfile in the bugreport shows the issue to me but my dockerfile doesn't 09:49
gibicraxy09:49
opendevreviewRajat Dhasmana proposed openstack/nova-specs master: Repropose volume backed server rebuild spec  https://review.opendev.org/c/openstack/nova-specs/+/84015509:53
whoami-rajatsean-k-mooney, ^^ updated the spec09:54
sean-k-mooneywhoami-rajat: thanks ill review it shortly09:57
whoami-rajatthanks!09:57
songwenpingsean-k-mooney: no, not this bug.10:28
sean-k-mooneysongwenping: are you refering to creating a vm with multipel vgpus?10:28
songwenpingi create two vms with flavor that has 1 vgpu metadata10:28
sean-k-mooneyok10:29
songwenpinglibvirt allocate the same vgpu for these two vms10:29
sean-k-mooneywe only support doing that via 2 differnt api request today10:29
sean-k-mooneywe do not supprot creating 2 vms that use vgpus in the same api request10:30
songwenpingdoes the api check the request?10:30
sean-k-mooneyno10:30
sean-k-mooneywe will not block it at the api10:30
sean-k-mooneywe docuement it in the api as part of the previous bug10:31
*** carloss_ is now known as carloss10:31
songwenpingwhy donot we block it at api?10:31
sean-k-mooneygibi: speaking of which if you use pci device in placment for now at least multi create will not work for the same reason10:31
sean-k-mooneysongwenping: becuase at the api we dont know fi it uses nested resouce provder or not10:32
sean-k-mooneyso we documented the limistaion10:32
songwenpingcould you please find the doc?10:32
sean-k-mooneysongwenping: https://docs.openstack.org/nova/yoga/admin/virtual-gpu.html#caveats10:35
sean-k-mooneywe added this section https://review.opendev.org/c/openstack/nova/+/723884/6/doc/source/admin/virtual-gpu.rst10:36
sean-k-mooneywe coud make that more clear that we do not support multi creat a all but that is our current stnace10:37
sean-k-mooneyif a server uses neste reseouce proverder we do not support multi create10:37
sean-k-mooneythis applys to vgpu, cyborg, neuton bandwith or packet per second qos or any other usage of resouce form nested resouce providers10:38
sean-k-mooneylike pmem10:38
*** songwenping__ is now known as songwenping10:45
opendevreviewRajesh Tailor proposed openstack/nova master: Fix typos  https://review.opendev.org/c/openstack/nova/+/84312710:50
songwenpingsean-k-mooney: i see, thanks.10:50
gibisean-k-mooney: I don't have the context why the multi create fail on nested allocations but I can take a look later10:59
sean-k-mooneygibi: we just dont have any code for it to handel mappign the allcoation to the differnt instance properly11:00
sean-k-mooneywe woudl need to do multipel placment queries and orchstrate that in the schduler11:00
gibihm, I thought that normal multicreate uses the alternatives from the scheduler 11:00
sean-k-mooneyit does i think but apprently that is not enough11:01
sean-k-mooneythis has basicaly never worked if you have nested rps11:02
sean-k-mooneyit work fine if you dont11:02
kevkoHi nova team ! 11:04
kevkoI would like to ask if there is any chance to do something as i will describe below : 11:05
kevkoI really would like to schedule 2 instances in server group with anti-affinity policy to be scheduled to different host but also in different datacentre (or az)11:06
kevkosomething similar as it is here 11:06
kevkohttps://specs.openstack.org/openstack/nova-specs/specs/rocky/implemented/complex-anti-affinity-policies.html11:06
kevkoinstead of max_server_per_host key should be something as max_server_per_az11:07
sean-k-mooneykevko: that is not currently possibel or trivial to do11:07
kevkoyeah, i was checking code :/11:07
sean-k-mooneythe request spec currently modles an az requst as a string11:07
kevkoit's mainly because of octavia and two datacentres ..11:07
sean-k-mooneyit cant accpet a list of stings so you cant do that in a singel request11:08
sean-k-mooneykevko: in the past we have declared this out of scope of nova11:08
sean-k-mooneykevko: as somethign a higher level orchestrator shoudl mange11:08
kevkohmm11:08
kevkoit's mainly because of octavia loadbalancers ..11:09
sean-k-mooneyyep so octavia coudl orchestrate this11:09
sean-k-mooneythey already mange the vm creation11:09
sean-k-mooneyvia calling nova11:09
sean-k-mooneyso they could make requsts with differnt AZs11:10
sean-k-mooneyi woudl have ot refresh my memory of server goups but i cant recall if you can add more server to the group in later request or not11:11
sean-k-mooneyif you can you could just group the loadbalnce request per AZ11:11
kevkoi am not sure if I understand 11:12
sean-k-mooneyi was trying to recall if you needed to populate the server group in one go or coudl you make multiple server create request  and resuse the same server group11:13
sean-k-mooneyi think its the latter11:13
sean-k-mooneywe have 2 or 3 ways to do anti afinity in nova11:13
kevkoyou can set az in octavia ...an if you have az created from two datacentres ... server group and anti affinity will not work ..there is big chance that nova just schedule to two hosts ..but in same datacentre11:13
sean-k-mooneyand some requrie it all in one request and other allow it to be exteded.11:13
sean-k-mooneykevko: right so ocativa could have an az anti affintiy policy in its api11:14
kevkocurrently I think there is only one way to do it ... have 1 az from two hosts from two different datacentres ... but this is no way ..because we have hundreds of amphoraes :( 11:14
sean-k-mooneyso what i would expect is you would creat at least 1 AZ per datacenter right11:15
sean-k-mooneyand then create a singel server group11:16
sean-k-mooneywith anti affintiy11:16
sean-k-mooneyand then in 2 differnet calls to nova create the vms for datacenter A and B11:16
kevkoit's not true i think 11:17
sean-k-mooneyim describing the change you need to make to ocativa not how it works today11:18
kevkoanti-affinity works per host ...so there could be situation that in one datacentre instance will be scheduled on two hosts 11:18
sean-k-mooneykevko: correct11:18
kevkoyeah, 11:18
sean-k-mooneythat is why you need ot use differnt AZ in the two diffenrt calls to nova11:18
kevkooctavia has availability_zone as one parameter in configuration 11:18
kevkoif this can be list and setup list of AZs where AZ is 2 hosts from two datacentres ... it will be work i think 11:19
sean-k-mooneyright if you want to supprot this you need to supprot this at the octavia api11:19
sean-k-mooneynot config driven11:19
sean-k-mooneykevko: az in nova are nto the same as AZs in aws11:19
sean-k-mooneythe are jsut metadtaa on a a host aggreate.11:19
sean-k-mooneythere is not gurantee or expectiaon of a sperate falut domain11:20
sean-k-mooneyand conversily there is not guranteeor or expecation fo network connectivy between them11:20
sean-k-mooneythat is all determin by how you deploy your cloud11:20
kevkoyeah i know that it is only metadata defined for host (or how to say :P ...i really don't know how it looks like in deep)11:21
sean-k-mooneyits a strign defiend on a host aggareate11:21
sean-k-mooneyand then host are mapped to aggreates11:21
kevkowhat about regions ? 11:23
kevkocan be helpfull ? 11:23
sean-k-mooneythe do not existin in nova11:23
sean-k-mooneythey are a keystone only construct11:23
kevkoyeah ... correct11:24
kevkocells ? 11:24
sean-k-mooneycells are internal to nova11:24
sean-k-mooneyand not expsoed at the api to end users11:24
sean-k-mooneythey are a way to shared the db and message bus11:24
kevkoyeah i know ..11:24
sean-k-mooneyand are explictly not for fault tollerance11:24
sean-k-mooneybut they can kind fo help in a way11:25
kevkoso currently i don't have way how to deal with it 11:25
sean-k-mooneywe coudl support az anti affintiy in nova11:25
sean-k-mooneybut we have rejected it in the past the last 2 times it came up11:25
sean-k-mooneykevko: filters and weigher can be added out fo tree by the way11:27
sean-k-mooneybut all instnace will be mapped ot an az even if oenis not requested11:28
sean-k-mooneyso really if you want to use AZs to map to datacenters 11:28
sean-k-mooneythen octavia need to accpet the AZ as an api parameter when creating loadblancers11:29
sean-k-mooneyand if you want ti to supprot spreaing loadbalncer between azs then it allso need to supprot doing that iself really11:29
sean-k-mooneysince AZ are not really visable to neutron by the way there is no guarentee at teh api level that a newton network can span azs11:30
opendevreviewsean mooney proposed openstack/os-vif master: [trivial] update job template to zed  https://review.opendev.org/c/openstack/os-vif/+/84343211:31
kevkowhy it was rejected ? 11:33
kevkothis would be super usefull 11:33
sean-k-mooneyhttps://github.com/openstack/nova/blob/master/doc/source/contributor/project-scope.rst#no-more-orchestration=11:42
sean-k-mooneykevko: it came up on the ptg in the past i can see if there was  a draft spec11:44
sean-k-mooneykevko: https://review.opendev.org/c/openstack/nova-specs/+/75638011:44
sean-k-mooneykevko: as i noted here https://review.opendev.org/c/openstack/nova-specs/+/756380/5#message-ba89445e7b2b033fe39f9bbd32a49271e4780e75=11:45
sean-k-mooneydoing this in a filter or weigher is far far too late11:45
sean-k-mooneyas the az is alreay asigned at that point11:46
sean-k-mooneykevko: so the only way to do it today woudl be to have an AZ span both datacenters and then use somethign else to model the datacenters and provide anti afinity 11:46
kevkohmm11:47
kevkounderstand 11:47
sean-k-mooneyyou coudl do that with metadata on host aggrats and then a weigher or filter that used that datacenter metadata11:47
sean-k-mooneyso 1 AZ that spans both datacheners 1 host aggreate per datacenter and a filter/weigher that used a datacenter tag on the host aggreate + server group infor to implment antiafinity11:49
sean-k-mooneybut its much simpelr to have 1 AZ per datacenter and have octaivr just schdule differnt loadbanceer to differnt AZs11:49
kevkoand this is not possible :/11:49
sean-k-mooneyits cleaner form an api level and more efficnt form a schdulign perspecitve11:50
kevkobecause octavia has only one AZ to configure 11:50
sean-k-mooneyright you woudl have to change octavia11:50
sean-k-mooneyso im suggestign you make a featur equest to octavia instead of nova11:50
kevkoyeah 11:50
kevkothank you ! 11:50
kevkoyou are always replying ;-) 11:50
sean-k-mooneyperhaps i reply too much on ocation11:52
sean-k-mooneybut when people have ligtimate usecase i try to help them find a solution11:52
sean-k-mooneyyou do i just dont think the clean solution is in nova11:52
sean-k-mooneyfor older release you coud do this as i descibe with an out of tree weigher/filter if you have 1 AZ and multiple host aggreates but thats messy for a number of reasons11:53
kevkostill thank you 11:56
opendevreviewMiguel proposed openstack/nova master: Pin autopep8 to 1.5.5 in tox  https://review.opendev.org/c/openstack/nova/+/84344312:16
kevkosean-k-mooney last question : Can i import octaviaclient in my own custom nova filter ? :P 12:26
sean-k-mooneyits your filter so :)12:26
sean-k-mooneyupstrem no12:27
sean-k-mooneyfilters are not ment to make any db or api calls in general12:27
sean-k-mooneyfilters run per host so we really dont want to do that on large clouds12:27
kevkohmm, ok12:28
kevkocan I write filter which will check security group assigned ? 12:28
sean-k-mooneysecurity groups? or server groups12:29
kevkoboth :D 12:31
kevkoi mean, if it is possible to check server groups it will be better, if no .. security group is same also 12:32
sean-k-mooneyfor server groups you can use the existin filter as a refernce12:32
sean-k-mooneyhttps://github.com/openstack/nova/blob/master/nova/scheduler/filters/affinity_filter.py#L82-L165=12:33
sean-k-mooneybut you cant add new policies12:33
sean-k-mooneyso you cant jsut add an AZ-anti-affinity policy12:33
sean-k-mooneykevko: we dont have any example of the security gorups but if its in teh request spec then yes12:34
kevkoi will have to try12:35
sean-k-mooneykevko: filters are passed 2 objects the hoststate object and a request spec object 12:35
sean-k-mooneyhttps://github.com/openstack/nova/blob/master/nova/objects/request_spec.py#L42=12:35
sean-k-mooneykevko: which has security groups https://github.com/openstack/nova/blob/master/nova/objects/request_spec.py#L98=12:35
sean-k-mooneybut only the security groups used for nova created ports12:36
kevkobut no server group :/12:36
sean-k-mooneyit will not have the security groups of neutron ports that are passed in as uuids12:36
sean-k-mooneykevko: its called instance_group i think https://github.com/openstack/nova/blob/master/nova/objects/request_spec.py#L91=12:37
kevkoyeah, i just saw it 12:37
sean-k-mooneythe network request whil it has the port uuid https://github.com/openstack/nova/blob/e44b1a940fdc45cc9dbb08e193a8c25052cf64e7/nova/objects/network_request.py#L39= does not have any port secuirty groups12:40
*** mnaser_ is now known as mnaser12:41
*** hemna2 is now known as hemna13:15
sean-k-mooneygibi: stephenfin hopefully a quick one https://review.opendev.org/c/openstack/os-vif/+/84343213:21
* gibi clicks13:21
sean-k-mooneyjust changing the template to zed since the py36 job is failing13:21
gibisean-k-mooney: do you want to drop 3.6 support from the setup.cfg at the same time as we stop testing with 3.6?13:24
sean-k-mooneyoh ya i can do that13:24
sean-k-mooneyshould i add a release note then too13:25
gibiyepp that a reno would make sense13:26
sean-k-mooneyok ill do that now13:26
stephenfinsean-k-mooney: replied13:40
opendevreviewsean mooney proposed openstack/os-vif master: update job template to zed  https://review.opendev.org/c/openstack/os-vif/+/84343213:40
sean-k-mooneystephenfin: hehe well ^ is an updated version13:40
sean-k-mooneystephenfin: no we are not release independent13:41
sean-k-mooneyi raised at the ptg and we said no keep it as release-with-intermidary13:41
sean-k-mooneyso we shoudl not use the unversioned one13:41
sean-k-mooneyif we want to make it release independent then sure13:41
opendevreviewsean mooney proposed openstack/os-vif master: update job template to zed  https://review.opendev.org/c/openstack/os-vif/+/84343213:43
sean-k-mooneyactully some typos fixed ^13:44
sean-k-mooneyill leave it for now for ye to review but if you want me to change anything i can13:44
sean-k-mooneystephenfin: by the way your dont happen to be a bindep core?13:44
gibisean-k-mooney: so we say py 3.10 is supported, which probably works, but I'm not sure the overall Zed release will go out with py3.10 supportr13:45
sean-k-mooneygibi: debain and ubuntu ship yoga on 3.1013:45
sean-k-mooneyso zed will be released on at least 3.10 by them13:46
sean-k-mooneyits technially not in the testing runtimes but we have tests using it so i think it will be fine13:46
gibiyeah, hence my statement that it probably works without problem13:46
gibiOK, I'm convinced +@13:46
gibi+213:46
sean-k-mooneyi can drop that from the list if you like untile we have a 22.04 based job13:46
sean-k-mooneybut i could also do that as a follow up13:47
sean-k-mooneyi.e. add a 22.04 job13:47
gibino, it is OK, we run py3.10 on os-vif so we know it works13:47
sean-k-mooneycool13:47
opendevreviewArtom Lifshitz proposed openstack/nova stable/wallaby: DNM: Testing live migration with local attach  https://review.opendev.org/c/openstack/nova/+/84314613:49
*** artom_ is now known as artom14:04
*** bhagyashris is now known as bhagyashris|ruck14:22
opendevreviewArtom Lifshitz proposed openstack/nova stable/xena: Reproduce live migration rollback w/o multi port bindings error  https://review.opendev.org/c/openstack/nova/+/84333614:50
opendevreviewArtom Lifshitz proposed openstack/nova stable/xena: Fix LM rollback w/o multi port bindings extension  https://review.opendev.org/c/openstack/nova/+/84333714:50
opendevreviewArtom Lifshitz proposed openstack/nova stable/wallaby: Reproduce live migration rollback w/o multi port bindings error  https://review.opendev.org/c/openstack/nova/+/84333814:58
opendevreviewArtom Lifshitz proposed openstack/nova stable/wallaby: Fix LM rollback w/o multi port bindings extension  https://review.opendev.org/c/openstack/nova/+/84333914:58
kashyapgibi: Hey, before I go out on PTO I can't get this to completion this week :-( - https://review.opendev.org/q/topic:bp%252Fcpu-selection-with-hypervisor-consideration15:34
gibikashyap: is there anything I can help with there?15:35
kashyapgibi: Stephen has a bunch of valid comments; need to address those and get it in.15:36
gibiack15:37
kashyapgibi: Also, one more is the separate config class to parse the "cpu" bit from `virsh domcapabilitles`15:38
gibiyeah I remember15:38
gibithat15:38
kashyapAlso have to address your own good comments on this patch: https://review.opendev.org/c/openstack/nova/+/76233015:39
kashyapgibi: Ah, forgot that I already have the patch here for: https://review.opendev.org/c/openstack/nova/+/838191 (libvirt/config: Parse the 'cpu' element in domainCapabilities)15:40
kashyap(Modulo test)15:40
gibiyeah only test needed15:41
kashyapActually, no that's not it...it's not adding a separate config class there15:41
kashyapgibi: From IRC log15:42
kashyap(Of this channel):15:42
kashyap17:13 < kashyap> gibi: To tie up the lose end on caps vs domCaps -- the guidance from the libvirt folks is (a) no, we can't treat the caps == domCaps w/ 'host-model' mode; and (b) we should use domCaps wherever possible.15:42
kashyap17:13 < kashyap> gibi: So that means, we should introduce a new config object 15:42
kashyap17:13 < gibi> kashyap: ack, make sense15:42
gibihehh, I'm just following you guid here :) 15:43
kashyapSo, to sum up:15:43
kashyap(1) address your + Stephen's comments; (2) add the separate config object class to parse "cpu" element from domCaps, and work that in "properly"15:44
kashyapI could definitely use some help, if you have some time for this!  I keep duking around this and never quite managing it fully15:44
gibiI cannot promise I can jump on this right away but I keep it in mind15:45
gibibut thanks for summing it up it helps keeping the context 15:46
opendevreviewMerged openstack/os-vif master: update job template to zed  https://review.opendev.org/c/openstack/os-vif/+/84343217:44
opendevreviewRajat Dhasmana proposed openstack/nova-specs master: Repropose volume backed server rebuild spec  https://review.opendev.org/c/openstack/nova-specs/+/84015518:41
opendevreviewMiguel Lavalle proposed openstack/os-vif master: Delete trunk bridges to avoid race with Neutron  https://review.opendev.org/c/openstack/os-vif/+/84149919:28

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!