Thursday, 2020-05-07

*** mriedem has left #openstack-nova00:18
*** vishalmanchanda has quit IRC00:18
*** swp20 has joined #openstack-nova00:31
*** songwenping_ has joined #openstack-nova01:01
*** swp20 has quit IRC01:04
*** Liang__ has joined #openstack-nova01:07
*** spatel has joined #openstack-nova01:27
*** kaisers_ has quit IRC01:29
*** brinzhang_ has joined #openstack-nova01:35
*** songwenping_ has quit IRC01:42
*** lbragstad has quit IRC01:52
*** songwenping_ has joined #openstack-nova02:10
*** spatel has quit IRC02:31
*** spatel has joined #openstack-nova02:34
*** brinzhang_ has quit IRC02:44
*** brinzhang_ has joined #openstack-nova02:45
*** spatel has quit IRC02:48
*** mkrai has joined #openstack-nova02:55
*** brinzhang01 has joined #openstack-nova03:03
*** brinzhang_ has quit IRC03:06
*** psachin has joined #openstack-nova03:37
openstackgerrittianmaofu proposed openstack/nova master: Delete queued image after deleting instance when creating image from instance  https://review.opendev.org/72483903:41
*** mkrai has quit IRC04:06
*** mkrai has joined #openstack-nova04:09
*** songwenping_ has quit IRC04:12
*** brinzhang_ has joined #openstack-nova04:31
*** ratailor has joined #openstack-nova04:31
*** munimeha1 has quit IRC04:32
*** brinzhang01 has quit IRC04:34
*** evrardjp has quit IRC04:36
*** evrardjp has joined #openstack-nova04:36
*** udesale has joined #openstack-nova04:41
*** HagunKim has joined #openstack-nova05:09
HagunKimHello Nova, I have a question. When I set my flavor vcpu=19, the vm only has 1 cpu core in linux. It is okay vcpu 18 or 17 flavor. What wrong with my Openstack?05:11
HagunKimlibvirt vm info has 19 vpus.05:15
*** songwenping_ has joined #openstack-nova05:37
*** vishalmanchanda has joined #openstack-nova05:45
*** songwenping__ has joined #openstack-nova05:46
*** mkrai has quit IRC05:47
*** mkrai has joined #openstack-nova05:48
*** jamesdenton has quit IRC05:48
*** jamesdenton has joined #openstack-nova05:49
*** songwenping_ has quit IRC05:49
*** brinzhang has quit IRC05:49
*** brinzhang_ has quit IRC05:50
*** brinzhang has joined #openstack-nova05:50
*** huaqiang has quit IRC05:55
*** gyee has quit IRC06:07
*** brinzhang_ has joined #openstack-nova06:17
*** ociuhandu has joined #openstack-nova06:17
*** brinzhang has quit IRC06:20
*** belmoreira has joined #openstack-nova06:27
*** songwenping_ has joined #openstack-nova06:34
*** songwenping__ has quit IRC06:36
*** huaqiang has joined #openstack-nova06:39
*** brinzhang has joined #openstack-nova06:41
*** brinzhang_ has quit IRC06:44
*** slaweq has joined #openstack-nova06:47
*** dklyle has quit IRC06:48
*** tbachman_ has joined #openstack-nova06:51
*** tbachman has quit IRC06:52
*** tbachman_ is now known as tbachman06:52
*** brinzhang_ has joined #openstack-nova07:01
*** brinzhang has quit IRC07:04
*** maciejjozefczyk_ has joined #openstack-nova07:04
*** brinzhang has joined #openstack-nova07:06
*** brinzhang_ has quit IRC07:06
*** Yumeng has joined #openstack-nova07:12
*** ttsiouts has joined #openstack-nova07:15
*** ccamacho has joined #openstack-nova07:18
*** ociuhandu has quit IRC07:19
*** tesseract has joined #openstack-nova07:20
*** ralonsoh has joined #openstack-nova07:26
*** mgariepy has quit IRC07:28
*** brinzhang_ has joined #openstack-nova07:29
*** tosky has joined #openstack-nova07:31
*** nightmare_unreal has joined #openstack-nova07:31
*** brinzhang has quit IRC07:32
bauzasgood morning Nova07:38
bauzasgibi: fwiw, stephenfin found some races for the multiple_instance_create() vGPU functest http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22testtools.matchers._impl.MismatchError%3A%202%20!%3D%201%5C%22%20AND%20build_name%3A%5C%22nova-tox-functional-py36%5C%2207:40
bauzasnot a lot tho (8 for 7 days)07:40
*** mgariepy has joined #openstack-nova07:42
*** brinzhang has joined #openstack-nova07:46
*** rpittau|afk is now known as rpittau07:47
*** brinzhang_ has quit IRC07:50
gibibauzas: good morning. I will check that race soon07:50
*** maciejjozefczyk_ is now known as maciejjozefczyk07:50
*** TheJulia has quit IRC07:56
*** mnasiadka has quit IRC07:56
*** Anticimex has quit IRC07:56
*** TheJulia has joined #openstack-nova08:01
*** mnasiadka has joined #openstack-nova08:01
*** Anticimex has joined #openstack-nova08:01
*** belmoreira has quit IRC08:03
*** xek has joined #openstack-nova08:05
bauzasgibi: nah, don't worry, I'll look08:09
bauzasif I can't find a solution soon, I'll write an e-r change08:10
gibiOK08:11
*** belmoreira has joined #openstack-nova08:15
*** ociuhandu has joined #openstack-nova08:20
gibibauzas: I was able to recreate the race locally08:22
* bauzas a bit on teaching mode but just looking08:22
bauzasgibi: I wonder, when i tested, I wasn't having this race08:23
*** martinkennelly has joined #openstack-nova08:25
gibicould be something else changed around the test. In the recent weeks I did not run a lot of full functional suites so I don't know how far this problem goes back08:25
bauzasgibi: when you reproduced, have you run only this test, or more ?08:27
gibiI run the whole functional suite08:27
gibilet me try running the test alon repeatedly08:27
bauzasok, because I only run the two multiple_create() methods08:28
bauzas(looping over)08:28
bauzasso, I wonder whether some global object would be raced08:28
gibiyeah, that is possible08:29
gibiI have to jump on a call now, but I will run the tests in different ways to see if it is an interference or not08:29
bauzaskk thanks08:30
openstackgerritJie Li proposed openstack/nova master: replace the "hide_hypervisor_id" to "hw:hide_hypervisor_id"  https://review.opendev.org/72218708:33
*** salmankhan has joined #openstack-nova08:34
* bauzas tries to remember how to ask ostestr to run the exact same concurrency from subunit that the gate ran08:41
*** songwenping_ has quit IRC08:45
*** songwenping_ has joined #openstack-nova08:46
bauzasah, bingo https://docs.openstack.org/os-testr/latest/user/subunit_trace.html08:47
bauzasor rather https://stestr.readthedocs.io/en/latest/MANUAL.html#running-previously-failed-tests08:49
gibibauzas: running just the test_multiple_instance_create* test case I can still reproduce the problem08:50
gibibut it needed like ~50 execution to get it once08:50
gibinow trying just only the single test_multiple_instance_create$ case as I suspect that it is not an interference08:51
gibiwhen I run the two test cases the, the test_multiple_instance_create$ failed, and the two case run in two separate executor so they cannot interfere08:52
*** kaisers_ has joined #openstack-nova08:53
*** martinkennelly has quit IRC09:00
bauzasgibi: you can ask to use a single subunit09:03
bauzasgibi: that's why I'd like to get the subunit output from the failing job and trying to rerun it09:04
gibibauzas: just paste me the command you want me to run and I will try that as well09:04
bauzasgibi: https://stestr.readthedocs.io/en/latest/MANUAL.html#parallel-testing09:04
gibiso far, after 77 execution no failure in the single test_multiple_instance_create$ test case execution09:04
bauzasgibi: I got one after a shit number of executions09:05
YumengHi gibi! Good morning, I'm Yumeng from Cyborg Team.09:08
YumengCyborg wants to have a "cyborg/neutron/nova sriov integration" discussion with NOVA and NEUTRON during the Virtual PTG.09:09
YumengCould you please find 1 hour for nova team to discuss this topic with cyborg and neutron team?09:09
YumengI notice that there is a nova-neutron session on PTG Friday 13:00 UTC, can we join after that on Friday 15:00 UTC?09:11
*** salmankhan1 has joined #openstack-nova09:11
gibiYumeng: hi! the schedule already closed so we have to use an exsiting booked slot for that discussion. Do you have a room booking from Friday 15:00 ?09:12
*** salmankhan has quit IRC09:12
*** salmankhan1 is now known as salmankhan09:12
Yumenggibi: no, I don't have now.  But I can email Kendall and book the room if it's ok for nova and neutron.09:13
gibiYumeng: I think we can try Friday 14:00 - 15:00 where we already have a room booked, or if you can get a room from Kendall then I we can do the discussion from 15:00 - 16:0009:15
Yumenggibi: aha!  Friday 14:00 - 15:00 is fantastic!  I am just not sure if it is already occupied.09:17
gibiYumeng: I kept that as an overflow slot so I think we can use that for cyborg. If you need the neutron team present as well then you have to synch with slaweq too09:18
*** dtantsur|afk is now known as dtantsur09:18
bauzasYumeng: tbh, what use case are you talking of ?09:19
YumengGreat! Thank you so much gibi! I will sync with  slaweq! ^^09:19
gibiYumeng: cool. I added it to the nova etherpad09:19
bauzasSR-IOV but how ?09:19
Yumengbauzas: yes, exactly!09:19
gibiYumeng, bauzas: yeah that is my next request, please try to prepare some material before the discussion09:20
bauzasYumeng: okay, but why do you need all contributors ?09:20
bauzasin nova and neutron ?09:20
bauzaswhat kind of feature request would you want to do in Cyborg ?09:20
gibi^^ +109:20
YumengWe will try to write a spec to describe this topic. And one of our team member from Intel is preparing a POC.09:21
Yumengyes, a lot question should be answered. ^^09:21
bauzasYumeng: before writing a spec, can you just summarize the reason ?09:22
bauzasYumeng: just saying, we still need to work on providing the NUMA topologies in Placement and then providing the PCI devices09:28
bauzasand when we discussed about SR-IOV to be in Placement, we said 'well, why not, but it's a mid-term issue'09:29
bauzasat least we first need to do the above before ^09:29
bauzasso, even if we would like to do it for Victoria, it wouldn't be possible09:29
bauzasYumeng: that's why I just want to be honest to say that I'm not sure we could do anything for you during Victoria09:30
bauzasand then, not sure we would need to ask all the contributors to be around just for this09:30
bauzassorry, 'this' being the virtual PTG09:31
bauzasYumeng: what we *could* do tho is to discuss about that once you provide some docs, maybe just an etherpad for explaining what you'd like09:31
*** songwenping_ has quit IRC09:32
*** songwenping_ has joined #openstack-nova09:32
Yumeng bauzas,gibi: the use case is for smartnic and the main integration feature is to pass device_profile from nova to neutron to create a port.09:32
openstackgerritHuaqiang Wang proposed openstack/nova-specs master: Use PCPU and VCPU in one instance  https://review.opendev.org/72580109:34
bauzasYumeng: that's not an usecase, it's rather a possible solution ;)09:35
bauzasYumeng: why do you need to pass this argument to Neutron ?09:35
*** avolkov has joined #openstack-nova09:39
YumengI think this doc makes sense: https://etherpad.opendev.org/p/Shanghai-Neutron-Cyborg-xproj09:39
*** masterpe has quit IRC09:39
Yumengplease check from line 41 to end.09:40
*** irclogbot_3 has quit IRC09:41
*** irclogbot_0 has joined #openstack-nova09:42
Yumengand line 46 describes why we need to pass the device_profile argument to neutron09:42
openstackgerritStephen Finucane proposed openstack/nova master: Fix ut error on Aarch64(And other non-x86 platform)  https://review.opendev.org/71316309:42
gibiYument so a smartnic provides both a programable device and an SRIOV NIC at the same time and the end user would like to specify what program should be loaded to the smarnic and then use that program via a SRIOV device09:46
gibiis this a correct view?09:46
Yumenggibi: yes,exactly. I think your are right.09:48
gibibauzas: does this help you approaching the problem?09:52
bauzasgibi: I'm literrally reading the etherpad09:52
bauzasgibi: but I don't get tho why nova needs to tell Neutron what Cyborg wants09:53
bauzaswe already call Cyborg at boot time09:53
bauzasso Cyborg could ping Neutron09:53
bauzasthe less informations Nova would have to carry, the better I think it would be :)09:54
bauzasalso, with us stuck with neutron late binding, this would require subtantial change in nova IIUC09:55
gibibauzas: current SRIOV support works by creating a port in neutron and providing that port in the nova boot request. if cyborg creates the port in neutron during the boot then we need a way in nova to know about such port09:55
bauzasgibi: I don't want to change this09:55
bauzasgibi: I'm just saying there is a piece of metadata information that neutron has to know09:55
bauzaswhich is carried by Nova09:56
*** brinzhang_ has joined #openstack-nova09:56
bauzasbut which could be provided by Cyborg before the nova port creation call09:56
gibisure. I'm not suggesting to change this. I just noted down the logical problem about letting cyborg handle neutron09:56
gibibauzas: nova does not create the neutron port for sriov09:56
gibithose ports need to be pre-created and provided in the boot request to nova09:56
gibiafaik09:57
bauzasoh you're right09:57
*** masterpe has joined #openstack-nova09:57
bauzaswe don't late bind09:57
bauzasactually, we bind09:57
bauzasbut we don't create09:57
*** brinzhang has quit IRC09:58
*** martinkennelly has joined #openstack-nova10:02
*** songwenping_ has quit IRC10:04
Yumengbauzas: emm IMHO, cyborg doesn't create the port in neutron during the boot. When Cyborg discovers SRIOV device and stores the device's info, user need to use Cyborg create a port/bandwith request, i.e. device_profile10:05
bauzasyeah I was confused, my apologies10:05
bauzasso, there is a pre-creation step10:06
bauzasand then nova gets the port at boot time10:06
Yumengyes10:06
bauzasOK, I think I still need to go thru the etherpad to understand the reasoning but that's a good first step10:10
*** rpittau is now known as rpittau|bbl10:10
Yumenggibi,bauzas: Thanks for your time!  I am going off line now. my time is UTC+8. You can either ping me in IRC at #openstack-cyborg channel if you have more question. or just leave comments at ehterpad(topic 13):https://etherpad.opendev.org/p/cyborg-victoria-goals https://etherpad.opendev.org/p/cyborg-victoria-goals10:10
Yumengbauzas: thanks for taking time reading it!10:11
gibiYumeng: thanks I tried to add some not the nova etherpad.10:11
bauzasgibi: fwiw, playing with stestr run --until-failure and then stestr --last helps me reproduce the failure everytime10:11
Yumenggibi: ok, thanks. I will go back to nova ehterpad later.10:11
gibibauzas: meantime I was able to reproduce the problem only runnig the single test case test_multiple_instance_create$ so it is not test case interference10:12
gibi(it took 356 runs though)10:12
gibiYumeng: OK. have a nice evening10:12
bauzasyou mean, the single test method() without nova.tests.functional.libvirt.test_vgpu.VGPUTests.test_multiple_instance_create_filling_up_capacity ?10:12
bauzasYumeng: ack, will do10:13
*** Yumeng has left #openstack-nova10:13
gibibauzas: yes a single test method nova.tests.functional.libvirt.test_vgpu.VGPUTests.test_multiple_instance_create [7.793940s] ... FAILED10:13
bauzasgibi: if so, the race is probably because we need to hold for a few things10:13
gibibauzas: yeah it is most likele a missing wait_for_<something>  at some point in the test10:14
bauzasgibi: lemme see if adding a pause of 1 sec will just solve the problem10:14
gibiack10:14
bauzasgibi: because what's failing is that we lookup the mdevs10:14
bauzasgibi: we should potentially wait a bit10:14
bauzasthis being said, we wait for the ACTIVE status10:15
bauzasoh wait10:15
bauzasthis is a multiple instance creation10:15
bauzasso we only check the first instance creation10:15
gibithat could explain it10:16
bauzaslemme try10:16
bauzas--until-failure allows me to run the test easily10:16
* bauzas tries to remember in test_servers how we managed the multi-instance creation checks10:17
bauzasahah, interesting10:18
bauzaswe don't hold10:18
*** derekh has joined #openstack-nova10:21
*** Liang__ has quit IRC10:25
bauzasgibi: okay, changed my test to wait for all servers, running it10:26
gibicool10:26
bauzasgibi: will leave it run for one hour or so10:26
*** jazeltq has joined #openstack-nova10:26
gibiif you push the patch up I can do the same while I have lunch10:26
bauzasand if I no longer see any issues after, say, 200 runs, then we will know it's fixed10:26
bauzask10:27
bauzasgibi: just writing a bug and uploading then10:27
* bauzas rushes as my daughters don't look at Netflix yet10:27
gibiack10:28
bauzasah shit, they started it :p10:29
bauzashttps://bugs.launchpad.net/nova/+bug/187728110:30
openstackLaunchpad bug 1877281 in OpenStack Compute (nova) "vGPU multiple instance creation test is racey" [High,Confirmed] - Assigned to Sylvain Bauza (sylvain-bauza)10:30
bauzasgibi: worth thinking about RC3 for this one ? :(10:31
bauzasor, maybe we just release and then backport later10:31
bauzasthe issue isn't that bad10:31
jazeltqhello gibi10:32
*** ociuhandu has quit IRC10:32
jazeltqI have one question about nova-scheduler10:32
*** ociuhandu has joined #openstack-nova10:33
openstackgerritSylvain Bauza proposed openstack/nova master: Wait for all servers to be active when testing vGPUs  https://review.opendev.org/72608710:35
bauzasgibi: ^10:36
* bauzas goes to lunch (while running the tests locally)10:36
bauzasgibi: FYI (functional) [sbauza@sbauza nova]$ stestr --test-path=./nova/tests/functional run --until-failure nova.tests.functional.libvirt.test_vgpu.VGPUTests.test_multiple_instance_create10:37
*** ociuhandu has quit IRC10:38
stephenfinmelwitt: When you're about, can you take a look at https://review.opendev.org/#/c/677444/ please?10:42
*** ociuhandu has joined #openstack-nova10:42
*** mkrai has quit IRC10:48
gibibauzas: I don't think we need an RC for a racy func test. It is OK to merge it after GA10:48
*** jazeltq has quit IRC10:59
*** ociuhandu has quit IRC11:04
*** ociuhandu has joined #openstack-nova11:04
*** brinzhang has joined #openstack-nova11:05
*** brinzhang_ has quit IRC11:08
*** ociuhandu has quit IRC11:10
*** jsuchome has joined #openstack-nova11:12
*** brinzhang_ has joined #openstack-nova11:23
*** brinzhang has quit IRC11:25
*** mgariepy has quit IRC11:44
*** artom has quit IRC11:52
*** brinzhang has joined #openstack-nova11:53
*** nweinber has joined #openstack-nova11:54
*** brinzhang_ has quit IRC11:56
*** songwenping_ has joined #openstack-nova11:58
bauzasgibi: okay, after 4560 seconds, no errors were found12:00
gibibauzas: no error seen in 412 runs12:00
gibiso I agree you fixed it!12:01
gibinice work12:01
*** brinzhang_ has joined #openstack-nova12:10
*** songwenping__ has joined #openstack-nova12:10
*** rpittau|bbl is now known as rpittau12:11
*** ygk_12345 has joined #openstack-nova12:12
*** songwenping_ has quit IRC12:13
*** brinzhang has quit IRC12:13
*** brinzhang_ has quit IRC12:13
*** ociuhandu has joined #openstack-nova12:25
*** raildo has joined #openstack-nova12:25
efriedstephenfin: go ahead and update that commit message. I'm here to re+1, for all the good it'll do ya.12:27
*** ociuhandu has quit IRC12:28
*** ociuhandu has joined #openstack-nova12:28
openstackgerritStephen Finucane proposed openstack/nova master: config: Explicitly register 'remote_debug' CLI opts  https://review.opendev.org/67744412:35
bauzasstephenfin: https://review.opendev.org/#/c/726087/1 closing https://bugs.launchpad.net/nova/+bug/187728112:35
openstackLaunchpad bug 1877281 in OpenStack Compute (nova) "vGPU multiple instance creation test is racey" [High,In progress] - Assigned to Sylvain Bauza (sylvain-bauza)12:35
stephenfinefried: ack, thanks12:35
stephenfinbauzas++12:35
efriedstephenfin: done (with bonus)12:37
* stephenfin loves bonus stuff \o/12:38
stephenfinbauzas: I have an ask on https://review.opendev.org/#/c/726087/12:39
stephenfinif you don't want to, I can live with it as-is12:40
*** mgariepy has joined #openstack-nova12:40
*** udesale_ has joined #openstack-nova12:40
*** songwenping_ has joined #openstack-nova12:42
*** ratailor has quit IRC12:42
*** ratailor has joined #openstack-nova12:43
*** udesale has quit IRC12:43
bauzasstephenfin: no worries, I can write a new rev12:43
bauzasthere is no urgency to fix this race12:44
bauzaswe only get it for around 0.5%12:44
*** songwenping__ has quit IRC12:45
openstackgerritSylvain Bauza proposed openstack/nova master: Wait for all servers to be active when testing vGPUs  https://review.opendev.org/72608712:45
bauzasstephenfin: fer your plizure ^12:46
stephenfin+"12:46
stephenfin*212:46
*** ratailor has quit IRC12:56
*** spatel has joined #openstack-nova12:58
*** lbragstad has joined #openstack-nova13:12
*** ttsiouts has quit IRC13:17
francoispbauzas, salut, when you have time: https://review.opendev.org/#/c/725272/ (this time for Rocky) - thanks13:18
*** psachin has quit IRC13:20
*** ociuhandu has quit IRC13:20
*** ociuhandu_ has joined #openstack-nova13:20
*** ttsiouts has joined #openstack-nova13:21
*** ttsiouts has quit IRC13:25
*** HagunKim has quit IRC13:29
*** songwenping_ has quit IRC13:32
*** songwenping_ has joined #openstack-nova13:32
*** jhesketh has quit IRC13:43
*** jangutter has joined #openstack-nova13:45
*** artom has joined #openstack-nova13:45
*** artom has quit IRC13:46
*** maciejjozefczyk has quit IRC13:46
*** maciejjozefczyk has joined #openstack-nova13:46
*** artom has joined #openstack-nova13:46
*** jhesketh has joined #openstack-nova13:47
*** maciejjozefczyk_ has joined #openstack-nova13:57
*** maciejjozefczyk has quit IRC13:58
*** belmoreira has quit IRC14:00
*** belmoreira has joined #openstack-nova14:00
*** songwenping_ has quit IRC14:06
*** songwenping_ has joined #openstack-nova14:06
*** ociuhandu_ has quit IRC14:09
*** ociuhandu has joined #openstack-nova14:10
*** slaweq_ has joined #openstack-nova14:14
*** slaweq has quit IRC14:14
*** ociuhandu has quit IRC14:15
*** ygk_12345 has left #openstack-nova14:19
*** ociuhandu has joined #openstack-nova14:24
bauzasfrancoisp: ack, will try to do it asap14:27
francoispbauzas, thank you!14:27
*** efried has quit IRC14:28
*** ociuhandu has quit IRC14:29
*** ociuhandu has joined #openstack-nova14:29
*** tosky_ has joined #openstack-nova14:29
*** slaweq_ has quit IRC14:32
openstackgerritJiri Suchomel proposed openstack/nova-specs master: Add spec for downloading images via RBD  https://review.opendev.org/57280514:32
*** tosky has quit IRC14:32
*** slaweq has joined #openstack-nova14:34
openstackgerritTakashi Kajinami proposed openstack/nova master: Check py3 fix in python-barbicanclient  https://review.opendev.org/72616414:36
*** dklyle has joined #openstack-nova14:38
*** tkajinam has quit IRC14:43
*** mriedem has joined #openstack-nova14:53
*** vishalmanchanda has quit IRC14:54
openstackgerritSasha Andonov proposed openstack/nova master: rbd_utils: increase _destroy_volume timeout  https://review.opendev.org/70576414:57
*** tosky_ is now known as tosky15:04
*** sapd1_x has joined #openstack-nova15:21
*** jdillaman has quit IRC15:23
*** jdillaman has joined #openstack-nova15:24
*** kaisers_ has quit IRC15:25
*** songwenping__ has joined #openstack-nova15:27
*** songwenping_ has quit IRC15:30
*** gyee has joined #openstack-nova15:32
*** nicolasbock has quit IRC15:32
*** nicolasbock has joined #openstack-nova15:32
*** songwenping_ has joined #openstack-nova15:43
*** songwenping__ has quit IRC15:45
gibinova meeting starts in 15 minutes on #openstack-meeting-315:45
*** priteau has joined #openstack-nova15:45
*** spatel has quit IRC15:49
*** spatel has joined #openstack-nova15:50
*** mgariepy has quit IRC15:52
openstackgerritStephen Finucane proposed openstack/nova master: docs: Rewrite 'nova-manage db' docs to use proper directives  https://review.opendev.org/67750916:15
openstackgerritStephen Finucane proposed openstack/nova master: docs: Use proper headers in nova-manage man page  https://review.opendev.org/72619116:15
openstackgerritStephen Finucane proposed openstack/nova master: docs: Rewrite 'nova-manage api_db', 'cell_v2' docs  https://review.opendev.org/72619216:15
openstackgerritStephen Finucane proposed openstack/nova master: docs: Rewrite 'nova-manage placement' docs  https://review.opendev.org/72619316:15
openstackgerritStephen Finucane proposed openstack/nova master: docs: Only use section headers for HTML  https://review.opendev.org/72619416:15
openstackgerritStephen Finucane proposed openstack/nova master: docs: Document options for nova-*proxy  https://review.opendev.org/72619516:15
*** dtantsur is now known as dtantsur|afk16:28
*** belmoreira has quit IRC16:29
*** jdillaman has quit IRC16:32
*** rpittau is now known as rpittau|afk16:34
*** evrardjp has quit IRC16:36
*** evrardjp has joined #openstack-nova16:36
*** songwenping__ has joined #openstack-nova16:43
openstackgerritShi Yan proposed openstack/nova master: Fix aggregate placement sync issue  https://review.opendev.org/72620116:44
*** songwenping_ has quit IRC16:45
openstackgerritMerged openstack/os-vif master: Remove six  https://review.opendev.org/72496616:47
openstackgerritMerged openstack/os-vif master: Update master for stable/ussuri  https://review.opendev.org/72471416:47
openstackgerritMerged openstack/os-vif master: Add Python3 victoria unit tests  https://review.opendev.org/72471516:47
*** udesale_ has quit IRC16:52
*** derekh has quit IRC16:53
*** jdillaman has joined #openstack-nova16:54
*** mgariepy has joined #openstack-nova16:56
artomgmann, continuing from #-meeting, yeah, sounds like more discussion would have been nice17:01
gmannstephenfin: gibi py3.8 functional job as voting on check and gate pipeline - https://review.opendev.org/#/c/725372/17:02
artomI just... I don't get the opposition. Like, yeah, the name could have been better chosen, and yeah, we may want to improve it in the future17:02
artomBut as long as we document what it does for the former, and make it evolve-able for the latter, why not do it?17:02
stephenfingmann: Oh, it works now? I thought there were issues with mock from Python 3.817:03
melwittartom: well, I think we need to know whether it's extendable right? we don't fully understand that yet, do we?17:03
gmannartom: i will say if old things have limit and need more clarity then documenting those is fine but any future thing and we can better design so better design is my preference than going with document-limits17:04
*** toabctl has quit IRC17:04
artomgmann, agreed, but we were pre-empted by projects with lesser standards than ours ;)17:04
gmannstephenfin: i think it worked. let me again check log that it is correctly doing py3.817:05
melwittlike can we build a health data cache in that middleware or is there something about it that would preclude doing that? I don't think basic investigation has been done17:05
artommelwitt, so, I'd be OK with just 2 URLs (and I know dansmith will hate me for that) - /healthcheck for unauthed, unversioned HTTP alice checks, and something like /status that's authed and versioned and we can evolve17:05
melwittthat's backward, /status should be the ping check17:05
gmannartom: that is something to fix since starting of openstack :), making uniform things across all projects is still an issue :).17:06
melwittand so to me seems worse to do that, expose two things that are doing the opposite of their names17:06
gmannstephenfin: it does and worked fine - https://zuul.opendev.org/t/openstack/build/bc80a0fe21354e10b81b1e4a4db35356/log/job-output.txt#117217:07
artommelwitt, so find a way for /healthcheck to do both?17:07
melwittyeah, at least do basic due diligence on it. look at the middleware enough to determine whether we could extend the health check to actually be a health check. or if we would need to expose it some other way in order to be able to extend it. that's mho17:08
stephenfingmann: okay. comments left17:09
gmannlet me do some poc for extend things.but may be next week i will be able to do. so that we can check if current proposed things can be done in compatible way with extendable.17:10
stephenfinwho's actually going to work on this extended health check?17:10
*** nightmare_unreal has quit IRC17:10
stephenfinI mean, without someone signed up to do that work, isn't this all effectively bikeshedding?17:11
* bauzas missed the consensus about healthcheck, some tl;dr: folks ?17:11
artombauzas, 💣17:11
bauzas\o/17:11
artomstephenfin, look like gmann wil try it? I kinda wanna have a whack at it too17:12
*** salmankhan1 has joined #openstack-nova17:13
melwittI'm also willing to help work on it (if need be) since I know at least some operators would want it to be a real health check17:13
dansmithme too, I have some ideas on how to make it useful without requiring auth17:14
bauzasagain, a good healthcheck is something about L7 checks17:14
bnemecNot a tldr, but most of the previous discussions are linked on https://storyboard.openstack.org/#!/story/200143917:14
gmann+117:14
bauzaswhat dansmith said17:14
zigoThere's already some extra parameters to give to /healthcheck with the Accept: header, so we could extend that and add more headers to do deeper checks...17:14
bnemecThere's also the detailed opt in the middleware that already allows operators to choose how much they want checked.17:15
zigoX-OpenStack-Check-Type: full-check17:15
artomSo we're all just going to have a massive healthcheck party? :)17:15
bauzasthis being said, hitting the WSGI nova v2 app could be sufficient IMHO17:15
artombauzas, that's a '300 multiple choices', not '200 OK', and is different from other projects17:15
bauzasand AFAIK, we can hit it without being auth'd17:15
bauzasartom: I'm just talking of the healthcheck being part of the WSGI app17:16
bauzasand returning 20017:16
bauzasnot operators using the existing app call17:16
artombauzas, you mean not using the middleware at all?17:16
artomIt'd be easier to make it extensible, yah17:16
bauzasthe middleware just verifies that the WSGI server runs, honestly17:17
bauzasand even worst, it's now a new app with another endpoint17:17
*** salmankhan has quit IRC17:17
bauzaswhich means that the wsgi main app can silently drop and the healthcheck would still tell 'heh, fine enough'17:17
bauzashaving it as a separate app or a middleware just means (again) that you verify uswgi, that's it17:18
*** salmankhan1 has quit IRC17:18
*** ociuhandu has quit IRC17:19
* zigo goes for dinner and will read the backlog later, enjoy trolling folks! :)17:21
openstackgerritGhanshyam Mann proposed openstack/nova master: Moving functional jobs to Victoria testing runtime  https://review.opendev.org/72537217:25
*** gmann is now known as gmann_afk17:25
*** priteau has quit IRC17:25
openstackgerritGhanshyam Mann proposed openstack/nova master: Moving functional jobs to Victoria testing runtime  https://review.opendev.org/72537217:26
* bauzas goes off too17:31
bauzasgibi: won't be there tomorrow, yet again bank holiday17:32
*** jsuchome has quit IRC17:43
sean-k-mooneyartom: some service allwo the middelware to be customisable other hardcode it17:49
sean-k-mooneyim not sure what nova does17:50
artomsean-k-mooney, nova doesn't do anything yet.17:50
artom(And with how things are going, probably never will :P )17:50
sean-k-mooneyartom: i mean in terms of middelware enabling17:50
sean-k-mooneyi cant rememebre if we hard code the list or if you can custimise it by a file17:50
*** ccamacho has quit IRC17:51
*** ralonsoh has quit IRC17:51
melwittI don't think we need to be so negative about it, just want there to be some thought put into what we're doing before pulling the trigger17:51
sean-k-mooneyartom: we do some of it via https://github.com/openstack/nova/blob/master/etc/nova/api-paste.ini17:52
sean-k-mooneyim not entirly shure how that works but i think we can add more midelware or customise it via that file17:53
melwittsean-k-mooney: do you know a lot about the middleware? the question I have is would it in anyway preclude us from doing a real health check in the future, such as accessing a cache built lazily in the background containing db/mq etc healthy kind of info?17:53
sean-k-mooneymelwitt: i do not17:53
sean-k-mooneyi assume we coudl swap out the implemenation in the future17:53
sean-k-mooneywith our own version17:54
sean-k-mooneyprovided it still returned the correct info17:54
sean-k-mooneymelwitt: i assume you would want to check if we can still acess the db and if the conductors are alive17:55
sean-k-mooneyor something like that17:55
melwittyeah, something like that. dansmith idea was to gather those lazily while normal requests are going, and an unauth'ed healthcheck could check the cache17:55
sean-k-mooneythat sound like it shoudl also be doable i think the current healthcheck midelware would jsut serve to say hay the api is running17:55
melwittright, as-is it checks whether wsgi is working17:56
sean-k-mooneymelwitt: dont we have heartbeats for each service in service groups or something17:56
sean-k-mooneyand the status is stored in the db17:56
melwittand we were talking about having that be the first version, then next we'd extend it to do real health checks17:56
*** ociuhandu has joined #openstack-nova17:57
melwittand we're looking to have a plan for this and the first thing to know is whether going forward with the middleware way of doing it would lock us in a corner in a way we couldn't extend it17:57
melwittor if we need to expose /healthcheck in a different way17:57
melwittwe do have heartbeats for each service via the service groups stuff yes17:58
sean-k-mooneymelwitt: so this is how neturon enables it17:59
sean-k-mooneyhttps://opendev.org/openstack/neutron/src/branch/master/etc/api-paste.ini#L47-L4817:59
melwittgmann_afk is going to investigate next week but since you mentioned the middleware I was curious if you knew more about it already17:59
sean-k-mooneyso we could just copy past that17:59
sean-k-mooneyand we woudl get healtcheck endpoint17:59
melwittyeah that's what's proposed already17:59
sean-k-mooneyoperator can already do that too17:59
sean-k-mooneythey just need to add it17:59
sean-k-mooneybut then we coudl cange the class to a nova one later18:00
sean-k-mooneyso paste.filter_factory = oslo_middleware:Healthcheck.factory would be paste.filter_factory = nova.middelware:Healthcheck.factory18:00
sean-k-mooneyor something like that18:00
melwittok18:00
melwittand then present the same header/response options, same interface18:01
melwittI wonder if we can subclass the middleware, that might be likely. anyway, gmann_afk is gonna figure it out :)18:01
sean-k-mooneycool it looks like this is the midealware code https://opendev.org/openstack/oslo.middleware/src/branch/master/oslo_middleware/healthcheck18:02
sean-k-mooneywe could derive form the Healthcheck class i thikn18:02
*** ociuhandu has quit IRC18:03
bnemecHealthcheck is pluggable.18:03
sean-k-mooneythen override process_request https://opendev.org/openstack/oslo.middleware/src/branch/master/oslo_middleware/healthcheck/__init__.py#L54418:03
* bnemec does not have context for this yet18:03
sean-k-mooneybnemec: yes nova does not use the middelware at all by default18:04
sean-k-mooneybnemec: but if we start enabling it we would like something more advanced that check if critical serivce like the scheduler and condcutoer were alive18:04
sean-k-mooneyrather then jsut saying yes the api is alive18:04
bnemecRight, I'm saying you don't need to reimplement it, just write a plugin for those checks.18:05
sean-k-mooneybnemec: right and we woudl do that by inheritinbg and then overriding process_request?18:05
sean-k-mooneybnemec: do you have an example we could follow?18:05
bnemecNo, it's a stevedore-based plugin system. You just register your plugin in the appropriate namespace.18:06
bnemechttps://github.com/openstack/oslo.middleware/blob/master/oslo_middleware/healthcheck/__init__.py#L38818:06
sean-k-mooneybnemec: sure but we dont kwno what the plugin should do18:06
sean-k-mooneye.g. what i shoudl retrun18:06
sean-k-mooneywhat the api for this is as a user18:06
bnemechttps://github.com/openstack/oslo.middleware/blob/master/oslo_middleware/healthcheck/pluginbase.py18:07
sean-k-mooneyah perfect18:07
sean-k-mooneyso we have to retrun a HealthcheckResult18:07
bnemecRight18:08
sean-k-mooneythere are some examples of the respocnes here too https://opendev.org/openstack/oslo.middleware/src/branch/master/oslo_middleware/healthcheck/__init__.py#L68-L16318:08
sean-k-mooneybnemec: anyway im sure gmann_afk will figure it out or be in touch18:10
*** ociuhandu has joined #openstack-nova18:11
sean-k-mooneybnemec: what i was more interested was if there was a standard way to report erros and infomation when it not healty18:11
sean-k-mooneyif not and we ant it to be consumable in an automated fashion then we would have too treat it like a version notification or other api resonce and managen and document it in our api ref18:12
sean-k-mooneyhttps://opendev.org/openstack/oslo.middleware/src/branch/master/oslo_middleware/healthcheck/disable_by_file.py i guess are the example plugins18:13
bnemecYep, versioning these checks is definitely something we should work out on the spec.18:13
bnemecLooks like it, yeah.18:14
sean-k-mooneyyeah so other then establishing conventiosn this look like its not much code to add on our side to have a funtioning plugin18:15
sean-k-mooneythen we can just make the healt check more advanced and imporve the info we return iteritivly18:15
dansmithI'm surprised that nobody thinks that exposing python version, gcc version, distro, version, kernel version, etc to everyone unauthenticated is not a huge problem18:19
dansmithbecause all those things are included in the healthcheck response without any recourse, as far as I can tell18:20
artomdansmith, only if you configure detailed=True in the paste file18:20
artomWhich nobody should ever do18:20
dansmithah, okay, I thought that was a query param18:20
artomIt isn't - at least, based on what I saw18:21
mriedemif your healthcheck api would give me some credentials i could do some health checking of my own, thanks18:22
dansmithso the details of the healthchecks aren't either, other than the reason thing18:22
dansmithokay, so in that case, the config causes us to return different data types18:25
artomMy own personal healthcheck api is to down a pint of beer. Still awak = 200 OK, pass out = 503 service unavailable18:25
*** efried has joined #openstack-nova18:26
dansmithwhich is unfortunate.. if nobody should ever turn on detailed=true, then hopefully nothing will ever see the conflicting structure, but..18:26
artomdansmith, I suppose you could if you're absolutely certain your Nova API endpoints aren't accessible from any unsafe network18:26
artomLike, you trust your firewalls 110%18:26
dansmithhmm, I don't see the if detailed switch in the html version tough18:27
*** ociuhandu has quit IRC18:27
dansmithso the html version is always detailed?18:28
dansmithor maybe the default template doesn't include those things?18:28
dansmithoh I see, conditionals in the template, although I can't eye-parse what that all looks like by default18:29
dansmithah, I see the if detailed at the top18:30
*** efried has quit IRC18:30
*** efried has joined #openstack-nova18:31
*** ociuhandu has joined #openstack-nova18:32
sean-k-mooneydansmith: so yes detailed is a vaild security consern. im not sure if we are required to exose any of the detailed info you listed18:33
sean-k-mooneydansmith: e.g. if we can retrun different detailed infor per service that is less scare form a security point of view18:33
sean-k-mooneyliek the equivalent of a service list18:33
dansmithsean-k-mooney: no, it looks like that's all off unless show_details, but it's one switch to turn on details which then exposes a ton of internals, which may surprise someone18:34
sean-k-mooneyok so other then the last hour or so i have never bother looking at this in detail other then being aware it was configurable via api-paste.ini18:35
sean-k-mooneyso im ok with the idea of a healt check api endpoint. but im not sure it we want to use teh default implemantion18:35
dansmithright, well, the fact that it is configurable with api-paste means this is not an emergency :)18:35
sean-k-mooneythem more i lean about it the mor ei think no18:35
sean-k-mooneyyep18:36
sean-k-mooneyoperatoer can add it today if they want. if the should is a different matter18:36
dansmithso, it's totally fine to have a spec, get people on the same page here, and make sure we have a plan for how this happens now, soon, and later18:36
sean-k-mooneyi thikn with out detailed=true its ok but does not tell you much18:36
dansmithright18:36
dansmithsean-k-mooney: it tells you basically nothing about heath.. other than that, it's great! :)18:37
dansmith*health18:37
sean-k-mooneyya it tells you apache is running  or well the wsgi app18:37
sean-k-mooneyit does not tell you if nova is working18:37
sean-k-mooneywhich is where wrighing our own plugin would come in18:38
*** ociuhandu has quit IRC18:39
*** spatel has quit IRC18:39
sean-k-mooneydansmith: since your about can you take a look at https://review.opendev.org/#/c/724188/4 and the follow up18:40
*** maciejjozefczyk_ has quit IRC18:41
sean-k-mooneyits jsut silencing the amqp heartbeat warning18:41
dansmithsean-k-mooney: omg, tell me you were the one that wrote "heartbeart" originally18:43
sean-k-mooneyhehe for one i think not18:43
dansmithdamn.18:43
sean-k-mooneybut i mean i would not put it past me that is a very sean thing to do18:44
dansmithsean-k-mooney: jeez, I missed all the typos in the first commit message18:45
dansmithsean-k-mooney: can we please get those fixed since you'regoing to backport I assume?18:46
sean-k-mooneydo you want to comment on them18:46
dansmithlemme just fix inline18:46
sean-k-mooneyi can can fix them up with the other one18:46
sean-k-mooneydansmith: ya we have a downstream bug for osp 13 so im going to back prot this upstream as far as people are ok with18:46
sean-k-mooneydansmith: want to remvoe the +w on the first patch and ill fix all typos and resubmit shortly18:47
openstackgerritDan Smith proposed openstack/nova master: Silence amqp heartbeat warning  https://review.opendev.org/72418818:47
dansmithsean-k-mooney: I just edited the bottom one to kick it out18:47
sean-k-mooneyah ok cool18:47
dansmithsean-k-mooney: I can do the top if you want or you can18:47
dansmithsean-k-mooney: look over that edit I made and then I'll +W18:48
sean-k-mooneydamb it i misspelled heartbeats lol18:48
dansmithyou also misspelled "damn" :P18:49
dansmithsean-k-mooney: okay re +Wd the bottom one18:50
sean-k-mooneyi am honestly surpised i write code that works sometimes. if it wasnt for the fact im consitet in adjcent lines it totally would not work18:50
dansmithI honestly have no idea how you do, so.. there's that at least :)18:51
dansmithkeep working on that compiler for English and you'll be set :)18:51
dansmith(and rich)18:51
sean-k-mooney:)18:51
openstackgerritsean mooney proposed openstack/nova master: remove support of oslo.messaging 9.8.0 warning message  https://review.opendev.org/72472518:56
sean-k-mooneydansmith: stephenfin ^ ok that should have the typos fixed18:56
*** ociuhandu has joined #openstack-nova18:56
dansmithno need to wake him, I'll get it18:56
*** songwenping_ has joined #openstack-nova18:57
sean-k-mooneyare we still holding backport to stable/ussuri until the upstream release by the way18:57
* dansmith does't know18:58
sean-k-mooneythis obviosly isnt a RC candiate but ill start preparing the backports tomorow and we can merge them whenever18:58
melwittyeah there shouldn't be non RC related backports to stable/ussuri until after GA IIRC18:59
melwittmerged I mean18:59
sean-k-mooneycool ill mark the backport as workflow -1 untill after then18:59
melwitt+118:59
*** songwenping__ has quit IRC19:00
sean-k-mooneyok im going to get something to eat ttyl19:00
*** gmann_afk is now known as gmann19:06
*** sapd1_x has quit IRC19:06
*** ociuhandu has quit IRC19:07
*** ociuhandu has joined #openstack-nova19:10
*** ociuhandu has quit IRC19:15
*** yaawang has quit IRC19:16
*** dpawlik has quit IRC19:23
*** ociuhandu has joined #openstack-nova19:59
*** nweinber has quit IRC20:10
openstackgerritGhanshyam Mann proposed openstack/nova master: Moving functional jobs to Victoria testing runtime  https://review.opendev.org/72537220:21
*** yaawang has joined #openstack-nova20:32
*** tesseract has quit IRC20:36
*** jamesdenton has quit IRC20:38
*** jamesdenton has joined #openstack-nova20:38
*** martinkennelly has quit IRC20:47
*** jangutter_ has joined #openstack-nova20:51
*** jangutter_ has quit IRC20:52
*** jangutter_ has joined #openstack-nova20:52
*** jangutter has quit IRC20:54
*** jangutter has joined #openstack-nova20:55
*** jangutter_ has quit IRC20:58
*** ociuhandu has quit IRC21:13
*** ociuhandu has joined #openstack-nova21:14
*** ociuhandu has quit IRC21:19
*** ociuhandu has joined #openstack-nova21:20
*** hemna has quit IRC21:26
*** ociuhandu has quit IRC21:36
*** ociuhandu has joined #openstack-nova21:36
*** avolkov has quit IRC21:38
*** hemna has joined #openstack-nova21:39
*** ociuhandu has quit IRC21:42
*** raildo has quit IRC21:45
openstackgerritMerged openstack/nova stable/rocky: Add retry to cinder API calls related to volume detach  https://review.opendev.org/72527221:50
zigosean-k-mooney: It's like this in Neutron because I wrote a wrong patch, that method in api-paste.ini is deprecated in the favor of the app thingy.21:52
zigodansmith: re [ exposing versions of verything ] what I'd like to see happen, with my operator hat, is having the healthcheck be 1/ unauth 2/ only answering for who I decided, that is only from the haproxy IP, with a check of the X-Forwarded-For header to make sure only who I decide can read the healthcheck (ie: not everyone on the internet, just 10.x.x.x in my case).21:58
*** xek has quit IRC22:04
*** mriedem has left #openstack-nova22:04
sean-k-mooneyzigo: well api-paste.ini was the old way to config midileware in general22:09
sean-k-mooneyits still what nova uses22:09
zigosean-k-mooney: What's the new way then?22:09
sean-k-mooneymainly because its not broke22:10
sean-k-mooneywell im not sure if it has been replace but i know some service decied to not make it configurable and just hard code the middel ware they support22:10
sean-k-mooneyso the remove the api-paste.ini file entirely22:11
*** slaweq has quit IRC22:14
*** slaweq has joined #openstack-nova22:20
*** slaweq has quit IRC22:24
*** hoonetorg has quit IRC22:34
*** hoonetorg has joined #openstack-nova22:46
*** tkajinam has joined #openstack-nova22:49
*** spotz has quit IRC23:16
*** tosky has quit IRC23:36
*** mlavalle has quit IRC23:40
*** tetsuro has joined #openstack-nova23:52
*** gyee has quit IRC23:54

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!