Wednesday, 2021-10-20

opendevreviewMerged openstack/nova stable/ussuri: Fix request path to query a resource provider by uuid  https://review.opendev.org/c/openstack/nova/+/80500400:01
melwittstephenfin: I noticed that when I try to run a subset of unit tests for db like "tox -epy38 nova.tests.unit.db" the tests fail with "oslo_db.sqlalchemy.enginefacade.AlreadyStartedError: this TransactionFactory is already started". I assume it's related to the db migration stuff but I don't understand how. any ideas?00:31
sean-k-mooney[m]isnt that the error we fixed in the past with the run once decorator00:32
sean-k-mooney[m]when it was reinitalised after SIG_HUP00:32
sean-k-mooney[m]im not really sure why you would see that in the unit tests however00:33
melwittsean-k-mooney[m]: yeah but this is unit tests? when you run all tests everything passes, when you run only nova.tests.unit.db you get 70+ failures ¯\_(ツ)_/¯00:34
sean-k-mooney[m]melwitt regarding self +1 ill check i think it might be possible to do yes by defining on +0  as allowed for the owner00:34
sean-k-mooney[m]so the owner of the patch would be allowed to clear the review priorty but not set it to +1 or +200:35
melwittsean-k-mooney[m]: ack. I brought it up as a possible option for making sure the +1 doesn't get used as a "ping" since some were concerned about that00:35
sean-k-mooney[m]ill see if i can create a draft patch tomorow and include that and we can discuss it in the review00:36
melwittcool00:37
sean-k-mooney[m]regarding the db failure i have not run them in a few days but i can try it in the morning00:37
sean-k-mooney[m]is it only when you filter00:37
sean-k-mooney[m]or do they fail if you run all them00:37
melwittyes only when filtering00:37
sean-k-mooney[m]ok that is odd00:37
melwitteverything passes if you run all00:37
sean-k-mooney[m]im not sure why filtering would break them00:37
melwittme neither. it's weird00:38
sean-k-mooney[m]maybe stephenfin will spot something00:38
melwittit might only be just me but I tried wiping everything, upgrading tox, re-cloning the nova repo00:39
sean-k-mooney[m]i wonder if there was  an sqlalchemy release or soemthing that has chnaged behaivor00:39
sean-k-mooney[m]so ya just quickly ssh to my home server cloned nova and ran the db test and ya it failed for me too00:52
sean-k-mooney[m]i only got 16 failure but it proably not deterministic00:53
sean-k-mooney[m]its likely differnt based on the number of cores/parallel tests we have00:53
sean-k-mooney[m]the server i ran it on had 48 threads so there will be less tests running in the same process since it will use more of them00:53
melwittoh, yeah ok00:54
sean-k-mooney[m]looks like its coming direclty form the db fixture which is od00:55
sean-k-mooney[m]*odd00:55
melwittyeah00:55
melwittI have run tests like this subset a lot of times in the past and this is the first time I see it fail like this. so I figure it must be a fairly recent change00:56
sean-k-mooney[m]im wondering if its a recent nova change or oslo.db or sqlalchmy version change00:56
melwittyeah I don't find sqlalchemy in my tox env which ... I don't understand00:58
sean-k-mooney[m]i have 1.4. something01:01
sean-k-mooney[m]i downgraded its uppercase in pip freeze01:02
sean-k-mooney[m]but downgrading it and oslo.db had no reall effect so i looks like that is not the issue01:02
sean-k-mooney[m]or rather it happens with oslo.db 11 and 10 and sqlalchmey 1.4.x and 1.3.x01:03
sean-k-mooney[m]so ya maybe a recent nova change01:03
sean-k-mooney[m]ill try and take a look again in the morning.01:04
melwittoh derp01:04
melwitthaha thanks01:04
sean-k-mooney[m]night all o/01:04
melwittSQLAlchemy==1.4.2501:04
melwittgnite o/01:04
bauzasgood morning Nova06:13
opendevreviewPierre-Samuel Le Stang proposed openstack/nova master: Fix instance's image_ref lost on failed unshelving  https://review.opendev.org/c/openstack/nova/+/80755107:13
gibimorning08:30
gibibauzas: do we start with the tox.ini python version pinning topic today at 13:00 UTC?09:33
bauzasI was thinking so09:33
bauzasgibi: updated the etherpad to make it clear where we start09:36
gibibauzas: thanks, I will gather my thoughts then09:36
opendevreviewBalazs Gibizer proposed openstack/nova master: Query ports with admin client to get resource_request  https://review.opendev.org/c/openstack/nova/+/81139609:41
opendevreviewBalazs Gibizer proposed openstack/nova master: Enable min pps tempest testing in nova-next  https://review.opendev.org/c/openstack/nova/+/81174809:41
opendevreviewBalazs Gibizer proposed openstack/nova master: Avoid unbound instance_uuid var during delete  https://review.opendev.org/c/openstack/nova/+/80560509:48
opendevreviewBalazs Gibizer proposed openstack/nova master: [nova-manage]support extended resource request  https://review.opendev.org/c/openstack/nova/+/80206009:48
opendevreviewBalazs Gibizer proposed openstack/nova master: Reno for qos-minimum-guaranteed-packet-rate  https://review.opendev.org/c/openstack/nova/+/80504609:48
gibisean-k-mooney[m]: btw, there is a settings in gerrit "Set new changes to "work in progress" by default" so the gerrit dev also has similar thinking as you, that we need a separate way to signal when a patch is ready and it is not the push09:56
opendevreviewBalazs Gibizer proposed openstack/nova master: DNM: turn off two unit test cases  https://review.opendev.org/c/openstack/nova/+/81473510:23
gibisean-k-mooney[m], melwitt: I found the two offending unit test case that causes .AlreadyStartedError error if the db tests are run selectively ^^ 10:24
gibibut I don't know why they are breaking our test runs10:24
gibimore funnier. I can fix it, but I don't know why the fix helps :D10:31
opendevreviewBalazs Gibizer proposed openstack/nova master: Fix db migration unit test  https://review.opendev.org/c/openstack/nova/+/81473510:37
gibisean-k-mooney[m], melwitt: after this ^^ I cannot reproduce the problem locally any more. But I need somebody to help with explainig why this helps10:38
opendevreviewBalazs Gibizer proposed openstack/nova master: Fix db migration unit test  https://review.opendev.org/c/openstack/nova/+/81473510:40
stephenfingibi: gdi, good find10:59
stephenfinall that context manager stuff is global. I suspect we're doing something somewhere to break this global-ness but not when we only run those tests11:02
stephenfinhttps://github.com/openstack/nova/blob/master/nova/tests/unit/api/openstack/test_wsgi_app.py#L71-L7711:04
opendevreviewWenping Song proposed openstack/nova master: Report gpu devices that only configured  https://review.opendev.org/c/openstack/nova/+/81474311:35
sean-k-mooneygibi: the engie facade mocking is similar to what we did in https://review.opendev.org/c/openstack/nova/+/805663/9/nova/tests/unit/db/test_migration.py11:57
sean-k-mooneygibi: in that case we needed to create our own db instance in the test to allow the url to be modifed since that is read only when the engin is created12:00
sean-k-mooneywell configured12:00
sean-k-mooneyso it does kind of make sense to me that any test that do manual db sync might need this type of mocking but i would not exepct the db syncy to actill try to restart the engin facade12:02
sean-k-mooneystephenfin: while your looking at gate issues https://review.opendev.org/c/openstack/nova/+/814036 also ill joing the ptg shortly12:07
sean-k-mooneyoh right we are currently utc +112:13
opendevreviewWenping Song proposed openstack/nova master: Cleanup guest process when vm evacuate failed and then deleted  https://review.opendev.org/c/openstack/nova/+/81474612:15
stephenfinsean-k-mooney: I don't think it's trying to start it. I think it's trying to _configure_ it12:15
stephenfinbauzas: I'll miss the first 30 minutes of the PTG, unfortunately12:16
bauzasstephenfin: OK, then I'll flip some topics12:16
bauzasstephenfin: we'll start with gibi's topic about the tox usage12:17
sean-k-mooneywell i think stephen is interested in that12:17
stephenfinyeah, I was going to say. I can miss it though. I think a few people understand the problem so I'm sure you'll settle on something sensible12:18
sean-k-mooneystephenfin: im hoping py39 will become a required testing env for yoga12:18
sean-k-mooneyjust an fyi so we will need to be able to test with that locally but ya we understand what you were tryign to do12:20
gibistephenfin: ok, so we have global context_manager objects one for the api db and one for the main db. So that is what gets dirty and leaking between tests12:22
gibido we have some special handling for them in a fixture we use for every test case so in general test cases are not interacting? And don't we use the same fixture in the db_sync tests for a reason?12:23
stephenfingibi: Sigh, all the DB tests are using NoDBTestCase, so we're not actually using the Database fixture https://github.com/openstack/nova/blob/master/nova/tests/fixtures/nova.py#L611-L61712:28
stephenfinI still don't know why running other tests first fixes the issue though...12:29
gibihm, so the normal way to deal with these globals is the Database fixture but for explicit DB testing we dont use that fixture12:30
stephenfinthat's my understanding from a quick inspection, but I'm juggling many things today and haven't had time to investigate it properly yet /o\12:32
stephenfinit does seem strange that DB tests would not use the DB fixture though12:32
gibiI think we have a poison fixture that prevent us touching the DB from a NoDBTestCase but it does not hit for the db_sync tests12:32
stephenfinhmm, true12:33
gibiI'm lost in oslo_db / sqla I guess our poision specific for the way nova objects are accessing the db and it still allows our direct get_engine calls12:39
bauzasreminder : nova sessions start in 10 mins12:50
* gibi fetches coffee12:54
bauzassessions start by now :)13:02
bauzasif people wanna come => https://www.openstack.org/ptg/rooms/newton13:03
* stephenfin joins late13:53
stephenfinI assume we're on a break?13:53
gibiyepp13:53
gibistephenfin: you joined for the break :D13:54
stephenfinclassic me13:54
gibi:)13:54
bauzasstephenfin: we basically NACK'd all your ideas while you were gone :p13:55
bauzasterrible me13:55
bauzasbelmoreira: we have an issue with the interop session being discussed one hour later that conflicts the vncproxy discussion we were planning to have at 15:40pm13:58
bauzasbelmoreira: can we make the discussion start at 4pm ?13:58
bauzasdang13:58
bauzasbelmoreira: I meant, can we discuss the vncproxy thing at 3pm ?13:59
bauzas(UTC)13:59
stephenfinbauzas: just another PTG so13:59
stephenfin:)13:59
belmoreirabauzas 3pm utc works for me. thanks13:59
bauzasjust hope melwitt can be present at this time14:00
bauzasbelmoreira: ack, will flip the sessions 14:00
bauzasstephenfin: we'll do neutron things until 3pm UTC14:11
*** ganso_ is now known as ganso14:24
*** vishalmanchanda_ is now known as vishalmanchanda14:24
*** andrewbonney_ is now known as andrewbonney14:26
*** carloss_ is now known as carloss15:00
bauzasdansmith: are you available ? belmoreira is discussing about moving instances between projects and we're talking of os-chown15:07
dansmithbauzas: not right this moment, but I can try to join soon15:08
bauzasdansmith: cool, appreciated.15:08
dansmithbauzas: actually, this next topic is something I have to stay for, so it'll be a bit yet15:11
bauzasok15:11
bauzasno worries15:11
dansmithbauzas: I'm here for 15 mins15:14
bauzasta15:14
dansmithwhich line is the vnc proxy thing?15:17
stephenfindansmith: 39415:17
dansmithah, in the "after 3pm" thing15:18
stephenfinyeah, I've lost track of how we're picking stuff/what we've covered :)15:18
bauzasside note: Jabra Evolve 2 65 are terrible headsets for a day long wearing them15:58
bauzasmy ears are pretty done with it15:58
bauzasstephenfin: yeah, we cherry-picked topics15:58
bauzasI don't wanna move topics too much as it's also confusing15:59
bauzasbrinzhang: when you're up, tell me when you want to discuss the cyborg suspend/resume spec ? we can do this either this Thursday 1pm or Friday 1pm if that suits your TZ17:26
bauzasfor the moment, placing it at the top of the discussions we have left so it would be discuss this at 1pm, but we can punt it for later17:27
bauzasto nova folks, I flipped topics so the agenda should reflect the first topics to discuss, starting at L35717:28
* bauzas calls it a productive day17:29

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!