Monday, 2023-01-30

*** ministry is now known as __ministry03:31
*** blarnath is now known as d34dh0r5305:06
plibeauhello guys, if you have sometime to review please: https://review.opendev.org/c/openstack/nova/+/86117209:27
*** elodilles is now known as elodilles_afk09:39
*** elodilles_afk is now known as elodilles11:12
opendevreviewMerged openstack/nova master: Fix rescue volume-based instance  https://review.opendev.org/c/openstack/nova/+/85273711:17
elodillesbauzas: hi, you've told me to ping you about the ussuri and train gate issue (failed to build cryptography bcrypt). If you have any memories about this error then please let me know :) an example: https://zuul.opendev.org/t/openstack/build/0bc1bf3540c5446fa0026558f30fbec812:26
opendevreviewRajesh Tailor proposed openstack/nova stable/zed: Handle InstanceInvalidState exception  https://review.opendev.org/c/openstack/nova/+/87211512:46
opendevreviewRajesh Tailor proposed openstack/nova stable/zed: Fix rescue volume-based instance  https://review.opendev.org/c/openstack/nova/+/87211612:46
opendevreviewRajesh Tailor proposed openstack/nova stable/yoga: Handle InstanceInvalidState exception  https://review.opendev.org/c/openstack/nova/+/87211713:01
opendevreviewRajesh Tailor proposed openstack/nova stable/yoga: Fix rescue volume-based instance  https://review.opendev.org/c/openstack/nova/+/87211813:02
sean-k-mooneyelodilles: we used to work around that by not building it and useing the prebuilt wheel13:04
sean-k-mooneyelodilles: the cryptography module added rust as a build dep13:04
sean-k-mooneywhich is not installed by devstack13:04
sean-k-mooneywithout rust the module could not be built and installed in the tempest jobs13:04
sean-k-mooneywe avoided the need to do that by using the prebuilt wheels13:05
sean-k-mooneyit sound like either those have been pulled form pypi or are not n the wheel cache avaiable in the providers fo the ci13:05
elodillessean-k-mooney: oh. nice. good question how to solve that then in upstream ci... :S13:07
sean-k-mooneythe simpletst way to fix this is to user rust_up to install rust so it can compile form source13:07
sean-k-mooneywe might have a role for that13:08
sean-k-mooneywe do https://opendev.org/zuul/zuul-jobs/src/branch/master/roles/ensure-rust13:09
elodillesoh, that sounds better then \o/13:09
elodilleswe just have to add it for failing jobs on the failing branches then i guess13:10
sean-k-mooneyadd it as or too the pre playbook ya13:10
sean-k-mooneythat or have devstack do it13:10
sean-k-mooneybut the role is proably simpler13:10
sean-k-mooneyit looks like it will have rustup install the latest sable version of the tool chain13:11
*** tkajinam is now known as Guest288213:11
sean-k-mooneyhttps://opendev.org/zuul/zuul-jobs/src/branch/master/roles/ensure-rust/defaults/main.yaml13:11
sean-k-mooneyso that proably will do what we want out of the box13:11
elodillescool, thx, i'll try out this \o/13:12
*** dasm|off is now known as dasm13:57
opendevreviewElod Illes proposed openstack/nova master: DNM: CI test  https://review.opendev.org/c/openstack/nova/+/87217814:38
opendevreviewElod Illes proposed openstack/nova stable/ussuri: DNM: CI test  https://review.opendev.org/c/openstack/nova/+/87218414:57
* bauzas trying to reboot my knowledge on the CPU series I did one month ago, but it's difficult :(15:00
artomSo a CPU is a central processing unit15:03
artomIt's found inside computers and other electronics15:03
opendevreviewribaudr proposed openstack/os-traits master: Add 'COMPUTE_SHARE_LOCAL_FS'  https://review.opendev.org/c/openstack/os-traits/+/87218515:03
sahidartom: ahaha :-)15:03
sahidbauzas: it's regarding the online/offline cpus?15:03
bauzasyes15:03
sahidcool, interesting work15:04
sahidi will try to continue my review on it15:05
opendevreviewJorge San Emeterio proposed openstack/nova master: Dividing global privsep profile  https://review.opendev.org/c/openstack/nova/+/87172915:06
sahidi was just not sure about the files that you have added in one of your patches to simulates hw ;)15:06
opendevreviewJorge San Emeterio proposed openstack/nova master: WIP: Moving privsep profiles to nova/__init__.py  https://review.opendev.org/c/openstack/nova/+/87201015:13
dansmithsean-k-mooney: I think I better put this host/node consistency check in its own patch after the last one in the series, because it seems to be death by a thousand cuts on the functional tests15:23
dansmithI also wonder if it might be better to just mock it out completely for functional15:23
dansmithbecause basically every place we do a compute restart type operation, we'll fail since we don't have the node uuid thing mocked out to what it needs to be at the time15:24
opendevreviewJorge San Emeterio proposed openstack/nova master: WIP: Moving privsep profiles to nova/__init__.py  https://review.opendev.org/c/openstack/nova/+/87201015:32
opendevreviewAaron S proposed openstack/nova master: Add further workaround features for qemu_monitor_announce_self  https://review.opendev.org/c/openstack/nova/+/86732415:46
elodillessean-k-mooney: unfortunately ensure-rust does not work: https://zuul.opendev.org/t/openstack/build/f2efc09c770a448297fe90cd73191d5a15:54
elodillessean-k-mooney: but meanwhile i've checked and the problem is the old pip version.15:55
elodillessean-k-mooney: with the old pip version i was able to reproduce the 'missing rust' error, and after upgrading pip it installed cryptography fine15:56
sean-k-mooneyack16:05
bauzassean-k-mooney: maybe you haven't seen but I created a Core API for the CPU states https://review.opendev.org/c/openstack/nova/+/868236/2/nova/virt/libvirt/cpu/core.py#4716:09
bauzassean-k-mooney: but I can modify both modules to rather only have one Core API that would directly call the filesystem instead of calling the backed module 16:10
sean-k-mooneyill take a look later16:41
sean-k-mooneyelodilles: ack so using new pip might correct the issue provided that does not break anything with the new resolveer logic16:41
elodillessean-k-mooney: yes. though i have a fear that it would break something :/16:45
elodillessean-k-mooney: anyway, i think the prebuilt wheel disappeared from pypi that's why we are facing this issue now :(16:47
elodilles(or we use different pip version now than before, but I haven't find any proof for that yet)16:53
sean-k-mooneyya i suspect it disappeared too16:53
opendevreviewMerged openstack/nova master: Persist existing node uuids locally  https://review.opendev.org/c/openstack/nova/+/86391818:29
opendevreviewMerged openstack/nova master: Fix huge-page doc  https://review.opendev.org/c/openstack/nova/+/86968918:29
dansmithsean-k-mooney: I thought on friday you said you were happy with my answer here: https://review.opendev.org/c/openstack/nova/+/863919/1218:41
dansmithbut I also thought you commented on it, which I don't see... do we need more on that?18:41
sean-k-mooneyi think that was in context of https://review.opendev.org/c/openstack/nova/+/863918/1218:41
sean-k-mooneyi have not looked at the rt one since then18:42
sean-k-mooneyas in i looked at it before we talked on irc/meet18:42
sean-k-mooneyill try and loop back too it again shortly 18:42
dansmithokay just wanted to make sure I wasn't missing some response or action18:43
sean-k-mooneyno i think we can likely proceed with that for now18:44
sean-k-mooneyhttps://review.opendev.org/c/openstack/nova/+/863919/12/nova/tests/unit/compute/test_resource_tracker.py#155318:44
dansmithack18:44
sean-k-mooneyi was expecting to be blocked becasue i was expecting use to block hypervior_hostname changes which we agreed should be done but is out of scope of the spec18:44
sean-k-mooneyim ok to defer that to after the current serise of patches as either a bug or something we do next cycle18:45
dansmithI've been working all morning (and some on friday) trying to get the consistency check (i.e. "are there any nodes that claim to be mine that aren't in my list of uuids") to pass functional tests18:48
dansmithbut I'm about to slam a body part in a door so I have an excuse to stop working on it.. this stuff is such a tangled mess18:49
sean-k-mooneyi assume as a patch on the end of the seriese18:49
sean-k-mooneyi was hoping we could jsut tweak _ensure_existing_node_identity18:49
sean-k-mooneybut im guessing when we do that has test fallout18:49
sean-k-mooneyand you have been trying to resolve that18:50
dansmithI don't want to put the logic in there specifically because that's the migration code, which we might remove one day once migrations are long out of our support envelope18:51
dansmithbut that has nothing to do with the difficulty18:51
sean-k-mooneyoh _ensure_existing_node_identity is only called there ok18:51
dansmithI can push up what I have if you want and you can try to get things to pass18:52
sean-k-mooneysure i can take a look at it18:52
sean-k-mooneyill proceed with the RT change and the last two patch reviews now18:53
opendevreviewDan Smith proposed openstack/nova master: Make resource tracker use UUIDs instead of names  https://review.opendev.org/c/openstack/nova/+/86391918:53
opendevreviewDan Smith proposed openstack/nova master: Fixup patch for stable-compute-uuid series  https://review.opendev.org/c/openstack/nova/+/87184818:53
opendevreviewDan Smith proposed openstack/nova master: Detect host renames and abort startup  https://review.opendev.org/c/openstack/nova/+/86392018:53
opendevreviewDan Smith proposed openstack/nova master: TURBOWIP: Add a host/node consistency check  https://review.opendev.org/c/openstack/nova/+/87220418:53
sean-k-mooneyheh ok18:54
sean-k-mooneyi was oging to suggest not modifying the make RT one 18:54
dansmithI had to rebase to submit18:54
dansmithit's otherwise unchanged18:54
sean-k-mooneyya gerrit agrees so gibis +2 is still there18:55
sean-k-mooneyi see thats not exactuly what i was expecting19:00
sean-k-mooneyhttps://review.opendev.org/c/openstack/nova/+/872204/1/nova/compute/manager.py#155619:00
dansmithwell, sorry to disappoint you19:01
sean-k-mooneyi was expectign you to get teh compute node object and see if its hypervior hostname is the same as what the driver currently says it is19:02
sean-k-mooneyi think this covers the first two cases19:03
sean-k-mooneyon line 156319:03
dansmithbut that's not going to catch the case you were concerned about.. where we have a different (generated) uuid, so we think we should create a compute node, but we fail because of the UC on the hostname19:03
sean-k-mooneyi was concewrned about 2 thing19:03
sean-k-mooneyi think this covers test 3 and 4 https://etherpad.opendev.org/p/Stable-compute-uuid-manual-testing#L7719:04
sean-k-mooneyim unsure about 9 and 1019:04
sean-k-mooneyhttps://etherpad.opendev.org/p/Stable-compute-uuid-manual-testing#L21619:05
dansmiththe third bullet will result in us looking up both our node and the conflicting host's node, but the uuids won't match what we expect19:05
sean-k-mooneya change in the hostname does not maen Conf.host changes or our uuid does but i can check your code now and confirm if it resoulve all 4 test cases19:06
dansmithI dunno what test 9 is supposed to be because there's no explanation other than "$old_hostname" so I'm not sure if this addresses that or not19:07
dansmithI think it *does* address what I listed as things it should address,19:07
sean-k-mooneythat you change /etc/hostname form X to Y but Conf.host is hardcoded to X19:07
dansmiththe later of which is setting CONF.host to be the same for two separate compute services19:07
dansmiththat's the hypervisor_hostname thing right? this doesn't address that19:08
sean-k-mooneythats the hypervior host name thing yes19:08
dansmithI haven't even gotten there yet19:08
sean-k-mooneyso that what happend in old OOO19:08
sean-k-mooneyack19:08
sean-k-mooneytripleo started hardcodeing CONF.host before they hardcoded /etc/hostname to the same value19:09
sean-k-mooneyso 9 is basically testing that old case19:10
dansmiththat's cool, but that's not what I'm trying to address with this patch, nor what even the series was trying to address. that's more in the scope of some of the later things in the top-level spec19:10
dansmithI think we can address it here, once we have a stable "who the fsck am I?" notion, but I haven't gotten there yet19:10
sean-k-mooneyack19:11
sean-k-mooneyso ill retest and with your last patch i should epect it to handel 3 an 4 (file delete and file with wrong uuid) correct19:11
sean-k-mooneyand 9/10 should still be out of scope19:11
dansmithprobably, but since I haven't gotten the *existing* tests to pass I have no idea19:12
dansmithI thought you said you wanted to look at the test failures, which is why I pushed it up19:12
sean-k-mooneydansmith: i can do that too19:12
sean-k-mooneybut i have yoru code deployed so i can quickly test those edge cases first19:13
gmanngibi: updated placement rbac change  for your comment, please check  (in case you missed my previous ping ) https://review.opendev.org/c/openstack/placement/+/86561820:00
sean-k-mooneythere is a security team irc right?20:08
sean-k-mooneyim trying to calulate teh CVSSv3 score for the vmdk CVE20:08
sean-k-mooneyi come up with 8.220:08
dansmithfungi: ^20:10
fungi#openstack-security yeah20:10
sean-k-mooneyack i can ask there but you said this was not normaly done by the sectuiy team20:11
sean-k-mooneyhttps://nvd.nist.gov/vuln-metrics/cvss/v3-calculator?vector=AV:N/AC:H/PR:L/UI:N/S:C/C:H/I:N/A:H&version=3.120:11
sean-k-mooneythat is what i came up with20:11
opendevreviewDan Smith proposed openstack/nova master: Fixup patch for stable-compute-uuid series  https://review.opendev.org/c/openstack/nova/+/87184821:31
opendevreviewDan Smith proposed openstack/nova master: Detect host renames and abort startup  https://review.opendev.org/c/openstack/nova/+/86392021:31
opendevreviewDan Smith proposed openstack/nova master: Check our nodes for hypervisor_hostname changes  https://review.opendev.org/c/openstack/nova/+/87222021:31
opendevreviewRafael Weingartner proposed openstack/nova-specs master: spec: nova Dynamic scheduler system  https://review.opendev.org/c/openstack/nova-specs/+/87222521:40

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!