Tuesday, 2022-06-28

noonedeadpunkjrosser_: should we merge https://review.opendev.org/c/openstack/openstack-ansible/+/847668 ? Commit message says part of debugging, but it seems we can indeed land that to unlbock stuff?06:13
noonedeadpunkalso fwiw I'm mostly off today06:13
jrosser_oh certainlt yes06:14
jrosser_talking with #opendev yesterday it is only really us and tripleo being hurt with POST_FAILURE06:14
jrosser_and extending the timeout lets use see really how long it takes as useful data points06:14
jrosser_and though a bunch of stuff is now passing, it's very unclear if thats because removing journals/ara has made a difference06:15
jrosser_or perhaps just the swift backends are faster again06:15
opendevreviewMerged openstack/openstack-ansible master: Imported Translations from Zanata  https://review.opendev.org/c/openstack/openstack-ansible/+/84770409:36
mgariepymorning.11:16
jrosser_o/ hello11:16
mgariepyhow are things going this morning ?11:17
jrosser_seems that the POST_FAILURES thing may have subsided a bit11:17
jrosser_i have a bunch of semi-related patches to merge, though its unclear if some underlying thing has just got better as well11:18
mgariepywith a lot of moving parts it's often kinda hard to determine what is causing issue11:19
mgariepyi just reviewed the 3hours ones. is there others that need attentions faster than others?11:19
mgariepyotherwise i'll probably spend a couple hours reviewing 11:19
jrosser_i think we need to let those merge then push through a few on stable/yoga11:24
jrosser_but we will be in zuul-queue fail trouble if we go too quickly on that11:24
mgariepywould it be easier to find / browse logs if we had an html page with all the links ?11:25
mgariepylooking at : https://review.opendev.org/c/openstack/openstack-ansible/+/84481711:25
jrosser_not quite following tbh11:27
jrosser_i was trying to see if we were uploading large amounts of nonsense11:27
mgariepywhen debuging patches, often i end up looking for log files in directory.11:28
jrosser_oh right i see, like a fully expanded tree of all the log files11:28
mgariepyyep11:28
jrosser_that would be a suggestion for the zuul people i think, but it would be really great11:29
mgariepyit would make the logs file easier to find..11:29
jrosser_becasue you can't use the browser back button in any sensible way and i find that really annoying11:29
jrosser_we can't do that in our job as we don't know what the upload URL will be11:30
mgariepyyueah in the zuul page you always endup in root dir if going back. and the index directory is not fun when needing look at multiple files.11:30
mgariepywe can probably use relative paths.11:30
jrosser_sometimes i use the 'raw' link as that goes to an old-school directory browser11:31
mgariepyi use the raw ones when i need multiple files since it's annoying otherwise.11:32
jrosser_we do have ~2000 lines in that tree output though :)11:34
jrosser_that could turn into a different kind of unwieldy11:34
*** dviroel|out is now known as dviroel11:37
mgariepyindeed. 2000 lines is quite a long page :D haha11:39
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/stein: Remove periodic jobs  https://review.opendev.org/c/openstack/openstack-ansible/+/84796612:46
opendevreviewMerged openstack/openstack-ansible stable/yoga: Set zuul post-timeout to 3 hours  https://review.opendev.org/c/openstack/openstack-ansible/+/84766813:37
*** dviroel is now known as dviroel|biab13:52
opendevreviewMerged openstack/openstack-ansible stable/yoga: Switch ARA report type from html to database for openstack-ansible jobs  https://review.opendev.org/c/openstack/openstack-ansible/+/84765214:18
*** dviroel|biab is now known as dviroel14:21
opendevreviewMerged openstack/openstack-ansible master: Set zuul post-timeout to 3 hours  https://review.opendev.org/c/openstack/openstack-ansible/+/84779714:33
opendevreviewMerged openstack/openstack-ansible master: Exclude /var/log/journal from log collection  https://review.opendev.org/c/openstack/openstack-ansible/+/84783614:34
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/yoga: Exclude /var/log/journal from log collection  https://review.opendev.org/c/openstack/openstack-ansible/+/84782714:42
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/xena: Exclude /var/log/journal from log collection  https://review.opendev.org/c/openstack/openstack-ansible/+/84782814:42
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/wallaby: Exclude /var/log/journal from log collection  https://review.opendev.org/c/openstack/openstack-ansible/+/84782914:42
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/xena: Set zuul post-timeout to 3 hours  https://review.opendev.org/c/openstack/openstack-ansible/+/84799014:43
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/wallaby: Set zuul post-timeout to 3 hours  https://review.opendev.org/c/openstack/openstack-ansible/+/84799114:43
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_manila stable/yoga: Create backends when running against manila_share  https://review.opendev.org/c/openstack/openstack-ansible-os_manila/+/84799214:44
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_manila stable/xena: Create backends when running against manila_share  https://review.opendev.org/c/openstack/openstack-ansible-os_manila/+/84799314:45
*** dviroel is now known as dviroel|afk|lunch14:52
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_ironic stable/yoga: Updated from OpenStack Ansible Tests  https://review.opendev.org/c/openstack/openstack-ansible-os_ironic/+/84677715:40
jrosser_mgariepy: these are the things which are important for release parentproject:openstack/openstack-ansible branch:^stable/yoga status:open 15:41
jrosser_argh15:41
jrosser_https://review.opendev.org/q/parentproject:openstack/openstack-ansible+branch:%255Estable/yoga+status:open+15:41
jrosser_unless you spot anything else which needs a backport15:42
mgariepyok i'll check these this afternoon, i'm with kids today.15:43
*** dviroel|afk|lunch is now known as dviroel16:23
spatelwhat is going on here? boostrapping new deployment and hit this - https://paste.opendev.org/show/byO9uh7Hh7bTTpPZGNdk/16:52
spatelserver certificate verification failed. CAfile: none CRLfile 16:53
spatelThis is ubuntu 20.0416:53
jrosser_what if you wget or curl the url16:54
spatelno cert error on curl https://opendev.org/openstack/ansible-collections-openstack/ 16:56
spatelre-running bootstrap again and see 16:57
spatelsame error failed again with same error https://paste.opendev.org/show/byO9uh7Hh7bTTpPZGNdk/16:58
spateljrosser_ check this out - https://paste.opendev.org/show/bf03iQiXuppINrufcWls/16:59
jrosser_sanity check - you've done apt update / dist upgrade before this?17:00
spatelI am running 20.04.1 17:01
spateli did apt update 17:01
spatelwhy do i need to dist upgrade?17:01
mgariepyapt upgrade as well ?17:01
jrosser_`apt policy ca-certificates` ?17:02
spatelapt-get install ca-certificates17:02
spatelworks for me 17:02
spatelwhy don't we add "apt-get install ca-certificates" in boostrap script to make sure we have latest certs 17:03
jrosser_so you didnt have that?17:03
jrosser_apt policy would have told you exactly what was going on17:04
jrosser_too late now17:04
spateli don't know how that missed out 17:04
mgariepydo you run in a stripped off container?17:04
spatelThis is brand new deployment 17:04
jrosser_if you really did boot from an image of Ubuntu 20.04.1 then i am not surprised it doesnt work17:05
jrosser_the first thing you need to do on there is apt update / apt dist-upgrade to get a modern ca-certificates (as well as everything else)17:05
spatelwe have 20.04.1 in kickstart which i didn't upgrade yet 17:05
damiandabrowski[m]PS. we have fixed it: https://opendev.org/openstack/openstack-ansible-openstack_hosts/commit/8c4ef001c148dc16c980bfc7ea695311e6dfd61217:06
jrosser_it does not have the root CA that letsencrypt is using17:06
mgariepyhttps://docs.openstack.org/project-deploy-guide/openstack-ansible/latest/deploymenthost.html17:06
mgariepystep 2.17:06
spateldoes apt dist-upgrade will change 20.04 to 22.10 ?17:06
mgariepyno17:06
mgariepydo-release-upgrade would17:06
spateloh ok so dist-upgrade is safe to do 17:07
jrosser_safe is the wrong word, essential would be better :)17:07
mgariepyapt upgrade will install newer pkg without installing new pkg (ex: new dependency) dist-upgrade will upgrade and install new dependency as well.17:07
spateljrosser_ copy that17:08
mgariepywhere did you clone the opesntack-ansible repo from ? github ? 17:08
spatelI used https://github.com17:09
spatelassuming opendev and github both in sync 17:10
mgariepythey should17:11
mgariepyit does explains why your initial clone worked.17:11
spatelYeah! 17:12
spatelDoes anyone using SmartNIC in their deployment for offloading stuff? 17:15
spatelI am planning to buy 2 Mellanox nic to play in lab for learning 17:16
spatelI want to test ASAP specifically 17:17
opendevreviewMerged openstack/openstack-ansible master: Do not try to run etcd in serial mode  https://review.opendev.org/c/openstack/openstack-ansible/+/84736217:18
jrosser_you can do ASAP with a connectx-6dx17:31
jrosser_that would be my choice today for a not-smartnic to do ASAP17:32
jrosser_dmsimard: o/ hello17:32
spateljrosser_  connectx-6dx is very costly for lab :(17:36
jrosser_well do you really mean a smartnic17:36
spatelhow about this one - https://www.fs.com/products/119649.html17:37
jrosser_this might be better connectx-6-lx17:40
jrosser_https://nvdam.widen.net/s/nqbftvdlh7/connectx-6-lx-en-card-product-brief-173100617:40
jrosser_"Full hardware offload for OVS data plane"17:41
jrosser_but i think you need to see really carefully what they actually mean by that17:41
jrosser_if it is just east<>west L217:41
jrosser_or any support for offload of north<>south L317:41
jrosser_and if security groups work at all17:42
jrosser_etc etc17:42
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/yoga: Do not try to run etcd in serial mode  https://review.opendev.org/c/openstack/openstack-ansible/+/84800217:44
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/xena: Do not try to run etcd in serial mode  https://review.opendev.org/c/openstack/openstack-ansible/+/84800317:44
spatelthat is where i am confused so looking for answers from folks who already using them 17:49
spatelI was thinking i can deploy those kind of card on network node to make them gateway for all VMs 17:50
jrosser_you would need the L3 offload to be working for that17:50
jrosser_it would be excellent if that did work17:51
spatelthat is the plan to make network node stronger and perfect gateway for all traffic 17:55
jrosser_tbh i have never seen any documentation of that17:59
opendevreviewMerged openstack/openstack-ansible-os_horizon stable/yoga: Fix ALLOWED_HOSTS  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/84765318:03
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_horizon stable/xena: Fix ALLOWED_HOSTS  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/84800418:05
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_horizon stable/wallaby: Fix ALLOWED_HOSTS  https://review.opendev.org/c/openstack/openstack-ansible-os_horizon/+/84800518:05
opendevreviewMerged openstack/openstack-ansible stable/yoga: Return facts gathering to pre_tasks for masakari  https://review.opendev.org/c/openstack/openstack-ansible/+/84726018:07
spatelIf you Offload whole OVS switch to NIC then it should use NIC for packet processing instead of CPU correct? - https://docs.nvidia.com/networking/display/TAN10/ASAP+OVS+Offload18:10
opendevreviewJonathan Rosser proposed openstack/openstack-ansible stable/xena: Return facts gathering to pre_tasks for masakari  https://review.opendev.org/c/openstack/openstack-ansible/+/84800618:10
*** akahat|ruck is now known as akahat|out18:11
jrosser_spatel: well, there were two things not really done when i looked at this before, security groups and any form of nat18:12
spatelFull OVS offload only support in OVN because OVN use virtual router in flow (doesn't need any namespace like lagacy design)18:12
jrosser_but that was a long tie ago18:12
jrosser_you'd have to try it, or maybe ask in the neutron channel if anyone is doing it18:12
spatelMellanox doc saying it support NAT/firewall 18:12
jrosser_yes but thats not the same as neutron/OVS/OVN programming rules that are offloadable18:13
jrosser_there was always a functionality gap that has been slowly closing over a very long time18:13
jrosser_and it's quite possible that with OVN it just all magically works18:14
jrosser_like in jamesdenton document here https://docs.openstack.org/openstack-ansible-os_neutron/latest/app-openvswitch-asap.html18:14
jrosser_`Hardware offloading is not yet compatible with the openvswitch firewall driver. To ensure flows are offloaded, port security must be disabled`18:15
spatelI believe full OVS offloading only supported by OVN because its pure flow based function 18:15
jrosser_this is a great topic for more lab work and a blog post :)18:15
spatelhmm I am on it.. i really want to prove that so hunting for hardware :) 18:16
jrosser_the newer hardware gets incrementally better, larger flow tables, more offloads etc18:16
jrosser_thats why i'd get the most recent that you can18:16
spatelits freaking costly so i may need approval from my boss :(18:18
jrosser_try and find a price for the 6-Lx18:18
opendevreviewMerged openstack/openstack-ansible stable/yoga: Fix facts gathering for zun  https://review.opendev.org/c/openstack/openstack-ansible/+/84726118:23
lowercaseBTW, if you ever accidentally upgrade the database up a version, you can hack the api version by changing the service number in the python code which will trick the module working on a future api version but old code.18:30
lowercaseor you know... don't do a nova-manage db upgrade until you are ready to upgrade versions >.>18:31
opendevreviewMerged openstack/openstack-ansible-os_manila stable/yoga: Create backends when running against manila_share  https://review.opendev.org/c/openstack/openstack-ansible-os_manila/+/84799220:57
*** dviroel is now known as dviroel|out21:19
opendevreviewMerged openstack/openstack-ansible stable/yoga: Exclude /var/log/journal from log collection  https://review.opendev.org/c/openstack/openstack-ansible/+/84782721:38

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!