Monday, 2024-02-12

tonybcorvus: I noticed that https://zuul.opendev.org/t/openstack/status isn't working as expected.  Anything in the search/filter box doesn't match items I can see in the queue.01:28
fungitonyb: tried to force-refresh your browser?02:08
fungithough i suppose that was mentioned in relationship to stale css02:09
tonybfungi: Yup.  I'll try firefox for a fresh look02:10
tonybSame thing with FF, I notice that 05:03
tonybI suspect it's a 'react' issue not sure how to debug it05:03
*** ralonsoh_ is now known as ralonsoh08:08
bbezakI can confirm the same issue on safari/chrome/firefox11:26
opendevreviewMerged openstack/project-config master: Implement kolla/kayobe-unmaintained-core groups  https://review.opendev.org/c/openstack/project-config/+/90832114:34
corvustonyb: remote:   https://review.opendev.org/c/zuul/zuul/+/908797 Fix filtering on status page [NEW]        <-- should be fixed there14:58
* clarkb has tea and will dig into reviews for those fixups momentarily16:13
*** dhill is now known as Guest252316:34
clarkbjgit moved from Gerrit's Gerrit to GerritForge. I've asked on discord if the copy that gerrit hosts is an up to date mirror as we fetch jgit from there for our builds16:41
clarkbI think it may be because gerrit uses it as a submodule and having gerrit host submodules simplifies some things. But definitely don't awnt us to fetch the wrong jgit content16:42
dpanechHi this review is stuck, Zuul jobs aren't being triggered, could someone have a look? 16:42
dpanechhttps://review.opendev.org/c/starlingx/ha/+/90600316:42
corvuslooking16:43
corvusdpanech: it appears to be based on 906002 which is abandoned16:45
clarkbdue to the depends on in the commit mesasge16:46
dpanechOh duh. Thanks16:46
jrosseri'm having trouble with the zuul status screen search box, is that still working?16:47
jrosseri.e to narrow down to one project or review number16:48
corvusjrosser: fix is in progress16:48
jrosserahha ok :)16:48
clarkbfungi: doesn't look like you updated https://review.opendev.org/c/opendev/system-config/+/908328 with your test results. Can you review that change? and maybe we should land it?19:24
clarkbor we can wait for after the preptg if we're worreid about impacting that19:24
fungioh, indeed thanks, i forgot to do so19:25
clarkbthank you!19:27
fungiinfra-root: for those of you using keycloak to authenticate for zuul's admin webui, were you relying on any social auth logins or just username/password? i'm to the point with keycloak03 where we can start adding accounts but i can add specific social auth identity providers first if needed19:39
clarkbI don't think any social providers were added yet but I may be wrong19:40
tonybfungi: I was using username/password.  I don't recall any social buttons20:33
corvusi think i experimentally added something for openstackid, just for exploration, but i haven't used it for realz.20:33
fungitonyb: corvus: thanks for confirming. infra-root: in that case it's open season for adding our accounts to the zuul realm on keycloak03 i suppose. in the short term, until we merge 908357 to change the cname, you'll have to locally override address resolution for keycloak to that of keycloak0320:40
clarkbbecause keycloak does strict name validation headers ya?20:41
fungiyes, and there's a redirect too20:41
fungii should probably log into the current production server and see if it was set up for things like e-mail notifications so i recreate those settings20:47
corvusfungi: i'll probably just wait until the cname changes, thanks!  and i don't think i set up anything like that.  was basically just the tutorial procedure.20:47
clarkbpossibly related: firefox really doesn't like the national forest service website because it apparently tries to serve random things as http?20:48
fungigot it. we'll almost certainly want to set more of these options before we add it to other services, but for now it's something we can get by with for a handful of service admins20:48
clarkball this extra browser validation to protect users is annoying when it impacts perfectly innocent things like looking up fire closure maps20:49
corvusclarkb: it's for your safety!20:57
funginow that 908797 has merged and promoted, should we do a restart of zuul-web containers, or a larger zuul-wide restart?21:13
fungi(well, image pull and container restart i mean)21:14
clarkbfungi: I think we only need the webs to restart in this case21:14
corvusi will restart the web containers21:15
fungithanks corvus!21:15
fungii know sometimes we've also restarted zuul-scheduler containers for good measure, when it was generally only a web-affecting change, so i wasn't sure21:16
corvussince this is strictly js, it falls well below my paranoia threshold for that21:17
fungimakes sense21:17
clarkbplease add content to the infra meeting agenda if you've got it and update the preptg etherpad ith your interest and topics too21:18
corvus#status log restarted zuul-web to pick up webui fixes21:28
opendevstatuscorvus: finished logging21:28
fungiconfirmed the filtering seems to be working for me now21:29
corvusthose should be in effect now21:29
fungitonyb: bbezak: jrosser: ^ please check again21:29
jrosserlooks like it works for me searcing by project or review number now, thanks for the fix21:33
clarkbluca reports that the jgit in gerrit.googlesource should be an up to date mirror. Its just not where dev happens anymore. This means we don't need to update our image builds21:37
clarkbI feel like the meeting agenda is so empty possibly because we're keeping the bigger agenda stuff for the preptg22:19
fungii saw a deploy failure for infra-prod-service-tracing on friday. looks like it happens occasionally due to reaching a 2-minute timeout waiting for port 16686 to be listening on the loopback address22:37
fungishould we increase that? or is it possibly a sign of a deeper problem with jaeger22:37
clarkbI suspect that when we update the image the container startup may not always be fast and we should increase the timeout22:38
fungithat was my first instinct, but wanted to double-check22:38
corvuscould be, but i'd be surprised if jaeger takes a long time to start; i feel like that warrants some log reading22:38
corvus(maybe it's related to our local data storage size though?)22:39
fungi`grep 'Timeout when waiting' /var/log/ansible/service-tracing.yaml.log.*` turns up "recent" occurrences at 2024-01-18T04:44:01 and 2024-02-09T21:05:2622:39
fungilooks like we don't set up any additional logging for that container22:40
clarkbya I suspect it only happens when we update the image version. The latest version is from today (so will run in a few hours) and the one before that is from 6 days ago. The weird thing about those timestamps is that it doesn't seem to align with a periodic job run. Maybe it was tripped by the keycloak inventory updates22:41
corvusindeed looks like last startup took 131 seconds22:41
corvusfrom: "level":"info","ts":1707512780.2357833,"caller":"flags/service.go:119","msg":"Mounting metrics handler on admin server","route":"/metrics"22:42
corvusto: "level":"info","ts":1707512911.3811984,"caller":"app/server.go:284","msg":"Starting HTTP server","port":16686,"addr":":16686"22:42
fungiaha, yep, i found the same looking at docker-compose logs22:43
corvusand it does seem like most of the slow bits are badger related; like compacting writeahead logs, etc,22:43
corvusso yeah, i guess that's our non-optimized local storage22:44
corvusprobably bumping timeout is fine for our current level of concern with this service then :)22:44
fungishould i up it to 180? i increased a similar timeout on the keycloak deploy to 300 just to be safe, because it does a container rebuild each time it's started22:44
corvusi'd say bump to 300?22:45
clarkb++22:46
opendevreviewJeremy Stanley proposed opendev/system-config master: Increase Jaeger start timeout to 300  https://review.opendev.org/c/opendev/system-config/+/90886722:47
opendevreviewMerged opendev/system-config master: Increase Jaeger start timeout to 300  https://review.opendev.org/c/opendev/system-config/+/90886723:44
tonybfungi, corvus: filtering looks good to me.  Thanks!23:57

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!