Saturday, 2021-10-30

-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed on behalf of Tobias Henkel: [zuul/zuul] 816072: Increase load_multiplier in tests https://review.opendev.org/c/zuul/zuul/+/81607200:20
-@gerrit:opendev.org- Zuul merged on behalf of Tobias Henkel: [zuul/zuul] 816072: Increase load_multiplier in tests https://review.opendev.org/c/zuul/zuul/+/81607201:55
@jim:acmegating.comwe seem to mostly be looking at dockerhub failures at this point02:18
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 816073: Cancel stats election on shutdown https://review.opendev.org/c/zuul/zuul/+/81607306:28
-@gerrit:opendev.org- Zuul merged on behalf of Simon Westphahl: [zuul/zuul] 815450: Create bundle items during queue deserialization https://review.opendev.org/c/zuul/zuul/+/81545006:33
-@gerrit:opendev.org- Zuul merged on behalf of Simon Westphahl: [zuul/zuul] 815495: Fix Gerrit change (de-)serialization https://review.opendev.org/c/zuul/zuul/+/81549506:45
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 815565: Remove unecessary assignment in re-enqueue https://review.opendev.org/c/zuul/zuul/+/81556506:45
-@gerrit:opendev.org- Zuul merged on behalf of Simon Westphahl:06:52
- [zuul/zuul] 815616: Only reset the pipeline state if needed https://review.opendev.org/c/zuul/zuul/+/815616
- [zuul/zuul] 815617: Ensure same layout UUID across schedulers https://review.opendev.org/c/zuul/zuul/+/815617
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com:06:52
- [zuul/zuul] 815744: Use a metaclass to deserialize event objects https://review.opendev.org/c/zuul/zuul/+/815744
- [zuul/zuul] 815764: Add a pipeline change list object to ZK https://review.opendev.org/c/zuul/zuul/+/815764
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 815916: Reduce use of OrderedDict in PipelineState https://review.opendev.org/c/zuul/zuul/+/81591608:26
-@gerrit:opendev.org- Zuul merged on behalf of Matthieu Huin https://matrix.to/#/@mhuin:matrix.org: [zuul/zuul] 735586: Zuul-web: Add authentication-realm attribute to tenants https://review.opendev.org/c/zuul/zuul/+/73558609:25
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/zuul] 816088: Pin CacheControl to <0.12.7 https://review.opendev.org/c/zuul/zuul/+/81608809:33
@tobias.henkel:matrix.orgcorvus, Clark : this just broke our update today ^09:33
-@gerrit:opendev.org- Zuul merged on behalf of Matthieu Huin https://matrix.to/#/@mhuin:matrix.org: [zuul/zuul] 736968: zuul-web: add pipelines' manager, triggers data in status https://review.opendev.org/c/zuul/zuul/+/73696809:34
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/zuul] 816088: Pin CacheControl to <0.12.7 https://review.opendev.org/c/zuul/zuul/+/81608809:34
-@gerrit:opendev.org- Zuul merged on behalf of Felix Edel:09:50
- [zuul/zuul] 760806: UI: Add actions and reducers to retrieve components https://review.opendev.org/c/zuul/zuul/+/760806
- [zuul/zuul] 760807: UI: Add components page https://review.opendev.org/c/zuul/zuul/+/760807
@tobias.henkel:matrix.orgcorvus, Clark : fyi, I've created an upstream issue in cachecontrol: https://github.com/ionrock/cachecontrol/issues/26310:06
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 815917: Update Pipeline for symmetry https://review.opendev.org/c/zuul/zuul/+/81591711:14
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 813255: Perform actual HTTP requests in gitlab tests https://review.opendev.org/c/zuul/zuul/+/81325512:42
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com:12:51
- [zuul/zuul] 813256: Add gitlab disable_connection_pool option https://review.opendev.org/c/zuul/zuul/+/813256
- [zuul/zuul] 813289: Stop GerritWebServer in tests https://review.opendev.org/c/zuul/zuul/+/813289
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 815979: Use activeContext instead of explicit _save calls https://review.opendev.org/c/zuul/zuul/+/81597914:07
@jim:acmegating.comtobiash: i went ahead and approved the cachecontrol change14:18
@jim:acmegating.comand hey, it looks like all the sos pipeline stuff merged, along with the components page; that will be handy :)14:19
@jim:acmegating.comas soon as i can focus a little better, i'll restart opendev14:19
-@gerrit:opendev.org- Zuul merged on behalf of Tobias Henkel: [zuul/zuul] 816088: Pin CacheControl to <0.12.7 https://review.opendev.org/c/zuul/zuul/+/81608815:33
@jim:acmegating.comi'm restarting opendev's zuul, and the first thing i note is that we may want to remove some of the extra change cache debug log lines i added.  there's a bunch of logs at startup as it adds 70k watches15:43
@jim:acmegating.comthe second thing is i'd like to know why there are 70k watches15:43
@jim:acmegating.comerm, did the fonts on the status page change?15:54
@jim:acmegating.comthere's something weird with the layout in the boxes too15:55
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: [zuul/zuul] 816094: Adjust spacing on status page https://review.opendev.org/c/zuul/zuul/+/81609416:04
@jim:acmegating.comthat's a quick fix for the status page spacing16:05
@jim:acmegating.comi'm starting a second scheduler now16:06
@jim:acmegating.comthe first thing that happens is 70k lines about adding elements to the change cache (sorry)16:06
@jim:acmegating.comit's loading the config right now16:09
@jim:acmegating.com(i wonder if we should add a new run state for priming config?)16:09
@jim:acmegating.comit is currently handling some gearman jobs (like status requests) which it shouldn't16:10
@jim:acmegating.combut that's not a long term problem; we'll be removing that functionality16:10
@jim:acmegating.comit's processing pipelines now16:16
@jim:acmegating.comapparently successfully16:16
@tobias.henkel:matrix.orgyay :)16:17
@jim:acmegating.comit processed a completed build16:17
@jim:acmegating.com2021-10-30 16:16:26,280 DEBUG zuul.Pipeline.openstack.check: Build <Build 6a8f03bb9ff0432bbee0698215fec9ff of nova-tox-functional-py39 voting:False> of <Change 0x7fa33e71f880 openstack/nova 806683,17> completed16:17
@jim:acmegating.comhttps://zuul.opendev.org/t/openstack/build/6a8f03bb9ff0432bbee0698215fec9ff16:18
@jim:acmegating.comthe first build to be started by one scheduler and finished by another? :)16:18
@jim:acmegating.comjust confirmed in the logs that 02 did not process the completion event, only 01 did16:20
@jim:acmegating.comnot much happening now... saturday... :)16:22
@jim:acmegating.com2021-10-30 16:24:05,903 DEBUG zuul.Scheduler: Skipping locked pipeline check in tenant openstack16:24
@jim:acmegating.comthat's a good sign16:24
@jim:acmegating.comand on the other scheduler immediately afterwords:16:26
@jim:acmegating.com2021-10-30 16:24:06,125 DEBUG zuul.Scheduler: Skipping locked pipeline gate in tenant openstack16:26
@jim:acmegating.comone thing we may want to think about: the schedulers can get into a pattern where one follows the other, so we might see one process a pipeline and then the other immediately process the same one.  of course, the second one is fast since there are no events left to process.  This may be no big deal and we can just leave it alone; or we could consider skipping the actual pipeline processing if there are no events.16:31
@jim:acmegating.comi think we should leave it, unless we find that it uses too much cpu time, then consider optimizing.16:32
@tobias.henkel:matrix.orgOtherwise when having many pipelines this would reduce locking overhead consuderably16:37
@tobias.henkel:matrix.org* Otherwise when having many pipelines this would reduce locking overhead considerably16:37
@jim:acmegating.comthis error happened: https://paste.opendev.org/show/810298/16:41
@jim:acmegating.comit's not critical; i wonder if that's a race on 2 schedulers16:41
@jim:acmegating.comonly one of them logged that error; the other just noted it was deleting an entry from the cache16:42
@tobias.henkel:matrix.orgDoes the cleanup do locking?16:42
@jim:acmegating.comthe general cleanup/connection cache maintenance does, so there was only one copy of that running at that time.16:48
@jim:acmegating.comi don't know why it had a cache entry with no data though.  i don't see any cache removals near then.  so i don't understand why that happened16:49
@jim:acmegating.comthat was the only general cleanup that's run since we started, so it may have been cleaning up an old error in zk.16:58
@jim:acmegating.com(and that cache key was in zk before startup; it was not added after the restart)16:58
@jim:acmegating.comi think i'm going to leave this running as long as possible today and see if anything major breaks.  we know that the branch cache is going to get out of date, but that shouldn't be a big deal for a relatively short period of time on a not-too-busy system.17:09
@clarkb:matrix.orgcorvus I guess it went well. Exciting. Can you expand a bit on the branch cache problem? I'm curious to know what we are missing there.18:01
@jim:acmegating.comClark: yes, is here: https://review.opendev.org/81578818:13
@jim:acmegating.comi still haven't dived into it yet, but that's next i think.18:13

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!