Tuesday, 2022-08-30

-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 854556: web: console: convert to PF4 DataList https://review.opendev.org/c/zuul/zuul/+/85455600:13
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 854556: web: console: convert to PF4 DataList https://review.opendev.org/c/zuul/zuul/+/85455600:26
@iwienand:matrix.orgcorvus: / anybody -- how does the spacing of the > pull-outs look on the console of https://fc7ee29928d7d35d8931-689502246b0e06bff85a4de58b84e796.ssl.cf2.rackcdn.com/854556/6/check/zuul-build-dashboard-opendev/943bb51/npm/html/ ?  00:47
-@gerrit:opendev.org- Zuul merged on behalf of Ian Wienand: [zuul/zuul] 854555: web: fix package task results in console https://review.opendev.org/c/zuul/zuul/+/85455501:45
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 855107: web: console -- clarify package: comment https://review.opendev.org/c/zuul/zuul/+/85510701:52
@iwienand:matrix.orgI've noticed that zuul-tox-remote is failing fairly randomly03:44
@iwienand:matrix.orgi've had a look at the last ~6 failures that appear unrelated to anything happening in the change03:45
@iwienand:matrix.orghttps://etherpad.opendev.org/p/zuul-tox-remote-2022-0803:45
@iwienand:matrix.orgthey all seem to fail in test_command @ https://opendev.org/zuul/zuul/src/branch/master/tests/remote/test_remote_zuul_stream.py#L11703:47
@iwienand:matrix.orgthis is a mixin -- 4 times it was tests.remote.test_remote_zuul_stream.TestZuulStream29, twice TestZuulStream2803:48
@iwienand:matrix.orgTestZuulStream5 did not seem to fail afaics03:48
@iwienand:matrix.orgi've extracted the end of executor job output logs before it got killed for about 4 of them, results in the etherpad.  they all appear to be at different points03:49
@iwienand:matrix.orgall of them just seem to stop, then ~2 minutes later are aborted03:50
@iwienand:matrix.orgi know we've had that grantpt() deadlock, and also Albin Vass has seen some too.  it does make me wonder if that's related03:56
@iwienand:matrix.orga backtrace would certainly help.  but i'd have to catch a failing job's ansible while it's in the two-minute quiescent period before it's reaped, i think03:58
@iwienand:matrix.org... on the remote side too ...03:59
@iwienand:matrix.orgalthough it's ssh-ing to itself04:01
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 855116: [dnm] zuul-tox-remote : try to dump some sort of backtraces https://review.opendev.org/c/zuul/zuul/+/85511604:25
@avass:vassast.orgianw: yeah, we got "lucky" that the job hung in cleanup :)06:41
@avass:vassast.orgwe've seen some issues lately, but way fewer than before glibc was updated.06:41
-@gerrit:opendev.org- Zuul merged on behalf of Tony Breeds: [zuul/zuul] 854846: Make the day-of-week difference from cron a warning https://review.opendev.org/c/zuul/zuul/+/85484606:44
@iwienand:matrix.orgi've run zuul-tox-remote now 5 times in 855116 and of course nothing :/07:50
@avass:vassast.orgUh, what's the difference between ABORTED and CANCELED in zuul? fail-fast seem to be causing both to be reported08:24
@avass:vassast.orgweird, the result shown for buildsets in zuul-web and what's reported by mqtt is different. mqtt doesn't show end_time for all canceled builds while zuul-web does, and some build reported canceled are actually successful. I the sql and mqtt reporters use different data somehow?09:37
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/nodepool] 763540: Support openstack server groups https://review.opendev.org/c/zuul/nodepool/+/76354011:59
-@gerrit:opendev.org- Zuul merged on behalf of Simon Westphahl: [zuul/nodepool] 854417: Consider all node types when adjusting label quota https://review.opendev.org/c/zuul/nodepool/+/85441712:13
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/nodepool] 614079: WIP: Add request handler timer to stats https://review.opendev.org/c/zuul/nodepool/+/61407912:18
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/nodepool] 743790: Check for images to upload single threaded https://review.opendev.org/c/zuul/nodepool/+/74379012:41
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/nodepool] 743790: Check for images to upload single threaded https://review.opendev.org/c/zuul/nodepool/+/74379012:43
@jim:acmegating.comAlbin Vass: they do use different data.  SQL isn't really a reporter any more, so it can report more accurate information about the result after the reporters run.  it has the ability to update data after the report is complete (so if the report itself fails, like a merge, it can record that correctly).13:36
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: [zuul/zuul] 855096: WIP: Tracing: implement span save/restore https://review.opendev.org/c/zuul/zuul/+/85509614:35
@jim:acmegating.comClark: swest tobiash regarding the instance type fallback change ( https://review.opendev.org/853371 ) and the idea to move the configuration of that into zuul, how about this syntax for zuul config?  https://etherpad.opendev.org/p/5myx14_07u5oWSJz8mYx  (and the fallback handling itself would be in zuul, which would potentially make it more useful/applicable)16:18
@tobias.henkel:matrix.orgI like that16:29
@clarkb:matrix.orgcorvus: I like that but "alternatives" doesn't really convey if there is a preference.16:36
@clarkb:matrix.orgI think there isn't one? Would a different term indicate that better maybe?16:36
@clarkb:matrix.org(to me alternatives indicate there is some primary and then alternatives, but I may be over thinking this)16:37
@jim:acmegating.comClark: yeah, i spent some time thinking about that and perusing the thesaurus, and haven't come up with anything better than the word "alternatives" and the fact that it's a list which has an inherent order.  obviously the docs would clarify, but i agree, we want to make it as intuitive as possible.  i'd be happy to change it if we think of a better one.16:38
@jim:acmegating.com(at any rate, we can probably s/alternatives/something/ later if we find it, so i'll start working on the implementation soon and if we manage to think of something better, i'm happy to s/ then.)16:41
@clarkb:matrix.orgaliases maybe, or standins16:44
@jim:acmegating.comi think aliases might be more confusing (since we name nodes -- it makes me think of that somehow)16:45
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 854556: web: console: convert to PF4 DataList https://review.opendev.org/c/zuul/zuul/+/85455620:35
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 854556: web: console: convert to PF4 DataList https://review.opendev.org/c/zuul/zuul/+/85455621:24
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed:21:34
- [zuul/zuul] 854458: WIP: Add support for configuring and testing tracing https://review.opendev.org/c/zuul/zuul/+/854458
- [zuul/zuul] 855096: WIP: Tracing: implement span save/restore https://review.opendev.org/c/zuul/zuul/+/855096
- [zuul/zuul] 855291: Fix and improve Keycloak tutorial https://review.opendev.org/c/zuul/zuul/+/855291
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: [zuul/zuul] 855293: Add tracing tutorial https://review.opendev.org/c/zuul/zuul/+/85529322:26
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 852929: Add detail to "depends on a change that failed to merge" https://review.opendev.org/c/zuul/zuul/+/85292922:37
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 855297: web: console: compress datalist https://review.opendev.org/c/zuul/zuul/+/85529723:22
@iwienand:matrix.org> <@gerrit:opendev.org> Ian Wienand proposed: [zuul/zuul] 855116: [dnm] zuul-tox-remote : try to dump some sort of backtraces  https://review.opendev.org/c/zuul/zuul/+/85511623:32
this did catch *a* error -- in tests.remote.test_remote_action_modules.TestActionModules5:test_shell_module @ https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_b95/855116/1/check/zuul-tox-remote/b9598b9/job-output.txt
@iwienand:matrix.orgin all the other tests i never saw that one time out.  unfortunately i guess due to a combination of bwrap, namespaces, etc, etc. the trace is basically useless23:32
@iwienand:matrix.orgvery open to ideas.  one might be to switch this job from jammy back to focal and see if we see any timeouts23:33
@iwienand:matrix.orgthat would at least be a datapoint that it's something underneath ansible23:34

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!