-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 854556: web: console: convert to PF4 DataList https://review.opendev.org/c/zuul/zuul/+/854556 | 00:13 | |
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 854556: web: console: convert to PF4 DataList https://review.opendev.org/c/zuul/zuul/+/854556 | 00:26 | |
@iwienand:matrix.org | corvus: / anybody -- how does the spacing of the > pull-outs look on the console of https://fc7ee29928d7d35d8931-689502246b0e06bff85a4de58b84e796.ssl.cf2.rackcdn.com/854556/6/check/zuul-build-dashboard-opendev/943bb51/npm/html/ ? | 00:47 |
---|---|---|
-@gerrit:opendev.org- Zuul merged on behalf of Ian Wienand: [zuul/zuul] 854555: web: fix package task results in console https://review.opendev.org/c/zuul/zuul/+/854555 | 01:45 | |
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 855107: web: console -- clarify package: comment https://review.opendev.org/c/zuul/zuul/+/855107 | 01:52 | |
@iwienand:matrix.org | I've noticed that zuul-tox-remote is failing fairly randomly | 03:44 |
@iwienand:matrix.org | i've had a look at the last ~6 failures that appear unrelated to anything happening in the change | 03:45 |
@iwienand:matrix.org | https://etherpad.opendev.org/p/zuul-tox-remote-2022-08 | 03:45 |
@iwienand:matrix.org | they all seem to fail in test_command @ https://opendev.org/zuul/zuul/src/branch/master/tests/remote/test_remote_zuul_stream.py#L117 | 03:47 |
@iwienand:matrix.org | this is a mixin -- 4 times it was tests.remote.test_remote_zuul_stream.TestZuulStream29, twice TestZuulStream28 | 03:48 |
@iwienand:matrix.org | TestZuulStream5 did not seem to fail afaics | 03:48 |
@iwienand:matrix.org | i've extracted the end of executor job output logs before it got killed for about 4 of them, results in the etherpad. they all appear to be at different points | 03:49 |
@iwienand:matrix.org | all of them just seem to stop, then ~2 minutes later are aborted | 03:50 |
@iwienand:matrix.org | i know we've had that grantpt() deadlock, and also Albin Vass has seen some too. it does make me wonder if that's related | 03:56 |
@iwienand:matrix.org | a backtrace would certainly help. but i'd have to catch a failing job's ansible while it's in the two-minute quiescent period before it's reaped, i think | 03:58 |
@iwienand:matrix.org | ... on the remote side too ... | 03:59 |
@iwienand:matrix.org | although it's ssh-ing to itself | 04:01 |
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 855116: [dnm] zuul-tox-remote : try to dump some sort of backtraces https://review.opendev.org/c/zuul/zuul/+/855116 | 04:25 | |
@avass:vassast.org | ianw: yeah, we got "lucky" that the job hung in cleanup :) | 06:41 |
@avass:vassast.org | we've seen some issues lately, but way fewer than before glibc was updated. | 06:41 |
-@gerrit:opendev.org- Zuul merged on behalf of Tony Breeds: [zuul/zuul] 854846: Make the day-of-week difference from cron a warning https://review.opendev.org/c/zuul/zuul/+/854846 | 06:44 | |
@iwienand:matrix.org | i've run zuul-tox-remote now 5 times in 855116 and of course nothing :/ | 07:50 |
@avass:vassast.org | Uh, what's the difference between ABORTED and CANCELED in zuul? fail-fast seem to be causing both to be reported | 08:24 |
@avass:vassast.org | weird, the result shown for buildsets in zuul-web and what's reported by mqtt is different. mqtt doesn't show end_time for all canceled builds while zuul-web does, and some build reported canceled are actually successful. I the sql and mqtt reporters use different data somehow? | 09:37 |
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/nodepool] 763540: Support openstack server groups https://review.opendev.org/c/zuul/nodepool/+/763540 | 11:59 | |
-@gerrit:opendev.org- Zuul merged on behalf of Simon Westphahl: [zuul/nodepool] 854417: Consider all node types when adjusting label quota https://review.opendev.org/c/zuul/nodepool/+/854417 | 12:13 | |
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/nodepool] 614079: WIP: Add request handler timer to stats https://review.opendev.org/c/zuul/nodepool/+/614079 | 12:18 | |
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/nodepool] 743790: Check for images to upload single threaded https://review.opendev.org/c/zuul/nodepool/+/743790 | 12:41 | |
-@gerrit:opendev.org- Tobias Henkel proposed: [zuul/nodepool] 743790: Check for images to upload single threaded https://review.opendev.org/c/zuul/nodepool/+/743790 | 12:43 | |
@jim:acmegating.com | Albin Vass: they do use different data. SQL isn't really a reporter any more, so it can report more accurate information about the result after the reporters run. it has the ability to update data after the report is complete (so if the report itself fails, like a merge, it can record that correctly). | 13:36 |
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: [zuul/zuul] 855096: WIP: Tracing: implement span save/restore https://review.opendev.org/c/zuul/zuul/+/855096 | 14:35 | |
@jim:acmegating.com | Clark: swest tobiash regarding the instance type fallback change ( https://review.opendev.org/853371 ) and the idea to move the configuration of that into zuul, how about this syntax for zuul config? https://etherpad.opendev.org/p/5myx14_07u5oWSJz8mYx (and the fallback handling itself would be in zuul, which would potentially make it more useful/applicable) | 16:18 |
@tobias.henkel:matrix.org | I like that | 16:29 |
@clarkb:matrix.org | corvus: I like that but "alternatives" doesn't really convey if there is a preference. | 16:36 |
@clarkb:matrix.org | I think there isn't one? Would a different term indicate that better maybe? | 16:36 |
@clarkb:matrix.org | (to me alternatives indicate there is some primary and then alternatives, but I may be over thinking this) | 16:37 |
@jim:acmegating.com | Clark: yeah, i spent some time thinking about that and perusing the thesaurus, and haven't come up with anything better than the word "alternatives" and the fact that it's a list which has an inherent order. obviously the docs would clarify, but i agree, we want to make it as intuitive as possible. i'd be happy to change it if we think of a better one. | 16:38 |
@jim:acmegating.com | (at any rate, we can probably s/alternatives/something/ later if we find it, so i'll start working on the implementation soon and if we manage to think of something better, i'm happy to s/ then.) | 16:41 |
@clarkb:matrix.org | aliases maybe, or standins | 16:44 |
@jim:acmegating.com | i think aliases might be more confusing (since we name nodes -- it makes me think of that somehow) | 16:45 |
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 854556: web: console: convert to PF4 DataList https://review.opendev.org/c/zuul/zuul/+/854556 | 20:35 | |
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 854556: web: console: convert to PF4 DataList https://review.opendev.org/c/zuul/zuul/+/854556 | 21:24 | |
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: | 21:34 | |
- [zuul/zuul] 854458: WIP: Add support for configuring and testing tracing https://review.opendev.org/c/zuul/zuul/+/854458 | ||
- [zuul/zuul] 855096: WIP: Tracing: implement span save/restore https://review.opendev.org/c/zuul/zuul/+/855096 | ||
- [zuul/zuul] 855291: Fix and improve Keycloak tutorial https://review.opendev.org/c/zuul/zuul/+/855291 | ||
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: [zuul/zuul] 855293: Add tracing tutorial https://review.opendev.org/c/zuul/zuul/+/855293 | 22:26 | |
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 852929: Add detail to "depends on a change that failed to merge" https://review.opendev.org/c/zuul/zuul/+/852929 | 22:37 | |
-@gerrit:opendev.org- Ian Wienand proposed: [zuul/zuul] 855297: web: console: compress datalist https://review.opendev.org/c/zuul/zuul/+/855297 | 23:22 | |
@iwienand:matrix.org | > <@gerrit:opendev.org> Ian Wienand proposed: [zuul/zuul] 855116: [dnm] zuul-tox-remote : try to dump some sort of backtraces https://review.opendev.org/c/zuul/zuul/+/855116 | 23:32 |
this did catch *a* error -- in tests.remote.test_remote_action_modules.TestActionModules5:test_shell_module @ https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_b95/855116/1/check/zuul-tox-remote/b9598b9/job-output.txt | ||
@iwienand:matrix.org | in all the other tests i never saw that one time out. unfortunately i guess due to a combination of bwrap, namespaces, etc, etc. the trace is basically useless | 23:32 |
@iwienand:matrix.org | very open to ideas. one might be to switch this job from jammy back to focal and see if we see any timeouts | 23:33 |
@iwienand:matrix.org | that would at least be a datapoint that it's something underneath ansible | 23:34 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!