*** marios is now known as marios|ruck | 05:08 | |
rdogerrit | User pojadhav created rdo-infra/ci-config master: Remove ussuri panels from promotions dashboard https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42425 | 08:04 |
---|---|---|
*** ysandeep is now known as ysandeep|lunch | 08:13 | |
rdogerrit | User pojadhav proposed rdo-infra/ci-config master: Remove ussuri panels from promotions dashboard https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42425 | 08:25 |
Tengu | dpawlik, tristanC, nhicher hello there! would it be possible to old a node for job 42344 ? it's just starting now, the node name should be available shortly | 08:30 |
Tengu | please add my key (https://github.com/cjeanner.keys) as well as slaweq's (he'll provide it shortly) | 08:31 |
Tengu | https://github.com/slawqo.keys <- slaweq keys | 08:32 |
Tengu | ah, hostname: https://github.com/slawqo.keys | 08:32 |
Tengu | fu.. | 08:32 |
Tengu | Hostname: node-0002494462 | 08:32 |
fserucas | Hi tengu, this run will not get hold | 08:38 |
fserucas | but the next one it will | 08:38 |
dpawlik | Tengu: done | 08:39 |
Tengu | fserucas: heya! err... why? dpawlik said it will be hold? don't tell me there are retries in there as well :/ | 08:41 |
dpawlik | Tengu: I hold the node for you | 08:41 |
dpawlik | this one that is currently running | 08:42 |
Tengu | ok. we'll see how it goes when it fails then... hopefully no retry this time :/ | 08:42 |
Tengu | slaweq: you should be able to connect to the node: ssh zuul@38.102.83.38 | 08:42 |
dpawlik | Tengu: you can allways give a trick that I do in your patch | 08:42 |
dpawlik | some time ago | 08:42 |
Tengu | right - but it won't stop retry if it's defaulting again to something else than 0. or 1. or... dang. I'm too new to all of that. | 08:43 |
slaweq | I'm connected to it | 08:44 |
Tengu | slaweq: ok. it's running, so things aren't ready to debug, but at least you have the access :) | 08:45 |
slaweq | ok | 08:45 |
Tengu | slaweq: you can get the live status here: https://review.rdoproject.org/zuul/stream/f13f56b34bb94493b8309579f8a29186?logfile=console.log | 08:46 |
Tengu | I think we can get to the config-dowload content as soon as it's generated (i.e. once the OC deploy starts) in order to get a first hint about the mappings. | 08:46 |
Tengu | and then.... we should be able to get the actual applied mapping pretty quick, it seems to be during step1. | 08:47 |
Tengu | so even if the node is dropped because of a retry, we should be able to get a view on the state | 08:47 |
*** pojadhav is now known as pojadhav|afk | 08:49 | |
*** ysandeep|lunch is now known as ysandeep | 08:53 | |
*** ysandeep is now known as ysandeep|sick | 08:55 | |
*** rlandy|out is now known as rlandy | 10:23 | |
*** pojadhav|afk is now known as pojadhav | 10:27 | |
rdogerrit | Chandan Kumar proposed config master: Remove reference of singlenode Job https://review.rdoproject.org/r/c/config/+/42226 | 11:14 |
Tengu | dpawlik: we can un-hold the node node-0002494462 - OC deploy failed, but for unrelated reason to the issue we're trying to catch. I'll do some more testing on my side and request a new node hold once I'm sure it will fail "as expected" at the right step. | 11:27 |
dpawlik | Tengu: ack. Unholding... | 11:37 |
Tengu | dpawlik: wanting to catch network things in tripleo isn't easy :(. Especially with other CI things ^^'. | 11:38 |
dpawlik | Tengu: I can imagine. Nobody says that it is related to the software... it can be a hypervisor configuration :P | 11:40 |
Tengu | dpawlik: I don't think it is, since it's working without the patch ^^ | 11:41 |
dpawlik | Tengu: if same job fails many times on one host and none on other, that will be strange :P | 11:41 |
dpawlik | ah | 11:41 |
dpawlik | Tengu: we added a hostid field in opensearch | 11:41 |
Tengu | dpawlik: I'm doing some runs in parallel, we found a difference between CI config state and my local lab. Apparently, something is wrong on my side, preventing to actually reproduce the CI issue. | 11:41 |
dpawlik | so if you use good queries in Opensearch, maybe you iwll find something interesting | 11:41 |
Tengu | dpawlik: hmmm ok. keeping that at hand - but I'm pretty sure this isn't related to the hypervisor. | 11:42 |
dpawlik | Tengu: ah. Today I got an issue, that service deployment works perfect on my laptop VM, but in the cloud not. After 3/4 hours spend on that, I saw that one restart was interrupting the DB sync, because the host SSD was too slow. Sleep 2 normally will fix that issue xD | 11:43 |
Tengu | dpawlik: uho - that sounds like the issue we just hit actually XD | 11:44 |
dpawlik | Tengu: you are an expert :) The work that you are doing with slaweq is a magic :D | 11:44 |
Tengu | but that's not the one we're running after | 11:44 |
Tengu | the deployed failed on mysql db sync - sounds like your SSD issue ^^'. This is part of the "with other CI things" I just described :D | 11:45 |
Tengu | anyway. deploying things here and there, I'll check back in a few to ensure it's working up to the interesting point. | 11:46 |
dpawlik | Tengu: yup! Good luck. Don't forget to share the knowledge what was an issue when you discover! | 11:49 |
Tengu | dpawlik: of course! | 11:54 |
Tengu | I also have to see why my lab is acting differently. | 11:54 |
Tengu | maybe that will provide a clue alreday. | 11:54 |
Tengu | dpawlik: ok, found out why my lab wasn't providing the exact same configuration as CI. I think we won't need to hold any node for that weird issue anymore :). | 13:04 |
Tengu | dpawlik: basically, a wrong config override was pushed in the lab at some point, and it wasn't noticed before now. Fun fact: that config thing has apparently no actual impact on the issue.......... | 13:05 |
rdogerrit | User pojadhav proposed rdo-infra/ci-config master: Remove ussuri panels from promotions dashboard https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42425 | 13:09 |
rdogerrit | Douglas Viroel proposed rdo-infra/ci-config master: Refactoring compose promoter role https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42378 | 13:18 |
rdogerrit | Douglas Viroel proposed rdo-infra/ci-config master: Refactoring compose promoter role https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42378 | 13:29 |
rdogerrit | Douglas Viroel proposed rdo-infra/ci-config master: Refactoring compose promoter role https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42378 | 13:30 |
*** pojadhav is now known as pojadhav|afk | 13:41 | |
*** artom_ is now known as artom | 13:42 | |
rdogerrit | Douglas Viroel proposed rdo-infra/ci-config master: Refactoring compose promoter role https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42378 | 13:46 |
rdogerrit | Ananya Banerjee proposed rdo-jobs master: DNM: Added jobs to test container login/push/pull to quay on master and train https://review.rdoproject.org/r/c/rdo-jobs/+/41528 | 14:45 |
*** dviroel is now known as dviroel|lunch | 15:23 | |
*** marios|ruck is now known as marios|out | 16:10 | |
*** dviroel|lunch is now known as dviroel | 16:23 | |
*** rlandy is now known as rlandy|ruck | 16:25 | |
rdogerrit | Douglas Viroel proposed rdo-infra/ci-config master: Refactoring compose promoter role https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42378 | 16:32 |
*** dasm|ruck|off is now known as dasm|ruck | 17:16 | |
rdogerrit | Ronelle Landy created rdo-infra/ci-config master: Temp remove fs035 for promotion https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42441 | 17:52 |
rdogerrit | Ronelle Landy created config master: Remove train c7 pipeline https://review.rdoproject.org/r/c/config/+/42442 | 18:00 |
rdogerrit | Ronelle Landy created rdo-jobs master: Remove train c7 pipeline https://review.rdoproject.org/r/c/rdo-jobs/+/42443 | 18:01 |
*** rlandy|ruck is now known as rlandy|rover | 18:30 | |
rdogerrit | Douglas Viroel proposed rdo-infra/ci-config master: Refactoring compose promoter role https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42378 | 18:55 |
rdogerrit | Douglas Viroel proposed rdo-infra/ci-config master: Refactoring compose promoter role https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42378 | 19:07 |
rdogerrit | Tristan de Cacqueray created config master: Re-use existing review for kvm trigger https://review.rdoproject.org/r/c/config/+/42444 | 19:10 |
rdogerrit | Tristan de Cacqueray proposed config master: Re-use existing review for kvm trigger https://review.rdoproject.org/r/c/config/+/42444 | 19:11 |
rdogerrit | Tristan de Cacqueray proposed config master: Re-use existing review for kvm trigger https://review.rdoproject.org/r/c/config/+/42444 | 19:28 |
rdogerrit | Dariusz created rdo-infra/ci-config master: [DNM] Promote CS9 Master Network component https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42446 | 20:23 |
rdogerrit | Merged rdo-infra/ci-config master: Temp remove fs035 for promotion https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42441 | 21:07 |
rdogerrit | Dariusz created rdo-infra/ci-config master: Revert "Temp remove fs035 for promotion" https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/41249 | 21:15 |
rdogerrit | Dariusz created rdo-infra/ci-config master: Temporary: Disable fs001 & fs035 to promote CS8 train https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42450 | 21:49 |
*** dasm|ruck is now known as dasm|ruck|bbl | 22:23 | |
*** rlandy|rover is now known as rlandy|rover|bbl | 22:29 | |
*** dviroel is now known as dviroel|out | 23:59 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!