Tuesday, 2022-09-13

*** dviroel|afk is now known as dviroel00:41
*** dviroel is now known as dviroel|out00:50
*** rlandy is now known as rlandy|out01:58
*** ysandeep|out is now known as ysandeep04:42
ysandeepgood morning ci o/04:45
bhagyashris|ruckgood morning all 04:49
*** ysandeep is now known as ysandeep|brb05:02
*** ysandeep|brb is now known as ysandeep05:35
* bhagyashris|ruck lunch afk05:52
ysandeepjm1[m], bhagyashris|ruck, I am seeing mirror related issue on periodic jobs is it known?06:23
ysandeephttps://logserver.rdoproject.org/54/31954/89/check/periodic-tripleo-ci-centos-9-scenario007-multinode-oooq-container-master/9a2c631/job-output.txt06:23
ysandeep~~~06:23
ysandeep2022-09-13 06:09:53.963695 | primary | Errors during downloading metadata for repository 'extras-common':06:23
ysandeep2022-09-13 06:09:53.963727 | primary |   - Curl error (7): Couldn't connect to server for https://mirrors.centos.org/metalink?repo=centos-extras-sig-extras-common-9-stream&arch=x86_64&protocol=https,http [Failed to connect to mirrors.centos.org port 443: No route to host]06:23
ysandeep~~~06:23
mariosi saw that at least once yesterday ysandeep (is the job in RETRY fail?)06:31
ysandeepmarios, yes06:31
mariosysandeep: you remember we had a card against vexx for that but it was moved done. not sure if this is the same thing or something new now though 06:33
mariosysandeep: but not consistent so retry may help you if you are waiting for some specific run 06:33
mariostempest failed in the gate but not sure if this is transient yet (unrelated to the above discussion)06:35
marioshttps://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_732/854140/1/gate/tripleo-ci-centos-8-standalone/7320f7d/logs/undercloud/var/log/tempest/stestr_results.html 06:35
ysandeepmarios, thanks! this time failure is during pulling content from mirrors.centos.org, afair.. that card was for pulling content from vexx mirrors, I have faced this issue twice, let me try again 06:36
mariosysandeep: right yeah should be a different thing then 06:37
mariosysandeep: but since it was seen yesterday and still today then we should have a cix on that 06:37
ysandeepfound the previous bug: https://bugs.launchpad.net/tripleo/+bug/1983817 , yes that was for vexx mirror.. looks different06:38
mariosack ysandeep thanks for finding06:39
ysandeepmarios, thanks! I will let bhagyashris|ruck and jm1[m] report and investigate once they are in.06:39
mariosysandeep: ack sounds good 06:41
mariosjm1[m]: fyi hopefully transient so just noted them there for now https://review.opendev.org/c/openstack/tripleo-heat-templates/+/854140/1#message-436dc7bef64a98a5462c2703e5fbb0603e2373e406:45
bhagyashris|ruckysandeep, ack07:27
abregman|afkbhagyashris|ruck, ysandeep, mario: hey, can we merge this one? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42772707:31
abregman|afkmarios: ^07:31
*** jm1|ruck is now known as jm1|rover07:33
jm1moin 😴07:33
mariosabregman|afk: looking07:35
*** jpena|off is now known as jpena07:37
chemmarios: which one is the "usual" one, ~/.zuul.yaml or zuul.d/layout.yaml ? 07:40
chemmarios: hum zuul.d/layout.yaml seems to be the one07:41
marioschem: we have zuul.d/layout in tripleo land07:42
marioschem: but .zuul.yaml is popular elsewhere 07:42
jm1bhagyashris|ruck, marios, rlandy|out: i have moved todays rr notes to a new hackmd because we are are hitting chars limit soon https://hackmd.io/dKeK6zo9R66heikGyCb4NA07:56
mariosjm1: ack and thanks for updating the index too 07:57
*** abregman|afk is now known as abregman07:57
jm1bhagyashris|ruck: you are editing the old doc..07:59
abregmanbhagyashris|ruck, ysandeep, jm1, marios: and this one please https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42787708:05
mariosabregman: looking08:15
mariosabregman: lets hold on that for a bit commented 08:17
mariosplease add https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/427877 to the review list08:18
mariosfrenzyfriday: no bot? ^ 08:18
abregmanmarios: so we should we ping tomorrow/later this week or you will simply merge this at some point?08:18
mariosabregman: please add your reviews to https://hackmd.io/FGMoCiRfSNa8puA1BpTQ-Q?edit08:18
mariosabregman: we only ever check today/yesterday so you'll need to re-add if not merged tomorrow08:19
mariosabregman: usually adding to review list and if needed you are welcome to join the reviews call to present something about the change08:20
mariosabregman: so usually adding to review list is enough but feel free to ping as needed ;)08:20
frenzyfridaymarios, bot is down for maintainence. I'll get it back in a few hrs08:21
frenzyfridayhackmd has suddenly started returning 403, I am checking why08:22
abregmanmarios: got it, thanks08:24
frenzyfridayadd to reviewlist https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42787708:28
mariosfrenzyfriday: thanks 08:29
mariosabregman: there is a bot you can use here abregman fyi frenzyfriday owns it and it should be back soon fyi 08:29
abregmanmarios, frenzyfriday: what's the syntax? :)08:30
mariosabregman: like that 11:28 < frenzyfriday> add to reviewlist 08:30
marioshttps://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42787708:30
frenzyfridaybot add to reviewlist https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42787708:32
reviewbotI have added your review to the Review list08:32
frenzyfridaynow it works. /me putting up a patch08:33
mariosthank you frenzyfriday 08:33
marioswelcome back reviewbot 08:33
marios:(08:33
abregmanadd to reviewlist https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42771008:34
frenzyfridaypls wait for the patch marios :D08:34
marios:)08:35
frenzyfridayhere is the patch, pls add to your review lists https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4491108:36
reviewbotI have added your review to the Review list08:36
frenzyfridaymeanwhile you can use the bot, it is running on my local08:36
mariosack frenzyfriday but how was it working before then 08:37
mariosysandeep: chandankumar: please merge it when you have a minute https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4491108:37
frenzyfridaylooks like hackmd has a bug (or feature?) There are 2 types of notes - personal notes for which the api patch is ...notes/<note_id> and team notes for which patch is .../teams/<team_id>/notes/<note_id> The bot used to use the syntax /notes/ to update the team notes. It worked well for so many months. Since a few weeks this api call returns 403 but still updates the actual note somehow! So the reviews get added to 08:39
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.08:39
frenzyfridaythe list but the bot thinks 403 means it couldnt add reviews to the list08:39
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.08:39
mariosack thanks frenzyfriday 08:40
mciecierskiHi, I would like to ask if it is possible to run tripleo-quickstart against PSI cloud, instead of using libvirt guest. I see in docs this https://docs.openstack.org/tripleo-quickstart/latest/configuration.html#consuming-openstack-hosted-vm-instances-as-overcloud-undercloud-nodes, but it seems experimental feature. 08:41
frenzyfridayjm1, hey  0/ I have a question regarding the incockpit ansible pull thingy. A new patch https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44860 which changes the docker compose yaml merged yesterday. Will it be automatically pulled and used in the incockpit deployment (the one on c7 vm on baremetal) or should I manually do stuff?08:47
jm1frenzyfriday: you edited the docker-compose.yml, hence docker-compose (called from ansible-pull) automatically picked up the stuff. downstream dockpit is already running with influx 1.808:52
frenzyfridayjm1, awesome , thanks!08:53
mariosmciecierski: should be able to do it but your mileage may vary/you may need to tweak things (feel free to ask here). 08:53
* frenzyfriday likes ansible pull now08:53
mariosmciecierski: we have some config there fwiw @ https://opendev.org/openstack/tripleo-ci/src/branch/master/toci-quickstart/config/testenv/multinode-psi.yml https://opendev.org/openstack/tripleo-ci/src/branch/master/toci-quickstart/config/testenv/ovb-psi.yml 08:53
mariosmciecierski: but that config is for use in our jobs so only some might apply 08:54
bhagyashris|ruckjm1, plz08:54
bhagyashris|rucksend me the new hackmd link08:54
bhagyashris|ruckgot it08:57
abregmanmarios: can you please send me an invite for today's review meeting?09:01
mciecierskimarios: After  doing some tweaks, should I pass this config as such `bash quickstart.sh --nodes toci-quickstart/config/testenv/multinode-psi.yml  to use it ?09:02
mciecierskiOr nodes need to be provisioned in advanced and https://opendev.org/openstack/tripleo-ci/src/branch/master/toci-quickstart/config/testenv/multinode-psi.yml is `Feature Configuration`?09:05
ysandeepfrenzyfriday, https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44911 failed on lint09:06
*** ysandeep is now known as ysandeep|lunch09:06
mariosabregman: sure sec 09:06
mariosabregman: tomrrow (today's one is much later you may not want to join)09:06
mariossending both 09:07
mariosabregman: see private please 09:07
jm1ysandeep: regarding the mirror issue thingy: checked todays c9 master failures and we had not had this reason once. we had it yesterday though09:07
mariosmciecierski: not sure about that to be honest it is not something i've tried for a while. in ci we get nodes from nodepool and then run quickstart and plays on those you can see some examples of how we use it at https://opendev.org/openstack/tripleo-ci/src/branch/master/roles/run-test/templates/toci_quickstart.sh.j2 09:15
mariosmciecierski: and yeah the multinode-psi is more about feature config 09:16
mariosabregman: added you for todays call too 09:17
mciecierskimarios: ack, thank you 09:21
mariosfolks anyone tried quickstart.sh with psi lately? I recall someone on one of our calls saying... maybe you can help mciecierski who is trying 09:22
abregmanmarios: thank you09:28
abregmanin the meantime, can we merge this one? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42771009:28
abregmantestproject won't pass and there is CIX but I understand from Ronelle we can merge in the meantime, simply not the criteria change09:28
frenzyfridayysandeep|lunch, thanks, updated https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44911/09:32
mariosamoralej|off: o/ ah maybe you're out today 09:38
jm1ysandeep|lunch: oh i actually got this centos mirror issue just now! https://logserver.rdoproject.org/08/44908/1/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp_1supp-featureset064-wallaby/cd2b20a/job-output.txt10:01
jm1rlandy|out: look at that! https://trunk.rdoproject.org/api-centos9-wallaby/api/civotes_agg_detail.html?ref_hash=f6749f9cda54d017021ab56bf4ec0958 🥳10:06
mariosjm1: we have both... the "current" issue for Failed to connect to mirrors.centos.org port 443: No route to host another example there https://review.rdoproject.org/zuul/build/045fb41162a24832ab1ee61f327db3a410:06
mariosjm1: but also we are still seeing that one https://bugs.launchpad.net/tripleo/+bug/1983817 (old closed bug) Could not resolve host: mirror.regionone.vexxhost-nodepool-tripleo.rdoproject.org] example there https://review.rdoproject.org/zuul/build/417881f0cfad4159936e9d65ce59e765 10:07
jm1rlandy|out: we had to rerun fs64 only a couple of dozen times and voila it passes10:07
mariosjm1: either resurrect +bug/1983817 (and its cix) or probably cleaner file a new one since we are mainly seeing the mirrors.centos.org issue now i think?10:07
jm1marios: why? it is intermittent and not even very often compared to the other failures10:08
mariosjm1: we still track intermittent issues as cix so they might be addressed. intermittent != transient10:08
mariosjm1: we've had that since at least yesterday, and apparently it is still ongoing10:09
jm1marios: we have ~6 other intermittent failures which are coming up dozens of times a week, hence we should report those first10:09
jm1marios: rr notes has a list of intermittent bugs which i am seeing every day10:10
jm1marios: when i get some time i will create bugs10:11
mariosjm1: this seems like a serious issue - first one i checked master buildset see that https://review.rdoproject.org/zuul/buildset/372d55679278445da538e008b4ac301810:11
mariosi see 10 jobs fail on RETRY in that ONE buildset 10:12
mariosjm1: so yeah i think this one is worth a CIX asap10:12
mariosjm1: ack on 13:11 < jm1> marios: when i get some time i will create bugs10:12
mariosjm1: ping if you want me to file something10:13
jm1marios: i completely agree with you that our situation is bad. we have so many intermittent issues that i had to rekick c9 wallaby fs64 dozens of times until it passes. its just i focus on the most outdated components etc. first10:17
jm1marios: i would really appreciate if you could file a bug for that one10:18
jm1rlandy|out: c9 wallaby is promoting.. lets hope all jobs still work.. http://promoter.rdoproject.org/promoter_logs/centos9_wallaby.log10:22
mariosjm1: ok i will file that no problem doing10:22
jm1marios: shall we file bugs for all other intermittent errors as well?10:23
jm1marios: we will be flooded with intermittent issues. even the ones we currently have are not handled10:24
mariosjm1: i don't know jakob i haven't seen them but i can tell you this one is killing all the lines10:24
mariosjm1: i have seen it for master wallaby 9 8 .. not on train but wont be surprised 10:24
mariosthis is just chekcing the last runs for those ^^^ sec will add links in bug10:25
mariosjm1: general rule is if you see something more than once (even 2 or 3 examples) it is enough to file a bug 10:25
jm1marios: that was really meant as a open question. i dont know what the "right" way is here which is why i am asking :D10:25
jm1marios: ack10:26
mariosjm1: it could be transient ... and fixed by end of day but then you still have a timestamp... during this day promotions were blocked by this thing10:26
mariosjm1: for the RETRY/mirror thing we have seen it since at least yesterday might be older 10:26
mariosjm1: we should establish asap that we are blocked on this thing 10:26
*** rlandy|out is now known as rlandy10:29
jm1marios: but how can we be blocked on that thing if jobs pass after rerun?10:31
rlandymarios: chandankumar: ysandeep|lunch: jm1: hey ... wanted to talk to you about OVB in general ... pls ping when around10:31
mariosjm1: there https://bugs.launchpad.net/tripleo/+bug/1989452 10:31
jm1rlandy: around ;)10:31
rlandyie: I don't want to keep doing this rerun dance10:31
rlandyjm1: hey ... how's the network component10:31
rlandywe get by there?10:31
jm1rlandy: sync?10:32
rlandyjm1: yeah - we can10:32
rlandybhagyashris|ruck" ^^ want to join?10:32
rlandyhttps://meet.google.com/soo-vrjn-kgt?pli=1&authuser=010:32
mariosjm1: we are blocked because it kills your promotion lines 13:31 < jm1> marios: but how can we be blocked on that thing if jobs pass after rerun?10:33
rlandyjm1: bhagyashris|ruck: marios: ^^ if you want to join10:33
mariosjm1: and you have to go chase things that should not be failing10:33
mariosrlandy: will join in a couple mins brb10:33
rlandyk10:33
*** ysandeep|lunch is now known as ysandeep10:43
mariosjm1: as discussed i thought you meant they were passing in testproject not on actual rerun in the same buildset. i have removed the flags https://bugs.launchpad.net/tripleo/+bug/1989452/comments/2 10:45
bhagyashris|ruckrlandy, https://logserver.rdoproject.org/74/44874/2/check/periodic-tripleo-ci-centos-9-ovb-1ctlr_2comp-featureset020-master/c38d70b/logs/11:07
abregmanhey. can someone review and possibly merge? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42771011:09
abregmanI know there is the reviews meeting. but after every merge we need to rebase other changes, so we can't simply excepts to merge or review everything on the meeting (assuming it goes this way)11:10
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.11:10
rlandyjm1: chandankumar: ysandeep: marios; sen invite to discuss OVB11:17
ysandeeprlandy, ack11:17
rlandywe will need chandankumar11:17
chandankumarrlandy: ack11:18
rlandyso if he is not around, will recshedule11:18
rlandyoh there he is11:18
rlandyabregman: ack - looking at patches11:18
abregmanexpect*11:19
abregmanthanks11:19
rlandyabregman: frenzyfriday: so sc001 and 002 is in11:19
rlandychecking of we have criteria for those11:19
* jm1 lunch11:19
abregmanrlandy: sc004 is different - it didn't pass testing (there is CIX) and so criteria is not relevant atm for it11:19
abregmanbut yesterday you said we can get it in, simply without criteria. so was hoping to do that now11:20
* rlandy checking criteria for 001 and 00211:20
abregmanand move to the next scenario11:20
rlandyabregman: ack11:20
* pojadhav brb11:20
rlandycriteria for 001 is in11:21
rlandyok - O see mario +1'ed here11:23
rlandymarios11:23
rlandygoing to rework that order11:23
mariosack rlandy thanks got the invite lgtm 11:23
rlandycould we get a second vote on https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/427710?11:24
rlandyany cores? ^^11:24
rlandytest has a CIX so we expect failures11:25
rlandybut also a fix11:25
rlandyso ok with adding this now11:25
mariosack rlandy wf11:29
rlandymarios: ty11:32
rlandyabregman: ^^11:33
rlandygoing to edit criteria patch11:33
rlandythen will get that merged later today11:33
*** dviroel|out is now known as dviroel11:36
abregmanthank you!11:46
abregmanmoving to the next one11:46
chandankumarrlandy: jm1 ovb meeting12:01
frenzyfridayafuscoar, hey, I have moved grafana to 8.3.11 here: http://10.0.111.235/?orgId=1 Lemme know if you find anything broken12:21
frenzyfridayhere is the patch, pls add to review list https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4495012:21
reviewbotI have added your review to the Review list12:21
jsanemethello12:22
jsanemetrlandy marios: could I please get a review on this change request?: https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42775412:22
jsanemetit is the followup to abregman's one12:22
*** frenzyfriday is now known as frenzyfriday|lunch12:32
mariosjsanemet: i'll add it to my next reviews 12:34
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.12:34
mariosjsanemet: you can add things to our review list with the reviewbot here or directly at https://hackmd.io/FGMoCiRfSNa8puA1BpTQ-Q?edit FYI12:35
reviewbotI have added your review to the Review list12:35
jsanemetmarios: awesome, thank you very much12:38
mariosnp12:41
jm1marios: btw thanks for pointing out the difference between transient and intermittent failures in tripleo ci. feel free to explain terms when you feel i have a different/wrong understanding of terms, because that is probably true XD12:44
jm1rlandy: skiplist patch to help c9 master network component https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/85742212:45
rlandyjm1: thanks  - merging that12:45
jm1rlandy: btw ykarel has a patch up to fix this issue but it has not been merged yet https://bugs.launchpad.net/tripleo/+bug/1989197/comments/312:45
rlandyyep12:45
mariosthank you jm1 np and you were right about that retry error anyway 12:47
jm1rlandy: our c9 wallaby promotion has jobs that fail so i guess the whole promotion will fail? what do we do about that?12:48
rlandyfailed?12:49
* rlandy looks12:49
jm1rlandy: its still wip, just looking at openstack-periodic-integration-stable112:49
rlandyjm1: not sure what you mean12:50
rlandyopenstack-periodic-integration-stable1 has failures ack12:51
rlandylast wallaby promotion happened 2 hours ago12:52
rlandy its still wip - what is?12:52
jm1rlandy: ah the one in progress has a different hash. ok i got it.12:53
jm1rlandy: i was expecting to see something in the log but could not find anything about promotion of f6749f9cda54d017021ab56bf4ec0958 http://promoter.rdoproject.org/promoter_logs/centos9_wallaby.log12:57
rlandyjm1: http://promoter.rdoproject.org/promoter_logs/container-push/12:57
rlandyyou can see what go to container push there12:57
jm1rlandy: ack thx!12:59
rlandyhttp://promoter.rdoproject.org/promoter_logs/centos9_wallaby_2022-09-13T11:06.log13:00
rlandyjm1: ^^ shows you promoted 1 hash13:00
jm1rlandy: this was the logfile i was looking for. so it had been rotated already13:01
rlandyyep13:01
*** dasm|off is now known as dasm13:21
dasmo/13:21
*** frenzyfriday|lunch is now known as frenzyfriday13:26
pojadhavcommunity call : arxcruz, rlandy, marios, ysandeep, bhagyashris|ruck , svyas, soniya29, pojadhav, akahat, chandankumar, frenzy_friday, anbanerj,  dviroel, rcastillo, dasm, jm1|rover 13:29
pojadhavin 1 min13:29
pojadhavhttps://hackmd.io/MMg4WDbYSqOQUhU2Kj8zNg13:29
jm1rlandy: c9 master promotion is missing only sc10 internal kvm (failed on both internal and vexxhost), fs64 and fs35. so our usual suspects13:44
rlandyjm1: give it a few13:45
rlandyfs064 and fs035 running now13:45
rlandyfs035 internal passed13:45
jm1rlandy: both are running several times atm ;)13:45
rlandywill likely skip sc010 kvm13:45
rlandywill see after this meeting13:45
jm1rlandy: will have aoc mtg after community call, then have to go to city. but will be back in evening. rr doc is up to date.13:46
jm1rlandy: ah, rekicked c9 master network comp. job as well. running atm13:47
rlandyjm1: k - np - I'll take care of the promotion13:47
rlandyfs064 juts passed13:47
jm1rlandy: spraying with machine guns helped :D13:48
ibernalHello everyone, I have a quick question, if I want to ssh into a machine that is running a job in component pipeline, but my job is passing, can I still use autohold feature in zuul?14:11
ibernalOr do we have a keyword we can use in testproject to hold that machine?14:11
ibernalThanks!14:11
* jm1 hopping on a bike, cycling to tailer. bbl14:38
ysandeepibernal, zuul only hold a hold if job fails, but i think there is a var to force fail the job.. i will find the var and share with you14:41
ysandeepzuul only hold a job*14:41
ibernalawesome, thank you14:41
ysandeepibernal, force_job_failure: true 14:42
ibernaland I should use it on testproject or the job definition itself?14:42
ysandeepibernal, but you need to request someone from infra/ci team to add you node on hold and add your keys.14:42
ysandeepibernal, yes under job vars14:42
ibernalI requestec access earlier today so I can hold the job myself14:43
ibernalysandeep: thank you for the quick response14:44
dviroelrlandy: fips enabled image still failing to boot - I have a guess that is something related to vexxhost. Because we also have a problem there when we try to enable fips with the reboot process.14:52
dviroelrlandy: so I have an idea of trying on ibm cloud instead14:52
dviroelto see the results14:52
*** ykarel is now known as ykarel|afk14:54
ysandeeprlandy, pojadhav dasm since you are planning rr , just a headup that 06-12 nov, doug/me/chandan will be in next gen mtg 15:01
dasmysandeep: ack15:02
*** eliadcohen__ is now known as eliadcohen15:04
*** ykarel|afk is now known as ykarel15:10
dasmfrenzyfriday: o/ qq about elastic recheck. are you actively involved in that right now? can it be put on back burner for the next sprint?15:11
pojadhavysandeep, yep considered that already15:11
frenzyfridaydasm, yeah, i am working on ER when I get time. It is not on priority15:12
rlandypojadhav: marios: ok - rr schedule sorted15:14
pojadhavrlandy, great :)15:14
pojadhavthanks !!15:14
mariossomeone light the chimney with white smoke 15:15
pojadhavmarios, :D15:15
rlandypojadhav: dasm: pls ping the team to look at the rest of the year and check if they are on pto on any weeks assigned to them15:15
pojadhavrlandy, yes will ping15:15
rlandydasm: ok to remove the proposal section so it does not confuse anyone?15:16
dasmrlandy: removed the proposal section15:17
rlandyty15:18
dasmfrenzyfriday: ack. should we keep it in active sprint then? or can we move it to backlog? if it's not continuous, ongoing effort15:18
frenzyfridaydasm we can move it to backlog. 15:19
pojadhavhello all : arxcruz, rlandy, marios, ysandeep, bhagyashris|ruck , svyas, soniya29, pojadhav, akahat, chandankumar, frenzy_friday, anbanerj,  dviroel, rcastillo, dasm, jm1|rover15:19
pojadhavplease check RR schedule and switch based on your PTO plans.15:19
mariosthanks pojadhav will do 15:21
dviroelpojadhav: thanks15:22
bhagyashris|ruckpojadhav, ack15:23
rlandyjm1: marios: bhagyashris|ruck: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44964 Skip kvm internal job to promote master15:27
dviroeljm1: btw, internal kvm master still failing - https://sf.hosted.upshift.rdu2.redhat.com/logs/32/425432/11/check/periodic-tripleo-ci-centos-9-scenario010-kvm-internal-standalone-master-1/1125beb/logs/undercloud/var/log/extra/errors.txt15:31
dviroeljm1: "Skylake is not correct, or your host CPU arch does not support this model."15:31
chandankumarsee ya people!15:33
abregmanrlandy, dviroel, bhagyashris|ruck: can we merge this? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42775415:37
*** marios is now known as marios|out15:49
*** dviroel is now known as dviroel|lunch15:51
*** ysandeep is now known as ysandeep|out15:57
frenzyfridayhey folks, whet is the right way to push a patch to gerrit using git commands (not git review)? I tried something like  git push origin HEAD:refs/changes/65/44965/1 but that does not work16:27
dasmfrenzyfriday: "git commit --amend" to change the commit content and "git review"16:28
dasmit should update it16:28
frenzyfridayno, I mean using only usual git commands, not git review16:29
frenzyfridayI am trying to do it through python, using GitPython which I think supports only the usual git commands16:29
dasmi don't know about that one.16:29
dasmgit-review is a python package16:29
dasmhmm. it says it could work, frenzyfriday: https://gerrit-review.googlesource.com/Documentation/user-upload.html#_git_push16:32
frenzyfridayI cant figure out how do I get this part ssh://sshusername@hostname:29418/projectname from my python code :/16:33
*** jpena is now known as jpena|off16:35
frenzyfridayarxcruz, jm1 follow up on our auto testproject discussion: pls lemme know if you have some ideas on this when you get some time: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44966/1/ci-scripts/infra-setup/roles/rrcockpit/files/telegraf_py3/git_utils.py#1716:35
rlandyabregman: w+'ed16:36
rlandydviroel|lunch: you could give psi a shot16:37
rlandyibm cloud is fine as well16:37
dasmfrenzyfriday: wouldn't it be easier to run bash script to do that?16:37
dasmfrenzyfriday: i'd prefer avoiding more options to rr script16:38
frenzyfridaydasm yeah probably that will be a better option. But that means we need to parse the RR script's output and get the testproject again through the bash script whereas in rr script we alreay get it in a variable16:39
dasmfrenzyfriday: or we can dump rr output to a file. it should be straighforward with jinja templates16:40
dasmthen, bash script picks up the file and based on user's config -- sends it to review16:40
dasmfrenzyfriday: i don't know what was the initial discussion here. is it going to be automatic, or does it require manual intervention from the user?16:41
frenzyfridaydasm, it is the same discussion from the retro - "something" to automate the number of times we are running testproject to reckick failed jobs (nothing more concrete)16:43
dasmfrenzyfriday: that's a good starting point, but probably wrong direction.16:43
frenzyfridayI think your approach is better - we add one flag to RR script - which makes it write the testprojs to a temp file16:43
dasmfrenzyfriday: i would be more inclined towards gathering some stats about number of rechecks to see what is costing us the most16:43
dasmfrenzyfriday: we can always create a temp file which can be used, or ignored16:44
abregmancan we get review for scenario 010? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42802216:52
*** dviroel|lunch is now known as dviroel16:57
abregmanrlandy, dasm: ^18:24
rlandyyep - in a bit18:24
abregmantnx18:25
* jm1 3-year-old little girl comes to her father holding a floppy disk in her hand. She says: “Daddy, Daddy, somebody 3D-printed the save icon.” 😆 https://eyeondesign.aiga.org/we-spoke-with-the-last-person-standing-in-the-floppy-disk-business/18:27
rlandybhagyashris|ruck' hi ... https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/428003 - reviewing that18:30
rlandycan you comment of we need the args?18:30
jm1dviroel: which cpu module are you testing with? "Skylake"? how about the generic ones i linked yesterday?18:32
jm1rlandy: c9 master has promoted, fs35 passed in one of my reruns18:34
rlandyjm1: ack - only skipped kvm job18:36
rlandyhere is the revert on that: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4475818:36
rlandyI'm rechecking the network jobs18:36
dviroeljm1: i can try that too - i was not expecting Skylake failing on Skylake18:37
jm1rlandy, bhagyashris|ruck, pojadhav, dasm and team: can we please postpone the rr shifts discussion to our retro or planning meeting? having rlandy as a ruck or rover is ridiculous18:37
rlandyjm1: we just spent a whole hour on that18:38
jm1dviroel: actually to me it looks like this cpu models in l2 does not work at all. but we know for sure after you have tested the generic profile18:39
rlandypojadhav: pls bring your victoria tear down patches to wed review meeting18:39
rlandyI would think our easiest path here is to get PSI to assign us the right nodes18:40
dviroeljm1: ack - will try that in a bit18:41
jm1dviroel: and btw, thanks for testing all that and retriggering the job thousands of times. i know it sucks and i am really glad you help us with that!18:41
dviroeljm1: np, I know that rrs have lot of things to do, so at least I can help with something18:42
jm1rlandy: i rekicked c9 master network as well, just to be increase the chance that one of those jobs passes ;)18:43
rlandyjm1: need to see why those are failing rather than just a random rekick18:51
jm1rlandy: ^ is why i had to create another rr doc. failure reasons in rr docs take a looot of space18:53
rlandyI get it18:54
rlandyjm1: https://hackmd.io/ulFAL5DBRoGQB6eUuOpC1Q18:54
rlandygoing to try get some order to the madness18:54
rlandythis can't carry on18:54
rlandyI know it's bad18:54
jm1rlandy: for fs35 and fs64 i check the logs 2-3 times and if it is always an intermittent/transient failure, then i scale up the number of "parallel" jobs and do not look at each logs any more. it just takes too much time. i would still be checking the errors of todays runs. how many reruns did i do today? 50?18:56
jm1rlandy: but usually i would not do that. i did it because we wanted to get promotions ;)18:57
rlandyjm1: it's ok18:59
rlandyyou have the promos you need18:59
rlandyyou can take a day off that18:59
rlandyI would like to start splitting where we are running jobs18:59
rlandyjm1: for fs03519:00
rlandyif it ran and passed on internal19:00
rlandyok to ski[19:00
rlandyskip19:00
rlandydasm: dviroel: rcastillo: review time?19:02
jm1rlandy: but is moving jobs etc. the right way? to me it looks as if tripleo or rhos in general is very sensitive to load or latency (cpu? network? disk?) of the underlying system. we need help from dfgs to make rdo/rhos more robust.19:02
rlandyjm1: on review time  - sec19:03
rlandyfeel free to add notes to hackmd19:03
jm1rlandy: ack19:03
rlandydasm: doing your reviews + docker compose updates19:15
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.19:15
rlandydasm: can you review pls: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4495019:16
jm1rlandy: ovb status hackmd is now in ruck_rover section19:17
jm1rlandy: also added some more content to the intro section in case we want to show it to someone outside our team19:18
abregmanadd to review list https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42828619:22
reviewbotI could not add the review to Review List19:22
abregman:(19:26
rlandydasm: hey - when you are back we can look at your series of patches19:30
dasmrlandy: sorry, i had an emergency. everything is fine, but i was afk19:40
rlandydasm: np - we were just struggling to understand your reviews19:47
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.19:47
dasmack19:48
abregmancan we merge this one? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42828619:48
rlandydasm: we should merge these if they help19:52
dasmrlandy: the intend is to lower number of queries to zuul. they *should* help. similar to our 3 or 4 other attempts from th past19:54
rlandydasm: can we test that?19:55
rlandyprove it works?19:55
rlandywon't break anything else?19:55
rlandyrr script is high impact19:55
dasmrlandy: i did diff comparison on cs9 master branch for integration and all components (influx)19:56
dasmthere is no difference. i can provide it again19:56
jm1rlandy: looks like nothing is urgent on rr front for now. c9 wallaby promoted today, c9 master promoted today and it took only a liiiiiiitle bit of cheating. ok and thousands of reruns.. anyway. only c9 master network is way out but this is in rerun. good results for a single rr day 😆 with that i can sleep well tonight. eod now19:56
rlandyjm1: thanks - have a good night19:56
dasmrlandy: also, there are unittests in place to ensure backcompat19:56
jm1rlandy: thanks to you :D19:56
dasmjm1: take care, good night19:57
* jm1 out for today, have a good night #oooq19:57
rcastilloo/ jm1 19:57
rlandydasm: ok - so what's thebets way here: you want us to get back on a call?19:57
dasmrlandy: if you need additional explanations we can talk f2f19:57
rlandydviroel: rcastillo: ^^?19:58
dasmif there are some missing pieces which can be left as a review, i can address those too19:58
rlandydasm: yeah - let's see if we can get the others back19:59
rlandyso your reviews don't lag19:59
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.19:59
dasmk, i'm back19:59
rcastilloI'm going through the script changes atm19:59
rlandyok19:59
rlandyrcastillo++19:59
rcastillodasm: I'll let you know if I have any questions19:59
rlandydasm: ok - so ping if you don't see good reviews/movement there19:59
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.19:59
dasmack19:59
rlandydasm: also -can your review ananya's patch on grafana?20:00
rlandyyou would know that better20:00
dasmsure20:00
rlandyty20:00
*** dviroel is now known as dviroel|brb20:10
abregmanadd to review list https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42781620:14
reviewbotI could not add the review to Review List20:14
abregman:'(20:14
rlandyabregman: re: https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/427816 - pls follow the same order as 1720:19
abregmanrlandy: done20:21
rcastillodasm: patches LGTM, see comments about tests20:43
dasmrcastillo: ack, thx.20:44
dasmrcastillo: i'm gonna respond to you on the patch, but you asked about track_component and track_integration inside "class TestInfluxDBMeasurements(unittest.TestCase):"20:45
dasmthere is no influx for them anymore20:46
arxcruzreviewbot please add https://review.opendev.org/c/openstack/tripleo-quickstart/+/839725 to review20:46
arxcruzreviewbot help20:46
rcastilloright but there's no test for the non-influx case20:46
arxcruzi never know the damn pattern 20:46
rcastilloafaict20:46
dasmrcastillo: because i never wrote it20:46
dasmrcastillo: i can't test rendering tables20:47
arxcruzreviewbot add to review list https://review.opendev.org/c/openstack/tripleo-quickstart/+/83972520:47
reviewbotI could not add the review to Review List20:47
arxcruz...20:47
rcastillodasm: ah, ack20:47
rcastillooh well20:47
dasmrcastillo: i'm slowly working towards smaller, chunk-sized functions, so I'm gonna be able (hopefully) to write some tests20:47
rcastilloyeah, I could see that intention in the patches20:48
dasmfor now, we're just using manual testing (by "we" i mean ysandeep|out and myself) to compare outputs20:48
rcastilloyeah it's ok since that path is just used by us locally20:48
dasmrcastillo: i left similar note on the review, to keep the history in one place20:50
dasmrcastillo: thank you for taking time to review those20:50
rcastillothanks for giving some attention to the script20:51
dasmrcastillo: otherwise dpawlik would come back to us more often, saying we're killing zuul :)20:51
dasm(actually, it might be just me ;P )20:52
rlandyabregman: thanks - voted20:57
rlandydviroel|brb: need help with the fips image?21:03
rlandywaiting for network tests - back later21:27
*** rlandy is now known as rlandy|bbl21:27
* dasm leaves22:28
*** dasm is now known as dasm|off22:28
dasm|offo/22:28
* rcastillo out as well22:28
rcastilloo/22:28
*** dasm|off is now known as Guest30523:03
*** rlandy|bbl is now known as rlandy23:27

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!