Monday, 2022-09-12

*** ysandeep|out is now known as ysandeep04:36
ysandeepgood morning team o/ 04:40
akahatGood morning o/04:43
ysandeepakahat: hey Amol o/ I heard about heavy rainfall near Pune, How is everything near your place?04:44
akahatysandeep, hey.. it's raining a lot.. rivers are flooded. 04:50
akahatweather forecast is showing heavy rain for few more days.04:50
ysandeepFor few more day - okay, stay safe and enjoy the weather mate :)04:51
akahatysandeep, you might be missing monsoon trips around Pune. :P04:57
ysandeepYes missssssing it a lot, I once went to Tamani ghat and we also went to hidden waterfall in Monsoon, It was an awesome experience.04:59
ysandeepakahat, also missing bike trip to mahabaleswar and lonavala :D05:03
ysandeepakahat, you got a chance to visit any nearby mountains in this season?05:03
akahatysandeep, no.. not this time.. 05:04
akahatysandeep, come back to Pune05:04
akahatEvery weekend we will go for ride. :D05:04
ysandeepthanks mate, I can definitely visit for a short trip for a week :D05:07
tonybrlandy, frenzyfriday, marios: I think I have updated https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/427518 Add 17.1 standalone jobs to promotion criteria correctly.05:48
tonybrlandy, frenzyfriday, marios: Sorry for missing that part.  I'll update our (CRE team) internal docs to include the missing step.05:50
mariostonyb: o/ will check thanks05:53
mariostonyb: not your fault it was really ours for not spotting it ;) 05:53
tonybI'll go though the open chnages from our team and try to address feedback and update the pipeline similarly05:54
*** ysandeep is now known as ysandeep|afk05:54
abregmanhey everyone06:00
abregmancan anyone tell me what's the issue here? https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/build/d6f8439050ab4cc6b75aa906f652e74e06:00
mariosabregman: looks like tempest issu e06:02
mariosi mean tempest test fail06:02
mariosabregman: that https://sf.hosted.upshift.rdu2.redhat.com/logs/86/427986/3/check/periodic-tripleo-ci-rhel-9-scenario004-standalone-glance-rhos-17.1/d6f8439/logs/undercloud/var/log/tempest/stestr_results.html 06:02
mariosabregman: for future reference i found that in 2 steps. 1. open job-output.txt search for "failed: 1" (https://sf.hosted.upshift.rdu2.redhat.com/logs/86/427986/3/check/periodic-tripleo-ci-rhel-9-scenario004-standalone-glance-rhos-17.1/d6f8439/job-output.txt)06:04
mariosabregman: from there you learn which file you need for step2 (in this case tempest so i knew to go look in /var/log/tempest )06:04
abregmanoh k, I saw the failed: 1 but totally missed the "TASK [os_tempest : Fail if tempest tests did not succeed]"06:12
abregmanthanks!06:12
abregmanmarios: in such case where a test fails, what do we do exactly with this change? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42771006:13
abregmanmarios: to clarify that job executed as part of testing the change 42771006:14
mariosabregman: yeah you have to scroll up from failed: 1 i forgot to say that 06:16
mariosabregman: so "what to do" 1. could be a legit issue, so we file a bug and track it as CIX (is it legit? is it consistently failing on this in consecutive executions?)06:17
mariosabregman: and 2. check with ruck|rover if this is a known issue. for that start with this irc channel's topic: tripleo-ci || rr status: https://hackmd.io/2hB-P772SqyqDs0KKZzZEQ?view06:18
abregmanmarios: got it. already executed again. so if it fails once more with that test failure, I'll open a bug and escalate it06:18
mariosabregman: yeah but also check if there is an existing bug/known issue for this. for example the ruck|rover may decide to skip a test or some other temp workaround that may unblock you 06:18
abregmanmarios: sure, will do. thanks06:19
mariosabregman: dont see something in the current notes (there https://hackmd.io/s4TgnCY-QQGKv2ONxTjOZA)06:19
mariosabregman: if you do file a new bug please alert the ruck|rover about it ( bhagyashris|ruck and jm1[m] )06:20
abregmanmarios: should I ask Bhagyashris or Jakob? (or both?)06:20
mariosabregman: usually you tell the 'ruck' about it06:20
mariosabregman: but either if one may be away/different timezone etc06:20
abregmangot it. thanks again06:20
abregmanmarios++06:20
mariosnp abregman 06:20
abregmanno karma bot I guess06:20
marios:) appreciate it all the same 06:21
*** ysandeep|afk is now known as ysandeep06:30
abregmanadded the procedure of what we discussed here https://docs.engineering.redhat.com/display/PRODCHAIN/Component+pipeline+%5B17.1%5D+standup+Notes#Componentpipeline[17.1]standupNotes-Unabletoverifytestprojectchangetestproject_verification06:35
*** jm1|ruck is now known as jm1|rover06:45
jm1moin #oooq06:45
* bhagyashris|ruck lunch06:49
jm1abregman: o/ a quick way to check what has failed is to go to the logs/ subdirectory and watch out for files such as "_Tempest_tests_FAILED.log" https://sf.hosted.upshift.rdu2.redhat.com/logs/86/427986/3/check/periodic-tripleo-ci-rhel-9-scenario004-standalone-glance-rhos-17.1/d6f8439/logs/06:55
jm1abregman: sometimes it says something like "no failure reason found" but often you get an idea where to look next06:55
jm1abregman: you will appreciate this micro optimization when you have to check dozens of logs a day ;)06:56
marios\o good morning jm1|rover 06:57
jm1marios: o/06:58
*** jpena|off is now known as jpena07:10
tonybjm1: nice.07:12
tonybSo we have a few changes that will confliuct with each other as the all edit the pipeline file.  Apart from building a review chain is there a good way to make the review/merge process easier?07:13
tonybsee: https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/427727 "Merge Conflicts" for example07:14
jm1|rovermarios: ^07:26
jm1marios: pls :)07:27
abregmanjm1: good tip. I'll add it. thanks!07:28
mariostonyb: not aware of another way than to create a chain/rebase07:29
tonybmarios: Okay.07:30
abregmanjm1, bhagyashris|ruck: is there a bug for these issue with the OSP 17.0 job? https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/build/23752a8efa844e99afdbf084453bb428/log/job-output.txt07:39
abregmanperhaps related to what I see in the 17.1 job although different failure07:40
abregmanjm1, bhagyashris|ruck: if not, then I guess I'll open a bug for this issue https://sf.hosted.upshift.rdu2.redhat.com/logs/86/427986/3/check/periodic-tripleo-ci-rhel-9-scenario004-standalone-glance-rhos-17.1/d6f8439/logs/undercloud/var/log/tempest/stestr_results.html07:40
bhagyashris|ruckabregman, hey let me check07:40
mariosjm1: bhagyashris|ruck: let me know if you want to discuss anything or need any help08:00
jm1marios: ack. still walking wading through todays fallout08:01
mariosbhagyashris|ruck: jm1: gentle reminder that we need to have an update for latest status on all active cix cards for the call this afternoon 08:02
bhagyashris|ruckabregman, hey is this consistent issue ^ i dont see it's failing consistently https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/builds?job_name=periodic-tripleo-ci-rhel-9-scenario004-standalone-glance-rhos-17&skip=0 but will hit testproject and will verify it 08:03
abregmanbhagyashris|ruck: should I open a bug for this one? (executed twice) https://sf.hosted.upshift.rdu2.redhat.com/logs/86/427986/3/check/periodic-tripleo-ci-rhel-9-scenario004-standalone-glance-rhos-17.1/d6f8439/logs/undercloud/var/log/tempest/stestr_results.html08:07
jm1marios: updated cix cards, will need some input from rlandy on some cards08:32
bhagyashris|ruckabregman, yes you can08:39
bhagyashris|rucki will also check at my end08:40
bhagyashris|ruckmarios, ack08:40
*** amoralej is now known as amoralej|afk08:51
* jm1 mtg 1h08:58
frenzyfridayjm1, pojadhav Hey, have you seen this error while setting up the cockpit manually? ERROR: Missing mandatory value for "environment" option interpolating 09:51
frenzyfridayI am trying to change the influxdb container version to 1.8 on the development server which does not have ansible pull)09:52
frenzyfridayit works if I export the missing env values, but I didnt see this error the last time09:56
* marios food biab10:08
rlandyjm1: bhagyashris|ruck: hey - I'm around if you need anything10:31
bhagyashris|ruckrlandy, ack10:32
bhagyashris|ruckchasing 16.2 promotion rest are good ...10:32
bhagyashris|rucksc010 is away for 16.2 re-running that 10:33
*** ysandeep is now known as ysandeep|lunch10:34
rlandytonyb: thanks for the update - merging https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/427518 - marios +2'ed10:36
rlandybhagyashris|ruck: nice10:39
rlandybhagyashris|ruck: components ok?10:40
bhagyashris|ruckchecking10:40
bhagyashris|ruckon that10:42
rlandythanks10:43
rlandychandankumar: jm1: bhagyashris|ruck: I see some passes here: https://review.rdoproject.org/zuul/builds?job_name=tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001&skip=0 :)10:44
rlandydid we merge a patch to fix?10:44
rlandyare there further fixes to stabilize check?10:44
chandankumarrlandy: nope, promotion and daniel kernel update in image fixed it10:44
rlandydpawlik++10:45
rlandythank you for that10:45
chandankumarrlandy:  we still need this https://review.opendev.org/c/openstack/tripleo-quickstart/+/856603 to avoid it10:46
dpawlikhey, soon we will move from our customize image to upstream image10:46
dpawliktoday I will try to push more that topic10:47
rlandychandankumar: dpawlik: ok - I added https://review.opendev.org/c/openstack/tripleo-quickstart/+/856603 to our review list10:47
rlandyand +2'ed it10:48
rlandydpawlik: thanks - that should help our stats on OVB check10:48
rlandyTengu: hi - will miss DF call again today ... pls tell the DF about ^^10:49
rlandywe are seeing some passes on OVB check now10:49
jm1rlandy: o/ had a long mtg, will have lunch now, then we can sync10:49
rlandyalso ... Tengu pls remind DF members to vote for TC10:49
bhagyashris|ruck16.2 promoted10:49
rlandyjm1: sure ... pls ping bhagyashris|ruck and me when ready10:49
rlandyjm1: bhagyashris|ruck: we just need to run through CIX10:51
rlandybhagyashris|ruck: thanks for taking care of https://trello.com/c/8tGYExhe/2603-cixlp1980255tripleociproa-tripleo-ci-centos-9-standalone-and-multinode-ipa-are-failing-the-testminimumbasicinstancehardrebootaft - can you post your results when you have them?10:57
bhagyashris|ruckrlandy, sure10:58
Tengushall we +W that oooq patch, rlandy ?11:04
Tenguto me, it makes perfectly sense to NOT update the OC image kernel - only the tripleo packages.11:05
rlandyTengu: I think so - added it to today's review list to see if anyone else can spot an issue with doing that  - if not - we will w+ shortly11:06
TenguI +2 it11:06
rlandythanks11:14
Tenguysandeep|lunch: I'll add the nftables switch topic for tomorrow's CI community call.11:16
rlandyTengu: we have Robert coming to tomorrow's call to discuss our scrum methodologies11:17
Tengurlandy: meh...11:17
rlandycan we move you to next week?11:17
Tenguwe're ready to switch (missing one or 2 patches), and I have a slot in next week all-hand to present the nftables thingy11:17
Tenguwhile we can wait a bit, it would be nice to get things as close as possible to "it's switched"11:18
rlandyTengu: maybe you can come to today's scrum?11:18
Tengusure? when is it? is there a hackmd as well?11:19
rlandy1:30 pm UTC11:19
rlandypojadhav: ^^11:19
TenguI don't see it in my calendar - care to invite me?11:19
Tenguand I'll have to jump on the DF call at 2pm UTC. not a big deal, topic shouldn't take too long anyway :)11:20
jm1frenzyfriday: cockpit is expected to fail when environment variables are not defined, please refer to https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4468011:39
jm1rlandy, bhagyashris|ruck: ready for sync11:39
rlandysec - still on review time11:39
rlandywill ping11:39
chandankumarrlandy: pojadhav our today's scrum collides with Product Engineering open office hours11:45
*** dviroel_ is now known as dviroel11:45
chandankumarif we finish it in 30 mins then there is no overlap11:45
rlandychandankumar: pojadhav; yeah - we can try be quick11:45
rlandywe did scrum on thurs11:45
rlandyso we can do Tengu's topic and blockers only11:46
rlandyTengu: sent you meeting invite11:46
Tenguthanks!11:48
rlandyjm1: bhagyashris|ruck: can sync now if you want11:52
rlandyjm1: bhagyashris|ruck: https://meet.google.com/hvc-qpjh-hna?pli=1&authuser=0 - when ready11:53
rlandywe should run though CIX11:54
*** amoralej is now known as amoralej|lunch12:18
rcastilloo/ happy monday all12:29
rlandyarxcruz: chandankumar: added revert https://review.rdoproject.org/r/c/rdo-jobs/+/44757 per bug https://bugs.launchpad.net/tripleo/+bug/198881012:38
rlandybhagyashris|ruck: https://review.opendev.org/c/openstack/tripleo-quickstart/+/85660312:38
*** ysandeep|lunch is now known as ysandeep12:44
*** amoralej|lunch is now known as amoralej12:51
abregmancan anyone send me an invite to tripleo reviews meeting?12:57
mariosabregman: sent13:12
jm1marios: looks like this one merged a couple of minutes too early XD https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4487213:12
abregmanmarios: thanks :)13:13
mariosjm1: yes :D :(13:13
rlandymarios: jm1: let's revert that tomorrow if needed13:15
rlandyline promoted 20 hours ago13:15
mariosjm1: rlandy: i am posting the criteria removal now sec 13:16
jm1rlandy, marios: 20 hours ago... we have plenty of time! lets wait for rdo folks to get ovs updated13:17
mariosjm1: rlandy: will add the info on the bug and then up to you rlandy .. you can at least get the gate clear today once zuul reports workflow https://review.opendev.org/c/openstack/tripleo-ci/+/85714213:18
jm1rlandy, marios: ah saw your decision on #rdo13:18
mariosjm1: yea added the info in comment#1 on the bug 13:18
mariosjm1: i mean about the 'plan' 13:18
mariosadded now the patches 13:18
jm1marios: yep, thank you :)13:19
rlandy https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44885 - so we're merge or not?13:21
rlandyboth 8 and 9 criteria there13:21
rlandyand we need a 9 promo13:22
rlandyjm1; marios: ^^13:22
dpawlikchandankumar, ysandeep: hey, may I ask you one think: some of the settings https://github.com/openstack/tripleo-ansible/blob/master/tripleo_ansible/roles/tripleo_kernel/vars/main.yml are decreased a lot, comparing to example values set in https://www.rabbitmq.com/networking.html#dealing-with-high-connection-churn-time-wait , so don't you have many13:23
dpawlik"errors" in the rabbitmq logs or nova conductor?13:23
mariosrlandy: yeah so let me check if it hits both (should do since we have bump on c9 should hit both lines) but yes we do want to  merge those per our plan https://bugs.launchpad.net/tripleo/+bug/1989341/comments/113:28
ysandeepdpawlik, decreased recently or in general?13:29
dpawlikgeneral13:30
rlandyok - will vote13:30
dpawlikysandeep: as I see last time change was done some time ago13:30
ysandeepdpawlik, need to check some ci jobs result, I will get back to you(In a mtg)13:31
*** dasm|off is now known as dasm13:33
dasmo/13:33
dasmfrenzyfriday: o/ i'm trying to understand what is done and what needs to be done wrt our infra. If you can go to our's board backlog, you might see a few infra named tasks.13:49
dasmfrenzyfriday: I'm not sure about how much is done, but I'd like to use your knowledge to see if i'm missing something13:50
dasmfrenzyfriday: do you have a few minutes to sync up?13:50
frenzyfridaydasm, ack, lemme check13:50
frenzyfridayyep sure13:50
mariosrlandy: re 'is it in both 8 and 9' for the mixed os thing - yes both https://bugs.launchpad.net/tripleo/+bug/1989341/comments/4 13:54
chandankumarjm1: bhagyashris|ruck good train hash 489d88a4b22eb070acc39b218844ac82 & buildset: https://review.rdoproject.org/zuul/buildset/34a8704d9fb3424ea756be801bc17c3b14:11
chandankumarfs039 and full-tempest-api are failing14:11
chandankumarwhile running the testproject on ibmcloud, please include these vars : https://github.com/rdo-infra/rdo-jobs/blob/master/zuul.d/integration-pipeline-wallaby.yaml14:12
dasmjm1: thanks for your comment on trello board wrt ouathlib releas.14:13
dasm*release14:13
rlandymarios: voted on https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/44885 - you can w+ when ready14:17
mariosrlandy: ack thanks it can go but may as well hold that till tomorrow morning maybe will be avoided... but gate one can go in as soon as ready i think14:24
* jm1 having a longer break now15:05
bhagyashris|ruckrlandy, fyi jenkins component 17.1 on rhel9  jobs trigger...15:06
bhagyashris|ruckhopefully it will pass ...15:06
bhagyashris|ruckand rest of the stuff updated on hackmd 15:07
bhagyashris|ruckleaving for the day15:07
jm1bhagyashris|ruck: have a nice evening o/15:07
bhagyashris|ruckjm1, thanks see you tomorrow 15:08
chandankumarsee ya people!15:08
rlandybhagyashris|ruck: thank you15:12
rlandychandankumar: have a good night15:12
*** ysandeep is now known as ysandeep|out15:16
*** ysandeep|out is now known as ysandeep15:16
ysandeepdpawlik, error grep from green upstream ci standalone job - https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_49e/853860/15/check/tripleo-ci-centos-9-standalone/49eaf6d/logs/undercloud/var/log/extra/errors.txt15:18
ysandeepseeing nothing rabbit related from quick look15:18
ysandeepits getting late for me, but I can discuss more in my morning15:20
*** ysandeep is now known as ysandeep|out15:20
dpawlikysandeep|out: that's right. Nothing interesting in https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_49e/853860/15/check/tripleo-ci-centos-9-standalone/49eaf6d/logs/undercloud/var/log/containers/nova/nova-conductor.log15:20
dpawlikwhere on IBM02 there are many of rabbit issues -,-15:21
ysandeep|outdpawlik, maybe worth pinging someone from pidone15:21
ysandeep|outmay be damien/eck from pidone15:22
dpawlikfrom time to time it disconnects and then reconnects - that's fine, but when I have "s unreachable: Server unexpectedly closed connection. Trying again in 1 seconds.: OSError: Server unexpectedly closed connection"15:22
dpawlik"A recoverable connection/channel error occurred, trying to reconnect: [Errno 104] Connection reset by peer"15:22
dpawliktomorrow15:22
dpawlikthanks ysandeep|out!15:22
ysandeep|outo/ let's continue tomorrow 15:22
* ysandeep|out out15:22
rlandyjm1: hey - what can I help with?15:29
mariosrlandy: jm1: i set workflow on that rlandy so you may want to keep an eye to get it through gate 15:31
mariosrlandy: 'that' https://review.opendev.org/c/openstack/tripleo-ci/+/857142/1#message-41d7487dcb1b979f906120f51fb066c944d73e76 :)15:31
rlandyk - will do15:32
rlandyif that clears, we don;t need the opendev email15:32
rlandyarxcruz: hey - have some time now - want to move your 1-1 up?15:36
arxcruzrlandy sure 15:37
arxcruzjoining 15:37
rlandyjoined15:37
rlandyarxcruz: ^^15:38
arxcruzrl15:38
arxcruzrlandy authenticating, one sec15:38
rlandyk15:38
*** marios is now known as marios|out15:47
*** dviroel is now known as dviroel|lunch15:55
rlandylunch - brb16:09
*** jpena|off is now known as jpena16:09
*** jpena is now known as jpena|off16:10
rlandyrekicked https://review.rdoproject.org/r/c/testproject/+/44661 for wallab c916:39
rlandycool - train only out of fs039 - rerun16:40
abregmanrlandy, arxcruz, chandankumar: is there anything else we need to do here or can we merge it? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42770816:49
*** dviroel|lunch is now known as dviroel16:52
*** amoralej is now known as amoralej|off16:52
rlandyabregman: looking17:19
rlandyabregman: I linked the testroject to the review17:21
rlandyhttps://code.engineering.redhat.com/gerrit/c/testproject/+/42770917:21
rlandyextra-vars @/home/zuul/workspace/.quickstart/config/release/tripleo-ci/RedHat-9/rhos-17.1.yml git right file17:24
rlandydviroel: I +2'ed https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/427708 - can you review?17:25
rlandydasm: ^^17:25
rlandythen we can merge for abregman 17:25
* dviroel looks17:26
dviroelrlandy: reviewed17:31
rlandyty17:33
dasmrlandy: i can't review. kerberos is playing games with me: "not found in Kerberos database while getting initial credentials"17:34
dasmprobably i need to restart pc17:34
rlandydasm: bhagyashris|ruck reported some issues today17:34
rlandydviroel reviewed so we are ok for now17:34
abregmanrlandy, dasm: ty, can we merge also this https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/427727?17:50
dasmabregman: quick check shows merge conflict. dviroel can you check that too? i still can't login17:52
rlandygoing to need a rebase17:53
rlandyabregman: also - we'll need a criteria patch for sc00117:53
abregmanI'll rebase it now17:53
dviroelyep, needs rebase since scn001 just merged17:54
abregmanrebased but let's see if the gates pass17:59
abregmanrlandy, dviroel, dasm: criteria patch for sc001: https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42772918:01
rlandyopenstack-promote-component running again now downstream18:01
rlandyhoping it will pick up new 17.118:01
rlandyabregman: thanks - will merge after current promot ejobs runs18:01
rlandyI want to see 17.1 components promote first18:01
rlandythey were missing the jenkins jobs links18:02
abregmansure18:02
rcastillolunch, brb 18:07
abregmanrlandy, dviroel, dasm: rebased sc002: https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42772718:13
abregmantest project change: https://code.engineering.redhat.com/gerrit/c/testproject/+/42772518:13
abregmanrlandy, dviroel, dasm: the the criteria patch for sc002: https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42787718:15
rlandyabregman: thanks18:17
rlandy14 minutes agobaremetalpromoted-components3 days ago6b374ec1359019e602c296db57c728e418:17
rlandy13 minutes agouipromoted-components2 days ago5f26c39a9e57e972904b976f29f9d2d718:17
rlandy13 minutes agotempestpromoted-components5 days ago92436dd5bd89e8fa1364e8644e757a2e18:17
rlandy13 minutes agovalidationpromoted-components5 days ago76cd2c2752da8817c0f976961006bf4b18:17
rlandy13 minutes agocomputepromoted-components2 days ago2febf07d2f0e2b528da4fb390826324f18:17
rlandygood 18:17
rlandylooking better18:18
rlandywaiting for common18:18
rlandythen we can add more criteria18:18
abregmanrlandy: a question on sc004 - we know the sc004 jobs fail due to ceratin bug (which was escalated as a CIX), should we wait for the CIX/bug to be resolved before we merge it? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42771018:18
rlandyit's ok to merge job definitions and add the job18:19
rlandyjust not criteria 18:19
abregmanack18:20
abregmank, once we merge sc002, I'll rebase it18:20
rlandyok - looking at criteria now18:29
rlandyabregman: left one comment18:34
rlandyhttps://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42772918:34
rlandycan fix that in a later patch if you want18:34
abregmanrlandy: fixed18:48
jm1rlandy: i bet we have to cheat again to get a c9 master promotion18:59
rlandyisk yet18:59
rlandyidk yer18:59
rlandyugh - yet18:59
jm1rlandy: ^^19:00
jm1rlandy: i updated and rerun all failed jobs once again (except for the ones you reran already)19:00
jm1rlandy: rr notes has been updated19:00
rlandyjm1: thanks - it would be a lot to skip 19:00
jm1rlandy: jobs failed on either known bugs or intermittent failures19:01
jm1rlandy: c9 wallaby tripleo component should promote19:01
rlandyjm1: also - there is a master run going on now19:01
jm1rlandy: c9 wallaby network component is the oldest one and is still failing on intermittent errors19:01
rlandymaybe that one will be better19:01
rlandyjm1: so that one we may want to skip promote19:02
jm1rlandy: you can give it a couple of rechecks19:02
rlandyjm1: ok19:02
jm1rlandy: its the only job left for c9 wallaby components19:03
rlandyI'll leave you notes for tomorrow19:03
rlandyyou can decide that when you get in19:03
* rlandy looks at component19:03
rlandyperiodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-network-wallaby 19:03
rlandyonly job missing19:03
jm1rlandy: yeah this one is in rerun right now19:04
rlandymaybe we try with depends-on chandan's patcj19:04
rlandyI see it's in deploy now19:04
rlandyok - I'll watch it19:04
jm1rlandy: last time it failed on tempest19:04
* rlandy checks that19:04
jm1rlandy: link in rr notes ;)19:04
jm1rlandy: https://logserver.rdoproject.org/58/44658/6/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-network-wallaby/8ada848/logs/undercloud/var/log/tempest/stestr_results.html.gz19:04
rlandyhttps://logserver.rdoproject.org/openstack-component-network/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-network-wallaby/448033a/logs/undercloud/var/log/ - deploy failure - from line19:05
jm1rlandy: previous error reasons for that job are also listed in rr notes ;)19:06
rlandyhttps://logserver.rdoproject.org/58/44658/6/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-network-wallaby/8ada848/logs/undercloud/var/log/tempest/stestr_results.html.gz - yep - ok - this one is tempest19:06
rlandyjm1: late fr you - I'll watch it19:06
jm1rlandy: we should invest some time and write a auto-rerun script. i burned most of my day with rerunning jobs and wading through intermittent errors. 19:09
jm1rlandy: did not even have time to watch robert's scrum literature19:09
rlandyjm1: that is what frenzyfriday's elastic recheck is suppsed to do19:11
rlandycatch repeated error traces19:11
rlandyI know OVB is a time sync19:11
rlandyI know we need to do something about it19:11
jm1rlandy: can we please put that on top of our scrum board, highest prio? :D19:11
rlandyon whose time though?19:12
rlandydasm is slammed with infra now19:12
dasm-?19:12
rlandyrcastillo and you and stuck in collections19:12
jm1rlandy: you said frenzyfriday is working on elastic recheck, so she is already on it19:12
rlandyysandeep|out, chandankumar anf dviroel are on next gen19:12
dasmcurrently i've changed the way how we're querying zuul to spare some infra resources19:12
rlandyarxcruz is busy with tempest19:12
rlandyso it goes19:13
dasmi might look into auto-rechecks soon19:13
rlandydasm: don;t want to derail you from fixing infra19:13
dasmfrenzyfriday did a great head start on that19:13
dasmrlandy: it's not gonna be one-time thing. infra is gonna be ongoing effort, which will take next few monshr19:14
dasm*months19:14
rlandycorrect19:14
rlandyso derailing it will delay stuff19:14
jm1rlandy: okeeee, will try to hack something when i get some time19:16
jm1dviroel:  still online?19:16
frenzyfridayjm1, rlandy the graph is broken but the bot should work fine. Try adding a query to https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/85350519:16
dviroeljm1: yes, sup19:17
frenzyfridaythe erbot should comment on the patches linking the existing bug and asking people to recheck19:17
jm1dviroel: oh great :) regarding this cpu issue.. what do we do about it? we are still facing it a lot19:17
jm1dviroel: cant we edit your patch and choose the lowest possible qemu cpu model? then simply merge it and hope for the best?19:18
dviroeljm1: yeah, I see your comment, this is not happening on the job that I was testing (16-2) - but we can try with those failing on master19:19
dviroeljm1: master job does not consume bits from internal repos, we need to create a new upstream change to test it19:19
dviroel jm1: I can do that19:19
jm1dviroel: that would be awesome19:21
dviroeli will give a try19:21
jm1dviroel: somewhere i saw a cpu model qemu64-x86_64-cpu19:21
jm1dviroel: qemu64-x86_64-cpu is deprecated but it reads as if it is very generic19:21
rlandy  [error] RuntimeError: Certificate issuance failed (CA_UNREACHABLE: Error 7 connecting to http://ipa.ooo.test:8080/ca/ee/ca//profileSubmit: Couldn't connect to server.)19:21
rlandyCertificate issuance failed (CA_UNREACHABLE: Error 7 connecting to http://ipa.ooo.test:8080/ca/ee/ca//profileSubmit: Couldn't connect to server.)19:21
rlandyThe ipa-server-install command failed. See /var/log/ipaserver-install.log for more information19:21
rlandy^^ real issue of fs03919:22
jm1rlandy: fs39 c9 what?19:22
rlandymaster atm19:22
rlandyonly one of those errors19:23
rlandyso ignore 19:23
rlandywill see if it repeats19:23
jm1rlandy: i bet its intermittent. someone wants to place another bet? XD19:24
jm1dviroel: oh wait, there is qemu64: https://qemu.readthedocs.io/en/latest/system/qemu-cpu-models.html19:25
rlandyjm1: no - because you will win19:25
jm1rlandy: yeah this is an easy one ;)19:25
rlandyjm1: well train will promote19:31
rlandyjm1: maybe I'll try ibm cloud for master/wallaby failure19:32
jm1rlandy: kvm internal?19:34
rlandyna - for ovb19:34
rlandywallaby is missing 39, 64 and 2019:35
* rlandy sees if c9 nodes are available there19:36
jm1rlandy: atm c9 master fs39 fs35 fs64 fail. for c9 wallaby its fs20 fs39 fs64. you want to move those to ibm?19:36
rlandygoing to testproject it19:36
rlandyc9 wallaby its fs20 fs39 fs6419:36
jm1rlandy: we could update our promoter to support conditionals, e.g. "c9-master-fs20-vexxhost||c9-master-fs20-ibm"19:37
rlandysec - trying it out first19:37
rlandyhttps://review.rdoproject.org/r/c/testproject/+/31165 - let's see what it does19:40
jm1rlandy: aye, aye, i am out for today :)19:44
dasmjm1: o/ take care19:46
* jm1 have a nice evening, oooci#oooq :)19:46
rlandyjm1[m]: have a good night19:48
abregmanhey. checking again. are we good to go with change? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42772720:06
abregmanthis*20:14
dviroelrlandy: voted on https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42772720:20
* dviroel biab20:28
*** dviroel is now known as dviroel|afk20:28
afuscoardviroel|afk: Hello, I'll add you as reviewer also for this one https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/428034 Thx20:35
abregmanrlandy, dasm: ^20:35
abregmanafuscoar: seems like there's merge conflict there20:36
afuscoarI see all of them have it, e.g. https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42775420:37
afuscoarWell, i don't see merge conflict on https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/428034 just same topic area 20:38
abregmanmaybe, but they really shouldn't have. I'll fix 007, maybe you can do it for 012 in the meantime20:38
afuscoarOh I see20:38
afuscoarMmm, I'll check what happens.20:38
abregmanwell, that was quite a rebase20:46
afuscoarBetter then20:47
afuscoarI'm checking this one https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/428072 20:47
afuscoarChecking why it fails20:47
abregmanafuscoar: clone -> cherry-pick -> tox -e linters -> amend -> submit20:48
afuscoarOh20:49
afuscoarI'll check, thank you abregman20:49
rlandywe just need the testproject listed20:50
abregmanrlandy: can we merge this one? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42775421:03
rlandychecking21:03
rlandyabregman: so ... going back to https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/427729/3/ci-scripts/dlrnapi_promoter/config/RedHat-9/component/rhos-17.1.yaml21:07
rlandythe order is still off here21:08
rlandyabregman: same here ... 21:09
rlandyhttps://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42772721:09
rlandypromote job is the bottom of the line21:09
abregmanyup, will fix it21:09
rlandyabregman: ^^ will merge this one21:09
rlandybut can you fix the order in the next patch21:09
abregmanrlandy: no no21:10
abregmanwill do it now21:10
rlandythere's a logic to it :)21:10
rlandyie: not random21:10
abregmanyup21:11
abregmanrlandy: fixed https://code.engineering.redhat.com/gerrit/c/tripleo-environments/+/42772921:11
rlandyok - merging that21:12
rlandyabregman: pls fix the order here next https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/427727/6/zuul.d/component-pipeline-rhos-17.1-rhel9.yaml21:13
rlandypls copy the 17 line21:13
rlandyand I'll merge that next21:13
abregmanyes, doing it right now. I need 2m21:13
rlandyno problem21:14
abregmanI think it's good now but let's see what the gates say21:17
abregmanrlandy: how does it looks now? https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42772721:20
*** abregman is now known as abregman|afk21:22
rlandyok - I see the  issue21:22
rlandyscen001 was added before standalone21:23
* rlandy edits21:23
abregman|afkit should be standlone -> scenario001 -> scenario002?21:24
abregman|afkwhen you said sorted I thought alphabetically :D21:24
abregman|afkrlandy: should I modify it to be standlone -> scenario001 -> scenario002?21:25
rlandyabregman|afk: no worries21:25
rlandyI am editing the patch21:25
rlandylate ofr you21:25
rlandythen it will be simple to carry one21:26
rlandyon21:26
abregman|afkk, thank you. yes, we (the team) will continue tomorrow morning with the other changes21:26
abregman|afkrlandy: just one question to understand it better - the order matters because they are triggered sequentially and so standalone is the most basic one and it fails no reason to run the other scenarios? 21:28
abregman|afkif it fails*21:28
rlandyno - they all trigger after the deps21:28
rlandyit's just easier to find them in this order21:29
rlandyall files kind of comply21:29
rlandyreally a small change21:29
rlandystandalone at top21:29
abregman|afkoh k. got it21:29
rlandypromote job at bottom21:29
rlandyetc.21:29
rlandyno big deal21:29
rlandyabregman|afk: ok ... https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/427727 Add scenario002-standalone-component jobs for OSP17.121:33
rlandydasm: dviroel|afk: can you take one more look at: https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/42772721:36
rlandythen we can merge21:36
afuscoarrlandy: in this case https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/428072 should I move them?21:39
afuscoarThis is the test with the depends-on https://code.engineering.redhat.com/gerrit/c/testproject/+/42807421:39
rlandyafuscoar: yes pls try match the 17 pattern21:39
rlandyand link your testproject in the commit message21:40
rlandyafuscoar: ^^ helps reviewers21:40
afuscoaroh yes, that's true21:42
afuscoarI don't get the order, I've checked the zuul.d/component-jobs-rhel-9.yam and the ovb jobs are at the end21:44
rlandyafuscoar: it's the line21:58
rlandybiab22:29
*** dasm is now known as dasm|off22:59
dasm|offleaving for tonight. take care23:00

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!