Thursday, 2021-01-14

*** jmasud has joined #oooq00:08
*** jmasud has quit IRC00:09
*** tosky has quit IRC00:36
*** jmasud has joined #oooq01:26
*** jmasud has quit IRC01:37
*** jmasud has joined #oooq01:45
*** jmasud has quit IRC01:52
*** jmasud has joined #oooq04:00
*** udesale has joined #oooq04:19
*** saneax has joined #oooq04:32
*** jmasud has quit IRC04:47
*** jmasud has joined #oooq04:57
*** ykarel has joined #oooq05:22
*** ykarel has quit IRC05:39
*** ykarel has joined #oooq05:41
*** ysandeep|out is now known as ysandeep|afk05:43
*** ykarel_ has joined #oooq05:54
*** ratailor has joined #oooq05:55
*** ykarel has quit IRC05:57
*** ykarel__ has joined #oooq06:05
*** marios has joined #oooq06:07
*** ykarel__ is now known as ykarel06:07
*** ykarel_ has quit IRC06:08
*** udesale_ has joined #oooq06:09
*** udesale has quit IRC06:12
*** jmasud has quit IRC06:22
pojadhavchandankumar, 0/06:24
pojadhavas per https://hackmd.io/IhMCTNMBSF6xtqiEd9Z0Kw?both#2021-01-07-Unified-Sprint-38-Planning , frenzy_friday also interested in the promoter work. I think she is missing in invite. Idk about her time zone.06:25
*** jfrancoa has joined #oooq06:26
chandankumarpojadhav: ah my bad, we can sync once again in evening if ok06:27
pojadhavchandankumar, its totally fine :)06:27
akahat|roverykarel, o/06:51
akahat|roverykarel,  i need to hold one node for testing purpose.06:51
akahat|roverthis is review link: https://review.rdoproject.org/r/#/c/28014/ and job is: tripleo-ci-promotion-staging-single-pipeline-centos-806:51
ykarelakahat|rover, hi06:52
ykarelok putting up hold request06:52
akahat|roverykarel, thank you :)06:52
ykarelakahat|rover, your pub key06:55
ykarelhttps://github.com/amolkahat.keys ?06:55
ykareladded these, try ssh zuul@38.102.83.4606:55
akahat|roverykarel, yes.. add any one.07:03
akahat|roverykarel, ok07:03
akahat|roverykarel, i'm in thanks :D07:04
ykarelack07:05
akahat|roverchandankumar, directory location is /home/zuul/src/review.rdoproject.org/rdo-infra/ci-config/07:06
akahat|roverandi t's there.07:06
chandankumarakahat|rover: not this path07:16
chandankumarakahat|rover:  look for /home/promoter07:16
akahat|roverchandankumar, yes. this is also there.07:17
chandankumarakahat|rover: then why it is not able to access it07:18
chandankumarakahat|rover: can you try running the playbook locally itself there07:18
akahat|roverchandankumar, ok07:18
*** ysandeep|afk is now known as ysandeep07:21
mariosarxcruz: chandankumar: what is the status on https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/77018807:37
*** udesale has joined #oooq07:37
mariosarxcruz: chandankumar: it was merging yesterday and it still blocks ussuri (i also saw it onvictoria today )07:37
mariosarxcruz: chandankumar: was it fixed some other way?07:37
chandankumarmarios: arxcruz was working on some other fixes instead of revert07:37
marioschandankumar: why workflow -1/07:37
marioschandankumar: arxcruz: but those can come after the revert? it was basically in the gate?!07:38
marioschandankumar: thanks07:38
mariosarxcruz: any update on that please07:38
mariosarxcruz: do you need reviews on the other fixes?07:38
mariosarxcruz: can you please put the other fixes on the bug and add some words about them https://bugs.launchpad.net/tripleo/+bug/191102007:39
openstackLaunchpad bug 1911020 in tripleo "Ugrades ussuri jobs fail in CI" [Critical,Triaged]07:39
chandankumarmarios: https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/770357 was one of the fix07:39
*** udesale_ has quit IRC07:39
*** akahat|rover is now known as akahat|lunch07:45
*** udesale has quit IRC07:48
marioschandankumar: thanks07:49
marioschandankumar: arxcruz: but that one is from alex, were there others arxcruz ?07:50
*** jpena|off is now known as jpena07:52
chandankumarmarios: https://review.opendev.org/c/openstack/tripleo-quickstart/+/77035907:52
marioschandankumar: thanks07:52
mariosarxcruz: https://bugs.launchpad.net/tripleo/+bug/1911020/comments/6 https://bugs.launchpad.net/tripleo/+bug/1911020/comments/7 add if there are more than those07:54
openstackLaunchpad bug 1911020 in tripleo "Ugrades ussuri jobs fail in CI" [Critical,Triaged]07:54
*** apetrich has joined #oooq08:02
*** slaweq has joined #oooq08:03
*** amoralej|off is now known as amoralej08:09
*** matbu has quit IRC08:29
*** matbu has joined #oooq08:31
zbrgood read: https://clig.dev/ -- Command Line Interface Guidelines08:35
*** tosky has joined #oooq08:49
*** slaweq has quit IRC08:55
*** slaweq has joined #oooq09:00
mariosarxcruz: are you around today?09:01
*** udesale has joined #oooq09:12
*** ykarel_ has joined #oooq09:13
*** ykarel has quit IRC09:16
arxcruzmarios: yes i am09:25
mariosarxcruz: hi can you please update the bug 09:54 < marios> arxcruz: https://bugs.launchpad.net/tripleo/+bug/1911020/comments/609:26
openstackLaunchpad bug 1911020 in tripleo "Ugrades ussuri jobs fail in CI" [Critical,Triaged]09:26
openstackbug 9 in Launchpad itself "Rosetta's po parser is too strict" [Medium,Fix released] https://launchpad.net/bugs/9 - Assigned to Carlos Perelló Marín (carlos)09:26
mariosarxcruz: and comment 709:26
mariosarxcruz: in particular what else are we waiting for please that is blocking ussuri patches09:26
mariosarxcruz: are there more patches that need workflow09:26
arxcruzmarios: sure, tive me a second09:27
arxcruzmarios: https://bugs.launchpad.net/tripleo/+bug/1911020/comments/8 i hope i made myself clear09:31
openstackLaunchpad bug 1911020 in tripleo "Ugrades ussuri jobs fail in CI" [Critical,Triaged]09:31
mariosarxcruz: did you test that https://review.opendev.org/c/openstack/tripleo-quickstart/+/770359 fixed the upgrade job?09:34
mariosarxcruz: is there a testproject somewhere09:34
arxcruzmarios: the upgrade job is passing on the patch09:35
arxcruzis there a reason to do a testproject ?09:35
mariosarxcruz: 11:34 < marios> arxcruz: did you test that https://review.opendev.org/c/openstack/tripleo-quickstart/+/770359 fixed the upgrade job?09:35
mariosarxcruz: for that reason ^09:35
mariosarxcruz: ?09:36
arxcruzmarios: tripleo-ci-centos-8-standalone-upgrade https://zuul.opendev.org/t/openstack/build/11e906f36c87494a81b68ffc04c6f9a8 : SUCCESS in 2h 33m 33s (non-voting)09:36
mariosarxcruz: the failure is on undercloud-upgrade-ussuri09:36
arxcruztripleo-ci-centos-8-undercloud-upgrade https://zuul.opendev.org/t/openstack/build/76026dfaff8e4d93ae73901008ea088c : SUCCESS in 1h 49m 27s (non-voting)09:36
mariosarxcruz: https://zuul.openstack.org/builds?job_name=tripleo-ci-centos-8-undercloud-upgrade-ussuri09:36
mariosarxcruz: it blocks ussuri gate09:37
mariosarxcruz: for a few days now09:37
mariosarxcruz: master upgrade jobs are non voting anyway09:37
arxcruzmarios: the issue was tempest running on upgrade jobs, which we don't do09:37
arxcruzupdating the featureset to not run tempest, it will fix it09:38
mariosarxcruz: we do run tempest on upgrade jobs09:38
arxcruzbut i can of course create a testproject09:38
mariosarxcruz: e.g. on standalone-upgrade09:38
mariosarxcruz: never mind man... my objection is that you blocked the revert after it was already in the gate. so i was hoping you had tested the thing you proposed instead of the revert.09:38
mariosarxcruz: lets hope if all merges today09:39
mariosarxcruz: no point doing a testproject now09:39
arxcruzmarios: come on man, i got the wrong information, if you check my conversation with alex, he told we don't run tempest on upgrade09:39
mariosarxcruz: k thanks09:40
arxcruzhe asked for revert, and i said, let's not revert, let's set the variable to false, since the issue is with tempest09:40
mariosarxcruz: ack ok it's ok i am grumpy cos it's always my fault when upgrade jobs are borked and i am blocked on ussuri there ussuri https://review.opendev.org/c/openstack/tripleo-heat-templates/+/761412 https://review.opendev.org/c/openstack/tripleo-common/+/769166 https://review.opendev.org/c/openstack/python-tripleoclient/+/769336 https://review.opendev.org/c/openstack/puppet-tripleo/+/76934009:41
marioshttps://review.opendev.org/c/openstack/os-net-config/+/76949309:41
mariosarxcruz: i mainly objecting to blocking the revert after it hit the gates. i think you could have just let it go through, take off any pressure from yourself and then fix it in your own peace after09:42
arxcruzmarios: the latest passing ussuri upgrade doesn't run tempest09:42
arxcruz2021-01-09 08:02:50.549224 | primary | TASK [Run os_tempest role] *****************************************************09:42
arxcruz2021-01-09 08:02:50.549282 | primary | Saturday 09 January 2021  08:02:50 +0000 (0:00:00.206)       0:00:43.723 ******09:42
arxcruz2021-01-09 08:02:50.585544 | primary | skipping: [undercloud]09:42
mariosarxcruz: cos you removed it09:42
arxcruzhttps://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_5aa/769510/2/gate/tripleo-ci-centos-8-undercloud-upgrade-ussuri/5aadfa6/job-output.txt09:42
arxcruzmarios: no, i haven't09:42
arxcruzthis is from a few days ago09:43
arxcruzon january 909:43
arxcruzbefore my os_tempest everywhere patch09:43
mariosarxcruz: you posted a patch to remove the tempest execution from fs50 didn't you?09:43
arxcruzmarios: can we chat?09:43
mariosarxcruz: ah you abandoned that https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/77035109:45
arxcruzmarios: https://meet.google.com/ymr-bcig-egh09:46
*** derekh has joined #oooq09:50
chandankumarstepping out for a bit09:55
mariosarxcruz: https://bugs.launchpad.net/tripleo/+bug/191119409:59
openstackLaunchpad bug 1911020 in tripleo "duplicate for #1911194 Ugrades ussuri jobs fail in CI" [Critical,Triaged]09:59
arxcruzmarios: http://eavesdrop.openstack.org/irclogs/%23tripleo/%23tripleo.2021-01-12.log.html#t2021-01-12T14:49:1010:03
*** ykarel_ is now known as ykarel10:06
mariosarxcruz: ack10:07
arxcruz:(10:07
mariosarxcruz: thanks for chatting ... hopefully it gets resolved today10:07
zbrthat day started just right: my lenovo (f33) failed to boot, grub stuff related to the 5.10 kernel.10:30
marioszbr: oh really? is it a known issue? I'm on 5.9.16 right now10:34
bhagyashrispojadhav, sshnaidm|afk zbr hi, could you please help me to complete the sprint report. please reply to an email 'sprint report'10:38
bhagyashristhank you :)10:38
bhagyashrischandankumar, ^^10:39
pojadhavbhagyashris, ack10:39
*** sshnaidm|afk is now known as sshnaidm|ruck10:40
bhagyashrispojadhav, thanks!10:54
*** dtantsur|afk is now known as dtantsur10:56
zbrmarios: that was the one still working.... this is how I manager to boot again.10:59
zbranyway i am going to do a full reinstall with formatting, i have nothing valuable on it.11:00
marioszbr: ouch thanks for the heads up11:02
zbrlast time i got a surprise like this was with fedora 6/7, and switching to something else. Now I cannot afford that luxury.11:08
zbrusually i would have tried to fix dig more but i observed that the installation was made using classic BIOS, and this prevented fwupdate from running, and there is no way to convert to UEFI. Full reinstall needed.11:10
zbrmarios: if you have "quiet" or "rhgb" on grub conf, i would worry. https://bugzilla.redhat.com/show_bug.cgi?id=190333211:13
openstackbugzilla.redhat.com bug 1903332 in kernel "Can't boot with kernel-5.9.10-200.fc33.x86_64 on Asus UX305CA/UX305CA" [Urgent,New] - Assigned to kernel-maint11:13
marioszbr: thx11:16
*** ysandeep is now known as ysandeep|afk11:17
marioszbr: looks like i do (must be a default i haven't touched that or changed anything here from vanilla 33 install)11:18
marioszbr: both quiet and rhgb11:18
*** chem has quit IRC11:18
*** chem has joined #oooq11:20
ykarelis it just me seeing errors in http://dashboard-ci.tripleo.org/d/_ZOYIidMk/vexxhost?orgId=1 or it's a known issue?12:04
bhagyashrisarxcruz, hi, Bugs related to os_tempest that is affecting upgrade jobs -> do you have bug link ? it would be great if you share the bug link with me . thank you :)12:05
arxcruzbhagyashris: sorry my lack of information https://bugs.launchpad.net/tripleo/+bug/191102012:06
openstackLaunchpad bug 1911020 in tripleo "Ugrades ussuri jobs fail in CI" [Critical,Triaged]12:06
bhagyashrisarxcruz, np thank you :)12:06
arxcruzbhagyashris: would you like me to send a followup email with this info, or is it fine?12:06
bhagyashrisit's fine12:06
bhagyashris:)12:06
bhagyashrisjust one more thing : Add the add test command on tempest-skiplist and documentation -> is there any WIP review link ?12:07
arxcruzbhagyashris: yes https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/75499412:10
arxcruzi'll update in a few, just finishing writing the documentation12:10
bhagyashrisok . thank you arxcruz :)12:10
mariossshnaidm|ruck: thanks for comments but i don't understand can you check my reply at https://review.opendev.org/c/openstack/tripleo-ci/+/770766 when you have time thank you12:21
sshnaidm|ruckmarios, solution you pointed is not the best also, if we fix it - ovb can return to use multiple playbooks. The point was to have playbooks so independent, so one can run one of them and it works12:24
sshnaidm|ruckmarios, mostly for devs that need to rerun various parts of deploy12:25
sshnaidm|ruckI'm not sure it's the case now, but it shouldn't be removed just so12:25
mariossshnaidm|ruck: maybe easier to discuss in scrum but, i don't see what the difference is from the ovb case 14:24 < sshnaidm|ruck> marios, solution you pointed is not the best also, if we fix it - ovb can return to use12:26
mariossshnaidm|ruck: we can also do the same here...12:26
mariossshnaidm|ruck: return to use the multiple if we fix it? i don't see the difference12:26
sshnaidm|ruckmarios, difference where? between multiple playbooks and single?12:26
mariossshnaidm|ruck: no you're saying in the ovb case https://review.opendev.org/c/openstack/tripleo-ci/+/764657 "ovb can return to use multiple playbooks"12:27
mariossshnaidm|ruck: so what's the difference between ovb case and this one12:27
sshnaidm|ruckmarios, yes, ovb patch is not the solution, it's a workaround and should be reverted when we fix the bug12:27
sshnaidm|ruckI don't see a point to make another workaround12:27
mariossshnaidm|ruck: k, well if we don't have a fix now then what do we do, besides apply the workaround?12:27
sshnaidm|ruckmarios, we can make it conditional and not use in CI as we discussed before12:28
mariossshnaidm|ruck: for the record, i am not sure it is OK yet, i have workflow -1 it until i test with the testproject reviews as i commented there12:28
mariossshnaidm|ruck: https://review.rdoproject.org/r/31555 for the train update https://review.rdoproject.org/r/31556 for victoria upgrade12:28
sshnaidm|ruckthis part is mostly for quickstart.sh runs from devs hosts, we don't need it in ci12:28
*** dsneddon has quit IRC12:31
*** ratailor has quit IRC12:31
*** jpena is now known as jpena|lunch12:33
*** akahat|lunch is now known as akahat|rover12:36
bhagyashrishi all,  do we need to keep the scrum today as we just finished the planning meeting two days before ? needs vote accordingly will decide12:51
bhagyashrisakahat|rover, frenzy_friday arxcruz chandankumar marios pojadhav sshnaidm|ruck zbr soniya29 ^^12:52
bhagyashrisysandeep|afk, ^12:52
*** ysandeep|afk is now known as ysandeep12:52
soniya29bhagyashris, i think we don't need scrum today since planning meeting has happened just two days before12:53
ysandeepbhagyashris, I am okay to skip it, if everyone agrees..12:55
akahat|roverbhagyashris, i'm with soniya29 ysandeep !!12:56
bhagyashrisysandeep, soniya29 ok , others plz let me know ...12:56
zbrysandeep: "if nobody comments" ;) sure.12:56
pojadhavbhagyashris, we can skip the scrum for today :)12:57
*** rlandy has joined #oooq12:58
mariosbhagyashris: seems a bit late to be asking the question though, e.g. US folks are just waking up now12:58
bhagyashrisrlandy,  do we need to keep the scrum today as we just finished the planning meeting two days before ? needs vote accordingly will decide12:58
mariosbhagyashris: imo we should have it since scrum != planning meeting12:59
rlandybhagyashris: marios: I'd say yes12:59
bhagyashrismarios, ok12:59
*** amoralej is now known as amoralej|lunch12:59
rlandylet's look at the boards12:59
rlandywhich I don't think I can access12:59
rlandyif cards are available12:59
mariosrlandy: right and figure out who/what is doing what with who ;)12:59
rlandywhat's listed12:59
rlandymarios: ack12:59
bhagyashrisrlandy, marios ok np :)12:59
rlandyso from tomorrow/monday people can get going13:00
ysandeepfolks, could i please get some eyes on https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/762350/ whenever time permits.13:00
mariosrfolco: are you around?13:03
rfolcomarios, o/13:03
mariosrfolco: we need someone to start the sprint for us (apparently we don't have permissions to do that)13:03
mariosrfolco: never mind bhagyashris just told me she already asked you13:04
rfolcomarios, something happened in jira, I can't do it anymopre, I don't have permissions either13:04
mariosrfolco: thanks13:04
rfolcomarios, she has the ticket number that weshay|ruck opened13:05
mariosrfolco: ack thx13:05
rfolcoyw13:05
chandankumarsshnaidm|ruck: hello13:08
chandankumarsshnaidm|ruck: need some help here https://logserver.rdoproject.org/14/28014/84/check/tripleo-ci-promotion-staging-single-pipeline-centos-8/96266e4/job-output.txt13:08
chandankumarsshnaidm|ruck: https://review.rdoproject.org/r/#/c/28014/85/ci-scripts/infra-setup/roles/promoter/tasks/promotion_run.yml@16 this part tries to copy the file to the zuul execute then and try to load the vars, but message": "Could not find or access '/home/promoter/ci-config/ci-scripts/dlrnapi_promoter/config_environments/staging/defaults.yaml' on the Ansible Controller.\nIf you are using a module and13:12
chandankumarexpect the file to exist on the remote, see the remote_src option"13:12
mariosrlandy: https://projects.engineering.redhat.com/browse/TRIPLEOCI-19713:13
sshnaidm|ruckchandankumar, I'm not familiar with it, do you have a playbook that fails there?13:18
akahat|roversshnaidm|ruck, this is the playbook: https://review.rdoproject.org/r/#/c/28014/84..85/ci-scripts/infra-setup/roles/promoter/tasks/promotion_run.yml13:22
*** ksambor has quit IRC13:23
rlandyzbr: you around?13:27
rlandyscrum13:27
ysandeeprlandy, If you don't get better slot, I am okay to join the mtg for first half an hour and then i will drop for another mtg13:27
*** jpena|lunch is now known as jpena13:29
mariosbhagyashris: https://projects.engineering.redhat.com/browse/TRIPLEOCI-19713:30
mariosbhagyashris: https://projects.engineering.redhat.com/browse/TRIPLEOCI-24913:31
bhagyashrischandankumar, akahat|rover frenzy_friday pojadhav can we continue the promoter sync?13:37
frenzy_fridayyep13:37
zbrrlandy: now i am13:44
rlandyzbr: no worries ... we just wanted to go through the elastic recheck epic at sync13:45
rlandyand you're the main contact on that epic13:45
*** pojadhav is now known as pojadhav|afk13:45
rlandycan do it on monday's call13:45
akahat|roverbhagyashris, yes13:46
zbrrlandy: link to the epic?13:47
zbrsomehow jira board seams empty https://projects.engineering.redhat.com/secure/RapidBoard.jspa?rapidView=428513:48
rlandyzbr: ack - none of us can get to the board - jira issue13:49
rlandyyou can view the epics from backlog13:49
rlandybhagyashris: ^^ can you point zbr to the elastc recheck epic - where you accessed it?13:49
bhagyashrisrlandy, sure13:57
bhagyashriszbr, https://projects.engineering.redhat.com/secure/RapidBoard.jspa?rapidView=4285&projectKey=TRIPLEOCI&view=planning.nodetail&selectedIssue=TRIPLEOCI-58&epics=visible&issueLimit=100&selectedEpic=TRIPLEOCI-12913:58
bhagyashriszbr, https://projects.engineering.redhat.com/secure/RapidBoard.jspa?rapidView=4285&projectKey=TRIPLEOCI&view=planning&selectedIssue=TRIPLEOCI-177&epics=visible&issueLimit=100&selectedEpic=TRIPLEOCI-17613:58
zbrso our sprint didnt even started because we have no issues in it, and is ending tomrorow.13:59
*** dsneddon has joined #oooq14:01
*** ykarel has quit IRC14:02
*** amoralej|lunch is now known as amoralej14:03
rlandyysandeep: attila has passing job on the current 16.s hash14:06
rlandy16.214:06
rlandywe have a failure on scenario01014:06
ysandeeplooking14:07
rlandyrerunning scenario01014:07
ysandeepack14:07
rlandyysandeep: ^^ timeout14:07
rlandyalso fs001 timeout14:07
rlandyfs035 passed14:08
ysandeepyes timedout on tempest , but i don't see any failures in tempest https://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-periodic-integration-rhos-16.2/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-ovb-3ctlr_1comp-featureset001-internal-rhos-16.2/44dc062/logs/undercloud/var/log/tempest/tempest_run.log.txt.gz14:13
*** ysandeep is now known as ysandeep|cinder_14:14
*** ysandeep|cinder_ is now known as ysandeep|session14:14
rlandymarios: did we stop queens promotions?14:19
rlandysshnaidm|ruck: akahat|rover: re we still watching/trying to promote queens? https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-stable614:25
rlandywe start the 13z15 import on the 25th14:25
rlandy^^ per rel del14:25
sshnaidm|ruckrlandy, hmm.. looks bad14:26
sshnaidm|ruckrlandy, will look into14:26
rlandysshnaidm|ruck: yeah - also looking into it14:26
rlandylast promotion was 11/1814:26
rlandy2021-01-13 12:50:18.853913 | primary | TASK [Create clouds.yaml if it doesn't exist] **********************************14:27
mariosrlandy: not that am aware of14:29
mariosrlandy: we do still need them (osp import)14:29
mariosrlandy: afaik14:29
sshnaidm|ruckrlandy, related to last tempest changes14:33
rlandymarios: apparently so14:33
sshnaidm|ruckrlandy, when we set os_tempest to run14:33
rlandysorry - half listening on osp meeting14:33
rlandysshnaidm|ruck: you working on the fix there?14:40
rlandygoing to create a job to rekick the line14:40
sshnaidm|ruckrlandy, yeah, found where it is14:40
rlandyhttps://opendev.org/openstack/tripleo-quickstart-extras/src/branch/master/playbooks/tasks/tempest.yml#L5814:40
rlandycool thanks14:40
rlandyysandeep|session: k - so if scenario010 passes rerun, we promote14:50
ysandeep|sessionrlandy, yes o/14:51
sshnaidm|ruckrlandy, https://bugs.launchpad.net/tripleo/+bug/191169614:51
openstackLaunchpad bug 1911696 in tripleo "Tempest tries to run on undercloud containers queens job" [Critical,Triaged]14:51
sshnaidm|ruckrlandy, probably should be solved by patches in gates14:52
sshnaidm|ruckarxcruz, can you please take a look, promotion blocker: https://bugs.launchpad.net/tripleo/+bug/1911696 if it will be solved by current patches?14:52
sshnaidm|ruckakahat|rover, fyi ^14:52
rlandysshnaidm|ruck: arxcruz: akahat|rover: adding a a testproject with those jobs14:53
rlandylet's see if it works14:53
sshnaidm|ruckrlandy, cool14:53
*** TrevorV has joined #oooq14:55
rlandyhttps://review.rdoproject.org/r/#/c/25325/14:56
rlandysshnaidm|ruck: akahat|rover: ^^ k- let's see what this does14:56
*** ykarel has joined #oooq15:02
arxcruzrlandy: sshnaidm|ruck https://review.opendev.org/c/openstack/tripleo-quickstart/+/770359 fix the problem since it set featureset023 to not run tempest15:03
zbrteam: i need some comments on https://github.com/rdo-infra/queries/pull/3#discussion_r557425557 -- naming challenge, do not lose the chance! ;)15:06
rlandyarxcruz: ^^ to confirm - it will fix the problem?15:06
mariosrlandy: sshnaidm|ruck: need a vote on that when you have a sec https://review.opendev.org/c/openstack/tripleo-heat-templates/+/770160 (rebased weshay|ruck patch for merge conflict)15:09
mjturekmarios: Could you take a look here? We're hitting this error in our container build job. Have you seen anything like it? Maybe we simply need to touch the file? http://paste.openstack.org/show/801625/15:20
mariosmjturek: looking15:21
mjturekif looking for context, see here https://ci.centos.org/job/tripleo-upstream-containers-build-master-ppc64le/3033/consoleFull15:21
sshnaidm|ruckakahat|rover, rerunning train c8 job that failed: https://review.rdoproject.org/r/#/c/23626/15:22
mariosmjturek: don't think the missing file is the root cause trying to find what it is (that is just the log file of the build/error)15:24
sshnaidm|ruckakahat|rover, only 1 test failed ther last time: TestVolumeBootPattern.test_volume_boot_pattern15:24
mariosmjturek: possibly "root" vs "jenkins" user is the problem15:24
sshnaidm|ruckakahat|rover, if it fails again in https://review.rdoproject.org/r/#/c/23626/  need to look for a fix..15:25
mariosmjturek: can we access any more files on this or only the console?15:28
mjturekmarios: let me grab the link for the collected logs15:29
mjturekmarios https://logserver.rdoproject.org/ci.centos.org/tripleo-upstream-containers-build-master-ppc64le/3033/logs/15:29
mariosmjturek: thx15:30
rlandymarios: looking15:32
mariosmjturek: so i suspect it is because jenkins user can't access /root/workspace/build.log15:32
mariosmjturek: https://opendev.org/openstack/tripleo-ci/src/commit/15023d0e98265570547ffd11132608f7045f6c74/roles/build-containers/tasks/main.yaml#L212-L22515:33
rlandyarxcruz: still have a failure on the testproject job ... https://review.rdoproject.org/r/#/c/25325/15:33
mariosmjturek: the tasks aren't executed with become there ... not sure why it is running as root user in https://ci.centos.org/job/tripleo-upstream-containers-build-master-ppc64le/3033/consoleFull15:34
mjturekmarios: is that a recent change??15:34
mjturekbecause this used to work15:35
rlandy2021-01-14 15:02:06.829348 | primary | + export TOCI_JOBTYPE=singlenode-featureset02315:35
rlandy2021-01-14 15:02:06.829441 | primary | + TOCI_JOBTYPE=singlenode-featureset02315:35
mariosmjturek: alternatively, you could try using root (see the next task below the one i pointed to)15:35
mariosmjturek: i mean by adding a become there https://opendev.org/openstack/tripleo-ci/src/commit/15023d0e98265570547ffd11132608f7045f6c74/roles/build-containers/tasks/main.yaml#L220-L23315:35
mariosmjturek: compare those two ^^15:35
rlandyuse_os_tempest: false15:35
rlandyis that correct?15:35
akahat|roversshnaidm|ruck, TestVolumeBootPattern.. i've seen it earlier today.. it shows ssh issue for cirros. I've checked history of job.. found it is very unpredictable.. :|15:36
akahat|roverhttps://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-1ctlr_2comp-featureset020-train15:36
mariosmjturek: don't think it is a new change git blame doesn't think so at least https://opendev.org/openstack/tripleo-ci/blame/commit/15023d0e98265570547ffd11132608f7045f6c74/roles/build-containers/tasks/main.yaml15:37
mjturekmarios: so that would require us to force ansible distribution to redhat, which is inaccurate15:37
mjturekwe always used the root user as the ansible user15:38
mjturekmaybe something changed in centos-ci then15:38
mariosmjturek: no i meant rather, if you have to run this as root, then you might add become on the https://opendev.org/openstack/tripleo-ci/src/commit/15023d0e98265570547ffd11132608f7045f6c74/roles/build-containers/tasks/main.yaml#L22015:38
mariosmjturek: but this is just a guess i don't have much to go on here. but it would explain the 'no such file' thing15:38
mjturekthat's fair marios - definitely on the right track, I'm going to ask if something changes in centos-ci15:39
mariosmjturek: in our jobs th ansible user is zuul and we have all the things in /home/zuul/...15:39
mjturekthat's fair15:39
mjturekthanks marios!15:41
mariosmjturek: ack hope it helps anyway15:41
ykarelmjturek, marios seems https://opendev.org/openstack/tripleo-ci/commit/d227115b1dc26a65598c5935fba7522ad9aad0d3 caused the issue in ci.centos jobs15:43
ykarellikely we create the logs directory in zuul at some place so working there and in ci.centos jobs it missing15:43
mariosykarel: yeah could be15:44
mjturekahh15:44
mariosykarel: but it didn't get to that point yet i mean the build report15:45
ykarelmarios, that patch changed log path {{ workspace }}/build.log --> {{ workspace }}/logs/build.log15:45
ykareland for that to work logs directory should exist15:46
mjturekykarel: think creating a /root/logs/ dir is an appropriate fix?15:46
ykarelmjturek, i think it should be fixed upstream as it's regression as part of that patch, but for now you can create /root/workspace/logs15:47
ykarelin ci.centos job15:48
ykareli think u must be creating /root/workspace somewhere already15:48
mjturekykarel: yeah I believe so15:48
rlandysshnaidm|ruck: arxcruz: k - so https://review.opendev.org/c/openstack/tripleo-quickstart/+/770359/4/config/general_config/featureset023.yml will not fix the problem ...15:50
rlandythe featureset passed to the tempest run is different15:50
rlandyactually scratch that15:50
mjturekykarel: I'll also take a quick look and see if I can find where that dir is created upstream, it might be as simple as removing a hardcoded "zuul"15:51
mjturekthanks ykarel and marios for the help15:51
rlandy --extra-vars @/home/zuul/src/opendev.org/openstack/tripleo-quickstart/config/general_config/featureset023.yml15:51
rlandyit is passed15:51
ykarelmjturek, where u see zuul is hardcoded?15:52
mjturekykarel nowhere, sorry just saying it could be something like that15:53
ykarelack that task depend on workspace var so hardcoding shouldn't be there but good to check15:53
mariosnp mjturek15:54
sshnaidm|ruckarxcruz, something is wrong there with tempest_cloud_name maybe: https://logserver.rdoproject.org/25/25325/73/check/periodic-tripleo-centos-7-queens-containers-build/2022671/job-output.txt15:54
sshnaidm|ruckarxcruz, it shouldn't be overcloud..15:54
rlandy'Create clouds.yaml if it doesn't exist' is executing before the switch15:56
rlandyof whether or not to run os_tempest15:56
*** ysandeep|session is now known as ysandeep15:58
ykarelmjturek, so get_hash : Ensure legacy workspace directory is creating /root/workspace16:00
mjturekykarel: right and prepare_node is makes the logs dir it seems16:00
ykarelso you need to add additional task to create /root/workspace/logs16:01
sshnaidm|ruckchandankumar, do you know where is tempest_cloud_name defined before gets there: https://github.com/openstack/tripleo-quickstart-extras/blob/master/playbooks/tasks/tempest.yml#L7916:03
rlandyhome/zuul/workspace/.quickstart/playbooks/multinode-validate.yml16:03
rlandysshnaidm|ruck: arxcruz: ^^ https://opendev.org/openstack/tripleo-quickstart-extras/src/branch/master/playbooks/multinode-validate.yml#L2916:04
rlandytempest_cloud_name: 'overcloud'16:04
sshnaidm|ruckok, so it should be a different condition there16:05
sshnaidm|ruckinstead of "not tempest_cloud_name in ['undercloud', 'standalone']16:06
sshnaidm|ruck"16:06
mjturekykarel yep seems so!! Thanks a ton!16:06
*** udesale has quit IRC16:08
rlandyproblem is the reuse of multinode playbook16:08
*** jmasud has joined #oooq16:20
arxcruzsshnaidm|ruck: once featureset023 use_os_tempest pass on gate, this will be fixed because it will not call tempest.yml playbook16:24
*** ykarel is now known as ykarel|away16:24
sshnaidm|ruckarxcruz, we ran job with these changes and it still failed, please read back16:26
arxcruzsshnaidm|ruck: sorry, let me check16:26
arxcruzsshnaidm|ruck: yeah, you're right, i'll submit a fix, tempest.yml should only be called when use_os_tempest is set to true16:28
*** saneax has quit IRC16:33
arxcruzsshnaidm|ruck: do you have the bug quickly?16:37
rlandyarxcruz: sshnaidm|ruck put in another change - under test now16:37
sshnaidm|ruckarxcruz, https://bugs.launchpad.net/tripleo/+bug/191169616:37
openstackLaunchpad bug 1911696 in tripleo "Tempest tries to run on undercloud containers queens job" [Critical,Triaged]16:37
rlandyand correct - the switch on os_tempest was after this task ran16:38
rlandyhence the issue16:38
sshnaidm|ruckarxcruz, trying  https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/770830 now, but feel free to hijack it16:38
*** zbr3 has joined #oooq16:38
arxcruzsshnaidm|ruck: https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/770837 it's a better approach and save more time, since all the tasks under tempest.yml only matter if we actually run tempest16:39
arxcruzrlandy: ^16:39
*** zbr3 has quit IRC16:39
sshnaidm|ruckarxcruz, thanks16:39
arxcruznp, it is my mess anyway :)16:40
*** zbr9 has joined #oooq16:40
*** zbr has quit IRC16:40
*** zbr9 is now known as zbr16:40
rlandyarxcruz: k - pls confirm which set of patches I shoudl test with and I will rerun  - thanks16:40
*** ykarel|away has quit IRC16:46
rlandyrunning second test16:47
*** amoralej is now known as amoralej|off16:57
*** marios has quit IRC17:03
*** jpena is now known as jpena|off17:07
arxcruzrlandy: https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/770837 and https://review.opendev.org/c/openstack/tripleo-quickstart/+/770359 should do it17:18
rlandyarxcruz: already under test here: https://review.rdoproject.org/r/#/c/29969/ thanks17:19
*** ysandeep is now known as ysandeep|out17:35
sshnaidm|ruckrlandy, running arxcruz patch now: https://review.rdoproject.org/r/#/c/25325/17:51
*** derekh has quit IRC18:04
*** dtantsur is now known as dtantsur|afk18:10
rlandysshnaidm|ruck: arxcruz: https://review.rdoproject.org/r/#/c/29969/ just passed18:28
rlandywith18:28
rlandyDepends-On: https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/77083718:28
rlandyDepends-On: https://review.opendev.org/c/openstack/tripleo-quickstart/+/77035918:28
rlandysshnaidm|ruck: pls vote on https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/770837 as well18:30
rlandywe can get this through gate today if lucky18:30
*** apetrich has quit IRC18:31
rlandychandankumar: ^^ if you are still around ... pls vote18:31
*** slaweq has quit IRC18:38
*** apetrich has joined #oooq18:58
weshay|ruckrlandy, https://docs.google.com/spreadsheets/d/1M1U-ekjEsec-bRjRq7q5rzjbJWKE2uT-ESX4SkeC0Uc/edit#gid=0&fvid=177806157619:33
*** slaweq has joined #oooq20:19
*** slaweq has quit IRC20:27
*** jmasud has quit IRC20:30
*** jmasud has joined #oooq20:40
rlandyweshay|ruck: you ok with our promoting 16.2? scenario010 juts passed20:41
rlandyfs035 passed in the run20:41
weshay|ruckaye20:41
rlandyfs001 timeout out running now20:41
rlandyfs020 had one tempest failure20:42
rlandyweshay|ruck: k- will promote ... since we have a passing test from the jenkins side20:42
weshay|ruckrlandy, 020 just had one tempest error https://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-periodic-integration-rhos-16.2/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-ovb-1ctlr_2comp-featureset020-internal-rhos-16.2/76a2c49/logs/undercloud/var/log/tempest/stestr_results.html.gz20:43
weshay|ruckso fs001 should have passed20:43
rlandytimedout20:43
rlandyrerunning now20:43
rlandybut soon the next hash will kick20:43
rlandyso promoting this one20:44
weshay|ruckrlandy, what about 01020:44
weshay|rucker..20:44
rlandyjust passed20:44
weshay|ruckscenaro 01020:44
weshay|ruckrlandy, k.. promote it20:44
rlandysee testproject rerun20:44
rlandyon it20:44
rlandyand we're rolling20:45
rlandyweshay|ruck: not to jinx anything but possible gate queue will clear up quite a bit20:46
weshay|ruckrlandy, ya.. things are merging20:47
*** jmasud has quit IRC20:53
rlandyshoot - gate failure21:04
rlandyso close21:04
*** TrevorV has quit IRC21:34
sshnaidm|rucktrain c8 should be promoted soon21:41
rlandygreat22:22
*** jmasud has joined #oooq23:13
*** rlandy is now known as rlandy|bbl23:28
*** jmasud has quit IRC23:33
*** jmasud has joined #oooq23:34
*** rfolco has quit IRC23:35
*** sshnaidm|ruck is now known as sshnaidm|afk23:49

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!