Thursday, 2019-09-05

*** holser has quit IRC00:21
*** Goneri has quit IRC00:47
*** apetrich has quit IRC02:10
*** ykarel|away has joined #oooq02:50
*** gkadam has joined #oooq03:54
*** gkadam has quit IRC03:54
*** ykarel|away has quit IRC04:06
*** ykarel|away has joined #oooq04:19
*** jtomasek has joined #oooq04:29
*** jtomasek has quit IRC04:40
*** raukadah is now known as chandankumar04:49
*** jtomasek has joined #oooq04:51
*** udesale has joined #oooq05:02
*** holser has joined #oooq05:08
*** ratailor has joined #oooq05:19
*** holser has quit IRC05:21
*** ccamacho has quit IRC05:32
*** marios has joined #oooq05:51
chandankumarsshnaidm|afk: Hello05:53
chandankumarsshnaidm|afk: please try with this patch https://review.opendev.org/#/c/617522/ for ansible python interpreter05:54
chandankumarsshnaidm|afk: and here it is working https://logs.rdoproject.org/22/617522/4/openstack-check/tripleo-ci-rhel-8-standalone-rdo/c7c157c/logs/ara_oooq/ara-report/result/372675db-cb2d-4513-bd28-e49501fc5937/05:57
*** kopecmartin|off is now known as kopecmartion06:23
*** kopecmartion is now known as kopecmartin06:24
*** holser has joined #oooq06:25
chandankumarpanda|rover: morning06:25
*** ccamacho has joined #oooq06:27
*** jfrancoa has joined #oooq06:35
*** surpatil has joined #oooq06:36
panda|roverchandankumar: morning06:38
surpatilGood Morning06:41
*** udesale has quit IRC06:56
*** sanjayu_ has quit IRC06:58
*** saneax has joined #oooq07:01
*** ykarel|away is now known as ykarel07:04
ykarelpanda|rover, are the ovb jobs failure in master periodic known?07:04
*** bogdando has joined #oooq07:08
*** tesseract has joined #oooq07:15
*** tosky has joined #oooq07:23
*** jpena|off is now known as jpena07:33
chandankumarpanda|rover: I am updating this patch to address marios https://review.opendev.org/#/c/673481/07:34
chandankumar's comments07:34
zbrchandankumar: panda|rover : can we wf https://review.opendev.org/#/c/673481/ ? i am kinda tired of not seeing any login changes merged.07:34
*** ykarel is now known as ykarel|lunch07:34
panda|roverykarel: probably not07:35
panda|roverykarel|lunch: ^07:35
chandankumarzbr: needs to address few comments07:35
zbrchandankumar: please do an ping, i want to clean these from our plate.07:35
zbri can workflow on rdo side but they depend on stuff like this one07:36
chandankumarzbr: done07:38
zbrpanda|rover: sshnaidm|afk : i also ned you help on https://review.rdoproject.org/r/#/c/21996/ which defaults to 2.8 for ALL rdo jobs. Do not merge but vote/comment, it will be jpena and tristan to workflow it at the right time.07:39
zbrnobody raised worries during yesterday rdo meeting, so I guess we will do it tomorrow (early enough to allow us to revert it needed)07:40
chandankumarpanda|rover: can I break it https://review.rdoproject.org/r/#/c/21820/ into two patches, one to remove the job from container-registry and add it07:41
chandankumarso that third party should vanish and emilienm can merge it07:42
chandankumar?07:42
zbrchandankumar: if you do it do it like this: create new change for removal, we do that one and after this werebase the original one, and voila, is "simplified" :D07:42
chandankumar*third party job07:42
*** sshnaidm|afk is now known as sshnaidm|ruck07:43
sshnaidm|ruckzbr, did you talk with jpena about setting 2.8 as default in rdo zuul? Maybe it's better than hardcoding it in config. When 2.9 arrives, we'll need to remove it07:45
zbrsshnaidm|ruck: yeah, talked with him and with openstack-infra, and pbelanger.07:45
zbrthe way to change default ansible is in "base" jobs.07:45
zbrthe one defined in rdo overrides the base one used upstream.07:46
sshnaidm|ruckzbr, isn't it in zuul config?07:46
zbrin fact is the only practical way to do it, based on pabelanger recommandation: he already did it long time ago on his ansible-network zuul.07:46
chandankumarzbr: panda|rover https://review.rdoproject.org/r/#/c/22097/07:48
zbrsshnaidm|ruck: not really, afaik. but the question here: which jobs should we test now using depends-on?07:48
sshnaidm|ruckzbr, we don't have 2.7 set in our base job, so where is this set?07:49
*** brault has joined #oooq07:50
zbri am sure there is a default on zuul, but we are better off controlling which version we use regardless which version is default in current zuul version, this allows them to upgrade zuul will more confidence.07:50
zbri kinda like splitting zuul upgrades from ansible version upgrades.07:50
zbrit will happen upstream too but they are bit slower, also they do not have the same "business reasons" as us (rhel8/centos8)07:51
chandankumarsshnaidm|ruck: zbr please have a look at this hack https://review.opendev.org/#/c/617522/5/defaults/main.yml /me is looking for better solution so that it works for all distro07:55
zbrchandankumar: why ever doing this when you can do it directly withut hostbars?07:56
panda|roveroh yes remove seomthing so we can replace it with a equal job the patch after.07:57
chandankumarzbr: you mean just ansible_python.executable ?08:00
zbrchandankumar: YES08:00
chandankumaransible_python_intepreter gives undefined var08:00
chandankumarok updating the patch08:00
* marios biab08:02
*** marios has quit IRC08:02
zbrchandankumar: that is because you are using wrong variable, correct one is ansible_python.executable08:02
zbrthe other one is the optional override config option08:03
zbrthus is why it can be undefined08:03
zbryou may need to run: setup setup:08:03
zbr        gather_subset: min08:03
zbrwhich is the minimal gather facts needed to load python interpreter fact from remote host (aka detection)08:04
chandankumarok08:08
chandankumarzbr: please merge this one https://review.rdoproject.org/r/#/c/22097/08:08
zbrchandankumar: done. also please add the setup role, as I mention in last comment.08:10
chandankumarzbr: Dhanyabad :-)08:10
*** brault has quit IRC08:13
zbrchandankumar: did i tell you that I *HATE* how openstack ansible-* repos were configured?08:13
chandankumarzbr: nope08:14
zbrwhere user cannot even run tox normally.08:14
zbrnot sure who did it, but the excuse is that they documented how you are supposed to test the code, including its linting.08:15
zbrbecause "we are reinventing the wheel here, every day" (probably a good tagline for openstack in general)08:15
*** ykarel|lunch is now known as ykarel08:28
zbrchandankumar: code still needs a lot of rework but i am updating it now.08:29
chandankumarok08:31
*** derekh has joined #oooq08:31
chandankumarkopecmartin: Hello08:32
chandankumarkopecmartin: once it merges https://review.rdoproject.org/r/#/c/21879/, bot will generate a new review against tempestconf and reuse the same review for removing deps08:32
kopecmartinchandankumar: ok08:33
ykarelpanda|rover, ack, i noticed all ovb jobs failed08:37
*** dtantsur|afk is now known as dtantsur08:38
*** chem` has quit IRC08:42
zbrchandankumar: please review https://review.opendev.org/#/c/617522/08:42
zbrfixes several bugs08:42
*** marios has joined #oooq08:55
chandankumarzbr: check #openstack-ansible discussion09:00
mariospanda|rover: trying the molecule check on the latest of https://review.rdoproject.org/r/#/c/21895/ right now10:10
mariosconverge looks like it hangs? does it take a whil10:10
marioswhile?10:10
mariospanda|rover: ^ do you recall10:10
zbrmarios: it does not hang, is due to pytest std collection.10:11
zbrlocally it runs with -x but on CI it does not, side effect: you do not get output until it finishes.10:11
marioszbr: ack noting will add -x next time then :D10:12
zbrthat is because pytest limitation: cannot do both: capturing std streams and displaying them. you need to pick one.10:12
mariosthanks zbr10:12
ykarelpanda|rover, so i found atleast https://review.opendev.org/#/c/656027/ is related to the failure, there may be other failures too10:12
zbrwith tox I used a magic trick: {tty:-x} added, which mainly adds -x when there is a tty present.10:12
zbrmarios: sorry option is named -s10:13
zbrmarios: add this to your profile PYTEST_ADDOPTS='-s --color=yes --no-cov'10:14
zbrmaybe even -x if you want to stop at first failure.10:14
marioszbr: ack noting10:15
zbrmarios: my pleasure. glad you asked.10:16
*** udesale has joined #oooq10:20
*** udesale has quit IRC10:28
*** udesale has joined #oooq10:29
*** soniya29 has joined #oooq10:32
panda|rovermarios: where ?10:37
mariospanda|rover: where what? running it on the rdo vm you talked me into setting up the other day10:38
mariospanda|rover: i think am ok for now thanks10:38
panda|rovermarios: where does it hang ?10:38
panda|rovermarios: ok10:38
mariospanda|rover: when i did molecul check10:38
mariospanda|rover: with test instead i am getting a fail and debugging it now10:39
sshnaidm|ruckfolks, please take a look at https://review.opendev.org/#/c/678630/10:41
chandankumarsshnaidm|ruck: I am little confused here https://review.opendev.org/#/c/678630/6/library/sova.py10:44
chandankumarin doc it mentions about podman container10:44
sshnaidm|ruckchandankumar, yeah, copy-paste, please leave a comment in a patch10:45
zbrin the absence of centos 7.7/8.0 it would worth mentioning that oraclelinux has 8.0 available https://hub.docker.com/_/oraclelinux :D10:52
panda|roverwow https://storage.gra1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/logs_30/678630/6/check/tripleo-ci-centos-7-containers-multinode/cba2e3c/11:09
panda|rover"payment required"11:09
chandankumarpanda|rover: it was seen yesterday11:09
chandankumari think it is fixed now11:09
panda|roverone hand on the keyboard, and one on the wallet11:11
panda|roverchandankumar: it's still there11:11
*** jpena is now known as jpena|lunch11:21
chandankumarpanda|rover: is something wrong with check-queue today in upstream?11:26
panda|roverchandankumar: not that I see11:31
chandankumarit is more than 3-4 waiting for patch11:32
panda|roverchandankumar: link ?11:37
chandankumarpanda|rover: http://zuul.openstack.org/status put patchno: 67348111:39
chandankumarearlier all jobs were waiting11:39
zbrhahaha11:48
panda|rover?11:50
zbrpanda|rover: chandankumar any chance to get some supporting comments on https://bugs.centos.org/view.php?id=16389 ?11:52
zbrpanda|rover: ^ when I read payment required.11:52
panda|roverare we really going to support python3 on centos7 ?11:54
panda|roverI think I'll kill myself11:54
zbrpanda|rover: you are missing few bits: 7.7 is schedules to be releases in 3 days. 8.0 ... nobody knows.11:55
zbrwe have jobs (even simple testing ones (tox) that do require python3, some even minimum pyhon3.6)11:56
panda|roverzbr: how many things we'll need to change to make centos7 nodes use python3 instead of python2 ? How many had coded conditionals ?11:56
zbrexample docs generation, sphinx already dropped python2.11:56
zbrpanda|rover: py36 is additional on c7.7 is not replacing python2.11:57
zbrmainly is like it was with epel, but now much easier to install.11:57
zbrit does not change default python11:57
zbr(that would be crazy)11:57
panda|roverzbr: yes, the question is; "are we going to suport 2, 3 or both for centos7?"11:57
panda|roverykarel: I had a previous patch to test the CR repositories https://review.opendev.org/618780, probably need some updates on branches, but you think  it should work ?11:58
zbrpanda|rover: let me ask you something, maybe we can find an answer: default node on rdo for tox is centos7, without any python3.11:58
zbri want/need to run some tox jobs (like tox-docs,...) with python3 minimum.11:59
chandankumarzbr: can you check from which repo libselinux comes?11:59
zbrwhat do I do? do I start switching jobs to fedora nodes?11:59
zbrlibselinux needs to be build at the same time as libselinux, version needs to match.12:00
zbrat this moment is coming from nowhere.12:00
ykarelpanda|rover, any reason to not use enable_cr_repo flag like it's done in fs021?12:00
zbrmainly if you want it, you need to get selinux sources and compile the python bindings yourself and install it.12:01
chandankumarzbr: https://cbs.centos.org/koji/buildinfo?buildID=1454212:01
chandankumarit comes from different repo not in base12:02
zbrwe are in a very bad spot with this because everyone avoid the pink elephant: selinux packager does not want to publish a python3 version of the rpm as this is "optional", is not an os component. python36 packages doesn't want either as they say, selinux is not part of python,...12:03
chandankumaror what about enabling epel8 and installing it from there?12:03
panda|roverykarel: I have no idea what does that flag do12:03
zbrchandankumar: libxselinux depends on selinux and kerbel, is not portable.12:04
zbryou cannot get the binary from one kernel and use it with another one.12:04
chandankumarok12:04
zbrit was possible by now I would have published a manylinux bynary for that library, me or someone from ansible team.12:04
zbri kinda envy debian distros now, they don't have this problem.12:05
ykarelpanda|rover, flag is enable_centos_cr_repo and it enables cr repo in job12:05
ykarelhttps://github.com/openstack/tripleo-quickstart/blob/6133bf14f3330d923759e8ae4fdfc12820f90a00/config/release/tripleo-ci/CentOS-7/promotion-testing-hash-master.yml#L11112:05
*** brault has joined #oooq12:06
chandankumarykarel: panda|rover https://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-master/138763a/logs/undercloud/etc/yum.repos.d/CentOS-CR.repo.txt.gz12:07
chandankumarcr repo is there12:07
ykarelchandankumar, not ^^, quickstart-cr12:07
ykarelit enables12:07
panda|roverykarel: mmmhhh, I remember we also needed to install CR in the local machine before anything else, to update everything to centos7.7 as first step12:09
chandankumarok it is also there https://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-master/138763a/logs/undercloud/etc/yum.repos.d/quickstart-centos-cr.repo.txt.gz12:09
ykarelpanda|rover, local machine? u mean in libvirt based jobs?12:10
ykarelvirthost12:10
panda|roverykarel: no, in zuul12:11
panda|roverykarel: if you look at the prepare-node tasks, the first thing we need to do is upgrade the base os in the node from 7.6 to 7.712:11
panda|roverykarel: otherwise we are not testing the job fully in centos7.712:12
chandankumarykarel: panda|rover https://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-master/138763a/logs/undercloud/home/zuul/repo_setup.log.txt.gz#_2019-09-04_19_26_1912:12
chandankumarcr is used here12:12
panda|roverykarel: the first step is simulating that we are getting a node with cento7.7 in it.12:12
zbr... ouch this upgrade downloaded ~600mb on my machine and took forever to install.12:13
ykarelpanda|rover, okk, but repo setup tasks yum update runs iirc, should that take care for upgrading nodes12:13
panda|roverykarel: what about the jobs that are not using repo setup ?12:14
panda|rovercrap12:14
ykarelrepo setup runs at all TripleO jobs iiuc12:15
panda|roverykarel: not the image builds12:15
panda|roverwe need to update tripleo-repos for those12:15
ykareland container builds also12:15
ykarelyes right12:15
panda|roverand I'd like to find the line that updates everything in repo setup12:16
panda|roverand it may not be neough12:16
panda|roverupdateing everything after we started quickstart is skipping some testing under centos7.712:16
zbrsshnaidm|ruck: panda|rover : see if you can comment on https://review.opendev.org/#/c/677971/6 -- i will be trying to persuade infra to display more in the header (so we know for sure which python version is used by a job)12:17
panda|roverI remember we wanted to be sure that *EVERY* step was run on centos7.712:17
panda|roverwell 7.6 at the time12:17
panda|roverto avoid surprises12:17
chandankumarzbr: in RHEl7.6 libselinux comes from base12:24
chandankumarsorry centos12:25
chandankumarweshay: time for 1:!12:31
chandankumar1:112:31
sshnaidm|ruckpanda|rover, we test cr repos in 021 periodic job12:39
weshaychandankumar hey12:42
weshaysorry I'm late12:42
chandankumarweshay: no problem12:42
weshayyou want to chat now or later?12:42
zbrchandankumar: we are talking about python-libselinux here, not libselinux itself which obviously comes from base.12:42
chandankumarweshay: yes12:42
*** ratailor has quit IRC12:42
chandankumarweshay: now would be ebtter12:44
chandankumar*better12:44
weshayjoining12:44
chandankumarweshay: https://wiki.centos.org/About/Building_812:52
*** Goneri has joined #oooq12:55
chandankumarweshay: https://github.com/rdo-infra/rdo-jobs/blob/master/zuul.d/rdoinfo-jobs.yaml12:55
rfolcoscrum time - https://bluejeans.com/587845809713:00
*** jpena|lunch is now known as jpena13:01
rfolcochandankumar, zbr13:01
rfolcozbr, scrum time13:02
panda|roversshnaidm|ruck: so we are testing CR only partially and only on a single job ?13:14
sshnaidm|ruckpanda|rover, what does mean "partially"?13:17
panda|roverchandankumar: can you shoot the loud people in you room ?13:17
panda|roverchandankumar: with a water gun.13:17
*** brault has quit IRC13:18
chandankumarpanda|rover: good idea, need to buy it today.13:18
sshnaidm|ruckhttp://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-master/381908f/logs/undercloud/etc/yum.repos.d/quickstart-centos-cr.repo.txt.gz13:18
sshnaidm|ruckpanda|rover, ^^13:18
chandankumarsshnaidm|ruck: panda|rover weshay https://review.opendev.org/#/c/673481/ needs +2 and +w13:21
panda|roversshnaidm|ruck: the testing for CR twofold: 1) we need to be sure we are able to deploy with 7.7 packages, and that's covered in the fs021 job. 2) we need to test that the workflow doesn't have any surprises in 7.7. Partially means that repo-setup is updating everything 7.7 only after quickstart runs. If 7.7 is breaking quickstart for some reason, we'll not detect it there13:22
panda|roveras well as build-* jobs.13:22
sshnaidm|ruckpanda|rover, yeah, that's right13:25
*** surpatil has quit IRC13:25
panda|roversshnaidm|ruck: ok, I'm working to increase the coverage to 2 too, reusing what we had for 7.613:27
weshaysshnaidm|ruck should we recheck? https://review.rdoproject.org/r/#/c/22086/13:47
sshnaidm|ruckweshay, no, new jobs running and passing now13:48
*** brault has joined #oooq13:50
ykarelpanda|rover, any chance u looked at master ovb failures13:53
ykarelit would need attention from nova/ironic guys13:54
zbrjust got the answer i was expecting: https://bugs.centos.org/view.php?id=16389#c35071 --- mainly "not my problem"13:54
chandankumarzbr: i have gone through this experience13:55
panda|roverykarel: I think sshnaidm|ruck was looking at it. It he's not I can try.13:55
zbralmost every time we face a centos issue, we get something like that, is frustrating and draining, we need to make our voice heard13:55
zbri do not think that shutting-up is the right approach here.13:56
ykarelpanda|rover, okk it's good then13:56
sshnaidm|ruckykarel, talk to me13:56
ykarel:)13:56
ykarelsshnaidm|ruck, u saw master ovb failures?13:56
sshnaidm|ruckykarel, ya, all of them because of different issues, two of no-valid-host13:57
ykarelsshnaidm|ruck, no valid host seems real issue caused by nova patch13:57
sshnaidm|ruckykarel, which patch? I wanted to see them failing again today to be sure13:58
ykarelsshnaidm|ruck, it failed multiple times i think13:58
ykarel2 runs till now13:58
*** brault has quit IRC13:58
ykarelsshnaidm|ruck, so nova is switching to openstacksdk13:58
ykarelsshnaidm|ruck, so basically https://review.opendev.org/#/c/656027/ and it;s related patcehs13:59
ykareli commented there with logs14:00
ykareland error,14:00
ykarelother ironic related patches can also have issue, i didn't checked in detail,14:03
*** Vorrtex has joined #oooq14:07
sshnaidm|ruckykarel, I see, will create a bug and gonna check last jobs14:07
sshnaidm|ruckykarel, which logs you see errors in ?14:07
ykarelsshnaidm|ruck, Thanks14:07
ykarelsshnaidm|ruck, nova-compute14:08
*** brault has joined #oooq14:08
sshnaidm|ruckykarel, this one BadRequestException: BadRequestException: 400: Client Error for url: http://192.168.24.3:6385/v1/nodes?fields=instance_id&associated=True, Field(s) "instance_id" are not valid14:09
ykarelsshnaidm|ruck, yes14:09
sshnaidm|ruckykarel, iirc ironic changed recently from uuid to id (or opposite), isn't it related in some way?14:10
sshnaidm|ruckdtantsur, ^^14:10
ykarelmay be, but dtantsur would know more14:11
*** soniya29 has quit IRC14:15
sshnaidm|ruckykarel, hmm.. I see also errors like "InstanceNotFound: Instance 2a9c9e29-1cdd-4701-ac09-60bcad1c291d could not be found."14:15
sshnaidm|ruckykarel, with these problems in rdo cloud you never know if it's a bug or it doesn't work again..14:15
bogdandoo/ what is the right place to push a patch for https://github.com/rdo-infra/ansible-role-tripleo-ci-reproducer ?14:17
bogdandoI suppose that is a mirror only?14:17
sshnaidm|ruckbogdando, in rdoproject14:17
bogdandosshnaidm|ruck: thanks14:18
bogdandosshnaidm|ruck: got link?.. :D14:18
sshnaidm|ruckbogdando, https://review.rdoproject.org/r/#/q/project:rdo-infra/ansible-role-tripleo-ci-reproducer14:19
sshnaidm|ruckbogdando, https://review.rdoproject.org/r/#/admin/projects/rdo-infra/ansible-role-tripleo-ci-reproducer14:19
*** sabedevops has joined #oooq14:20
sshnaidm|ruckbogdando, feel free to ping me for review14:20
*** sabedevops has left #oooq14:20
ykarelsshnaidm|ruck, yes but it's not always rdo cloud issue :)14:20
*** sabedevops has joined #oooq14:20
*** sabedevops has quit IRC14:21
bogdandosshnaidm|ruck, ykarel: https://review.rdoproject.org/r/#/c/22104/14:24
sshnaidm|ruckjpena, do you know maybe why experimental jobs are not queued for this patch? https://review.opendev.org/#/c/656027/ I'm posting "check rdo experimental" according to pipeline config14:25
bogdandow/o that log dir, it seems the reproducer fails ansible in zuul executor but the job reports succeeded14:25
sshnaidm|ruckbogdando, hmm.. workspace/logs should be created inside a job14:26
bogdandomhm14:26
sshnaidm|ruckbogdando, it's part of toci_quickstart.sh14:26
zbrsshnaidm|ruck: can you please help me with https://review.opendev.org/#/c/674092/ -- do I miss something? -- a second opinnion would be useful before trying to push on infra.14:26
bogdandowell, may be that's just me doing it wrong then14:26
sshnaidm|ruckbogdando, maybe job failed before it's created14:27
chandankumarsshnaidm|ruck: panda|rover weshay https://review.opendev.org/#/c/673481/ needs +2 and +w14:27
bogdandosshnaidm|ruck, ykarel: one more https://review.rdoproject.org/r/#/c/22105/14:28
bogdandoI'm trying basically to trigger an upstream 007 multinode for a patch14:28
bogdandow/o that change zuul can't trace down all the parents14:28
sshnaidm|ruckzbr, not sure I understand purpose of this patch..14:29
jpenasshnaidm|ruck: checking14:29
sshnaidm|ruckchandankumar, well, firstly it needs review :D14:29
zbrsshnaidm|ruck: we need to fix that one as is blocking rdo changes, like https://review.rdoproject.org/r/#/c/21486/14:29
zbrmainly zuul roles upstream are *not* tested with rhel, many of them not even with centos. so there are bugs we encounter when we try to use them in rdo.14:30
sshnaidm|ruckzbr, I don't think we should install manually libselinux though, shouldn't it be in the image?14:31
zbrsshnaidm|ruck: another issue caused by lack of proper testing of ansible roles14:31
bogdandosshnaidm|ruck, ykarel: https://review.rdoproject.org/r/22106 and https://review.rdoproject.org/r/#/c/22107/ a few more :)14:31
bogdandonot sure if really needed but...14:32
sshnaidm|ruckzbr, and ssh build key role seems like not really place to do it14:32
ykarelack14:32
zbrsshnaidm|ruck: i have zero control over who is building images, we have multiple zuul instances,... cannot just hope people are dreaming about what needs to be installed or not.14:32
sshnaidm|ruckbogdando, cool, I'll look at it later14:32
sshnaidm|ruckbogdando, did you get it working locally?14:32
zbreach role should take care of its own requirements.14:33
bogdandosshnaidm|ruck: to the stage I can see a non idle CPU on my nodepool nodes :D14:33
zbrfor example libselinux is not really neeeded for 100% of ansible usages14:33
jpenasshnaidm|ruck: I don't see any experimental job defined for upstream's openstack-nova in review.rdo14:33
sshnaidm|ruckzbr, what do you mean? we can add/remove elements in DIB afaik14:33
bogdandoans some ansible logs failing a little bit later and later in zuul executor14:33
bogdandonot that I've managed to start 007 job tho14:33
zbralternative is to add dependency on a bootstrapping role. but still, it should be part of the role.14:33
sshnaidm|ruckjpena, I think I have them here: https://github.com/rdo-infra/review.rdoproject.org-config/blob/master/zuul.d/projects.yaml#L119-L12314:34
jpenasshnaidm|ruck: hm, true. I misread14:34
* jpena needs caffeine14:34
zbrsshnaidm|ruck: images are build in multiple places, by multiple people.14:34
sshnaidm|ruckjpena, and job itself: https://github.com/rdo-infra/rdo-jobs/blob/master/zuul.d/project-templates.yaml#L170-L17414:34
bogdandoand one more https://review.rdoproject.org/r/#/c/22108/ , prolly nice to have if using custom gerrit SSH keys != id_rsa for nodes ?..14:34
sshnaidm|ruckzbr, I don't think so14:34
bogdandothat's all yet on my contribution plate14:35
sshnaidm|ruckzbr, if we are talking about CI and OS images there14:35
sshnaidm|ruckzbr, not containers!14:35
zbrwhile I agree that we should do this in the image, i also think that we should also install it inside the role.14:35
zbrconfiguration management 101: bring the system to the desired state.14:35
sshnaidm|ruckbogdando, if it reports tripleo-ci tenant is not ready after a bootstrap, just need to wait about 10-15 mins, then just recheck in local gerrit14:36
bogdandosshnaidm|ruck: no, the tenant is fine, and the job gets queued14:36
bogdandojust not running well :D14:36
sshnaidm|ruckbogdando, ah, ok14:37
sshnaidm|ruckbogdando, logs from job? or from executor14:37
bogdandowill post some snippets14:37
bogdandoas now I'm at the most distant point of failure14:37
sshnaidm|ruckzbr, then maybe a new role? ssh build key should build key, libselinux there is completely off-topic14:38
zbri am trying to explain people that almost any ansible role should take care of its own dependencies, that is why it has the "dependencies" implemented. they are not random piecec of code that are supposed to run only when some magic stuff is happening, like when you already installed libselinux. If you need it to run you have two options: a) install it b) add a dependency on a role that does that already.14:39
sshnaidm|ruckjpena, maybe it's related to low priority? Although I thought it still should be queued there, just not running14:39
zbrotherwise: you have broken role.14:39
*** ykarel is now known as ykarel|afk14:40
sshnaidm|ruckzbr, libselinux is not dependency of any role, it's ansible dependency14:40
jpenasshnaidm|ruck: I think the issue is that the patch is already merged14:40
zbransible knows to skip including roles multiple times, so there is no performance impact if you include the same bootstrapping role 100 times.14:40
jpenayep, that's it14:40
sshnaidm|ruckjpena, oh, it doesn't work with merged? :(14:40
jpenathere's "require: opendev.org: open: True" in the pipeline definition14:40
zbrno is not ansible dependency, is a soft-dependency.14:40
bogdandosshnaidm|ruck: http://paste.openstack.org/show/SROCnzcsBNccmvfeKUJA/14:41
sshnaidm|ruckjpena, I see.. that's sad14:41
zbralso is a soft-dependency for the managed role, even worse.14:41
bogdandosearched in executor by failed=1|FATAL|FAIL|UNREA|Ansible command14:41
zbrs/role/host14:41
sshnaidm|ruckzbr, not sure you can run anything without this package on selinux enabled systems14:41
sshnaidm|ruckzbr, it's very annoying ansible dep :)14:42
zbrsshnaidm|ruck: yes you can run a LOT! mainly the module that needs is the most is "template" module.14:42
bogdandothe most interesting the job reports:14:43
bogdando             Build succeeded.14:43
bogdando14:43
bogdando             - tripleo-ci-centos-7-scenario007-multinode-oooq-container-dlrn-hash-tag http://localhost:8000/01/1001/1/check/tripleo-ci-centos-7-scenario007-multinode-oooq-container-dlrn-hash-tag/92fbf72/ : RETRY_LIMIT in 2m 24s (non-voting)14:43
zbrwithout it you may get ok. in fact you can use ansible even without having python on the target host, "raw" does work ok. I used it several times to boostrap python itself,14:43
bogdandoand executor always logs Ansible complete, result RESULT_NORMAL code 214:43
zbrsshnaidm|ruck: but I agree, is an annoying soft-dependency, probably the most important after python itself.14:44
bogdandoI want to nail it down, feel too much excited about zuul after reading https://model-engineers.com/en/company/references/success-stories/volvo-cars/14:44
bogdandoalso having a local clone of OS infra for dev purposes looks nice :)14:45
sshnaidm|ruckbogdando, I see "changed=4    unreachable=0    failed=1" but don't see failed task14:46
sshnaidm|ruckbogdando, seems like something wrong with keys, maybe14:46
bogdandowhich line is it?14:46
*** udesale has quit IRC14:46
sshnaidm|ruck18414:47
sshnaidm|ruckbogdando, failed on 'secondary' host14:47
*** udesale has joined #oooq14:47
sshnaidm|ruckbogdando, are you trying multinode?14:47
bogdandosshnaidm|ruck: yes14:47
bogdando, 00714:47
sshnaidm|ruckbogdando, I see.. I don't think I tried multinode recently, so might be surprises14:48
sshnaidm|ruckbogdando, on libvirt or rdo cloud tenant?14:48
bogdandolibvirt14:48
bogdandothere is also Add port to bridge on switch node failing (seems taking empty nodepool and switch _ips)14:49
sshnaidm|ruckbogdando, yeah, multinode is not tested well there14:49
bogdandowanted a real overcloud :)14:50
bogdandosshnaidm|ruck: http://paste.openstack.org/show/ghY2RKzHUKEYQmnNGvRO/14:52
sshnaidm|ruckbogdando, then ovb14:52
bogdandosigh14:52
bogdandoovb runs docker yet14:53
bogdandowanted podman14:53
bogdandobut anyway, I'll try14:53
sshnaidm|ruckbogdando, ovb rhel8 runs podman :)14:54
bogdandosshnaidm|ruck: perchance, any write-ups to start it quick and simple?14:54
sshnaidm|ruckbogdando, to start what?14:54
bogdandoovb in rdo cloud14:55
bogdandosomething with more specific instructions than https://github.com/rdo-infra/ansible-role-tripleo-ci-reproducer/blob/master/README.md14:55
sshnaidm|ruckbogdando, yes, just need to share images with you, need your tenant id14:55
bogdandosshnaidm|ruck: thanks, will post14:55
sshnaidm|ruckbogdando, for me it's much easier to run on rdo cloud, libvirt sucks14:55
*** aakarsh has joined #oooq14:58
*** bogdando has quit IRC15:09
*** tosky has quit IRC15:15
weshayzbr https://review.opendev.org/#/c/668851/ merged15:16
weshayzbr so .. scenario03 should work w/ rsyslog now15:17
weshaywait.. I'm getting my scenarios mixed up15:17
weshayzbr sorry.. https://review.opendev.org/#/c/668851/ is just scen001 / 00215:20
*** jfrancoa has quit IRC15:20
weshaywhich are working now :)15:20
zbryep, i knew about that one. i like changes that are removing stuff15:21
*** ykarel|afk is now known as ykarel|away15:25
* marios almost out15:25
weshaysshnaidm|ruck panda|rover I've udpated the cix board https://trello.com/b/j4IcIomh/production-chain-escalation15:26
*** ccamacho has quit IRC15:26
sshnaidm|ruckweshay, ack15:26
weshayscen003 / scen004 are on the board under "waiting on external deps"  Please don't let folks close these tickets until they are resolved15:27
weshaythey tend to like to just close shit15:27
weshaysshnaidm|ruck did the timeserver change fix vexx btw?15:27
*** marios is now known as marios|out15:31
*** ykarel|away has quit IRC15:35
*** holser has quit IRC15:36
*** brault has quit IRC15:40
sshnaidm|ruckweshay, it fixed time sync issues15:40
sshnaidm|ruckweshay, not it times out on containers start, I think it's too slow because registry is far away..15:41
sshnaidm|rucks/mot/now15:41
weshaysshnaidm|ruck ok..  thanks for the update.. I'll update the prod-chain council card15:41
*** saneax has quit IRC15:48
*** marios|out has quit IRC15:50
panda|roverupstream check queue is really clogged today ... :(15:51
sshnaidm|ruckweshay, panda|rover stein started promotion15:52
*** ykarel|away has joined #oooq15:53
sshnaidm|ruckweshay, panda|rover queens promoted..15:53
weshaysshnaidm|ruck woot.. keeping us out of trouble ++15:53
weshaysaw that15:53
sshnaidm|ruckweshay, panda|rover rocky is failing on 020, rerunning it again15:54
weshaysshnaidm|ruck how's master?15:54
weshayrocky < master15:54
sshnaidm|ruckweshay, panda|rover and master fails most likely because of https://bugs.launchpad.net/tripleo/+bug/184293315:54
openstackLaunchpad bug 1842933 in tripleo "OVB: Nova compute fails with error BadRequestException: 400: Client Error for url: http://192.168.24.3:6385/v1/nodes?fields=instance_id&associated=True, Field(s) "instance_id" are not valid" [Critical,Triaged]15:54
sshnaidm|ruckbecause rdo cloud is not stable it's difficult to say if we have a bug or not15:55
weshaysshnaidm|ruck aye.. thanks15:57
sshnaidm|ruckpanda|rover, I'm out for today, just keep etherpad updated, I'll look in Sunday16:00
*** sshnaidm|ruck is now known as sshnaidm|afk16:00
*** altlogbot_1 has quit IRC16:01
ykarel|awaysshnaidm|afk, panda|rover i noticed tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master-vexxhost running in periodic pipeline, is it intentionally running?16:01
*** altlogbot_2 has joined #oooq16:01
ykarel|awayi mean why not running periodic version of it16:02
*** irclogbot_1 has quit IRC16:02
ykarel|awayperiodic release file16:02
*** irclogbot_0 has joined #oooq16:03
panda|roverykarel|away: maybe that's the periodic version, just missing a name16:03
ykarel|awaypanda|rover, it's running with master release file16:04
ykarel|awayQUICKSTART_RELEASE=master16:04
ykarel|awayperiodic have QUICKSTART_RELEASE=promotion-testing-hash-master16:04
panda|roverykarel|away: I don't know the status there, usually we start with the check job as POC, running them peruiodically, then make the periodi version16:05
panda|roverwhen they start to get stable16:05
ykarel|awaypanda|rover, ack, but afair -vexxhost jobs are running from many days in check too16:05
sshnaidm|afkykarel|away, it's not for promotion, just for periodic runs16:05
ykarel|awaysshnaidm|afk, okk but good to run with same release files16:06
sshnaidm|afkykarel|away, running in check for simplicity16:06
ykarel|awaysshnaidm|afk, if it runs with same hash that would be help in ruling out the issue is due to rdo cloud16:06
sshnaidm|afkykarel|away, fair enough16:07
*** irclogbot_0 has quit IRC16:07
sshnaidm|afkykarel|away, will change it then16:07
*** irclogbot_0 has joined #oooq16:07
ykarel|awaysshnaidm|afk, ack16:07
chandankumarzbr: sshnaidm|afk openstack-ansible-deploy-aio_metal-debian-stable (2. attempt)16:18
chandankumarif a job failed, rechecked again, it will show attempts16:18
*** kopecmartin is now known as kopecmartin|off16:18
chandankumarweshay: we also need to start preparing for train release16:21
weshaychandankumar not quite yet16:21
weshaybut yes16:21
chandankumarone more ping for review https://review.opendev.org/#/c/673481/16:33
chandankumarsee ya tomorrow16:33
*** chandankumar is now known as raukadah16:33
zbrweshay: can you sf https://review.opendev.org/#/c/679618/ ? of so thanks.16:47
weshaydone16:49
*** jaosorior has quit IRC16:51
*** tesseract has quit IRC17:06
*** jpena is now known as jpena|off17:07
*** udesale has quit IRC17:18
*** jaosorior has joined #oooq17:22
*** brault has joined #oooq18:05
*** brault has quit IRC18:09
dtantsursshnaidm|afk, ykarel|away, talk to nova folks, they've been refactoring how nova talks to ironic18:27
dtantsuralthough this one may be an openstacksdk bug18:27
*** dtantsur is now known as dtantsur|afk18:36
*** ykarel|away has quit IRC18:56
*** Goneri has quit IRC20:44
*** aakarsh has quit IRC20:45
*** dsneddon has quit IRC20:46
*** Vorrtex has quit IRC20:53
-openstackstatus- NOTICE: Gerrit is being restarted to pick up configuration changes. Should be quick. Sorry for the interruption.21:12
*** dsneddon has joined #oooq21:17
*** dsneddon has quit IRC22:37
*** dsneddon has joined #oooq23:05
*** dsneddon has quit IRC23:10
*** dsneddon has joined #oooq23:40
*** dsneddon has quit IRC23:46

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!