Monday, 2018-11-26

*** tosky has quit IRC00:39
hubbot1FAILING CHECK JOBS on master: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-fedora-28-standalone, tripleo-ci-centos-7-scenario007-multinode-oooq-container, tripleo-ci-centos-7-scenario004-multinode-oooq-container, tripleo-ci-centos-7-containers-multinode, tripleo- (3 more messages)00:56
*** agopi|brb has quit IRC01:09
*** agopi|brb has joined #oooq01:10
hubbot1FAILING CHECK JOBS on master: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-fedora-28-standalone, tripleo-ci-centos-7-scenario007-multinode-oooq-container, tripleo-ci-centos-7-scenario004-multinode-oooq-container, tripleo-ci-centos-7-containers-multinode, tripleo- (3 more messages)02:56
*** udesale has joined #oooq03:45
*** sshnaidm is now known as sshnaidm|afk04:07
*** ykarel has joined #oooq04:23
*** chkumar has joined #oooq04:34
*** ykarel has quit IRC04:42
hubbot1FAILING CHECK JOBS on master: tripleo-ci-centos-7-scenario010-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario008-multinode-oooq-container, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-fedora-28-standalone, tripleo-ci-centos-7-scenario007-multinode-oooq- (3 more messages)04:56
*** ykarel has joined #oooq05:02
*** chkumar is now known as chkumar|ruck05:27
*** chkumar has joined #oooq05:44
*** chkumar|ruck has quit IRC05:47
*** chkumar has quit IRC05:48
*** ratailor has joined #oooq05:59
*** chkumar246 has joined #oooq06:03
*** apetrich has joined #oooq06:06
*** chkumar246 is now known as chkumar|ruck06:44
hubbot1FAILING CHECK JOBS on master: tripleo-ci-centos-7-scenario010-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario008-multinode-oooq-container, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-fedora-28-standalone, tripleo-ci-centos-7-scenario007-multinode-oooq- (3 more messages)06:56
*** jbadiapa has joined #oooq06:58
*** jfrancoa has joined #oooq07:00
*** quiquell|off is now known as quiquell07:11
*** skramaja has joined #oooq07:25
quiquellmarios: I don't wnat to workflow mystuff here https://review.openstack.org/#/c/619468/07:26
*** ykarel is now known as ykarel|lunch07:26
quiquellI think itÂ's bad practice07:27
quiquellsshnaidm|afk: Can yuo workflow this https://review.openstack.org/#/c/619468/  ?07:29
mariosquiquell: checking07:29
mariosquiquell: oh07:30
mariosquiquell: yeah but its early still07:30
mariossure we can get a vote before lunch :) ?07:30
mariosquiquell: i already voted there07:30
quiquellmarios: Yep let 's wait for Sagi to workflow this, I think is the early next one07:30
quiquellwith power07:30
mariosyeah IMO avoid the +2a or even +2 your own patch unless emergeycny07:31
mariosemergency07:31
quiquellthis is not emergency is just feature07:31
quiquell§'just'07:31
mariosright and not controversial so we should be able to merge it today if we offer enough blood sacrifice for zuul07:31
*** jtomasek has joined #oooq07:47
*** ccamacho has quit IRC08:01
*** ccamacho has joined #oooq08:01
*** gkadam has joined #oooq08:07
*** saneax has joined #oooq08:10
*** ykarel|lunch is now known as ykarel08:14
*** amoralej|off is now known as amoralej08:31
*** chem has joined #oooq08:43
*** tosky has joined #oooq08:46
arxcruzsshnaidm|afk: chkumar|ruck please take a look at https://review.rdoproject.org/r/#/c/17437/08:53
hubbot1FAILING CHECK JOBS on master: tripleo-ci-centos-7-scenario010-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario008-multinode-oooq-container, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-fedora-28-standalone, tripleo-ci-centos-7-scenario007-multinode-oooq- (3 more messages)08:57
chkumar|ruckarxcruz, hello09:01
chkumar|ruckarxcruz, https://review.rdoproject.org/r/#/c/17437/ please send a dummy patch like this https://review.rdoproject.org/r/17339 and call this new job there so that we can see it in action09:02
arxcruzack09:03
*** holser_ has joined #oooq09:04
chkumar|ruckarxcruz, what about running cinder/keystone/neutron/horizontempest plugins tests also? then it have full coverage09:05
*** jschluet has joined #oooq09:19
chkumar|ruckssbarnea|bkp2, quiquell Is someone looking into f28 standalone post_failure issue http://logs.openstack.org/92/619492/2/check/tripleo-ci-fedora-28-standalone/0f91b7f/job-output.txt.gz ?09:23
quiquellchkumar|ruck: not yet, could be related to python309:24
*** udesale has quit IRC09:25
*** jschlueter has quit IRC09:25
*** kopecmartin|off is now known as kopecmartin09:25
*** udesale has joined #oooq09:26
*** chkumar|ruck has quit IRC09:29
*** chkumar246 has joined #oooq09:31
*** chkumar246 is now known as chkumar|ruck09:31
*** jaosorior has joined #oooq09:38
*** derekh has joined #oooq09:38
quiquellssbarnea|bkp2: this is good now https://review.openstack.org/#/c/619518/09:54
quiquellmarios: ^ documentation on pre-commit09:54
mariosquiquell: ack.09:56
*** bogdando has joined #oooq10:02
*** sshnaidm|afk is now known as sshnaidm10:07
quiquellsshnaidm: good morning, added a tox -e zuul to start zuul it also checks API to wait until it really starts10:08
quiquellsshnaidm: Also can you workflow this https://review.openstack.org/#/c/619468/ for standalone scenarios ?10:09
sshnaidmquiquell, do you want to override docker/podman?10:10
sshnaidmquiquell, why not to do it in featureset?10:10
quiquellsshnaidm: we need to use docker at scenarios10:11
quiquellsshnaidm: but we don't want to replicate featureset05210:11
sshnaidmquiquell, why not replicate it?10:12
sshnaidmquiquell, I'm just afraid we start overuse this overriding, it was supposed to be only for tempest tests10:12
quiquellsshnaidm: to reduce redudancy, but panda|rover say it not a good idea reuse fs05210:12
quiquellsshnaidm: yep, panda|rover say so, maybe we don't want10:12
sshnaidmquiquell, well, if it's only replicating one featureset I think it's fine..10:13
quiquellsshnaidm: well, we are suppose to use podman in the future, so this will be deleted10:13
quiquellsshnaidm: so maybe we can create anodther featureset for scenarios  ?10:13
quiquellmarios: ^ new featureset for scenarios10:14
sshnaidmquiquell, we did it for containerized scenarios, from 004 to 016 for example10:15
quiquellsshnaidm: ack, then we do the same, looks like it was a pin in the past to reuse featureset you didn't know what you were executing10:15
quiquellsshnaidm: we are also override the environment file for standalone, maybe we don't want that either10:16
quiquellsshnaidm: and we just use featureset_override for tempest stuff10:16
quiquellsshnaidm: we can comment on the scrum and we decide10:16
quiquellpanda|rover: ^10:16
sshnaidmquiquell, yeah, I think it's good point to discuss it in scrum today10:16
quiquellsshnaidm: ok let's not workflow any of this yet10:17
chkumar|rucksshnaidm, Hello10:23
chkumar|rucksshnaidm, regarding this bug https://bugs.launchpad.net/tripleo/+bug/180509410:23
openstackLaunchpad bug 1805094 in tripleo "[master][rocky] No kolla logs getting collected in periodic-tripleo-centos-7-master-containers-build " [Critical,Triaged] - Assigned to chandan kumar (chkumar246)10:23
chkumar|rucksshnaidm, https://review.rdoproject.org/r/#/c/17204/1/playbooks/tripleo-ci-periodic-base/containers-build.yaml is not working10:24
chkumar|rucksshnaidm, can we do something here, always copy the logs whether the job failed or passed10:25
chkumar|ruck?10:25
chkumar|rucklet me put a patch10:25
mariosquiquell: why no i prefer to use overrides10:27
mariosquiquell: why do we want 10 new fs10:27
mariosto change 2 things10:27
*** skramaja_ has joined #oooq10:29
quiquellmarios: Looks like it was like that before10:30
quiquellmarios: At the end we are tripleo CI and tripleo quickstart10:30
quiquellmarios: If someone want to exercise the job without ci, featureset is the place10:30
*** skramaja has quit IRC10:30
quiquellmarios: I mean exercise not the job the scenario10:30
quiquellmarios: I mean running quickstart for standalone scenario001 for example10:30
panda|roverfeaturesets were implemented for a variety of reasons, and they demonstrated to be an excellent way to protect us from CI misuse. THey are the ultimate source of truth to understand what a job is doing, a fixed set of feature ina combination taht we support10:31
quiquellmarios: You just pass @featuresetfoobar.yaml to quickstart and that's it10:31
quiquellpanda|rover: sshnaidm agree and I think I agree too10:31
quiquellpanda|rover: maybe we can fix redudancy with other mechanism10:31
quiquellpanda|rover: we are not workflowing overrides until scrum meeting10:32
panda|roverone of the main reason was when we translated tripleo.sh jobs into quickstart, I spend 2 months looking at the logic in the bash script to understand exaclty what the HA job was doing10:32
marios12:31 < panda|rover> featuresets were implemented for a variety of reasons, and they demonstrated to be an excellent way to protect us  from CI misuse. THey are the ultimate source of truth to understand what a job is doing, a fixed set of feature  ina combination taht we support10:32
mariosi don't see how that conflicts with what is proposed ^10:32
quiquellmarios: with override you need to know that part of the stuff is in the job10:33
mariosthe featureset for the job scenario-standalon1 is 52, plus these two overrides10:33
panda|roverwith this 1 featureset maps to 4 jobs10:33
quiquellmarios: and also how the override works10:33
quiquellmarios: folks have to be able to run standalone with scenario001 without a clue on CI10:33
mariosquiquell:yeah i accept the ci/quickstart distinction i.e. using it outside of ci. i guess ceph is one example10:33
panda|roveryou have to look at the zuul configuration and the playbook code to undestand what are you runnning on your job10:34
marioscos gfidente was asking me about kicking those jobs in ceph-ansible10:34
quiquellmarios: For me at least is what make me clear the panda|rover's point10:34
mariospanda|rover: that is the same currenlty10:34
mariospanda|rover: you have to look at a featureset 052 adn then in tht there is an environment file multinode-containers whatever its called10:34
quiquellpanda|rover: still maybe is not bad to have a meachanism to reduce redundacy within tripleo-quickstart10:34
mariospanda|rover: i mean even without featureset override10:34
marios12:34 < panda|rover> you have to look at the zuul configuration and the playbook code to undestand what are you runnning on your job10:35
mariosthis still holds ^10:35
mariospanda|rover: and i don't see what the environment file has to do with featureset override. we still have to specify that env file. your just sayingwe need a new featureuset for that that10:36
panda|roveras far as I know the multinode env files are fot the environemnt in wchich we run the featureset, they don't alter the set of features we want to test10:36
mariospanda|rover: right they specify services, again same in both cases10:36
panda|roverthe environment for standalone is a n argument to openstack overcloud deploy roght ?10:37
mariospanda|rover: ie. right now https://github.com/openstack/tripleo-heat-templates/blob/master/ci/environments/scenario001-multinode-containers.yaml10:37
mariosthis is used ^10:37
mariosto specify services10:37
panda|roveryes, these are arguments to pass to the overcloud deploy10:38
mariosin our job, we are defining another env file to be used which also defined just services e.g. https://review.openstack.org/619504 for scenario 410:38
*** skramaja_ is now known as skramaja10:38
marioswell, scrum in like 2.3 hours so we can shout at each other on camera in a bit :)10:39
panda|rovergive me your address, I'll send you a Howler10:40
panda|roverthese types of environments are values for variables in different featureset files,  what we did until now was that fsx, has scenario: scenarioY so featureset and scenarios are tied10:43
panda|roverwith zull we have less need for the featuresets10:44
panda|roverand we can discuss removeing or altering them10:44
panda|roverbut the idea of having a single file that contains all the configurations for a specific job should still remains, that's what we realized was something to protect in the past10:45
panda|roveryou look at a single file and understand at a glance and without lookign at anything else, what are the switches activated in a particlar job10:45
panda|roveralso as a warning for the developers :"you can test whatever combination you want, but know that in CI we are currently testing ONLY this sets of combination, if you do something different and have trouble we won't be able to provide as much help"10:46
mariospanda|rover: but you cant do that currently is my point10:46
mariosall of this holds10:46
mariospanda|rover: currently you need to check the featureset and then also https://github.com/openstack/tripleo-heat-templates/blob/master/ci/environments/scenario001-multinode-containers.yaml for example10:47
mariosthat remains the same just different file...10:47
marios?10:47
panda|rovermarios: maybe I misunderstood your proposal, what I understand right now, is that fs052 can map to four standalone scenarios10:47
mariospanda|rover: right. possibly all of them10:47
mariospanda|rover: so its like 10 featuresets10:47
mariospanda|rover: where we override the same thing just 2 lines10:47
marioseven for 1 / 4 with ceph we are using the same fs5210:48
mariosie. we didn't need some new thing to warrant a new fs10:48
panda|roverwe know for th stat of the design that we needed to replicat all the featuresets even for a single different value, because it maps to a different job10:49
panda|roverstart*10:49
panda|roverif we are not doing this, then there's is something that take precedence over the featuresets10:49
panda|roverand that's what we didn't want10:49
panda|roverfeatureset has the last word10:49
sshnaidmmarios, panda|rover, quiquell one of questions is - do we need to run podman or docker on queens/rocky/stein jobs?10:49
panda|roversshnaidm: I don't think so, podman is only for RHEL8 and so master10:50
quiquellsshnaidm: scenarios are not prepared for podman, queens, rocky or stein10:50
panda|roverI don't think thery're back porting podman to RHEL710:50
sshnaidmpanda|rover, so it will be supported from stein, right?10:51
panda|roversshnaidm: I think so10:51
sshnaidmthen we'll need to run both docker and podman jobs, docker for previous branches10:51
sshnaidmand this is better to do with featuresets, not with overrides..10:51
sshnaidmlike we did with non-containerized and containerized jobs10:52
quiquellsshnaidm: so -podman -docker jobs10:53
panda|roverto me the feature we want to test is "container", docker or podman is an inmplementation, and should be set depending on release, but we always ahad difficult in spearating the switches in featuresets from their configurations10:53
quiquellpanda|rover, sshnaidm: so if we have fs per scenario docker have to be podman from steain and beyond10:53
chkumar|ruckpanda|rover, sshnaidm from stein we are switching to podman and there is a var for the same undercloud_container_cli which we have used in validate-tempest role10:53
quiquellthat's it ?10:53
sshnaidmpanda|rover, you can also say that feature is scenario and container is implementation10:54
chkumar|ruckif this var is not there it assumes we have used docker10:54
panda|roversshnaidm: yeah, because when we move forward, what previously was a feature to test, it's now the default, and starts to make less sense to have the same default10:55
sshnaidmjust think about configuring periodic and patch jobs for next branches, we'll need to put "override" in all job definitions there to run podman10:55
panda|roveruh, taht did make more sense in my mind10:55
panda|roveryeah agree, that's also one of the reason why in the featureset distribution etherpad I ask to not use hole in the featureset numbers10:56
panda|roverwe need to leave the featureset for the older jobs alone10:56
panda|roverbecause they represent older job s10:56
hubbot1FAILING CHECK JOBS on master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/604298, master: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario008-multinode-oooq-container, tripleo-ci-centos-7-scenario009-multinode-oooq, tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates, tripleo- (2 more messages)10:57
panda|rovermarios: beside all this, I'm really so sorry to not have caught this at the start of the sprint10:57
panda|rovermarios: now it feels like you're alsmost there and we start making complaints10:58
*** udesale has quit IRC10:58
panda|roverit sucks10:59
quiquellpanda|rover: let's just make it right, even if it's not very sprint friendly11:00
panda|roverit's not even team member friendly, the sprint is not the important thing, the thing is the satisfaction to have something completed at the end of a short period11:00
panda|roverthat's why I always push to plan US to be completed on a single sprint, mentally you close tabs frm the browser on you brain and free memory11:01
panda|roverthis is all the matters for the "sprint" not the sprint itself.11:02
quiquellmarios: reproduce the linting issue11:18
quiquellmarios: do you want a tmate session to try to fix it ?11:22
quiquellmarios: got it11:25
quiquellmarios: problem is that we are running ansible-lint at venv so we don't see system wide packages11:25
quiquell:-/11:25
quiquellOr this is what I think11:25
quiquellwe have to use ansible-lint from RPM not from pip11:27
quiquellhumm not working either11:31
mariospanda|rover: lets talk more on scrum if the consensus is to get new fs we can do it its one more easy review (copy/paste) per job11:37
mariosquiquell: sure, sec11:37
panda|roverchkumar|ruck: how's going ?11:40
*** holser_ is now known as holser|lunch11:41
*** rfolco has joined #oooq11:54
chkumar|ruckpanda|rover, on friday, we have promotion for master and rocky11:55
chkumar|ruckpanda|rover, today container build master failed due to container selinux issue11:55
chkumar|ruckpanda|rover, I am waiting for next run and rest is ok11:55
chkumar|ruckpanda|rover, oh pike got promoted also11:57
chkumar|ruckpanda|rover, please have a look at this bug https://bugs.launchpad.net/tripleo/+bug/1805102/11:57
openstackLaunchpad bug 1805102 in tripleo "[master][rocky][fs02] ERROR! Unexpected Exception, this is probably a bug: No module named tripleo_common in upload job while converting image" [Critical,Triaged]11:57
chkumar|ruckpanda|rover, currently I am dealing with this one https://bugs.launchpad.net/tripleo/+bug/180509411:57
openstackLaunchpad bug 1805094 in tripleo "[master][rocky] No kolla logs getting collected in periodic-tripleo-centos-7-master-containers-build " [Critical,Triaged] - Assigned to chandan kumar (chkumar246)11:57
chkumar|ruckpanda|rover, on rocky side fs01/02 blocked on this https://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-rocky-upload/9d18325/logs/undercloud/var/log/containers/nova/nova-compute.log.txt.gz?level=ERROR#_2018-11-25_13_54_39_58512:02
panda|roverpromotion bonanza!12:02
chkumar|ruckpanda|rover, I have no idea what to do with this?12:02
chkumar|ruckpanda|rover, vexhost fs01 rocky/master is passing in check queue12:02
*** hubbot1 has quit IRC12:02
panda|roverchkumar|ruck: do we have a bug for that ? I don't think there's much we can do other thanshow it to some nova guy12:04
chkumar|ruckpanda|rover, https://bugs.launchpad.net/tripleo/+bug/180158712:04
openstackLaunchpad bug 1801587 in tripleo "[master/Rocky]Fs035 job fails in promotion becasue of heat stack timeout" [Critical,Triaged]12:04
*** hubbot1 has joined #oooq12:05
panda|rovermmmhh ...12:06
*** ratailor has quit IRC12:10
quiquellmarios: ansible-pacemaker is an RDO thing, so we build it, I think the patch is just wrong12:15
quiquellmarios: as this is not default for ansible12:15
quiquellmarios: going to change the distgit12:16
quiquellmarios: https://github.com/rdo-packages/ansible-pacemaker-distgit/blob/rpm-master/ansible-pacemaker.spec#L4812:16
quiquellmarios: defaults ar https://docs.ansible.com/ansible/2.7/dev_guide/developing_locally.html12:18
chkumar|ruckpanda|rover, did we get a chance to look at rdo phase 1 master jobs?12:21
panda|roverchkumar|ruck: from friday evening to now ?12:22
panda|roverchkumar|ruck: looks like it worked ..12:23
panda|roverchkumar|ruck: https://ci.centos.org/view/rdo/view/promotion-pipeline/job/rdo_trunk-promote-master-current-tripleo/12:23
chkumar|ruckpanda|rover, yes12:23
chkumar|ruckpanda|rover, I think we need to remove this job https://ci.centos.org/view/rdo/view/promotion-pipeline/job/rdo_trunk-promote-ocata-current-tripleo/12:24
chkumar|ruckas we are removing ocata jobs already12:24
*** apetrich has quit IRC12:29
*** apetrich has joined #oooq12:41
marios quiquell lgtm nice on the distgit but still not fully clear on why now12:46
quiquellmarios: is clear now, looks like before standalone we run ansible using mistral12:46
quiquellmarios: and at mistral there is the module_path option that was set12:46
mariosquiquell: ah right so standalone is running ansible directoy12:46
mariosdirectly12:46
quiquellmarios: standalone just run directly ansible-playbook from python-tripleoclient12:46
mariosbypass mistral12:46
mariosthis is kinda 'violation' in the sense that everything else is using mistral12:47
mariosbut ok makes more sense12:47
quiquellmarios: yep, so we chagne distgit, we I think is not going to break anything12:47
quiquellmarios: or we cahnge python-tripleoclient (used also by mistral :-/)12:47
quiquellmarios: --module-path prepend stuff so is not going to override12:47
ykarelpanda|rover, can u check my comment https://review.openstack.org/#/c/618669/3..7/playbooks/tripleo-ci/post.yaml and confirm the issue12:54
*** holser|lunch is now known as holser_12:56
ykarelpanda|rover, i mean if run.yml fails before running collect logs, logs will be missed, i think for that u need to rename the scipt to something else(just after it's successully run in ovb) so it's not run again12:56
panda|roverpeople, still need review on https://review.openstack.org/618669   https://review.openstack.org/607288  https://review.openstack.org/61761712:57
hubbot1FAILING CHECK JOBS on master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/604298, master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container @ https://review.openstack.org/56722412:57
*** rlandy has joined #oooq12:57
panda|roverykarel: it complicated12:58
panda|roverykarel: we would really need to split the logs collection in two parts12:58
panda|roverykarel: one to collect only overcloud nodes, the other to collect the rest12:58
panda|roverykarel: I understand wheat you're saying, if the run times out we can run in post so we can at least have undercloud logs12:59
*** dtantsur|afk is now known as dtantsur|mtg12:59
panda|roverwe could try to check if we have the artifacts colelct logs should produce13:02
rfolcoconnection is bad, let me check my wifi points13:03
chkumar|ruckpanda|rover, please have a look at #rhos-ops internal13:03
*** weshay_pto is now known as weshay13:05
mariosquiquell: and rfolco please can one each qe my scen1/413:13
marios?13:13
quiquellmarios: Better panda|rover or sshnaidm so we follow what they have in mind regarding that13:14
mariosquiquell: ack. i added myself https://tree.taiga.io/project/tripleo-ci-board/us/337?milestone=206481 please re-assign if you find someone to sell it to13:15
quiquellmarios: ack13:16
weshaypanda|rover, chkumar|ruck thanks for holding things down13:18
panda|roverweshay: most of the time, chkumar|ruck already did everything before I could even wake up.13:19
weshayI see the board says there was a promotion, however the hash looks old to me13:19
weshaypanda|rover, quiquell 11/19 https://trunk.rdoproject.org/centos7-master/current-tripleo/delorean.repo13:19
weshayer.. sorry quiquell meant chkumar|ruck13:20
panda|roverit was too good to be true ...13:21
* chkumar|ruck is confused13:21
panda|roverchecking the promotion server13:23
*** trown|outtypewww is now known as trown13:27
panda|rovercloud image and container images are updated properly13:30
mariosrlandy: * repro/zuul3: failed like http://pastebin.test.redhat.com/672514 indeed @ toci_gate_test (as far as run-v3) but continue tomorrow.13:31
chkumar|ruckpanda|rover, each of the logs http://38.145.34.55/master.log-20181122 at the end does print success? on successful promotion13:32
weshaypanda|rover, w/ 3ed8ac0e93367a02ad53d9fa93467057724b6621_fd8eb74b13:32
chkumar|ruckweshay, I have not looked at the promotion server13:32
weshaypanda|rover, I don't see that hash here https://trunk.rdoproject.org/centos7-master/report.html13:33
*** gouthamr has quit IRC13:34
panda|roverthese are the promoted images https://images.rdoproject.org/master/rdo_trunk/618d3ab83cd319e03fac86c1d6de510ef4a5134b_be9e0d5c/13:35
panda|roverthis 618d3ab83cd319e03fac86c1d6de510ef4a5134b_be9e0d5c is the promotion hash13:35
panda|roverI don't see any exception in the logs, so I have to assume the call to DLRN api to promote this hash succeded13:35
panda|roverbut there was no change in the repo13:35
weshaypanda|rover, maybe the infra guys did something, because we should not have seen a promote in http://rhos-release.virt.bos.redhat.com:3030/rhosp if the dlrn hash did not update13:39
weshaypanda|rover, oh ya..13:40
*** gouthamr has joined #oooq13:40
weshayhttp://rhos-release.virt.bos.redhat.com:3030/rhosp13:41
weshaypanda|rover, check it out.. master moved back to yellow13:41
weshayso I think something changed13:41
weshayit was green 2days last night iirc13:41
panda|roverweshay: asking int prodinfra channel13:42
chkumar|ruckweshay, panda|rover is it something problem with master only? as rocky looks good from this dashboard http://rhos-release.virt.bos.redhat.com:3030/rhosp13:42
panda|roverchkumar|ruck: to be really sure we have to check if the hash is the same in containers, images, and repo13:43
chkumar|ruckadding a todo, will script it13:43
ykarelweshay, because master became consistent today13:44
*** hubbot1 has quit IRC13:49
ykarelpanda|rover, ack for the robust plan of collecting overcloud, undercloud logs seperately, for the current situation, changing the script name would not help?13:50
*** dmellado has quit IRC13:51
*** hubbot1 has joined #oooq13:51
panda|roverykarel: I don't fully understand the idea of different names, but it would be too implicit13:52
ykarelpanda|rover, i meant post is looking for a file collect_logs.sh to run, if the file not exist(we rename after successful run in run.yml) don't run13:53
*** dmellado has joined #oooq13:53
*** agopi|brb has quit IRC13:54
ykarelso in run.yml success case(collect_logs.sh is run and renamed to collect_logs_ovb.sh), and fail case(collect_logs.sh is not executed in run.yml and then executed in post to collect undercloud logs)13:55
bogdandoquiquell: hi, PTAL https://review.openstack.org/#/q/topic:base-container-reduction+(status:open+OR+status:merged)14:06
bogdandoyou were asking if we can remove puppet things from the base layers14:06
chkumar|ruckpanda|rover, weshay I am heading home now, see ya tomorrow :-)14:14
*** chkumar|ruck has quit IRC14:14
quiquellbogdando: ack give me a sec14:16
*** agopi|brb has joined #oooq14:17
*** agopi|brb is now known as agopi14:17
*** zul has joined #oooq14:25
*** skramaja has quit IRC14:38
*** udesale has joined #oooq14:48
*** gfidente has joined #oooq14:56
weshaymarios, I can meet quickly14:56
weshaymarios, scratch that14:57
weshaymarios, let's chat tomorrow14:57
*** quiquell is now known as quiquell|off14:58
mariosweshay: sure np14:58
mariosweshay: or im around for at least another hour14:58
marioswhatever just ping me14:58
panda|roverweshay: promotion is legit, chandan repeated the tests on a hash taht was crated on the 19, and it was promoted friday, that's why it seems older than the promotion. SO promoted on friday monday's hash15:01
mariosrlandy: want to talk ?15:05
rlandyyep - give me 5 mins to submit review15:05
mariosack np whenever you're ready15:05
panda|roverykarel: sorry was in meeting15:06
ykarelpanda|rover, ack15:07
ykarelpanda|rover, anything to discuss? i have to leave now15:07
panda|roverykarel: we can discuss it tomorrow, to me the rename is an hack15:08
ykarelpanda|rover, ack15:08
ykarelyup agree it's a hack15:08
panda|roverykarel: let's see if I can come up with the same idea but more explicit implementation15:08
panda|roverand conditionals15:08
ykarelpanda|rover, ack15:09
ykarelpanda|rover, and for the promotion master issue, master repo was not consistent from last couple of days, so testing same hash in periodic run and chandan's explicit run:- https://trunk-primary.rdoproject.org/api-centos-master-uc/api/civotes_detail.html?commit_hash=618d3ab83cd319e03fac86c1d6de510ef4a5134b&distro_hash=be9e0d5ccc3bd5a194c7b77587223e48b8469219&offset=015:09
ykarelmaster repo became consistent today15:10
*** ykarel is now known as ykarel|away15:10
ykarel|awaywith the merge of https://review.rdoproject.org/r/#/c/17410/15:10
* ykarel|away out15:11
panda|roveryep15:12
panda|roverok15:12
agopiping panda|rover15:14
*** chkumar|away has joined #oooq15:14
panda|roveragopi: pong15:16
*** ykarel|away has quit IRC15:16
agopihello panda|rover, https://review.rdoproject.org/zuul/builds?job_name=tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset053 hasn't triggered for openstack/browbeat in days and it has been failing for others anyways. I don't see any change to https://github.com/rdo-infra/review.rdoproject.org-config/blob/master/zuul.d/tripleo.yaml#L222 as well. any idea what I should be doing?15:17
panda|roveragopi: you had some job triggered even after we disabled third party OVB jobs globally ?15:22
agopioh okay I hadn't known that the jobs were disabled globally. and yes looks like it wass triggered for tht, oooq and toci15:23
panda|roveragopi: ok, so because of the instability of the jobs, we decided to disable the triggers everywhere, you can always run ovb job by explicitly commenting check-rdo15:24
panda|roverbut until we have the jobs stable again, you may see some false negatives15:25
agopioh sweet thnks for letting me know panda|rover, that helps. That explains why the job has builds recently.15:26
agopipanda|rover++15:27
hubbot1agopi: panda|rover's karma is now 315:27
weshaypanda|rover, can we chat about ruck/rover and next sprint briefly?15:33
panda|roverweshay: ok15:41
*** ykarel|away has joined #oooq15:41
weshayk cool in my blue15:41
*** gfidente has quit IRC15:42
weshaypanda|rover, https://hub.docker.com/r/tripleomaster/centos-binary-keystone/tags/15:44
*** saneax has quit IRC15:47
hubbot1FAILING CHECK JOBS on master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/604298, master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container @ https://review.openstack.org/56722415:51
*** ykarel|away has quit IRC15:55
*** udesale has quit IRC16:10
*** chkumar|away has quit IRC16:11
*** rlandy_ has joined #oooq16:25
*** gkadam has quit IRC16:27
*** rlandy has quit IRC16:27
*** dsneddon has joined #oooq16:41
*** rlandy_ is now known as rlandy16:49
rlandysshnaidm: what rechck will trigger rdo ovb jobs now?16:50
rlandyrecheck16:50
*** ykarel|away has joined #oooq16:53
*** quiquell|off has quit IRC16:59
*** kopecmartin is now known as kopecmartin|off17:00
*** bogdando has quit IRC17:11
*** ykarel|away has quit IRC17:15
sshnaidmrlandy, "check-rdo"17:28
*** agopi is now known as agopi|food17:29
rlandypanda|rover: how would we pass a drln_hash_tag_newest now?17:37
*** jfrancoa has quit IRC17:37
rlandywith the v3 workflow17:37
rlandyEXTRA_VARS is not longer included17:38
panda|rovervia featureset override maybe ? :)17:40
weshayarxcruz, rfolco fyi.. https://tree.taiga.io/project/tripleo-ci-board/task/304 https://tree.taiga.io/project/tripleo-ci-board/issue/32317:50
weshayare both close enough that we can have one task17:50
rfolcoweshay, ok will close 304 and point to 323 which has already something on it17:51
rfolcothanks for clarifying17:51
hubbot1FAILING CHECK JOBS on master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/604298, master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container @ https://review.openstack.org/56722417:51
*** derekh has quit IRC17:52
rfolcoactually 304 is closed17:53
rfolcoso will comment on 32317:53
*** apetrich has quit IRC17:57
weshaypanda|rover, rlandy sshnaidm https://review.rdoproject.org/r/#/c/17437/17:58
*** apetrich has joined #oooq17:58
weshayarxcruz++17:59
hubbot1weshay: arxcruz's karma is now 917:59
arxcruzthis is confuse, so, folco already have the job in place, just need to add the feature_override right ?17:59
weshayarxcruz, work it w/ rfolco  :)18:02
weshayI'll just be looking at the end result :)18:02
arxcruzlol18:02
arxcruzat least is in portuguese18:02
rfolcoarxcruz, talk to me18:03
rfolcoo/18:03
*** agopi|food is now known as agopi18:10
sshnaidmssbarnea|bkp2, do you set -1 just randomly? :) like https://review.openstack.org/#/c/614633/ and https://review.openstack.org/#/c/565215/18:18
sshnaidmssbarnea|bkp2, or it's a bot18:19
weshayarxcruz, kopecmartin|off need's review https://review.openstack.org/#/c/509728/34/roles/validate-tempest/templates/cleanup-network.sh.j218:28
arxcruzwow...18:30
arxcruzchecking18:30
*** holser_ has quit IRC18:38
*** amoralej is now known as amoralej|off18:41
*** sshnaidm is now known as sshnaidm|afk18:48
weshaypanda|rover, did someone change the keys on the promotion server?18:58
weshayI don't have access atm18:58
*** brault has quit IRC19:01
*** brault has joined #oooq19:04
sshnaidm|afkweshay, you can use your user19:46
sshnaidm|afkweshay, we blocked centos user from login via ssh19:46
hubbot1FAILING CHECK JOBS on master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/604298, master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container @ https://review.openstack.org/56722419:51
weshaysshnaidm|afk, ah k20:28
weshayrlandy, need any help w/ anything?20:44
rlandyweshay: there are a few design decisions we need to make20:45
rlandyweshay: I plan to bring them to the community meeting tomorrow20:45
weshayk.. rlandy would you like reviews or discussion?20:45
weshayah k20:45
rlandyweshay: there are two review sets to look at ...20:46
weshayrlandy, I'll review your stuff now20:46
rlandyhttps://review.openstack.org/#/c/61699320:46
rlandyand20:46
rlandyhttps://review.openstack.org/#/c/61865420:46
rlandyweshay" ^^ those two and all the patches below them20:46
rlandyweshay; per marios's suggestion, I am moving the t-q-e patches to a new repo20:47
rlandyand not editing nofepool-setup20:47
rlandynodepool-setup20:47
weshayhrm.. k20:47
rlandythat way we can merge some of these changes without the huge mess20:47
rlandyand the old reproducer will stay working20:48
weshaya new heredoc20:48
rlandyhowever the main work is those two patches20:48
weshayin commit message of https://review.openstack.org/#/c/61699320:48
weshaynot sure what you mean20:48
rlandytha messge is just so we get the reproducer out quickly20:48
rlandyI can remove that depnends-on20:49
rlandyit just edits out the running section20:49
rlandyso that we get results quickly20:49
rlandyI don't plan to merge that whole set of reviews20:49
rlandyat least not until we go through the design discussion tomorrow20:49
rlandythere is a diff now that we run ansible playbooks20:50
rlandynot straight shell scripts20:50
rlandyzuul is slow today though20:50
weshayk20:52
rlandyweshay: you can start by looking at nodepool setup work20:52
* weshay rearranges some mtgs20:52
weshayrlandy, you have a DNM test review I can follow and try?20:52
rlandyweshay: ack ... you can try this ...20:53
rlandyyou can get the reproducer and inventory from any job run with https://review.openstack.org/#/c/61699320:53
rlandythen patch the reproducer git fetch https://git.openstack.org/openstack/tripleo-quickstart-extras refs/changes/93/616993/25 && git checkout FETCH_HEAD20:54
rlandyjust after t-q nad t-q-e are clones20:54
rlandyjust after t-q nad t-q-e are cloned20:54
rlandyand run20:55
rlandyyou should get as far as running toci-gate_test20:55
rlandyto run-toci-gate-test20:55
rlandyyou will need patch https://review.openstack.org/#/c/61865420:55
weshayoff20:56
weshayoof20:56
rlandywhich you can apply and again20:56
weshaymaybe we need an etherpad again20:56
* weshay tries20:56
rlandyweshay: ok20:56
weshayhttp://logs.openstack.org/93/616993/25/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/1dfceb8/logs/reproducer-quickstart/20:58
rlandyrfolco: if you have time ^^20:58
rlandyweshay: yep - you need the reproducer script and the inventory20:59
rlandythat is all20:59
rfolcorlandy, reproduce it on ovb by getting the script and inventory files ?20:59
rlandyrfolco: we would need an ovb job run21:00
rfolcorlandy, so where this should be tested ?21:01
rfolcorlandy, libvirt ?21:01
rlandyrfolco: you can use rdocloud with multinode/singlenode/standalone21:01
rlandyor libvirt21:01
rlandyor ovb21:01
rlandyI will kick an ovb run21:01
rlandyit's juts not kicked by default21:01
rfolcorlandy, standalone would help ?21:02
rlandyrfolco: sure ...21:03
rfolcorlandy, ok will test standalone reproducer on rdo cloud21:03
rfolcothanks rlandy21:03
rlandyyou will need this in the reproducer file after you t-qe- is cloned ...21:03
rlandycd tripleo-quickstart-extras21:03
rlandygit fetch https://git.openstack.org/openstack/tripleo-quickstart-extras refs/changes/93/616993/25 && git checkout FETCH_HEAD21:03
rlandycd ..21:03
rlandysed -i "s#git+https://git.openstack.org/openstack/tripleo-quickstart-extras#file:///$WORKSPACE/tripleo-quickstart-extras#1" $WORKSPACE/tripleo-quickstart/quickstart-extras-requirements.txt21:03
rlandyrfolco: weshay: that is not the latest work but it should get  you to run the pres and see what's happening21:04
rlandyrfolco: weshay: you would need the latest patch plus https://review.openstack.org/#/c/618654/ to finish and run toci-gate-test21:06
rlandyrfolco: weshay: I will put together an etherpad for the design decisions and testing flow21:06
weshayk21:06
rfolcorlandy, please... I am still confused on how to gather the Frankenstein bits21:07
rlandyrfolco: weshay: if you want to bj - I can walk you though it quickly21:07
rlandymay save you some time in testing21:08
rfolcorlandy, I have 12 min before ubering my son from english class21:08
weshaylet me know if you guys join21:09
weshayif not.. I'll keep poking21:09
rlandyrfolco: k - let's run through this quickly - your bj?21:09
rfolcosure21:09
rlandyweshay: ^^ if you want to join21:09
*** apetrich has quit IRC21:22
*** agopi is now known as agopi|brb21:28
*** agopi|brb has quit IRC21:28
*** jtomasek has quit IRC21:39
*** apetrich has joined #oooq21:42
hubbot1FAILING CHECK JOBS on master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/604298, master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container @ https://review.openstack.org/56722421:51
*** vinaykns has joined #oooq23:08
*** rlandy has quit IRC23:30
*** tosky has quit IRC23:32
*** tosky has joined #oooq23:32
hubbot1FAILING CHECK JOBS on master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/604298, master: tripleo-ci-fedora-28-standalone @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container @ https://review.openstack.org/56722423:51
*** vinaykns has quit IRC23:51

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!